Programming Pig
by Alan F Gates
Publisher: O'Reilly Media 2011
Number of pages: 222
Description:
Apache Pig is a platform for analyzing large data sets that consists of a high-level language for expressing data analysis programs, coupled with infrastructure for evaluating these programs. The salient property of Pig programs is that their structure is amenable to substantial parallelization, which in turns enables them to handle very large data sets.
Download or read it online for free here:
Download link
(6.4MB, PDF)
Similar books

by Chris Eaton, et al. - McGraw-Hill
Big Data represents a new era in data exploration and utilization, and IBM helps clients navigate this transformation. The book reveals how to use Big Data technology to deliver a robust, secure, highly available, enterprise-class Big Data platform.
(10356 views)

by Eric Redmond - GitBook
This is a free little book about Riak, a scalable, high availability NoSQL datastore. Riak is an open-source, distributed key/value database for high availability and near-linear scalability. Riak has remarkably high uptime and grows with you.
(7508 views)

by Allen Wyatt
Access for beginners: getting started, creating database, sorting and filtering, queries, printing, simple reports, custom forms, Web features, data relationships, importing and exporting, data security, OLE, macros, dialog boxes and menus, and more.
(23432 views)

by J. C. Anderson, J. Lehnardt, N. Slater - O'Reilly Media
CouchDB's creators show you how to use this document-oriented database as a standalone application framework or with high-volume, distributed applications. CouchDB is ideal for web applications that handle huge amounts of loosely structured data.
(9899 views)