
Programming Pig
by Alan F Gates
Publisher: O'Reilly Media 2011
Number of pages: 222
Description:
Apache Pig is a platform for analyzing large data sets that consists of a high-level language for expressing data analysis programs, coupled with infrastructure for evaluating these programs. The salient property of Pig programs is that their structure is amenable to substantial parallelization, which in turns enables them to handle very large data sets.
Download or read it online for free here:
Download link
(6.4MB, PDF)
Similar books
Learn Access Nowby Allen Wyatt
Access for beginners: getting started, creating database, sorting and filtering, queries, printing, simple reports, custom forms, Web features, data relationships, importing and exporting, data security, OLE, macros, dialog boxes and menus, and more.
(27429 views)
A Little Riak Bookby Eric Redmond - GitBook
This is a free little book about Riak, a scalable, high availability NoSQL datastore. Riak is an open-source, distributed key/value database for high availability and near-linear scalability. Riak has remarkably high uptime and grows with you.
(10588 views)
Spring Data: Modern Data Access for Enterprise Javaby Mark Pollack, et al. - O'Reilly Media
This book shows you how Spring's data access framework can help you connect to either non-relational or relational databases. You'll learn how Spring Data's model reduces the learning curve for applications with newer data access technologies.
(16956 views)
Data Mining for the Massesby Matthew North - Global Text Project
In this book, professor Matt North uses simple examples, clear explanations and free, powerful, easy-to-use software to teach you the basics of data mining; techniques that can help you answer some of your toughest business questions.
(16864 views)