Programming Pig
by Alan F Gates
Publisher: O'Reilly Media 2011
Number of pages: 222
Description:
Apache Pig is a platform for analyzing large data sets that consists of a high-level language for expressing data analysis programs, coupled with infrastructure for evaluating these programs. The salient property of Pig programs is that their structure is amenable to substantial parallelization, which in turns enables them to handle very large data sets.
Download or read it online for free here:
Download link
(6.4MB, PDF)
Similar books
![Book cover: Developing PHP Applications for IBM Data Servers](images/3765.jpg)
- IBM Redbooks
The book provides lots of information for developers, including code samples for creating PHP applications with DB2, Informix Dynamic Server, and Cloudscape. We use the latest PHP data access extensions including: PHP Data Objects and ibm_db2.
(15174 views)
![Book cover: The Little Redis Book](images/7274.jpg)
by Karl Seguin - openmymind.net
Redis represents a simplification in the way we deal with data. It peels away much of the complexity and abstraction available in other systems. The goal of this book is to build the foundation you'll need to master Redis.
(9579 views)
![Book cover: Data Mining for the Masses](images/10114.jpg)
by Matthew North - Global Text Project
In this book, professor Matt North uses simple examples, clear explanations and free, powerful, easy-to-use software to teach you the basics of data mining; techniques that can help you answer some of your toughest business questions.
(14468 views)
![Book cover: Text Mining with R: A Tidy Approach](images/12304.jpg)
by Julia Silge, David Robinson - O'Reilly Media
With this practical book, you'll explore text-mining techniques with tidytext, a package that authors developed using the tidy principles behind R packages like ggraph and dplyr. You'll learn how tidytext can make text analysis easy and effective.
(5583 views)