Logo

Programming Pig by Alan F Gates

Small book cover: Programming Pig

Programming Pig
by

Publisher: O'Reilly Media
Number of pages: 222

Description:
Apache Pig is a platform for analyzing large data sets that consists of a high-level language for expressing data analysis programs, coupled with infrastructure for evaluating these programs. The salient property of Pig programs is that their structure is amenable to substantial parallelization, which in turns enables them to handle very large data sets.

Download or read it online for free here:
Download link
(6.4MB, PDF)

Similar books

Book cover: Mastering Apache Spark 2.0Mastering Apache Spark 2.0
by - GitBook
This collections of notes (what some may rashly call a 'book') serves as the ultimate place of mine to collect all the nuts and bolts of using Apache Spark. The notes aim to help me designing and developing better products with Apache Spark.
(4205 views)
Book cover: Data Wrangling HandbookData Wrangling Handbook
by - School of Data
The Data Wrangling Handbook is a companion text to the School of Data. Its function is something like a traditional textbook -- it will provide the detail and background theory to support the School of Data courses and challenges.
(6236 views)
Book cover: HBase: The Definitive GuideHBase: The Definitive Guide
by - O'Reilly Media
If you are looking for a solution to accommodate a virtually endless amount of data, this book will show you how Apache HBase can fulfill your needs. HBase scales to billions of rows and columns, while ensuring that performance remain constant.
(7751 views)
Book cover: Text Mining with R: A Tidy ApproachText Mining with R: A Tidy Approach
by - O'Reilly Media
With this practical book, you'll explore text-mining techniques with tidytext, a package that authors developed using the tidy principles behind R packages like ggraph and dplyr. You'll learn how tidytext can make text analysis easy and effective.
(1655 views)