
Mastering Apache Spark 2.0
by Jacek Laskowski
Publisher: GitBook 2016
Number of pages: 1621
Description:
This collections of notes (what some may rashly call a 'book') serves as the ultimate place of mine to collect all the nuts and bolts of using Apache Spark. The notes aim to help me designing and developing better products with Apache Spark.
Download or read it online for free here:
Read online
(online html)
Similar books
Graph Databasesby Ian Robinson, Jim Webber, Emil Eifrem - O'Reilly Media
Graph Databases, published by O'Reilly Media, discusses the problems that are well aligned with graph databases, with examples drawn from practical, real-world use cases. This book also looks at the ecosystem of complementary technologies.
(12318 views)
Programming Pigby Alan F Gates - O'Reilly Media
Apache Pig is a platform for analyzing large data sets that consists of a high-level language for expressing data analysis programs. The structure of Pig programs is amenable to parallelization, which enables them to handle very large data sets.
(12783 views)
The Little Redis Bookby Karl Seguin - openmymind.net
Redis represents a simplification in the way we deal with data. It peels away much of the complexity and abstraction available in other systems. The goal of this book is to build the foundation you'll need to master Redis.
(11056 views)
Data Wrangling Handbookby Open Knowledge Foundation - School of Data
The Data Wrangling Handbook is a companion text to the School of Data. Its function is something like a traditional textbook -- it will provide the detail and background theory to support the School of Data courses and challenges.
(10603 views)