Mining of Massive Datasets
by Anand Rajaraman, Jeffrey D. Ullman
Publisher: Stanford University 2010
Number of pages: 340
Description:
At the highest level of description, this book is about data mining. However, it focuses on data mining of very large amounts of data, that is, data so large it does not fit in main memory. Because of the emphasis on size, many of our examples are about the Web or data derived from the Web.
Download or read it online for free here:
Download link
(2MB, PDF)
Similar books

by Arno Jan Knobbe - IOS Press
This thesis is concerned with Data Mining: extracting useful insights from large collections of data. With the increased possibilities in modern society for companies and institutions to gather data, this subject has become of increasing importance.
(16511 views)

by E. F. Codd - Addison-Wesley
Written by the originator of the relational model, the book covers the practical aspects of the design of relational databases. The author defines twelve rules that database management systems need to follow in order to be described as relational.
(21152 views)

by Clinton Gormley, Zachary Tong - O'Reilly
Whether you need full-text search or real-time analytics of data, this book introduces you to the fundamental concepts required to start working with Elasticsearch. With these foundations laid, it will move on to more-advanced search techniques.
(9098 views)

by Chuck Ballard, et al. - IBM Redbooks
It covers data modeling techniques for data warehousing, within the context of the overall data warehouse development process. The process of data warehouse modeling, including the steps required before and after the actual modeling, is discussed.
(23382 views)