Mining of Massive Datasets
by Anand Rajaraman, Jeffrey D. Ullman
Publisher: Stanford University 2010
Number of pages: 340
Description:
At the highest level of description, this book is about data mining. However, it focuses on data mining of very large amounts of data, that is, data so large it does not fit in main memory. Because of the emphasis on size, many of our examples are about the Web or data derived from the Web.
Download or read it online for free here:
Download link
(2MB, PDF)
Similar books

by Tom Jewett
This text is a teaching resource for an introductory database class at California State University Long Beach, Department of Computer Engineering and Computer Science. It is also designed to be used as an individual self-study tutorial.
(21057 views)

by Chuck Ballard, et al. - IBM Redbooks
It covers data modeling techniques for data warehousing, within the context of the overall data warehouse development process. The process of data warehouse modeling, including the steps required before and after the actual modeling, is discussed.
(23392 views)

by P. A. Bernstein, V. Hadzilacos, N. Goodman - Addison Wesley
This book is about techniques for concurrency control and recovery. It covers techniques for centralized and distributed computer systems, and for single copy, multiversion, and replicated databases. Example applications are included.
(24790 views)

by Tony Gill, at al. - Getty Publications
This book provides an overview of metadata, its types, roles, and characteristics; a discussion of metadata as it relates to resources on the Web; a description of methods, tools, standards, and protocols used to publish digital collections; etc.
(17380 views)