Mining of Massive Datasets by Anand Rajaraman, Jeffrey D. Ullman

Small book cover: Mining of Massive Datasets

Mining of Massive Datasets

Publisher: Stanford University
Number of pages: 340

At the highest level of description, this book is about data mining. However, it focuses on data mining of very large amounts of data, that is, data so large it does not fit in main memory. Because of the emphasis on size, many of our examples are about the Web or data derived from the Web.

Home page url

Download or read it online for free here:
Download link
(2MB, PDF)

Similar books

Book cover: Storage Basics: An Introduction to the Fundamentals of Storage TechnologyStorage Basics: An Introduction to the Fundamentals of Storage Technology
- Fujitsu Siemens Computers
This book is an introduction to storage technologies and storage networks. It also provides an overview of the storage product portfolio of Fujitsu Siemens Computers which is the basis for solutions that help you manage the growing flood of data.
Book cover: Data Modeling Techniques for Data WarehousingData Modeling Techniques for Data Warehousing
by - IBM Redbooks
It covers data modeling techniques for data warehousing, within the context of the overall data warehouse development process. The process of data warehouse modeling, including the steps required before and after the actual modeling, is discussed.
Book cover: Forensic Analysis of Database TamperingForensic Analysis of Database Tampering
by - University of Arizona
The text on detection via cryptographic hashing. The authors show how to determine when the tampering occurred, what data was tampered, and who did the tampering. Four successively more sophisticated forensic analysis algorithms are presented.
Book cover: Concurrency Control and Recovery in Database SystemsConcurrency Control and Recovery in Database Systems
by - Addison Wesley
This book is about techniques for concurrency control and recovery. It covers techniques for centralized and distributed computer systems, and for single copy, multiversion, and replicated databases. Example applications are included.