Mining of Massive Datasets
by Anand Rajaraman, Jeffrey D. Ullman
Publisher: Stanford University 2010
Number of pages: 340
Description:
At the highest level of description, this book is about data mining. However, it focuses on data mining of very large amounts of data, that is, data so large it does not fit in main memory. Because of the emphasis on size, many of our examples are about the Web or data derived from the Web.
Download or read it online for free here:
Download link
(2MB, PDF)
Similar books
Data Mining and Knowledge Discovery in Real Life Applications
by Julio Ponce, Adem Karahoca - InTech
This book presents different ways of theoretical and practical advances and applications of data mining in different promising areas. The book will serve as a Data Mining bible to show a right way for the students, researchers and practitioners.
(16868 views)
by Julio Ponce, Adem Karahoca - InTech
This book presents different ways of theoretical and practical advances and applications of data mining in different promising areas. The book will serve as a Data Mining bible to show a right way for the students, researchers and practitioners.
(16868 views)
Data Mining Algorithms In R
- Wikibooks
Data mining comprises techniques and algorithms, for determining interesting patterns from large datasets. There are currently hundreds algorithms that perform tasks such as frequent pattern mining, clustering, and classification, among others.
(18539 views)
- Wikibooks
Data mining comprises techniques and algorithms, for determining interesting patterns from large datasets. There are currently hundreds algorithms that perform tasks such as frequent pattern mining, clustering, and classification, among others.
(18539 views)
Concurrency Control and Recovery in Database Systems
by P. A. Bernstein, V. Hadzilacos, N. Goodman - Addison Wesley
This book is about techniques for concurrency control and recovery. It covers techniques for centralized and distributed computer systems, and for single copy, multiversion, and replicated databases. Example applications are included.
(22893 views)
by P. A. Bernstein, V. Hadzilacos, N. Goodman - Addison Wesley
This book is about techniques for concurrency control and recovery. It covers techniques for centralized and distributed computer systems, and for single copy, multiversion, and replicated databases. Example applications are included.
(22893 views)
Database design with UML and SQL
by Tom Jewett
This text is a teaching resource for an introductory database class at California State University Long Beach, Department of Computer Engineering and Computer Science. It is also designed to be used as an individual self-study tutorial.
(19637 views)
by Tom Jewett
This text is a teaching resource for an introductory database class at California State University Long Beach, Department of Computer Engineering and Computer Science. It is also designed to be used as an individual self-study tutorial.
(19637 views)