Data-Intensive Text Processing with MapReduce
by Jimmy Lin, Chris Dyer
Publisher: Morgan & Claypool Publishers 2010
Number of pages: 175
This book focuses on MapReduce algorithm design, with an emphasis on text processing algorithms common in natural language processing, information retrieval, and machine learning. We introduce the notion of MapReduce design patterns, which represent general reusable solutions to commonly occurring problems across a variety of problem domains. This book not only intends to help the reader 'think in MapReduce', but also discusses limitations of the programming model as well.
Home page url
Download or read it online for free here:
by Anand Rajaraman, Jeffrey D. Ullman - Stanford University
At the highest level of description, this book is about data mining. However, it focuses on data mining of very large amounts of data. Because of the emphasis on size, many of our examples are about the Web or data derived from the Web.
by Clinton Gormley, Zachary Tong - O'Reilly
Whether you need full-text search or real-time analytics of data, this book introduces you to the fundamental concepts required to start working with Elasticsearch. With these foundations laid, it will move on to more-advanced search techniques.
by Mohammed J. Zaki, Wagner Meira, Jr. - Cambridge University Press
This textbook provides a broad yet in-depth overview of data mining, integrating related concepts from machine learning and statistics. The main parts of the book include exploratory data analysis, pattern mining, clustering, and classification.
by Neeraj Sharma, at al. - IBM Corporation
This free e-book teaches you the fundamentals of databases, including relational database theory, logical and physical database design, and the SQL language. Advanced topics include using functions, stored procedures and XML.