Data-Intensive Text Processing with MapReduce
by Jimmy Lin, Chris Dyer
Publisher: Morgan & Claypool Publishers 2010
ISBN/ASIN: 1608453421
ISBN-13: 9781608453429
Number of pages: 175
Description:
This book focuses on MapReduce algorithm design, with an emphasis on text processing algorithms common in natural language processing, information retrieval, and machine learning. We introduce the notion of MapReduce design patterns, which represent general reusable solutions to commonly occurring problems across a variety of problem domains. This book not only intends to help the reader 'think in MapReduce', but also discusses limitations of the programming model as well.
Download or read it online for free here:
Download link
(1.7MB, PDF)
Similar books

- National Academies Press
Using big data analytics to identify complex patterns hidden inside volumes of data that have never been combined could accelerate the rate of scientific discovery and lead to the development of beneficial technologies and products.
(6245 views)

by P. A. Bernstein, V. Hadzilacos, N. Goodman - Addison Wesley
This book is about techniques for concurrency control and recovery. It covers techniques for centralized and distributed computer systems, and for single copy, multiversion, and replicated databases. Example applications are included.
(22369 views)

by Serge Abiteboul, Richard Hull, Victor Vianu - Addison Wesley
This book provides in-depth coverage of the theory concerning the logical level of database management systems, including both classical and advanced topics. It includes detailed proofs and numerous examples and exercises.
(17057 views)

by Julio Ponce, Adem Karahoca - InTech
This book presents different ways of theoretical and practical advances and applications of data mining in different promising areas. The book will serve as a Data Mining bible to show a right way for the students, researchers and practitioners.
(16558 views)