Data-Intensive Text Processing with MapReduce
by Jimmy Lin, Chris Dyer
Publisher: Morgan & Claypool Publishers 2010
ISBN/ASIN: 1608453421
ISBN-13: 9781608453429
Number of pages: 175
Description:
This book focuses on MapReduce algorithm design, with an emphasis on text processing algorithms common in natural language processing, information retrieval, and machine learning. We introduce the notion of MapReduce design patterns, which represent general reusable solutions to commonly occurring problems across a variety of problem domains. This book not only intends to help the reader 'think in MapReduce', but also discusses limitations of the programming model as well.
Download or read it online for free here:
Download link
(1.7MB, PDF)
Similar books

by I. Androutsopoulos, G. D. Ritchie, P. Thanisch - arXiv
This paper is an introduction to natural language interfaces to databases (NLIDBs). Some advantages and disadvantages of NLIDBs are then discussed, comparing NLIDBs to formal query languages, form-based interfaces, and graphical interfaces.
(16586 views)

by Arno Jan Knobbe - IOS Press
This thesis is concerned with Data Mining: extracting useful insights from large collections of data. With the increased possibilities in modern society for companies and institutions to gather data, this subject has become of increasing importance.
(16464 views)

by Tom Jewett
This text is a teaching resource for an introductory database class at California State University Long Beach, Department of Computer Engineering and Computer Science. It is also designed to be used as an individual self-study tutorial.
(20974 views)

by Neeraj Sharma, at al. - IBM Corporation
This free e-book teaches you the fundamentals of databases, including relational database theory, logical and physical database design, and the SQL language. Advanced topics include using functions, stored procedures and XML.
(22716 views)