Data-Intensive Text Processing with MapReduce
by Jimmy Lin, Chris Dyer
Publisher: Morgan & Claypool Publishers 2010
Number of pages: 175
This book focuses on MapReduce algorithm design, with an emphasis on text processing algorithms common in natural language processing, information retrieval, and machine learning. We introduce the notion of MapReduce design patterns, which represent general reusable solutions to commonly occurring problems across a variety of problem domains. This book not only intends to help the reader 'think in MapReduce', but also discusses limitations of the programming model as well.
Home page url
Download or read it online for free here:
Data mining comprises techniques and algorithms, for determining interesting patterns from large datasets. There are currently hundreds algorithms that perform tasks such as frequent pattern mining, clustering, and classification, among others.
by Tony Gill, at al. - Getty Publications
This book provides an overview of metadata, its types, roles, and characteristics; a discussion of metadata as it relates to resources on the Web; a description of methods, tools, standards, and protocols used to publish digital collections; etc.
by C.J. Date, Hugh Darwen
The database field is full of important problems still to be solved and interesting issues still to be examined -- and some of those problems and issues are explored in this book. It reports on some of our most recent investigations in this field.
by Arno Jan Knobbe - IOS Press
This thesis is concerned with Data Mining: extracting useful insights from large collections of data. With the increased possibilities in modern society for companies and institutions to gather data, this subject has become of increasing importance.