Data-Intensive Text Processing with MapReduce
by Jimmy Lin, Chris Dyer
Publisher: Morgan & Claypool Publishers 2010
Number of pages: 175
This book focuses on MapReduce algorithm design, with an emphasis on text processing algorithms common in natural language processing, information retrieval, and machine learning. We introduce the notion of MapReduce design patterns, which represent general reusable solutions to commonly occurring problems across a variety of problem domains. This book not only intends to help the reader 'think in MapReduce', but also discusses limitations of the programming model as well.
Home page url
Download or read it online for free here:
by Anand Rajaraman, Jeffrey D. Ullman - Stanford University
At the highest level of description, this book is about data mining. However, it focuses on data mining of very large amounts of data. Because of the emphasis on size, many of our examples are about the Web or data derived from the Web.
by Tony Gill, at al. - Getty Publications
This book provides an overview of metadata, its types, roles, and characteristics; a discussion of metadata as it relates to resources on the Web; a description of methods, tools, standards, and protocols used to publish digital collections; etc.
by P. A. Bernstein, V. Hadzilacos, N. Goodman - Addison Wesley
This book is about techniques for concurrency control and recovery. It covers techniques for centralized and distributed computer systems, and for single copy, multiversion, and replicated databases. Example applications are included.
by Paris C. Kanellakis - Brown University Providence
The goal of this paper is to provide a systematic and unifying introduction to relational database theory, including some of the recent developments in database logic programming. The exposition closes with the problems of complex objects...