Data-Intensive Text Processing with MapReduce
by Jimmy Lin, Chris Dyer
Publisher: Morgan & Claypool Publishers 2010
Number of pages: 175
This book focuses on MapReduce algorithm design, with an emphasis on text processing algorithms common in natural language processing, information retrieval, and machine learning. We introduce the notion of MapReduce design patterns, which represent general reusable solutions to commonly occurring problems across a variety of problem domains. This book not only intends to help the reader 'think in MapReduce', but also discusses limitations of the programming model as well.
Home page url
Download or read it online for free here:
by Chuck Ballard, et al. - IBM Redbooks
It covers data modeling techniques for data warehousing, within the context of the overall data warehouse development process. The process of data warehouse modeling, including the steps required before and after the actual modeling, is discussed.
by David Maier - Computer Science Press
The book is intended for a second course in databases and a reference for researchers in the field. The material covered includes relational algebra, functional dependencies, multivalued and join dependencies, normal forms, representation theory...
by C.J. Date, Hugh Darwen - Addison Wesley
This is a book on database management based on an earlier book by the same authors. It can be seen as an abstract blueprint for the design of a DBMS and the language interface to such a DBMS. It serves as a basis for a model of type inheritance.
by Kyriacos E. Pavlou, Richard T. Snodgrass - University of Arizona
The text on detection via cryptographic hashing. The authors show how to determine when the tampering occurred, what data was tampered, and who did the tampering. Four successively more sophisticated forensic analysis algorithms are presented.