Introducing Hadoop in 20 pages by Prashant Sharma.
Getting started in hadoop for a newbie is a non trivial task, with amount of knowledge base available a significant amount of effort is gone in figuring out, where and how should one start exploring this field. Introducing hadoop in 20 pages is a concise document to briefly introduce just the right information in right amount, before starting out in-depth in this field. This document is intended to be used as a first and shortest guide to both understand and use Map-Reduce for building distributed data processing applications.
Well, counting the annexes it’s 35 pages but still useful. Could use some copy editing.
Disappointing because an introduction to the entire Hadoop ecosystem, carrying a single example, even an inverted index of a text, would be a better exercise at this point in Hadoop development. Two versions, one with the code examples at the end, for people who want to get a high level view and one with the code inline and commented, for people who want to code to follow along.