MapReduce Algorithms by Bill Bejeck.
Bill is writing a series of posts on implementing the algorithms given in pseudo-code in: Data-Intensive Text Processing with MapReduce.
- Working Through Data-Intensive Text Processing with MapReduce
- Working Through Data-Intensive Text Processing with MapReduce – Local Aggregation Part II
- Calculating A Co-Occurrence Matrix with Hadoop
- MapReduce Algorithms – Order Inversion
- Secondary Sorting
Another resource to try with your Hadoop Sandbox install!
I first saw this at Alex Popescu’s 3 MapReduce and Hadoop Links: Secondary Sorting, Hadoop-Based Letterpress, and Hadoop Vaidya.