Data-Intensive Text Processing with MapReduce will help answer the question: What subjects are available in a given torrent of information?
Or, perhaps the more interesting question, What subjects did you find in a given torrent of information?
Not exactly the same question is it?
The first presumes that we are going to find the same subjects and the second does not.
Download the Final Manuscript Support the authors by buying a copy as well: publisher’s site.
Authored by Jimmy Lin and Chris Dyer.
Very interested in hearing from anyone using MapReduce to mine texts for use in topic map construction.
*****
Updated to insert the authors. Opps! 20 April 2011
[…] Data-Intensive Text Processing with MapReduce by Lin and Dyer for more details on […]
Pingback by Cloud9: a MapReduce library for Hadoop « Another Word For It — November 29, 2010 @ 1:40 pm