Indexing Files via Solr and Java MapReduce by Adam Smieszny.
From the post:
Several weeks ago, I set about to demonstrate the ease with which Solr and Map/Reduce can be integrated. I was unable to find a simple, yet comprehensive, primer on integrating the two technologies. So I set about to write one.
What follows is my bare-bones tutorial on getting Solr up and running to index each word of the complete works of Shakespeare. Note: Special thanks to Sematext for looking over the Solr bits and making sure they are sane. Check them out if you’re going to be doing a lot of work with Solr, ElasticSearch, or search in general and want to bring in the experts.
Looks like a nice weekend (if you are married, long night if not) project!
If you have the time, look over this post and report back on your experiences.
Particularly if you learn something new or see something others need to know about (such as other resources).