Another Word For It Patrick Durusau on Topic Maps and Semantic Diversity

January 4, 2012

Hadoop for Archiving Email – Part 2

Filed under: Hadoop,Indexing,Lucene,Solr — Patrick Durusau @ 9:40 am

Hadoop for Archiving Email – Part 2 by Sunil Sitaula.

From the post:

Part 1 of this post covered how to convert and store email messages for archival purposes using Apache Hadoop, and outlined how to perform a rudimentary search through those archives. But, let’s face it: for search to be of any real value, you need robust features and a fast response time. To accomplish this we use Solr/Lucene-type indexing capabilities on top of HDFS and MapReduce.

Before getting into indexing within Hadoop, let us review the features of Lucene and Solr:

Continues Part 1 (my blog post) and mentions several applications and libraries that will be useful for indexing email.

1 Comment

  1. […] Update: Hadoop for Archiving Email – Part 2 […]

    Pingback by Hadoop for Archiving Email « Another Word For It — January 4, 2012 @ 9:37 am

RSS feed for comments on this post.

Sorry, the comment form is closed at this time.

Powered by WordPress