HBase and Lucene for realtime search
From the post that starts this exciting thread:
I’m curious as to what a ‘good’ approach would be for implementing search in HBase (using Lucene) with the end goal being the integration of realtime search into HBase. I think the use case makes sense as HBase is realtime and has a write-ahead log, performs automatic partitioning, splitting of data, failover, redundancy, etc. These are all things Lucene does not have out of the box, that we’d essentially get for ‘free’.
For starters: Where would be the right place to store Lucene segments or postings? Eg, we need to be able to efficiently perform a linear iteration of the per-term posting list(s).
Thanks!
Jason Rutherglen
This could definitely have legs for exploring data sets, authoring topic maps or assuming a dynamic synonyms table, composed of conditions for synonymy, even acting as a topic map engine.
Will keep a close eye on this activity.