Solr 4 preview: SolrCloud, NoSQL, and more
From the post:
The first alpha release of Solr 4 is quickly approaching, bringing powerful new features to enhance existing Solr powered applications, as well as enabling new applications by further blurring the lines between full-text search and NoSQL.
The largest set of features goes by the development code-name “Solr Cloud” and involves bringing easy scalability to Solr. Distributed indexing with no single points of failure has been designed from the ground up for near real-time (NRT), and NoSQL features such as realtime-get, optimistic locking, and durable updates.
We’ve incorporated Apache ZooKeeper, the rock-solid distributed coordination project that is immune to issues like split-brain syndrome that tend to plague other hand-rolled solutions. ZooKeeper holds the Solr configuration, and contains the cluster meta-data such as hosts, collections, shards, and replicas, which are core to providing an elastic search capability.
When a new node is brought up, it will automatically be assigned a role such as becoming an additional replica for a shard. A bounced node can do a quick “peer sync” by exchanging updates with its peers in order to bring itself back up to date. New nodes, or those that have been down too long, recover by replicating the whole index of a peer while concurrently buffering any new updates.
Run, don’t walk, to learn about the new features for Solr 4.
You won’t be disappointed.
Interested to see the “….blurriing [of] the lines between full-text search and NoSQL.”
Would be even more interested to see the “…blurring of indexing and data/data formats.”
That is to say that data, along with its format, is always indexed in digital media.
So why can’t I see the data as a table, as a graph, as a …., depending upon my requirements?
No ETL, JVD – Just View Differently.
Suspect I will have to wait a while for that, but in the mean time, enjoy Solr 4 alpha.