Another Word For It Patrick Durusau on Topic Maps and Semantic Diversity

November 20, 2013

R and Solr Integration…

Filed under: R,Solr — Patrick Durusau @ 5:41 pm

R and Solr Integration Using Solr’s REST APIs by Jitender Aswani.

From the post:

Solr is the most popular, fast and reliable open source enterprise search platform from the Apache Luene project. Among many other features, we love its powerful full-text search, hit highlighting, faceted search, and near real-time indexing. Solr powers the search and navigation features of many of the world’s largest internet sites. Solr, written in Java, uses the Lucene Java search library for full-text indexing and search, and has REST-like HTTP/XML and JSON APIs that make it easy to use from virtually any programming language including R.

We invested significant amount of time integrating our R-based data-management platform with Solr using HTTP/JSON based REST interface. This integration allowed us to index millions of data-sets in solr in real-time as these data-sets get processed by R. It took us few days to stabilize and optimize this approach and we are very proud to share this approach and source code with you. The full source code can be found and downloaded from datadolph.in’s git repository.

The script has R functions for:

  • querying Solr and returning matching docs
  • posting a document to solr (taking a list and converting it to JSON before posting it)
  • deleting all indexes, deleting indexes for a certain document type and for a certain category within document type

Integration across systems is the lifeblood of enterprise IT systems.

I was extolling the virtues of reaching across silos earlier today.

A silo may provide comfort but it doesn’t offer much room for growth.

Or to put it another way, semantic integration doesn’t have one path, one process or one technology.

Once you’re past that, the rest is a question of requirements, resources and understanding identity in your domain (and/or across domains).

No Comments

No comments yet.

RSS feed for comments on this post.

Sorry, the comment form is closed at this time.

Powered by WordPress