Archive for the ‘IndexTank’ Category

8 Best Open Source Search Engines built on top of Lucene

Tuesday, February 7th, 2012

8 Best Open Source Search Engines built on top of Lucene

By my count I get five (5) based on Lucene. See what you think.

Lucene base:

  • Apache Solr
  • Compass
  • Constellio
  • Elastic Search
  • Katta

No Lucene base:

  • Bobo Search
  • Index Tank
  • Summa

Post has short summaries about the search engines and links to their sites.

Do you think the terminology around search engines is as confused as around NoSQL databases?

Any cross-terminology comparisons you would recommend to CIO’s or even users?

IndexTank is now open source!

Saturday, December 24th, 2011

IndexTank is now open source! by Diego Basch, Director of Engineering, LinkedIn.

From the post:

We are proud to announce that the technology behind IndexTank has just been released as open-source software under the Apache 2.0 License! We promised to do this when LinkedIn acquired IndexTank, so here we go:

indextank-engine: Indexing engine

indextank-service: API, BackOffice, Storefront, and Nebulizer

We know that many of our users and other interested parties have been patiently waiting for this release. We want to thank you for your patience, for your kind emails, and for your continued support. We are looking forward to seeing IndexTank thrive as an open-source project. Of course we’ll do our part; our team is hard at work building search infrastructure at LinkedIn. We are part of a larger team that has built and released search technologies such as Zoie, Bobo, and just this past Monday, Cleo. We are excited to add IndexTank to this array of powerful open source tools.

From the homepage:


  • Truly real-time: instant updates without reindexing
  • Geo & Social aware: use location, votes, ratings or comments
  • Works with Ruby, Rails, Python, Java, PHP, .NET & more!


  • You control how to sort and score results
  • “Fuzzy”, Autocomplete, Facets for how users really search
  • Highlights & Snippets quickly shows search results relevance


  • Scalable from a personal blog to hundreds of millions of documents! (try Reddit)
  • Free up 100K documents
  • Easier than SQL, SOLR/Lucene & Sphinx.

If you are looking for documentation, rather than github, you best look here.

So far, I haven’t seen anything out of the ordinary for a search engine. I mention it in case some people prefer it over others.

Do you see anything out of the ordinary?