Luke « Another Word For It

Another Word For It Patrick Durusau on Topic Maps and Semantic Diversity

November 16, 2013

CLue

Filed under: Indexing,Lucene,Luke — Patrick Durusau @ 7:24 pm

CLue – Command Line tool for Apache Lucene by John Wang.

From the webpage:

When working with Lucene, it is often useful to inspect an index.

Luke is awesome, but often times it is not feasible to inspect an index on a remote machine using a GUI. That’s where Clue comes in. You can ssh into your production box and inspect your index using your favorite shell.

Another important feature for Clue is the ability to interact with other Unix commands via piping, e.g. grep, more etc.

…

[New in 0.0.4 Release]

Add ability to investigate indexes on HDFS

Add command to dump the index

Add command to import from a dumped index

Add configuration support, now you can configure Clue to run your own custom code

Add index trimming functionlity: sometimes you want a smaller index to work with

lucene 4.5.1 upgrade

Definitely a tool to investigate for adding to your tool belt!

Comments Off

July 25, 2012

Using Luke the Lucene Index Browser to develop Search Queries

Filed under: Lucene,Luke — Patrick Durusau @ 3:27 pm

Using Luke the Lucene Index Browser to develop Search Queries

From the post:

Luke is a GUI tool written in Java that allows you to browse the contents of a Lucene index, examine individual documents, and run queries over the index. Whether you’re developing with PyLucene, Lucene.NET, or Lucene Core, Luke is your friend.

Which also covers:

Downloading, running Luke ….

Exploring Document Indexing ….

Exploring Search ….

Using the Lucene XML Query Parser ….

Nothing surprising but a well written introduction to Luke.

Comments Off

December 28, 2011

Luke – The Lucene Index Toolbox v. 3.5.0

Filed under: Lucene,Luke — Patrick Durusau @ 9:33 pm

Luke – The Lucene Index Toolbox v. 3.5.0

Andrzej Bialecki writes:

I’m happy to announce the release of Luke – The Lucene Index Toolbox, version 3.5.0. This release includes Lucene 3.5.0 libraries, and you can download it from:

http://code.google.com/p/luke

Changes in version 3.5.0 (released on 2011.12.28):
* Update to Lucene 3.5.0 and fix some deprecated API usage.
* Issue 49 : fix faulty logic that prevented opening indexes in
read-only mode (MarkHarwood).
* Issue 43 : fix left-over references to Field (merkertr).
* Issue 42 : Luke should indicate if a field is a numeric field (merkertr).

Enjoy!

PS. Merry Christmas and a happy New Year to you all!

About Luke (from its homepage):

Lucene is an Open Source, mature and high-performance Java search engine. It is highly flexible, and scalable from hundreds to millions of documents.

Luke is a handy development and diagnostic tool, which accesses already existing Lucene indexes and allows you to display and modify their content in several ways:

browse by document number, or by term

view documents / copy to clipboard

retrieve a ranked list of most frequent terms

execute a search, and browse the results

analyze search results

selectively delete documents from the index

reconstruct the original document fields, edit them and re-insert to the index

optimize indexes

open indexes consisting of multiple parts, and located on Hadoop filesystem

and much more…

Current stable release of Luke is 3.5.0 and it includes Lucene 3.5.0 and Hadoop 0.20.2. Available is also Luke 1.0.1 (using Lucene 3.0.1), 0.9.9.1 based on Lucene 2.9.1, and other versions as well – please see the Downloads section.

Luke releases are numbered the same as the version of Lucene libraries that they use (plus a minor number in case of bugfix releases).

Below is a screenshot of the application showing the Overview section, which displays the details of the index format and some overall statistics.

Luke Overview tab

Comments Off

May 15, 2011

Luke 3.1

Filed under: Lucene,Luke — Patrick Durusau @ 5:55 pm

Luke 3.1

Luke is a development and diagnostic tool for use with Lucene.

Luke is now being numbered consistently with Lucene.

See my prior blog post on Luke.

Comments Off