FreeDiscovery: Open Source e-Discovery and Information Retrieval Engine
From the webpage:
FreeDiscovery is built on top of existing machine learning libraries (scikit-learn) and provides a REST API for information retrieval applications. It aims to benefit existing e-Discovery and information retrieval platforms with a focus on text categorization, semantic search, document clustering, duplicates detection and e-mail threading.
In addition, FreeDiscovery can be used as Python package and exposes several estimators with a scikit-learn compatible API.
…
Python 3.5+ required.
Homepage has command line examples, with a pointer to: http://freediscovery.io/doc/stable/examples/ for more examples.
The additional examples use a subset of the TREC 2009 legal collection. Cool!
I saw this in a tweet by Lynn Cherny today.
Enjoy!