Shrinking the Haystack with Solr and NLP by Wes Caldwell.
A very high level view of using Solr and NLP to shrink a data haystack but a useful one none the less.
If you think of this in the context of Chuck Hollis’ “modest data,” you begin to realize that the inputs may be “big data” but to be useful to a human analyst, it needs to be pared down to “modest data.”
Or even further to “actionable data.”
There’s an interesting contrast: Big data vs. Actionable data.
Ask your analyst if they prefer five terabytes of raw data or five pages of actionable data?
Adjust your deliverables accordingly.