From Text to Truth: Real-World Facets for Multilingual Search by Benson Margulies.
Description:
Solr’s ability to facet search results gives end-users a valuable way to drill down to what they want. But for unstructured documents, deriving facets such as the persons mentioned requires advanced analytics. Even if names can be extracted from documents, the user doesn’t want a “George Bush” facet that intermingles documents mentioning either the 41st and 43rd U.S. Presidents, nor does she want separate facets for “George W. Bush” or even “乔治·沃克·布什” (a Chinese translation) that are limited to just one string. We’ll explore the benefits and challenges of empowering Solr users with real-world facets.
One of the better conference presentations I have seen in quite some time.
This is likely to change your mind about how you think about facets. Or at least how to construct them.
If you think of facets as the decoration you see at ecommerce sites, think again.
Enjoy!