SAXually Explicit Images: Data Mining Large Shape Databases by Eamonn Keogh.
ABSTRACT
The problem of indexing large collections of time series and images has received much attention in the last decade, however we argue that there is potentially great untapped utility in data mining such collections. Consider the following two concrete examples of problems in data mining.
Motif Discovery (duplication detection): Given a large repository of time series or images, find approximately repeated patterns/images.
Discord Discovery: Given a large repository of time series or images, find the most unusual time series/image.
As we will show, both these problems have applications in fields as diverse as anthropology, crime…
Ancient history in the view of some, this is a Google talk from 2006!
But, it is quite well done and I enjoyed the unexpected application of time series representation to shape data for purposes of evaluating matches. It is one of those insights that will stay with you and that seems obvious after they say it.
I think topic map authors (semantic investigators generally) need to report such insights for the benefit of others.