This entry was posted
on Thursday, January 13th, 2011 at 5:42 am and is filed under Data Mining, Similarity.
You can follow any responses to this entry through the RSS 2.0 feed.
Both comments and pings are currently closed.
One Response to “Scaling Jaccard Distance for Document Deduplication: Shingling, MinHash and Locality-Sensitive Hashing – Post”