Location Sensitive Hashing in Map Reduce by Ricky Ho.
From the post:
Inspired by Dr. Gautam Shroff who teaches the class: Web Intelligence and Big data in coursera.org, there are many scenarios where we want to compute similarity between large amount of items (e.g. photos, products, persons, resumes … etc). I want to add another algorithm to my Map/Reduce algorithm catalog.
For the background of Map/Reduce implementation on Hadoop. I have a previous post that covers the details.
“Location” here is not used in the geographic sense but as a general measure of distance. Could be geographic, but could be some other measure of location as well.