Jeff Dalton, Jeff’s Search Engine Caffè reports a new data mining book by Anand Rajaraman and Jeffrey D. Ullman (yes, that Jeffrey D. Ullman, think “dragon book.”).
A free eBook no less.
Read Jeff’s post on your way to get a copy.
Look for more comments as I read through it.
Has anyone written a comparison of the recent search engine titles? Just curious.
Update: New version out in hard copy and e-book remains available. See: Mining Massive Data Sets – Update
[…] Carpenter of Ling-Pipe Blog points out the treatment of Jaccard distance in Mining Massive Datasets by Anand Rajaraman and Jeffrey D. […]
Pingback by Scaling Jaccard Distance for Document Deduplication: Shingling, MinHash and Locality-Sensitive Hashing – Post « Another Word For It — January 13, 2011 @ 5:42 am
[…] Update of Mining of Massive Datasets – eBook. […]
Pingback by Mining Massive Data Sets – Update « Another Word For It — January 3, 2012 @ 5:03 pm