Introduction to Information Retrieval by Christopher D. Manning, Prabhakar Raghavan and Hinrich Schütze.
A bit dated now (2008) but the underlying principles of information retrieval remain the same.
I have a hard copy but the additional materials and ability to cut-n-paste will make this a welcome resource!
We’d be pleased to get feedback about how this book works out as a textbook, what is missing, or covered in too much detail, or what is simply wrong. Please send any feedback or comments to: informationretrieval (at) yahoogroups (dot) com
Online resources
Apart from small differences (mainly concerning copy editing and figures), the online editions should have the same content as the print edition.
The following materials are available online. The date of last update is given in parentheses.
- HTML edition (2009.04.07)
- PDF of the book for online viewing (with nice hyperlink features, 2009.04.01)
- PDF of the book for printing (2009.04.01)
- PDFs of individual chapters (2009.04.01)
- Stanford slides and assignments (2013.09.13)
- University of Munich slides and assignments (2013.09.13)
- errata (2009.03.31)
- 8th European Summer School on information Retrieval (2011.08.28)
Information retrieval resources
A list of information retrieval resources is also available.
Introduction to Information Retrieval: Table of Contents
Front matter (incl. table of notations) pdf
02 The term vocabulary & postings lists pdf html
03 Dictionaries and tolerant retrieval pdf html
04 Index construction pdf html
06 Scoring, term weighting & the vector space model pdf html
07 Computing scores in a complete search system pdf html
08 Evaluation in information retrieval pdf html
09 Relevance feedback & query expansion pdf html
11 Probabilistic information retrieval pdf html
12 Language models for information retrieval pdf html
13 Text classification & Naive Bayes pdf html
14 Vector space classification pdf html
15 Support vector machines & machine learning on documents pdf html
16 Flat clustering pdf html Resources.
17 Hierarchical clustering pdf html
18 Matrix decompositions & latent semantic indexing pdf html
20 Web crawling and indexes pdf html
Bibliography & Index pdf
bibtex file bib