Manning, C.D.; Raghavan, P.; Schütze, H.: Introduction to information retrieval (2008)
0.01
0.013738594 = sum of:
0.012180208 = product of:
0.048720833 = sum of:
0.048720833 = weight(_text_:authors in 4041) [ClassicSimilarity], result of:
0.048720833 = score(doc=4041,freq=2.0), product of:
0.24182312 = queryWeight, product of:
4.558814 = idf(docFreq=1258, maxDocs=44218)
0.053045183 = queryNorm
0.20147301 = fieldWeight in 4041, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
4.558814 = idf(docFreq=1258, maxDocs=44218)
0.03125 = fieldNorm(doc=4041)
0.25 = coord(1/4)
0.0015583857 = product of:
0.0031167713 = sum of:
0.0031167713 = weight(_text_:a in 4041) [ClassicSimilarity], result of:
0.0031167713 = score(doc=4041,freq=2.0), product of:
0.06116359 = queryWeight, product of:
1.153047 = idf(docFreq=37942, maxDocs=44218)
0.053045183 = queryNorm
0.050957955 = fieldWeight in 4041, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
1.153047 = idf(docFreq=37942, maxDocs=44218)
0.03125 = fieldNorm(doc=4041)
0.5 = coord(1/2)
- Abstract
- Class-tested and coherent, this textbook teaches information retrieval, including web search, text classification, and text clustering from basic concepts. Ideas are explained using examples and figures, making it perfect for introductory courses in information retrieval for advanced undergraduates and graduate students. Slides and additional exercises are available for lecturers. - This book provides what Salton and Van Rijsbergen both failed to achieve. Even more important, unlike some other books in IR, the authors appear to care about making the theory as accessible as possible to the reader, on occasion including short primers to certain topics or choosing to explain difficult concepts using simplified approaches. Its coverage [is] excellent, the quality of writing high and I was surprised how much I learned from reading it. I think the online resources are impressive.
- Content
- Inhalt: Boolean retrieval - The term vocabulary & postings lists - Dictionaries and tolerant retrieval - Index construction - Index compression - Scoring, term weighting & the vector space model - Computing scores in a complete search system - Evaluation in information retrieval - Relevance feedback & query expansion - XML retrieval - Probabilistic information retrieval - Language models for information retrieval - Text classification & Naive Bayes - Vector space classification - Support vector machines & machine learning on documents - Flat clustering - Hierarchical clustering - Matrix decompositions & latent semantic indexing - Web search basics - Web crawling and indexes - Link analysis Vgl. die digitale Fassung unter: http://nlp.stanford.edu/IR-book/pdf/irbookprint.pdf.