Document (#36043)

Author
Manning, C.D.
Raghavan, P.
Schütze, H.
Title
Introduction to information retrieval
Imprint
Cambridge : Cambridge University Press
Year
2008
Pages
XXI, 482 S
Isbn
978-0-521-86571-5
Abstract
Class-tested and coherent, this textbook teaches information retrieval, including web search, text classification, and text clustering from basic concepts. Ideas are explained using examples and figures, making it perfect for introductory courses in information retrieval for advanced undergraduates and graduate students. Slides and additional exercises are available for lecturers. - This book provides what Salton and Van Rijsbergen both failed to achieve. Even more important, unlike some other books in IR, the authors appear to care about making the theory as accessible as possible to the reader, on occasion including short primers to certain topics or choosing to explain difficult concepts using simplified approaches. Its coverage [is] excellent, the quality of writing high and I was surprised how much I learned from reading it. I think the online resources are impressive.
Content
Inhalt: Boolean retrieval - The term vocabulary & postings lists - Dictionaries and tolerant retrieval - Index construction - Index compression - Scoring, term weighting & the vector space model - Computing scores in a complete search system - Evaluation in information retrieval - Relevance feedback & query expansion - XML retrieval - Probabilistic information retrieval - Language models for information retrieval - Text classification & Naive Bayes - Vector space classification - Support vector machines & machine learning on documents - Flat clustering - Hierarchical clustering - Matrix decompositions & latent semantic indexing - Web search basics - Web crawling and indexes - Link analysis Vgl. die digitale Fassung unter: http://nlp.stanford.edu/IR-book/pdf/irbookprint.pdf.
LCSH
Text processing (Computer science)
Information retrieval
Document clustering
Semantic Web
RSWK
Dokumentverarbeitung / Information Retrieval / Abfrageverarbeitung (GBV)
Information Retrieval / Einführung (BVB)
Semantic Web (BVB)
Textverarbeitung (BVB)
World Wide Web / Suchmaschine (HBZ)
BK
54.64 / Datenbanken
DDC
025.04
GHBS
AZE (PB)
TWP (PB)
LCC
QA76.9.T48
RVK
ST 306
ST 205
ST 270
ST 515

Similar documents (author)

  1. Manning, R.W.: ¬The Anglo-American Cataloguing Rules and their future (1999) 2.24
    2.244621 = sum of:
      2.244621 = product of:
        4.489242 = sum of:
          4.489242 = weight(author_txt:manning in 810) [ClassicSimilarity], result of:
            4.489242 = score(doc=810,freq=1.0), product of:
              0.7841802 = queryWeight, product of:
                1.124153 = boost
                9.159613 = idf(docFreq=11, maxDocs=41962)
                0.076157615 = queryNorm
              5.724758 = fieldWeight in 810, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.159613 = idf(docFreq=11, maxDocs=41962)
                0.625 = fieldNorm(doc=810)
        0.5 = coord(1/2)
    
  2. Manning, R.W.: ¬The Anglo American Cataloguing Rules and their future (2000) 2.24
    2.244621 = sum of:
      2.244621 = product of:
        4.489242 = sum of:
          4.489242 = weight(author_txt:manning in 1315) [ClassicSimilarity], result of:
            4.489242 = score(doc=1315,freq=1.0), product of:
              0.7841802 = queryWeight, product of:
                1.124153 = boost
                9.159613 = idf(docFreq=11, maxDocs=41962)
                0.076157615 = queryNorm
              5.724758 = fieldWeight in 1315, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.159613 = idf(docFreq=11, maxDocs=41962)
                0.625 = fieldNorm(doc=1315)
        0.5 = coord(1/2)
    
  3. Manning, C.D.: Part-of-Speech Tagging from 97% to 100% : is it time for some linguistics? (2011) 2.24
    2.244621 = sum of:
      2.244621 = product of:
        4.489242 = sum of:
          4.489242 = weight(author_txt:manning in 3122) [ClassicSimilarity], result of:
            4.489242 = score(doc=3122,freq=1.0), product of:
              0.7841802 = queryWeight, product of:
                1.124153 = boost
                9.159613 = idf(docFreq=11, maxDocs=41962)
                0.076157615 = queryNorm
              5.724758 = fieldWeight in 3122, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.159613 = idf(docFreq=11, maxDocs=41962)
                0.625 = fieldNorm(doc=3122)
        0.5 = coord(1/2)
    
  4. Mallett, J.; Manning, C.: Multimedia and database design : a discussion of database technology and its use in multimedia (1993) 1.80
    1.7956967 = sum of:
      1.7956967 = product of:
        3.5913935 = sum of:
          3.5913935 = weight(author_txt:manning in 6277) [ClassicSimilarity], result of:
            3.5913935 = score(doc=6277,freq=1.0), product of:
              0.7841802 = queryWeight, product of:
                1.124153 = boost
                9.159613 = idf(docFreq=11, maxDocs=41962)
                0.076157615 = queryNorm
              4.5798063 = fieldWeight in 6277, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.159613 = idf(docFreq=11, maxDocs=41962)
                0.5 = fieldNorm(doc=6277)
        0.5 = coord(1/2)
    
  5. Toutanova, K.; Manning, C.D.: Enriching the knowledge sources used in a maximum entropy Part-of-Speech Tagger (2000) 1.80
    1.7956967 = sum of:
      1.7956967 = product of:
        3.5913935 = sum of:
          3.5913935 = weight(author_txt:manning in 3061) [ClassicSimilarity], result of:
            3.5913935 = score(doc=3061,freq=1.0), product of:
              0.7841802 = queryWeight, product of:
                1.124153 = boost
                9.159613 = idf(docFreq=11, maxDocs=41962)
                0.076157615 = queryNorm
              4.5798063 = fieldWeight in 3061, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.159613 = idf(docFreq=11, maxDocs=41962)
                0.5 = fieldNorm(doc=3061)
        0.5 = coord(1/2)
    

Similar documents (content)

  1. Kuropka, D.: Modelle zur Repräsentation natürlichsprachlicher Dokumente : Ontologie-basiertes Information-Filtering und -Retrieval mit relationalen Datenbanken (2004) 0.15
    0.15255025 = sum of:
      0.15255025 = product of:
        0.9534391 = sum of:
          0.8294125 = weight(subject_txt:dokumentverarbeitung in 1326) [ClassicSimilarity], result of:
            0.8294125 = score(doc=1326,freq=2.0), product of:
              0.2764382 = queryWeight, product of:
                1.3036069 = boost
                9.698609 = idf(docFreq=6, maxDocs=41962)
                0.021864621 = queryNorm
              3.000354 = fieldWeight in 1326, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.698609 = idf(docFreq=6, maxDocs=41962)
                0.21875 = fieldNorm(doc=1326)
          0.036759924 = weight(abstract_txt:text in 1326) [ClassicSimilarity], result of:
            0.036759924 = score(doc=1326,freq=1.0), product of:
              0.14502065 = queryWeight, product of:
                1.6353968 = boost
                4.05569 = idf(docFreq=1975, maxDocs=41962)
                0.021864621 = queryNorm
              0.2534806 = fieldWeight in 1326, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.05569 = idf(docFreq=1975, maxDocs=41962)
                0.0625 = fieldNorm(doc=1326)
          0.02262966 = weight(abstract_txt:information in 1326) [ClassicSimilarity], result of:
            0.02262966 = score(doc=1326,freq=2.0), product of:
              0.1049459 = queryWeight, product of:
                1.9674603 = boost
                2.439594 = idf(docFreq=9945, maxDocs=41962)
                0.021864621 = queryNorm
              0.21563168 = fieldWeight in 1326, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.439594 = idf(docFreq=9945, maxDocs=41962)
                0.0625 = fieldNorm(doc=1326)
          0.06463703 = weight(abstract_txt:retrieval in 1326) [ClassicSimilarity], result of:
            0.06463703 = score(doc=1326,freq=2.0), product of:
              0.2112683 = queryWeight, product of:
                2.7915177 = boost
                3.4614017 = idf(docFreq=3579, maxDocs=41962)
                0.021864621 = queryNorm
              0.30594757 = fieldWeight in 1326, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4614017 = idf(docFreq=3579, maxDocs=41962)
                0.0625 = fieldNorm(doc=1326)
        0.16 = coord(4/25)
    
  2. Kuropka, D.: Modelle zur Repräsentation natürlichsprachlicher Dokumente : Ontologie-basiertes Information-Filtering und -Retrieval mit relationalen Datenbanken (2004) 0.15
    0.15255025 = sum of:
      0.15255025 = product of:
        0.9534391 = sum of:
          0.8294125 = weight(subject_txt:dokumentverarbeitung in 1386) [ClassicSimilarity], result of:
            0.8294125 = score(doc=1386,freq=2.0), product of:
              0.2764382 = queryWeight, product of:
                1.3036069 = boost
                9.698609 = idf(docFreq=6, maxDocs=41962)
                0.021864621 = queryNorm
              3.000354 = fieldWeight in 1386, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.698609 = idf(docFreq=6, maxDocs=41962)
                0.21875 = fieldNorm(doc=1386)
          0.036759924 = weight(abstract_txt:text in 1386) [ClassicSimilarity], result of:
            0.036759924 = score(doc=1386,freq=1.0), product of:
              0.14502065 = queryWeight, product of:
                1.6353968 = boost
                4.05569 = idf(docFreq=1975, maxDocs=41962)
                0.021864621 = queryNorm
              0.2534806 = fieldWeight in 1386, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.05569 = idf(docFreq=1975, maxDocs=41962)
                0.0625 = fieldNorm(doc=1386)
          0.02262966 = weight(abstract_txt:information in 1386) [ClassicSimilarity], result of:
            0.02262966 = score(doc=1386,freq=2.0), product of:
              0.1049459 = queryWeight, product of:
                1.9674603 = boost
                2.439594 = idf(docFreq=9945, maxDocs=41962)
                0.021864621 = queryNorm
              0.21563168 = fieldWeight in 1386, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.439594 = idf(docFreq=9945, maxDocs=41962)
                0.0625 = fieldNorm(doc=1386)
          0.06463703 = weight(abstract_txt:retrieval in 1386) [ClassicSimilarity], result of:
            0.06463703 = score(doc=1386,freq=2.0), product of:
              0.2112683 = queryWeight, product of:
                2.7915177 = boost
                3.4614017 = idf(docFreq=3579, maxDocs=41962)
                0.021864621 = queryNorm
              0.30594757 = fieldWeight in 1386, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4614017 = idf(docFreq=3579, maxDocs=41962)
                0.0625 = fieldNorm(doc=1386)
        0.16 = coord(4/25)
    
  3. Salton, G.; Rijsbergen, C.J. van; Maron, M.E.: Panel on key issues in information retrieval (1983) 0.12
    0.115998685 = sum of:
      0.115998685 = product of:
        0.7249918 = sum of:
          0.22940087 = weight(abstract_txt:salton in 7410) [ClassicSimilarity], result of:
            0.22940087 = score(doc=7410,freq=1.0), product of:
              0.23469889 = queryWeight, product of:
                1.2011663 = boost
                8.936469 = idf(docFreq=14, maxDocs=41962)
                0.021864621 = queryNorm
              0.9774263 = fieldWeight in 7410, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.936469 = idf(docFreq=14, maxDocs=41962)
                0.109375 = fieldNorm(doc=7410)
          0.25412464 = weight(abstract_txt:rijsbergen in 7410) [ClassicSimilarity], result of:
            0.25412464 = score(doc=7410,freq=1.0), product of:
              0.25127283 = queryWeight, product of:
                1.2428547 = boost
                9.246624 = idf(docFreq=10, maxDocs=41962)
                0.021864621 = queryNorm
              1.0113494 = fieldWeight in 7410, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.246624 = idf(docFreq=10, maxDocs=41962)
                0.109375 = fieldNorm(doc=7410)
          0.06261612 = weight(abstract_txt:information in 7410) [ClassicSimilarity], result of:
            0.06261612 = score(doc=7410,freq=5.0), product of:
              0.1049459 = queryWeight, product of:
                1.9674603 = boost
                2.439594 = idf(docFreq=9945, maxDocs=41962)
                0.021864621 = queryNorm
              0.5966514 = fieldWeight in 7410, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                2.439594 = idf(docFreq=9945, maxDocs=41962)
                0.109375 = fieldNorm(doc=7410)
          0.1788502 = weight(abstract_txt:retrieval in 7410) [ClassicSimilarity], result of:
            0.1788502 = score(doc=7410,freq=5.0), product of:
              0.2112683 = queryWeight, product of:
                2.7915177 = boost
                3.4614017 = idf(docFreq=3579, maxDocs=41962)
                0.021864621 = queryNorm
              0.8465548 = fieldWeight in 7410, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                3.4614017 = idf(docFreq=3579, maxDocs=41962)
                0.109375 = fieldNorm(doc=7410)
        0.16 = coord(4/25)
    
  4. Kommers, P.A.M.; Ferreira, A.; Kwak, A.K.: Document management for hypermedia design (1997) 0.12
    0.11585722 = sum of:
      0.11585722 = product of:
        0.9654769 = sum of:
          0.83783317 = weight(subject_txt:dokumentverarbeitung in 585) [ClassicSimilarity], result of:
            0.83783317 = score(doc=585,freq=1.0), product of:
              0.2764382 = queryWeight, product of:
                1.3036069 = boost
                9.698609 = idf(docFreq=6, maxDocs=41962)
                0.021864621 = queryNorm
              3.0308154 = fieldWeight in 585, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.698609 = idf(docFreq=6, maxDocs=41962)
                0.3125 = fieldNorm(doc=585)
          0.09564051 = weight(abstract_txt:making in 585) [ClassicSimilarity], result of:
            0.09564051 = score(doc=585,freq=1.0), product of:
              0.1509701 = queryWeight, product of:
                1.3624108 = boost
                5.0680504 = idf(docFreq=717, maxDocs=41962)
                0.021864621 = queryNorm
              0.6335063 = fieldWeight in 585, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.0680504 = idf(docFreq=717, maxDocs=41962)
                0.125 = fieldNorm(doc=585)
          0.03200317 = weight(abstract_txt:information in 585) [ClassicSimilarity], result of:
            0.03200317 = score(doc=585,freq=1.0), product of:
              0.1049459 = queryWeight, product of:
                1.9674603 = boost
                2.439594 = idf(docFreq=9945, maxDocs=41962)
                0.021864621 = queryNorm
              0.30494925 = fieldWeight in 585, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.439594 = idf(docFreq=9945, maxDocs=41962)
                0.125 = fieldNorm(doc=585)
        0.12 = coord(3/25)
    
  5. Colomb, R.M.: Information spaces : the architecture of cyberspace (2002) 0.11
    0.112782836 = sum of:
      0.112782836 = product of:
        0.70489275 = sum of:
          0.03902902 = weight(abstract_txt:including in 2263) [ClassicSimilarity], result of:
            0.03902902 = score(doc=2263,freq=1.0), product of:
              0.11362349 = queryWeight, product of:
                1.1819434 = boost
                4.396727 = idf(docFreq=1404, maxDocs=41962)
                0.021864621 = queryNorm
              0.3434943 = fieldWeight in 2263, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.396727 = idf(docFreq=1404, maxDocs=41962)
                0.078125 = fieldNorm(doc=2263)
          0.04473606 = weight(abstract_txt:concepts in 2263) [ClassicSimilarity], result of:
            0.04473606 = score(doc=2263,freq=1.0), product of:
              0.12444616 = queryWeight, product of:
                1.2369535 = boost
                4.60136 = idf(docFreq=1144, maxDocs=41962)
                0.021864621 = queryNorm
              0.35948125 = fieldWeight in 2263, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.60136 = idf(docFreq=1144, maxDocs=41962)
                0.078125 = fieldNorm(doc=2263)
          0.58648324 = weight(subject_txt:dokumentverarbeitung in 2263) [ClassicSimilarity], result of:
            0.58648324 = score(doc=2263,freq=1.0), product of:
              0.2764382 = queryWeight, product of:
                1.3036069 = boost
                9.698609 = idf(docFreq=6, maxDocs=41962)
                0.021864621 = queryNorm
              2.1215708 = fieldWeight in 2263, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.698609 = idf(docFreq=6, maxDocs=41962)
                0.21875 = fieldNorm(doc=2263)
          0.034644447 = weight(abstract_txt:information in 2263) [ClassicSimilarity], result of:
            0.034644447 = score(doc=2263,freq=3.0), product of:
              0.1049459 = queryWeight, product of:
                1.9674603 = boost
                2.439594 = idf(docFreq=9945, maxDocs=41962)
                0.021864621 = queryNorm
              0.33011723 = fieldWeight in 2263, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.439594 = idf(docFreq=9945, maxDocs=41962)
                0.078125 = fieldNorm(doc=2263)
        0.16 = coord(4/25)