Document (#35308)

Author
Lavrenko, V.
Title
¬A generative theory of relevance
Imprint
Berlin : Springer
Year
2009
Pages
XX, 197 S
Isbn
978-3-540-89363-9
Series
The information retrieval series ; 26
Abstract
A modern information retrieval system must have the capability to find, organize and present very different manifestations of information - such as text, pictures, videos or database records - any of which may be of relevance to the user. However, the concept of relevance, while seemingly intuitive, is actually hard to define, and it's even harder to model in a formal way. Lavrenko does not attempt to bring forth a new definition of relevance, nor provide arguments as to why any particular definition might be theoretically superior or more complete. Instead, he takes a widely accepted, albeit somewhat conservative definition, makes several assumptions, and from them develops a new probabilistic model that explicitly captures that notion of relevance. With this book, he makes two major contributions to the field of information retrieval: first, a new way to look at topical relevance, complementing the two dominant models, i.e., the classical probabilistic model and the language modeling approach, and which explicitly combines documents, queries, and relevance in a single formalism; second, a new method for modeling exchangeable sequences of discrete random variables which does not make any structural assumptions about the data and which can also handle rare events. Thus his book is of major interest to researchers and graduate students in information retrieval who specialize in relevance modeling, ranking algorithms, and language modeling.
Content
Vgl. auch: http://dx.doi.org/10.1007/978-3-540-89364-6
Footnote
Rez. in: JASIST 60(2009) no.12, S.2587-2588 (R. Luk)
Theme
Retrievalalgorithmen
RSWK
Relevanz-Feedback / Information Retrieval
BK
06.74 / Informationssysteme
54.64 / Datenbanken
DDC
025.04 / DDC22ger

Similar documents (content)

  1. Larkey, L.S.; Connell, M.E.: Structured queries, language modelling, and relevance modelling in cross-language information retrieval (2005) 0.32
    0.3158692 = sum of:
      0.3158692 = product of:
        0.9870913 = sum of:
          0.05473925 = weight(abstract_txt:language in 3023) [ClassicSimilarity], result of:
            0.05473925 = score(doc=3023,freq=6.0), product of:
              0.085173495 = queryWeight, product of:
                1.05092 = boost
                4.197964 = idf(docFreq=1727, maxDocs=42306)
                0.019306168 = queryNorm
              0.6426794 = fieldWeight in 3023, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                4.197964 = idf(docFreq=1727, maxDocs=42306)
                0.0625 = fieldNorm(doc=3023)
          0.021688187 = weight(abstract_txt:which in 3023) [ClassicSimilarity], result of:
            0.021688187 = score(doc=3023,freq=2.0), product of:
              0.08349065 = queryWeight, product of:
                1.4714698 = boost
                2.938938 = idf(docFreq=6085, maxDocs=42306)
                0.019306168 = queryNorm
              0.25976786 = fieldWeight in 3023, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.938938 = idf(docFreq=6085, maxDocs=42306)
                0.0625 = fieldNorm(doc=3023)
          0.029743163 = weight(abstract_txt:model in 3023) [ClassicSimilarity], result of:
            0.029743163 = score(doc=3023,freq=1.0), product of:
              0.117971614 = queryWeight, product of:
                1.5147879 = boost
                4.0339417 = idf(docFreq=2035, maxDocs=42306)
                0.019306168 = queryNorm
              0.25212136 = fieldWeight in 3023, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.0339417 = idf(docFreq=2035, maxDocs=42306)
                0.0625 = fieldNorm(doc=3023)
          0.010914571 = weight(abstract_txt:information in 3023) [ClassicSimilarity], result of:
            0.010914571 = score(doc=3023,freq=1.0), product of:
              0.07169245 = queryWeight, product of:
                1.5244884 = boost
                2.435865 = idf(docFreq=10064, maxDocs=42306)
                0.019306168 = queryNorm
              0.15224156 = fieldWeight in 3023, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.435865 = idf(docFreq=10064, maxDocs=42306)
                0.0625 = fieldNorm(doc=3023)
          0.16288222 = weight(abstract_txt:probabilistic in 3023) [ClassicSimilarity], result of:
            0.16288222 = score(doc=3023,freq=3.0), product of:
              0.22200581 = queryWeight, product of:
                1.6966786 = boost
                6.777487 = idf(docFreq=130, maxDocs=42306)
                0.019306168 = queryNorm
              0.7336845 = fieldWeight in 3023, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.777487 = idf(docFreq=130, maxDocs=42306)
                0.0625 = fieldNorm(doc=3023)
          0.04344175 = weight(abstract_txt:retrieval in 3023) [ClassicSimilarity], result of:
            0.04344175 = score(doc=3023,freq=3.0), product of:
              0.11589467 = queryWeight, product of:
                1.7336608 = boost
                3.4626071 = idf(docFreq=3604, maxDocs=42306)
                0.019306168 = queryNorm
              0.3748382 = fieldWeight in 3023, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.4626071 = idf(docFreq=3604, maxDocs=42306)
                0.0625 = fieldNorm(doc=3023)
          0.33778408 = weight(abstract_txt:modeling in 3023) [ClassicSimilarity], result of:
            0.33778408 = score(doc=3023,freq=6.0), product of:
              0.36102837 = queryWeight, product of:
                3.0598707 = boost
                6.1114206 = idf(docFreq=254, maxDocs=42306)
                0.019306168 = queryNorm
              0.93561643 = fieldWeight in 3023, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                6.1114206 = idf(docFreq=254, maxDocs=42306)
                0.0625 = fieldNorm(doc=3023)
          0.32589802 = weight(abstract_txt:relevance in 3023) [ClassicSimilarity], result of:
            0.32589802 = score(doc=3023,freq=5.0), product of:
              0.47196174 = queryWeight, product of:
                4.947671 = boost
                4.9409437 = idf(docFreq=821, maxDocs=42306)
                0.019306168 = queryNorm
              0.6905179 = fieldWeight in 3023, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.9409437 = idf(docFreq=821, maxDocs=42306)
                0.0625 = fieldNorm(doc=3023)
        0.32 = coord(8/25)
    
  2. Bodoff, D.; Wong, S.P.-S.: Documents and queries as random variables : history and implications (2006) 0.27
    0.27067852 = sum of:
      0.27067852 = product of:
        0.84587044 = sum of:
          0.039504647 = weight(abstract_txt:language in 1319) [ClassicSimilarity], result of:
            0.039504647 = score(doc=1319,freq=2.0), product of:
              0.085173495 = queryWeight, product of:
                1.05092 = boost
                4.197964 = idf(docFreq=1727, maxDocs=42306)
                0.019306168 = queryNorm
              0.46381387 = fieldWeight in 1319, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.197964 = idf(docFreq=1727, maxDocs=42306)
                0.078125 = fieldNorm(doc=1319)
          0.01916983 = weight(abstract_txt:which in 1319) [ClassicSimilarity], result of:
            0.01916983 = score(doc=1319,freq=1.0), product of:
              0.08349065 = queryWeight, product of:
                1.4714698 = boost
                2.938938 = idf(docFreq=6085, maxDocs=42306)
                0.019306168 = queryNorm
              0.22960453 = fieldWeight in 1319, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.938938 = idf(docFreq=6085, maxDocs=42306)
                0.078125 = fieldNorm(doc=1319)
          0.037178952 = weight(abstract_txt:model in 1319) [ClassicSimilarity], result of:
            0.037178952 = score(doc=1319,freq=1.0), product of:
              0.117971614 = queryWeight, product of:
                1.5147879 = boost
                4.0339417 = idf(docFreq=2035, maxDocs=42306)
                0.019306168 = queryNorm
              0.3151517 = fieldWeight in 1319, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.0339417 = idf(docFreq=2035, maxDocs=42306)
                0.078125 = fieldNorm(doc=1319)
          0.013643214 = weight(abstract_txt:information in 1319) [ClassicSimilarity], result of:
            0.013643214 = score(doc=1319,freq=1.0), product of:
              0.07169245 = queryWeight, product of:
                1.5244884 = boost
                2.435865 = idf(docFreq=10064, maxDocs=42306)
                0.019306168 = queryNorm
              0.19030195 = fieldWeight in 1319, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.435865 = idf(docFreq=10064, maxDocs=42306)
                0.078125 = fieldNorm(doc=1319)
          0.20360278 = weight(abstract_txt:probabilistic in 1319) [ClassicSimilarity], result of:
            0.20360278 = score(doc=1319,freq=3.0), product of:
              0.22200581 = queryWeight, product of:
                1.6966786 = boost
                6.777487 = idf(docFreq=130, maxDocs=42306)
                0.019306168 = queryNorm
              0.9171056 = fieldWeight in 1319, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.777487 = idf(docFreq=130, maxDocs=42306)
                0.078125 = fieldNorm(doc=1319)
          0.031351384 = weight(abstract_txt:retrieval in 1319) [ClassicSimilarity], result of:
            0.031351384 = score(doc=1319,freq=1.0), product of:
              0.11589467 = queryWeight, product of:
                1.7336608 = boost
                3.4626071 = idf(docFreq=3604, maxDocs=42306)
                0.019306168 = queryNorm
              0.2705162 = fieldWeight in 1319, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4626071 = idf(docFreq=3604, maxDocs=42306)
                0.078125 = fieldNorm(doc=1319)
          0.24377464 = weight(abstract_txt:modeling in 1319) [ClassicSimilarity], result of:
            0.24377464 = score(doc=1319,freq=2.0), product of:
              0.36102837 = queryWeight, product of:
                3.0598707 = boost
                6.1114206 = idf(docFreq=254, maxDocs=42306)
                0.019306168 = queryNorm
              0.67522293 = fieldWeight in 1319, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.1114206 = idf(docFreq=254, maxDocs=42306)
                0.078125 = fieldNorm(doc=1319)
          0.257645 = weight(abstract_txt:relevance in 1319) [ClassicSimilarity], result of:
            0.257645 = score(doc=1319,freq=2.0), product of:
              0.47196174 = queryWeight, product of:
                4.947671 = boost
                4.9409437 = idf(docFreq=821, maxDocs=42306)
                0.019306168 = queryNorm
              0.5459023 = fieldWeight in 1319, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.9409437 = idf(docFreq=821, maxDocs=42306)
                0.078125 = fieldNorm(doc=1319)
        0.32 = coord(8/25)
    
  3. Dominich, S.: ¬A unified mathematical definition of classical information retrieval (2000) 0.21
    0.21172474 = sum of:
      0.21172474 = product of:
        1.0586237 = sum of:
          0.03274371 = weight(abstract_txt:information in 5769) [ClassicSimilarity], result of:
            0.03274371 = score(doc=5769,freq=1.0), product of:
              0.07169245 = queryWeight, product of:
                1.5244884 = boost
                2.435865 = idf(docFreq=10064, maxDocs=42306)
                0.019306168 = queryNorm
              0.45672467 = fieldWeight in 5769, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.435865 = idf(docFreq=10064, maxDocs=42306)
                0.1875 = fieldNorm(doc=5769)
          0.2821203 = weight(abstract_txt:probabilistic in 5769) [ClassicSimilarity], result of:
            0.2821203 = score(doc=5769,freq=1.0), product of:
              0.22200581 = queryWeight, product of:
                1.6966786 = boost
                6.777487 = idf(docFreq=130, maxDocs=42306)
                0.019306168 = queryNorm
              1.2707788 = fieldWeight in 5769, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.777487 = idf(docFreq=130, maxDocs=42306)
                0.1875 = fieldNorm(doc=5769)
          0.07524332 = weight(abstract_txt:retrieval in 5769) [ClassicSimilarity], result of:
            0.07524332 = score(doc=5769,freq=1.0), product of:
              0.11589467 = queryWeight, product of:
                1.7336608 = boost
                3.4626071 = idf(docFreq=3604, maxDocs=42306)
                0.019306168 = queryNorm
              0.6492388 = fieldWeight in 5769, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4626071 = idf(docFreq=3604, maxDocs=42306)
                0.1875 = fieldNorm(doc=5769)
          0.23127832 = weight(abstract_txt:definition in 5769) [ClassicSimilarity], result of:
            0.23127832 = score(doc=5769,freq=1.0), product of:
              0.22260173 = queryWeight, product of:
                2.0807855 = boost
                5.541217 = idf(docFreq=450, maxDocs=42306)
                0.019306168 = queryNorm
              1.0389781 = fieldWeight in 5769, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.541217 = idf(docFreq=450, maxDocs=42306)
                0.1875 = fieldNorm(doc=5769)
          0.43723807 = weight(abstract_txt:relevance in 5769) [ClassicSimilarity], result of:
            0.43723807 = score(doc=5769,freq=1.0), product of:
              0.47196174 = queryWeight, product of:
                4.947671 = boost
                4.9409437 = idf(docFreq=821, maxDocs=42306)
                0.019306168 = queryNorm
              0.92642695 = fieldWeight in 5769, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.9409437 = idf(docFreq=821, maxDocs=42306)
                0.1875 = fieldNorm(doc=5769)
        0.2 = coord(5/25)
    
  4. Meij, E.; Trieschnigg, D.; Rijke, M. de; Kraaij, W.: Conceptual language models for domain-specific retrieval (2010) 0.21
    0.21060586 = sum of:
      0.21060586 = product of:
        0.75216377 = sum of:
          0.038706493 = weight(abstract_txt:language in 1239) [ClassicSimilarity], result of:
            0.038706493 = score(doc=1239,freq=3.0), product of:
              0.085173495 = queryWeight, product of:
                1.05092 = boost
                4.197964 = idf(docFreq=1727, maxDocs=42306)
                0.019306168 = queryNorm
              0.45444295 = fieldWeight in 1239, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.197964 = idf(docFreq=1727, maxDocs=42306)
                0.0625 = fieldNorm(doc=1239)
          0.09331813 = weight(abstract_txt:generative in 1239) [ClassicSimilarity], result of:
            0.09331813 = score(doc=1239,freq=1.0), product of:
              0.17530313 = queryWeight, product of:
                1.0660983 = boost
                8.51719 = idf(docFreq=22, maxDocs=42306)
                0.019306168 = queryNorm
              0.5323244 = fieldWeight in 1239, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.51719 = idf(docFreq=22, maxDocs=42306)
                0.0625 = fieldNorm(doc=1239)
          0.021688187 = weight(abstract_txt:which in 1239) [ClassicSimilarity], result of:
            0.021688187 = score(doc=1239,freq=2.0), product of:
              0.08349065 = queryWeight, product of:
                1.4714698 = boost
                2.938938 = idf(docFreq=6085, maxDocs=42306)
                0.019306168 = queryNorm
              0.25976786 = fieldWeight in 1239, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.938938 = idf(docFreq=6085, maxDocs=42306)
                0.0625 = fieldNorm(doc=1239)
          0.042063184 = weight(abstract_txt:model in 1239) [ClassicSimilarity], result of:
            0.042063184 = score(doc=1239,freq=2.0), product of:
              0.117971614 = queryWeight, product of:
                1.5147879 = boost
                4.0339417 = idf(docFreq=2035, maxDocs=42306)
                0.019306168 = queryNorm
              0.35655344 = fieldWeight in 1239, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.0339417 = idf(docFreq=2035, maxDocs=42306)
                0.0625 = fieldNorm(doc=1239)
          0.03547004 = weight(abstract_txt:retrieval in 1239) [ClassicSimilarity], result of:
            0.03547004 = score(doc=1239,freq=2.0), product of:
              0.11589467 = queryWeight, product of:
                1.7336608 = boost
                3.4626071 = idf(docFreq=3604, maxDocs=42306)
                0.019306168 = queryNorm
              0.30605412 = fieldWeight in 1239, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4626071 = idf(docFreq=3604, maxDocs=42306)
                0.0625 = fieldNorm(doc=1239)
          0.1950197 = weight(abstract_txt:modeling in 1239) [ClassicSimilarity], result of:
            0.1950197 = score(doc=1239,freq=2.0), product of:
              0.36102837 = queryWeight, product of:
                3.0598707 = boost
                6.1114206 = idf(docFreq=254, maxDocs=42306)
                0.019306168 = queryNorm
              0.54017836 = fieldWeight in 1239, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.1114206 = idf(docFreq=254, maxDocs=42306)
                0.0625 = fieldNorm(doc=1239)
          0.32589802 = weight(abstract_txt:relevance in 1239) [ClassicSimilarity], result of:
            0.32589802 = score(doc=1239,freq=5.0), product of:
              0.47196174 = queryWeight, product of:
                4.947671 = boost
                4.9409437 = idf(docFreq=821, maxDocs=42306)
                0.019306168 = queryNorm
              0.6905179 = fieldWeight in 1239, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.9409437 = idf(docFreq=821, maxDocs=42306)
                0.0625 = fieldNorm(doc=1239)
        0.28 = coord(7/25)
    
  5. Bruza, P.D.; Huibers, T.W.C.: ¬A study of aboutness in information retrieval (1996) 0.17
    0.17052953 = sum of:
      0.17052953 = product of:
        0.7105397 = sum of:
          0.06309478 = weight(abstract_txt:model in 775) [ClassicSimilarity], result of:
            0.06309478 = score(doc=775,freq=2.0), product of:
              0.117971614 = queryWeight, product of:
                1.5147879 = boost
                4.0339417 = idf(docFreq=2035, maxDocs=42306)
                0.019306168 = queryNorm
              0.53483015 = fieldWeight in 775, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.0339417 = idf(docFreq=2035, maxDocs=42306)
                0.09375 = fieldNorm(doc=775)
          0.040102694 = weight(abstract_txt:information in 775) [ClassicSimilarity], result of:
            0.040102694 = score(doc=775,freq=6.0), product of:
              0.07169245 = queryWeight, product of:
                1.5244884 = boost
                2.435865 = idf(docFreq=10064, maxDocs=42306)
                0.019306168 = queryNorm
              0.55937123 = fieldWeight in 775, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                2.435865 = idf(docFreq=10064, maxDocs=42306)
                0.09375 = fieldNorm(doc=775)
          0.14106014 = weight(abstract_txt:probabilistic in 775) [ClassicSimilarity], result of:
            0.14106014 = score(doc=775,freq=1.0), product of:
              0.22200581 = queryWeight, product of:
                1.6966786 = boost
                6.777487 = idf(docFreq=130, maxDocs=42306)
                0.019306168 = queryNorm
              0.6353894 = fieldWeight in 775, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.777487 = idf(docFreq=130, maxDocs=42306)
                0.09375 = fieldNorm(doc=775)
          0.08412459 = weight(abstract_txt:retrieval in 775) [ClassicSimilarity], result of:
            0.08412459 = score(doc=775,freq=5.0), product of:
              0.11589467 = queryWeight, product of:
                1.7336608 = boost
                3.4626071 = idf(docFreq=3604, maxDocs=42306)
                0.019306168 = queryNorm
              0.7258711 = fieldWeight in 775, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                3.4626071 = idf(docFreq=3604, maxDocs=42306)
                0.09375 = fieldNorm(doc=775)
          0.16353847 = weight(abstract_txt:definition in 775) [ClassicSimilarity], result of:
            0.16353847 = score(doc=775,freq=2.0), product of:
              0.22260173 = queryWeight, product of:
                2.0807855 = boost
                5.541217 = idf(docFreq=450, maxDocs=42306)
                0.019306168 = queryNorm
              0.7346685 = fieldWeight in 775, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.541217 = idf(docFreq=450, maxDocs=42306)
                0.09375 = fieldNorm(doc=775)
          0.21861903 = weight(abstract_txt:relevance in 775) [ClassicSimilarity], result of:
            0.21861903 = score(doc=775,freq=1.0), product of:
              0.47196174 = queryWeight, product of:
                4.947671 = boost
                4.9409437 = idf(docFreq=821, maxDocs=42306)
                0.019306168 = queryNorm
              0.46321347 = fieldWeight in 775, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.9409437 = idf(docFreq=821, maxDocs=42306)
                0.09375 = fieldNorm(doc=775)
        0.24 = coord(6/25)