Document (#35307)

Author
Lavrenko, V.
Title
¬A generative theory of relevance
Imprint
Berlin : Springer
Year
2009
Pages
XX, 197 S
Isbn
978-3-540-89363-9
Series
The information retrieval series ; 26
Abstract
A modern information retrieval system must have the capability to find, organize and present very different manifestations of information - such as text, pictures, videos or database records - any of which may be of relevance to the user. However, the concept of relevance, while seemingly intuitive, is actually hard to define, and it's even harder to model in a formal way. Lavrenko does not attempt to bring forth a new definition of relevance, nor provide arguments as to why any particular definition might be theoretically superior or more complete. Instead, he takes a widely accepted, albeit somewhat conservative definition, makes several assumptions, and from them develops a new probabilistic model that explicitly captures that notion of relevance. With this book, he makes two major contributions to the field of information retrieval: first, a new way to look at topical relevance, complementing the two dominant models, i.e., the classical probabilistic model and the language modeling approach, and which explicitly combines documents, queries, and relevance in a single formalism; second, a new method for modeling exchangeable sequences of discrete random variables which does not make any structural assumptions about the data and which can also handle rare events. Thus his book is of major interest to researchers and graduate students in information retrieval who specialize in relevance modeling, ranking algorithms, and language modeling.
Content
Vgl. auch: http://dx.doi.org/10.1007/978-3-540-89364-6
Footnote
Rez. in: JASIST 60(2009) no.12, S.2587-2588 (R. Luk)
Theme
Retrievalalgorithmen
RSWK
Relevanz-Feedback / Information Retrieval
BK
06.74 / Informationssysteme
54.64 / Datenbanken
DDC
025.04 / DDC22ger

Similar documents (content)

  1. Larkey, L.S.; Connell, M.E.: Structured queries, language modelling, and relevance modelling in cross-language information retrieval (2005) 0.31
    0.31369412 = sum of:
      0.31369412 = product of:
        0.9802941 = sum of:
          0.05484366 = weight(abstract_txt:language in 1022) [ClassicSimilarity], result of:
            0.05484366 = score(doc=1022,freq=6.0), product of:
              0.08565992 = queryWeight, product of:
                1.053995 = boost
                4.1820874 = idf(docFreq=1834, maxDocs=44218)
                0.019433275 = queryNorm
              0.6402488 = fieldWeight in 1022, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                4.1820874 = idf(docFreq=1834, maxDocs=44218)
                0.0625 = fieldNorm(doc=1022)
          0.021483166 = weight(abstract_txt:which in 1022) [ClassicSimilarity], result of:
            0.021483166 = score(doc=1022,freq=2.0), product of:
              0.083331525 = queryWeight, product of:
                1.4701762 = boost
                2.9167147 = idf(docFreq=6503, maxDocs=44218)
                0.019433275 = queryNorm
              0.2578036 = fieldWeight in 1022, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.9167147 = idf(docFreq=6503, maxDocs=44218)
                0.0625 = fieldNorm(doc=1022)
          0.029083796 = weight(abstract_txt:model in 1022) [ClassicSimilarity], result of:
            0.029083796 = score(doc=1022,freq=1.0), product of:
              0.11673693 = queryWeight, product of:
                1.5069523 = boost
                3.986234 = idf(docFreq=2231, maxDocs=44218)
                0.019433275 = queryNorm
              0.24913962 = fieldWeight in 1022, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.986234 = idf(docFreq=2231, maxDocs=44218)
                0.0625 = fieldNorm(doc=1022)
          0.010858428 = weight(abstract_txt:information in 1022) [ClassicSimilarity], result of:
            0.010858428 = score(doc=1022,freq=1.0), product of:
              0.071763195 = queryWeight, product of:
                1.5253539 = boost
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.019433275 = queryNorm
              0.15130915 = fieldWeight in 1022, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.0625 = fieldNorm(doc=1022)
          0.1650162 = weight(abstract_txt:probabilistic in 1022) [ClassicSimilarity], result of:
            0.1650162 = score(doc=1022,freq=3.0), product of:
              0.22493367 = queryWeight, product of:
                1.7079571 = boost
                6.7769065 = idf(docFreq=136, maxDocs=44218)
                0.019433275 = queryNorm
              0.73362166 = fieldWeight in 1022, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.7769065 = idf(docFreq=136, maxDocs=44218)
                0.0625 = fieldNorm(doc=1022)
          0.04450192 = weight(abstract_txt:retrieval in 1022) [ClassicSimilarity], result of:
            0.04450192 = score(doc=1022,freq=3.0), product of:
              0.118294865 = queryWeight, product of:
                1.7516514 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.019433275 = queryNorm
              0.37619486 = fieldWeight in 1022, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.0625 = fieldNorm(doc=1022)
          0.3260749 = weight(abstract_txt:modeling in 1022) [ClassicSimilarity], result of:
            0.3260749 = score(doc=1022,freq=6.0), product of:
              0.35419977 = queryWeight, product of:
                3.0310204 = boost
                6.0133076 = idf(docFreq=293, maxDocs=44218)
                0.019433275 = queryNorm
              0.920596 = fieldWeight in 1022, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                6.0133076 = idf(docFreq=293, maxDocs=44218)
                0.0625 = fieldNorm(doc=1022)
          0.32843205 = weight(abstract_txt:relevance in 1022) [ClassicSimilarity], result of:
            0.32843205 = score(doc=1022,freq=5.0), product of:
              0.47650868 = queryWeight, product of:
                4.971817 = boost
                4.931848 = idf(docFreq=866, maxDocs=44218)
                0.019433275 = queryNorm
              0.6892467 = fieldWeight in 1022, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.931848 = idf(docFreq=866, maxDocs=44218)
                0.0625 = fieldNorm(doc=1022)
        0.32 = coord(8/25)
    
  2. Bodoff, D.; Wong, S.P.-S.: Documents and queries as random variables : history and implications (2006) 0.27
    0.26939383 = sum of:
      0.26939383 = product of:
        0.84185576 = sum of:
          0.039580002 = weight(abstract_txt:language in 193) [ClassicSimilarity], result of:
            0.039580002 = score(doc=193,freq=2.0), product of:
              0.08565992 = queryWeight, product of:
                1.053995 = boost
                4.1820874 = idf(docFreq=1834, maxDocs=44218)
                0.019433275 = queryNorm
              0.46205974 = fieldWeight in 193, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.1820874 = idf(docFreq=1834, maxDocs=44218)
                0.078125 = fieldNorm(doc=193)
          0.018988615 = weight(abstract_txt:which in 193) [ClassicSimilarity], result of:
            0.018988615 = score(doc=193,freq=1.0), product of:
              0.083331525 = queryWeight, product of:
                1.4701762 = boost
                2.9167147 = idf(docFreq=6503, maxDocs=44218)
                0.019433275 = queryNorm
              0.22786833 = fieldWeight in 193, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.9167147 = idf(docFreq=6503, maxDocs=44218)
                0.078125 = fieldNorm(doc=193)
          0.036354743 = weight(abstract_txt:model in 193) [ClassicSimilarity], result of:
            0.036354743 = score(doc=193,freq=1.0), product of:
              0.11673693 = queryWeight, product of:
                1.5069523 = boost
                3.986234 = idf(docFreq=2231, maxDocs=44218)
                0.019433275 = queryNorm
              0.31142452 = fieldWeight in 193, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.986234 = idf(docFreq=2231, maxDocs=44218)
                0.078125 = fieldNorm(doc=193)
          0.013573035 = weight(abstract_txt:information in 193) [ClassicSimilarity], result of:
            0.013573035 = score(doc=193,freq=1.0), product of:
              0.071763195 = queryWeight, product of:
                1.5253539 = boost
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.019433275 = queryNorm
              0.18913643 = fieldWeight in 193, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.078125 = fieldNorm(doc=193)
          0.20627026 = weight(abstract_txt:probabilistic in 193) [ClassicSimilarity], result of:
            0.20627026 = score(doc=193,freq=3.0), product of:
              0.22493367 = queryWeight, product of:
                1.7079571 = boost
                6.7769065 = idf(docFreq=136, maxDocs=44218)
                0.019433275 = queryNorm
              0.91702706 = fieldWeight in 193, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.7769065 = idf(docFreq=136, maxDocs=44218)
                0.078125 = fieldNorm(doc=193)
          0.032116495 = weight(abstract_txt:retrieval in 193) [ClassicSimilarity], result of:
            0.032116495 = score(doc=193,freq=1.0), product of:
              0.118294865 = queryWeight, product of:
                1.7516514 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.019433275 = queryNorm
              0.27149525 = fieldWeight in 193, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.078125 = fieldNorm(doc=193)
          0.23532426 = weight(abstract_txt:modeling in 193) [ClassicSimilarity], result of:
            0.23532426 = score(doc=193,freq=2.0), product of:
              0.35419977 = queryWeight, product of:
                3.0310204 = boost
                6.0133076 = idf(docFreq=293, maxDocs=44218)
                0.019433275 = queryNorm
              0.6643829 = fieldWeight in 193, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.0133076 = idf(docFreq=293, maxDocs=44218)
                0.078125 = fieldNorm(doc=193)
          0.25964832 = weight(abstract_txt:relevance in 193) [ClassicSimilarity], result of:
            0.25964832 = score(doc=193,freq=2.0), product of:
              0.47650868 = queryWeight, product of:
                4.971817 = boost
                4.931848 = idf(docFreq=866, maxDocs=44218)
                0.019433275 = queryNorm
              0.5448974 = fieldWeight in 193, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.931848 = idf(docFreq=866, maxDocs=44218)
                0.078125 = fieldNorm(doc=193)
        0.32 = coord(8/25)
    
  3. Dominich, S.: ¬A unified mathematical definition of classical information retrieval (2000) 0.21
    0.21337655 = sum of:
      0.21337655 = product of:
        1.0668827 = sum of:
          0.032575283 = weight(abstract_txt:information in 4768) [ClassicSimilarity], result of:
            0.032575283 = score(doc=4768,freq=1.0), product of:
              0.071763195 = queryWeight, product of:
                1.5253539 = boost
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.019433275 = queryNorm
              0.45392746 = fieldWeight in 4768, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.1875 = fieldNorm(doc=4768)
          0.28581646 = weight(abstract_txt:probabilistic in 4768) [ClassicSimilarity], result of:
            0.28581646 = score(doc=4768,freq=1.0), product of:
              0.22493367 = queryWeight, product of:
                1.7079571 = boost
                6.7769065 = idf(docFreq=136, maxDocs=44218)
                0.019433275 = queryNorm
              1.2706699 = fieldWeight in 4768, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.7769065 = idf(docFreq=136, maxDocs=44218)
                0.1875 = fieldNorm(doc=4768)
          0.07707959 = weight(abstract_txt:retrieval in 4768) [ClassicSimilarity], result of:
            0.07707959 = score(doc=4768,freq=1.0), product of:
              0.118294865 = queryWeight, product of:
                1.7516514 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.019433275 = queryNorm
              0.6515886 = fieldWeight in 4768, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.1875 = fieldNorm(doc=4768)
          0.2307736 = weight(abstract_txt:definition in 4768) [ClassicSimilarity], result of:
            0.2307736 = score(doc=4768,freq=1.0), product of:
              0.22326335 = queryWeight, product of:
                2.0840306 = boost
                5.512738 = idf(docFreq=484, maxDocs=44218)
                0.019433275 = queryNorm
              1.0336385 = fieldWeight in 4768, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.512738 = idf(docFreq=484, maxDocs=44218)
                0.1875 = fieldNorm(doc=4768)
          0.4406378 = weight(abstract_txt:relevance in 4768) [ClassicSimilarity], result of:
            0.4406378 = score(doc=4768,freq=1.0), product of:
              0.47650868 = queryWeight, product of:
                4.971817 = boost
                4.931848 = idf(docFreq=866, maxDocs=44218)
                0.019433275 = queryNorm
              0.9247215 = fieldWeight in 4768, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.931848 = idf(docFreq=866, maxDocs=44218)
                0.1875 = fieldNorm(doc=4768)
        0.2 = coord(5/25)
    
  4. Meij, E.; Trieschnigg, D.; Rijke, M. de; Kraaij, W.: Conceptual language models for domain-specific retrieval (2010) 0.21
    0.20713705 = sum of:
      0.20713705 = product of:
        0.7397752 = sum of:
          0.08535388 = weight(abstract_txt:generative in 4238) [ClassicSimilarity], result of:
            0.08535388 = score(doc=4238,freq=1.0), product of:
              0.16591385 = queryWeight, product of:
                1.0372324 = boost
                8.231152 = idf(docFreq=31, maxDocs=44218)
                0.019433275 = queryNorm
              0.514447 = fieldWeight in 4238, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.231152 = idf(docFreq=31, maxDocs=44218)
                0.0625 = fieldNorm(doc=4238)
          0.038780324 = weight(abstract_txt:language in 4238) [ClassicSimilarity], result of:
            0.038780324 = score(doc=4238,freq=3.0), product of:
              0.08565992 = queryWeight, product of:
                1.053995 = boost
                4.1820874 = idf(docFreq=1834, maxDocs=44218)
                0.019433275 = queryNorm
              0.45272425 = fieldWeight in 4238, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.1820874 = idf(docFreq=1834, maxDocs=44218)
                0.0625 = fieldNorm(doc=4238)
          0.021483166 = weight(abstract_txt:which in 4238) [ClassicSimilarity], result of:
            0.021483166 = score(doc=4238,freq=2.0), product of:
              0.083331525 = queryWeight, product of:
                1.4701762 = boost
                2.9167147 = idf(docFreq=6503, maxDocs=44218)
                0.019433275 = queryNorm
              0.2578036 = fieldWeight in 4238, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.9167147 = idf(docFreq=6503, maxDocs=44218)
                0.0625 = fieldNorm(doc=4238)
          0.041130695 = weight(abstract_txt:model in 4238) [ClassicSimilarity], result of:
            0.041130695 = score(doc=4238,freq=2.0), product of:
              0.11673693 = queryWeight, product of:
                1.5069523 = boost
                3.986234 = idf(docFreq=2231, maxDocs=44218)
                0.019433275 = queryNorm
              0.35233662 = fieldWeight in 4238, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.986234 = idf(docFreq=2231, maxDocs=44218)
                0.0625 = fieldNorm(doc=4238)
          0.036335666 = weight(abstract_txt:retrieval in 4238) [ClassicSimilarity], result of:
            0.036335666 = score(doc=4238,freq=2.0), product of:
              0.118294865 = queryWeight, product of:
                1.7516514 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.019433275 = queryNorm
              0.3071618 = fieldWeight in 4238, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.0625 = fieldNorm(doc=4238)
          0.18825941 = weight(abstract_txt:modeling in 4238) [ClassicSimilarity], result of:
            0.18825941 = score(doc=4238,freq=2.0), product of:
              0.35419977 = queryWeight, product of:
                3.0310204 = boost
                6.0133076 = idf(docFreq=293, maxDocs=44218)
                0.019433275 = queryNorm
              0.5315063 = fieldWeight in 4238, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.0133076 = idf(docFreq=293, maxDocs=44218)
                0.0625 = fieldNorm(doc=4238)
          0.32843205 = weight(abstract_txt:relevance in 4238) [ClassicSimilarity], result of:
            0.32843205 = score(doc=4238,freq=5.0), product of:
              0.47650868 = queryWeight, product of:
                4.971817 = boost
                4.931848 = idf(docFreq=866, maxDocs=44218)
                0.019433275 = queryNorm
              0.6892467 = fieldWeight in 4238, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.931848 = idf(docFreq=866, maxDocs=44218)
                0.0625 = fieldNorm(doc=4238)
        0.28 = coord(7/25)
    
  5. Bruza, P.D.; Huibers, T.W.C.: ¬A study of aboutness in information retrieval (1996) 0.17
    0.1714029 = sum of:
      0.1714029 = product of:
        0.71417874 = sum of:
          0.06169604 = weight(abstract_txt:model in 7705) [ClassicSimilarity], result of:
            0.06169604 = score(doc=7705,freq=2.0), product of:
              0.11673693 = queryWeight, product of:
                1.5069523 = boost
                3.986234 = idf(docFreq=2231, maxDocs=44218)
                0.019433275 = queryNorm
              0.5285049 = fieldWeight in 7705, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.986234 = idf(docFreq=2231, maxDocs=44218)
                0.09375 = fieldNorm(doc=7705)
          0.03989641 = weight(abstract_txt:information in 7705) [ClassicSimilarity], result of:
            0.03989641 = score(doc=7705,freq=6.0), product of:
              0.071763195 = queryWeight, product of:
                1.5253539 = boost
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.019433275 = queryNorm
              0.5559453 = fieldWeight in 7705, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.09375 = fieldNorm(doc=7705)
          0.14290823 = weight(abstract_txt:probabilistic in 7705) [ClassicSimilarity], result of:
            0.14290823 = score(doc=7705,freq=1.0), product of:
              0.22493367 = queryWeight, product of:
                1.7079571 = boost
                6.7769065 = idf(docFreq=136, maxDocs=44218)
                0.019433275 = queryNorm
              0.63533497 = fieldWeight in 7705, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.7769065 = idf(docFreq=136, maxDocs=44218)
                0.09375 = fieldNorm(doc=7705)
          0.086177595 = weight(abstract_txt:retrieval in 7705) [ClassicSimilarity], result of:
            0.086177595 = score(doc=7705,freq=5.0), product of:
              0.118294865 = queryWeight, product of:
                1.7516514 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.019433275 = queryNorm
              0.7284982 = fieldWeight in 7705, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.09375 = fieldNorm(doc=7705)
          0.16318156 = weight(abstract_txt:definition in 7705) [ClassicSimilarity], result of:
            0.16318156 = score(doc=7705,freq=2.0), product of:
              0.22326335 = queryWeight, product of:
                2.0840306 = boost
                5.512738 = idf(docFreq=484, maxDocs=44218)
                0.019433275 = queryNorm
              0.7308927 = fieldWeight in 7705, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.512738 = idf(docFreq=484, maxDocs=44218)
                0.09375 = fieldNorm(doc=7705)
          0.2203189 = weight(abstract_txt:relevance in 7705) [ClassicSimilarity], result of:
            0.2203189 = score(doc=7705,freq=1.0), product of:
              0.47650868 = queryWeight, product of:
                4.971817 = boost
                4.931848 = idf(docFreq=866, maxDocs=44218)
                0.019433275 = queryNorm
              0.46236074 = fieldWeight in 7705, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.931848 = idf(docFreq=866, maxDocs=44218)
                0.09375 = fieldNorm(doc=7705)
        0.24 = coord(6/25)