Document (#34861)

Author
Bache, R.
Baillie, M.
Crestani, F.
Title
Measuring the likelihood property of scoring functions in general retrieval models
Source
Journal of the American Society for Information Science and Technology. 60(2009) no.6, S.1294-1297
Year
2009
Series
Brief Communications
Abstract
Although retrieval systems based on probabilistic models will rank the objects (e.g., documents) being retrieved according to the probability of some matching criterion (e.g., relevance), they rarely yield an actual probability, and the scoring function is interpreted to be purely ordinal within a given retrieval task. In this brief communication, it is shown that some scoring functions possess the likelihood property, which means that the scoring function indicates the likelihood of matching when compared to other retrieval tasks, which is potentially more useful than pure ranking although it cannot be interpreted as an actual probability. This property can be detected by using two modified effectiveness measures: entire precision and entire recall.

Similar documents (author)

  1. Crestani, F.: Combination of similarity measures for effective spoken document retrieval (2003) 1.69
    1.693766 = sum of:
      1.693766 = product of:
        3.387532 = sum of:
          3.387532 = weight(author_txt:crestani in 4690) [ClassicSimilarity], result of:
            3.387532 = score(doc=4690,freq=1.0), product of:
              0.6229117 = queryWeight, product of:
                8.701155 = idf(docFreq=19, maxDocs=44218)
                0.071589544 = queryNorm
              5.438222 = fieldWeight in 4690, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.701155 = idf(docFreq=19, maxDocs=44218)
                0.625 = fieldNorm(doc=4690)
        0.5 = coord(1/2)
    
  2. Lee, M.; Baillie, S.; Dell'Oro, J.: TML: a Thesaural Markpup Language (200?) 1.43
    1.4302714 = sum of:
      1.4302714 = product of:
        2.8605428 = sum of:
          2.8605428 = weight(author_txt:baillie in 1622) [ClassicSimilarity], result of:
            2.8605428 = score(doc=1622,freq=1.0), product of:
              0.7822922 = queryWeight, product of:
                1.1206533 = boost
                9.7509775 = idf(docFreq=6, maxDocs=44218)
                0.071589544 = queryNorm
              3.6566167 = fieldWeight in 1622, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.7509775 = idf(docFreq=6, maxDocs=44218)
                0.375 = fieldNorm(doc=1622)
        0.5 = coord(1/2)
    
  3. Ruthven, I.; Baillie, M.; Elsweiler, D.: ¬The relative effects of knowledge, interest and confidence in assessing relevance (2007) 1.43
    1.4302714 = sum of:
      1.4302714 = product of:
        2.8605428 = sum of:
          2.8605428 = weight(author_txt:baillie in 835) [ClassicSimilarity], result of:
            2.8605428 = score(doc=835,freq=1.0), product of:
              0.7822922 = queryWeight, product of:
                1.1206533 = boost
                9.7509775 = idf(docFreq=6, maxDocs=44218)
                0.071589544 = queryNorm
              3.6566167 = fieldWeight in 835, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.7509775 = idf(docFreq=6, maxDocs=44218)
                0.375 = fieldNorm(doc=835)
        0.5 = coord(1/2)
    
  4. Baillie, M.; Azzopardi, L.; Ruthven, I.: Evaluating epistemic uncertainty under incomplete assessments (2008) 1.43
    1.4302714 = sum of:
      1.4302714 = product of:
        2.8605428 = sum of:
          2.8605428 = weight(author_txt:baillie in 2065) [ClassicSimilarity], result of:
            2.8605428 = score(doc=2065,freq=1.0), product of:
              0.7822922 = queryWeight, product of:
                1.1206533 = boost
                9.7509775 = idf(docFreq=6, maxDocs=44218)
                0.071589544 = queryNorm
              3.6566167 = fieldWeight in 2065, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.7509775 = idf(docFreq=6, maxDocs=44218)
                0.375 = fieldNorm(doc=2065)
        0.5 = coord(1/2)
    
  5. Htun, N.N.; Halvey, M.; Baillie, L.: Beyond traditional collaborative search : understanding the effect of awareness on multi-level collaborative information retrieval (2018) 1.43
    1.4302714 = sum of:
      1.4302714 = product of:
        2.8605428 = sum of:
          2.8605428 = weight(author_txt:baillie in 5094) [ClassicSimilarity], result of:
            2.8605428 = score(doc=5094,freq=1.0), product of:
              0.7822922 = queryWeight, product of:
                1.1206533 = boost
                9.7509775 = idf(docFreq=6, maxDocs=44218)
                0.071589544 = queryNorm
              3.6566167 = fieldWeight in 5094, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.7509775 = idf(docFreq=6, maxDocs=44218)
                0.375 = fieldNorm(doc=5094)
        0.5 = coord(1/2)
    

Similar documents (content)

  1. Dang, E.K.F.; Luk, R.W.P.; Allan, J.: ¬A retrieval model family based on the probability ranking principle for ad hoc retrieval (2022) 0.15
    0.14932768 = sum of:
      0.14932768 = product of:
        0.6221987 = sum of:
          0.06265768 = weight(abstract_txt:probabilistic in 638) [ClassicSimilarity], result of:
            0.06265768 = score(doc=638,freq=1.0), product of:
              0.09862149 = queryWeight, product of:
                1.0282152 = boost
                6.7769065 = idf(docFreq=136, maxDocs=44218)
                0.014153246 = queryNorm
              0.63533497 = fieldWeight in 638, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.7769065 = idf(docFreq=136, maxDocs=44218)
                0.09375 = fieldNorm(doc=638)
          0.02003185 = weight(abstract_txt:some in 638) [ClassicSimilarity], result of:
            0.02003185 = score(doc=638,freq=1.0), product of:
              0.058095977 = queryWeight, product of:
                1.1160567 = boost
                3.6779325 = idf(docFreq=3037, maxDocs=44218)
                0.014153246 = queryNorm
              0.34480616 = fieldWeight in 638, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6779325 = idf(docFreq=3037, maxDocs=44218)
                0.09375 = fieldNorm(doc=638)
          0.080525935 = weight(abstract_txt:models in 638) [ClassicSimilarity], result of:
            0.080525935 = score(doc=638,freq=4.0), product of:
              0.09252715 = queryWeight, product of:
                1.4084707 = boost
                4.6415744 = idf(docFreq=1158, maxDocs=44218)
                0.014153246 = queryNorm
              0.87029517 = fieldWeight in 638, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.6415744 = idf(docFreq=1158, maxDocs=44218)
                0.09375 = fieldNorm(doc=638)
          0.0698258 = weight(abstract_txt:function in 638) [ClassicSimilarity], result of:
            0.0698258 = score(doc=638,freq=1.0), product of:
              0.13355985 = queryWeight, product of:
                1.692198 = boost
                5.5765896 = idf(docFreq=454, maxDocs=44218)
                0.014153246 = queryNorm
              0.5228053 = fieldWeight in 638, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.5765896 = idf(docFreq=454, maxDocs=44218)
                0.09375 = fieldNorm(doc=638)
          0.047793794 = weight(abstract_txt:retrieval in 638) [ClassicSimilarity], result of:
            0.047793794 = score(doc=638,freq=2.0), product of:
              0.103732064 = queryWeight, product of:
                2.1090395 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.014153246 = queryNorm
              0.4607427 = fieldWeight in 638, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.09375 = fieldNorm(doc=638)
          0.34136364 = weight(abstract_txt:probability in 638) [ClassicSimilarity], result of:
            0.34136364 = score(doc=638,freq=3.0), product of:
              0.3053516 = queryWeight, product of:
                3.133711 = boost
                6.8847027 = idf(docFreq=122, maxDocs=44218)
                0.014153246 = queryNorm
              1.1179364 = fieldWeight in 638, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.8847027 = idf(docFreq=122, maxDocs=44218)
                0.09375 = fieldNorm(doc=638)
        0.24 = coord(6/25)
    
  2. Fuhr, N.: Probabilistic datalog : implementing logical information retrieval for advanced applications (2000) 0.11
    0.11136711 = sum of:
      0.11136711 = product of:
        0.55683553 = sum of:
          0.08354358 = weight(abstract_txt:probabilistic in 4380) [ClassicSimilarity], result of:
            0.08354358 = score(doc=4380,freq=1.0), product of:
              0.09862149 = queryWeight, product of:
                1.0282152 = boost
                6.7769065 = idf(docFreq=136, maxDocs=44218)
                0.014153246 = queryNorm
              0.8471133 = fieldWeight in 4380, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.7769065 = idf(docFreq=136, maxDocs=44218)
                0.125 = fieldNorm(doc=4380)
          0.053683955 = weight(abstract_txt:models in 4380) [ClassicSimilarity], result of:
            0.053683955 = score(doc=4380,freq=1.0), product of:
              0.09252715 = queryWeight, product of:
                1.4084707 = boost
                4.6415744 = idf(docFreq=1158, maxDocs=44218)
                0.014153246 = queryNorm
              0.5801968 = fieldWeight in 4380, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.6415744 = idf(docFreq=1158, maxDocs=44218)
                0.125 = fieldNorm(doc=4380)
          0.09310106 = weight(abstract_txt:function in 4380) [ClassicSimilarity], result of:
            0.09310106 = score(doc=4380,freq=1.0), product of:
              0.13355985 = queryWeight, product of:
                1.692198 = boost
                5.5765896 = idf(docFreq=454, maxDocs=44218)
                0.014153246 = queryNorm
              0.6970737 = fieldWeight in 4380, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.5765896 = idf(docFreq=454, maxDocs=44218)
                0.125 = fieldNorm(doc=4380)
          0.063725054 = weight(abstract_txt:retrieval in 4380) [ClassicSimilarity], result of:
            0.063725054 = score(doc=4380,freq=2.0), product of:
              0.103732064 = queryWeight, product of:
                2.1090395 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.014153246 = queryNorm
              0.6143236 = fieldWeight in 4380, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.125 = fieldNorm(doc=4380)
          0.26278186 = weight(abstract_txt:probability in 4380) [ClassicSimilarity], result of:
            0.26278186 = score(doc=4380,freq=1.0), product of:
              0.3053516 = queryWeight, product of:
                3.133711 = boost
                6.8847027 = idf(docFreq=122, maxDocs=44218)
                0.014153246 = queryNorm
              0.86058784 = fieldWeight in 4380, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.8847027 = idf(docFreq=122, maxDocs=44218)
                0.125 = fieldNorm(doc=4380)
        0.2 = coord(5/25)
    
  3. Liu, X.; Croft, W.B.: Statistical language modeling for information retrieval (2004) 0.10
    0.09852219 = sum of:
      0.09852219 = product of:
        0.4105091 = sum of:
          0.036550317 = weight(abstract_txt:probabilistic in 4277) [ClassicSimilarity], result of:
            0.036550317 = score(doc=4277,freq=1.0), product of:
              0.09862149 = queryWeight, product of:
                1.0282152 = boost
                6.7769065 = idf(docFreq=136, maxDocs=44218)
                0.014153246 = queryNorm
              0.37061208 = fieldWeight in 4277, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.7769065 = idf(docFreq=136, maxDocs=44218)
                0.0546875 = fieldNorm(doc=4277)
          0.033215255 = weight(abstract_txt:models in 4277) [ClassicSimilarity], result of:
            0.033215255 = score(doc=4277,freq=2.0), product of:
              0.09252715 = queryWeight, product of:
                1.4084707 = boost
                4.6415744 = idf(docFreq=1158, maxDocs=44218)
                0.014153246 = queryNorm
              0.35897845 = fieldWeight in 4277, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.6415744 = idf(docFreq=1158, maxDocs=44218)
                0.0546875 = fieldNorm(doc=4277)
          0.040731713 = weight(abstract_txt:function in 4277) [ClassicSimilarity], result of:
            0.040731713 = score(doc=4277,freq=1.0), product of:
              0.13355985 = queryWeight, product of:
                1.692198 = boost
                5.5765896 = idf(docFreq=454, maxDocs=44218)
                0.014153246 = queryNorm
              0.30496973 = fieldWeight in 4277, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.5765896 = idf(docFreq=454, maxDocs=44218)
                0.0546875 = fieldNorm(doc=4277)
          0.034145534 = weight(abstract_txt:retrieval in 4277) [ClassicSimilarity], result of:
            0.034145534 = score(doc=4277,freq=3.0), product of:
              0.103732064 = queryWeight, product of:
                2.1090395 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.014153246 = queryNorm
              0.3291705 = fieldWeight in 4277, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.0546875 = fieldNorm(doc=4277)
          0.114967056 = weight(abstract_txt:probability in 4277) [ClassicSimilarity], result of:
            0.114967056 = score(doc=4277,freq=1.0), product of:
              0.3053516 = queryWeight, product of:
                3.133711 = boost
                6.8847027 = idf(docFreq=122, maxDocs=44218)
                0.014153246 = queryNorm
              0.37650716 = fieldWeight in 4277, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.8847027 = idf(docFreq=122, maxDocs=44218)
                0.0546875 = fieldNorm(doc=4277)
          0.15089922 = weight(abstract_txt:likelihood in 4277) [ClassicSimilarity], result of:
            0.15089922 = score(doc=4277,freq=1.0), product of:
              0.3660518 = queryWeight, product of:
                3.4310744 = boost
                7.538004 = idf(docFreq=63, maxDocs=44218)
                0.014153246 = queryNorm
              0.4122346 = fieldWeight in 4277, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.538004 = idf(docFreq=63, maxDocs=44218)
                0.0546875 = fieldNorm(doc=4277)
        0.24 = coord(6/25)
    
  4. Robertson, S.E.; Sparck Jones, K.: Simple, proven approaches to text retrieval (1997) 0.09
    0.094943024 = sum of:
      0.094943024 = product of:
        0.4747151 = sum of:
          0.013354568 = weight(abstract_txt:some in 4532) [ClassicSimilarity], result of:
            0.013354568 = score(doc=4532,freq=1.0), product of:
              0.058095977 = queryWeight, product of:
                1.1160567 = boost
                3.6779325 = idf(docFreq=3037, maxDocs=44218)
                0.014153246 = queryNorm
              0.22987078 = fieldWeight in 4532, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6779325 = idf(docFreq=3037, maxDocs=44218)
                0.0625 = fieldNorm(doc=4532)
          0.059379317 = weight(abstract_txt:matching in 4532) [ClassicSimilarity], result of:
            0.059379317 = score(doc=4532,freq=1.0), product of:
              0.1570904 = queryWeight, product of:
                1.8352196 = boost
                6.047913 = idf(docFreq=283, maxDocs=44218)
                0.014153246 = queryNorm
              0.37799457 = fieldWeight in 4532, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.047913 = idf(docFreq=283, maxDocs=44218)
                0.0625 = fieldNorm(doc=4532)
          0.060880788 = weight(abstract_txt:actual in 4532) [ClassicSimilarity], result of:
            0.060880788 = score(doc=4532,freq=1.0), product of:
              0.1597275 = queryWeight, product of:
                1.8505596 = boost
                6.0984654 = idf(docFreq=269, maxDocs=44218)
                0.014153246 = queryNorm
              0.3811541 = fieldWeight in 4532, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.0984654 = idf(docFreq=269, maxDocs=44218)
                0.0625 = fieldNorm(doc=4532)
          0.045060422 = weight(abstract_txt:retrieval in 4532) [ClassicSimilarity], result of:
            0.045060422 = score(doc=4532,freq=4.0), product of:
              0.103732064 = queryWeight, product of:
                2.1090395 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.014153246 = queryNorm
              0.43439242 = fieldWeight in 4532, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.0625 = fieldNorm(doc=4532)
          0.29604 = weight(abstract_txt:scoring in 4532) [ClassicSimilarity], result of:
            0.29604 = score(doc=4532,freq=1.0), product of:
              0.5776123 = queryWeight, product of:
                4.976757 = boost
                8.200379 = idf(docFreq=32, maxDocs=44218)
                0.014153246 = queryNorm
              0.5125237 = fieldWeight in 4532, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.200379 = idf(docFreq=32, maxDocs=44218)
                0.0625 = fieldNorm(doc=4532)
        0.2 = coord(5/25)
    
  5. López-Pujalte, C.; Guerrero-Bote, V.P.; Moya-Anegón, F. de: Order-based fitness functions for genetic algorithms applied to relevance feedback (2003) 0.09
    0.093640864 = sum of:
      0.093640864 = product of:
        0.46820432 = sum of:
          0.07739984 = weight(abstract_txt:functions in 5154) [ClassicSimilarity], result of:
            0.07739984 = score(doc=5154,freq=4.0), product of:
              0.12908056 = queryWeight, product of:
                1.66358 = boost
                5.4822793 = idf(docFreq=499, maxDocs=44218)
                0.014153246 = queryNorm
              0.5996243 = fieldWeight in 5154, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.4822793 = idf(docFreq=499, maxDocs=44218)
                0.0546875 = fieldNorm(doc=5154)
          0.05760334 = weight(abstract_txt:function in 5154) [ClassicSimilarity], result of:
            0.05760334 = score(doc=5154,freq=2.0), product of:
              0.13355985 = queryWeight, product of:
                1.692198 = boost
                5.5765896 = idf(docFreq=454, maxDocs=44218)
                0.014153246 = queryNorm
              0.43129233 = fieldWeight in 5154, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.5765896 = idf(docFreq=454, maxDocs=44218)
                0.0546875 = fieldNorm(doc=5154)
          0.019713935 = weight(abstract_txt:retrieval in 5154) [ClassicSimilarity], result of:
            0.019713935 = score(doc=5154,freq=1.0), product of:
              0.103732064 = queryWeight, product of:
                2.1090395 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.014153246 = queryNorm
              0.19004668 = fieldWeight in 5154, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.0546875 = fieldNorm(doc=5154)
          0.16258797 = weight(abstract_txt:probability in 5154) [ClassicSimilarity], result of:
            0.16258797 = score(doc=5154,freq=2.0), product of:
              0.3053516 = queryWeight, product of:
                3.133711 = boost
                6.8847027 = idf(docFreq=122, maxDocs=44218)
                0.014153246 = queryNorm
              0.5324615 = fieldWeight in 5154, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.8847027 = idf(docFreq=122, maxDocs=44218)
                0.0546875 = fieldNorm(doc=5154)
          0.15089922 = weight(abstract_txt:likelihood in 5154) [ClassicSimilarity], result of:
            0.15089922 = score(doc=5154,freq=1.0), product of:
              0.3660518 = queryWeight, product of:
                3.4310744 = boost
                7.538004 = idf(docFreq=63, maxDocs=44218)
                0.014153246 = queryNorm
              0.4122346 = fieldWeight in 5154, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.538004 = idf(docFreq=63, maxDocs=44218)
                0.0546875 = fieldNorm(doc=5154)
        0.2 = coord(5/25)