Search (1 results, page 1 of 1)

  • × author_ss:"Aizawa, A."
  • × theme_ss:"Retrievalalgorithmen"
  1. Aizawa, A.: ¬An information-theoretic perspective of tf-idf measures (2003) 0.01
    0.013508371 = product of:
      0.06078767 = sum of:
        0.02808394 = weight(_text_:of in 4155) [ClassicSimilarity], result of:
          0.02808394 = score(doc=4155,freq=22.0), product of:
            0.061262865 = queryWeight, product of:
              1.5637573 = idf(docFreq=25162, maxDocs=44218)
              0.03917671 = queryNorm
            0.458417 = fieldWeight in 4155, product of:
              4.690416 = tf(freq=22.0), with freq of:
                22.0 = termFreq=22.0
              1.5637573 = idf(docFreq=25162, maxDocs=44218)
              0.0625 = fieldNorm(doc=4155)
        0.03270373 = weight(_text_:systems in 4155) [ClassicSimilarity], result of:
          0.03270373 = score(doc=4155,freq=2.0), product of:
            0.12039685 = queryWeight, product of:
              3.0731742 = idf(docFreq=5561, maxDocs=44218)
              0.03917671 = queryNorm
            0.2716328 = fieldWeight in 4155, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.0731742 = idf(docFreq=5561, maxDocs=44218)
              0.0625 = fieldNorm(doc=4155)
      0.22222222 = coord(2/9)
    
    Abstract
    This paper presents a mathematical definition of the "probability-weighted amount of information" (PWI), a measure of specificity of terms in documents that is based on an information-theoretic view of retrieval events. The proposed PWI is expressed as a product of the occurrence probabilities of terms and their amounts of information, and corresponds well with the conventional term frequency - inverse document frequency measures that are commonly used in today's information retrieval systems. The mathematical definition of the PWI is shown, together with some illustrative examples of the calculation.