Search (1 results, page 1 of 1)

  • × author_ss:"Sparck Jones, K."
  • × theme_ss:"Retrievalalgorithmen"
  1. Sparck Jones, K.: ¬A statistical interpretation of term specificity and its application in retrieval (2004) 0.02
    0.015214371 = product of:
      0.07607185 = sum of:
        0.07607185 = weight(_text_:index in 4420) [ClassicSimilarity], result of:
          0.07607185 = score(doc=4420,freq=2.0), product of:
            0.2250935 = queryWeight, product of:
              4.369764 = idf(docFreq=1520, maxDocs=44218)
              0.051511593 = queryNorm
            0.33795667 = fieldWeight in 4420, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              4.369764 = idf(docFreq=1520, maxDocs=44218)
              0.0546875 = fieldNorm(doc=4420)
      0.2 = coord(1/5)
    
    Abstract
    The exhaustivity of document descriptions and the specificity of index terms are usually regarded as independent. It is suggested that specificity should be interpreted statistically, as a function of term use rather than of term meaning. The effects on retrieval of variations in term specificity are examined, experiments with three test collections showing, in particular, that frequently-occurring terms are required for good overall performance. It is argued that terms should be weighted according to collection frequency, so that matches on less frequent, more specific, terms are of greater value than matches on frequent terms. Results for the test collections show that considerable improvements in performance are obtained with this very simple procedure.