Document (#29424)

Author
Sparck Jones, K.
Title
IDF term weighting and IR research lessons
Source
Journal of documentation. 60(2004) no.5, S.521-523
Year
2004
Abstract
Robertson comments on the theoretical status of IDF term weighting. Its history illustrates how ideas develop in a specific research context, in theory/experiment interaction, and in operational practice.
Footnote
Vgl. auch unter:http://www.emeraldinsight.com/10.1108/00220410410560591.
Theme
Retrievalalgorithmen
Object
IDF

Similar documents (author)

  1. Sparck Jones, K.: Fashionable trends and feasible strategies in information management (1988) 5.30
    5.3017416 = sum of:
      5.3017416 = sum of:
        2.0809913 = weight(author_txt:jones in 817) [ClassicSimilarity], result of:
          2.0809913 = score(doc=817,freq=1.0), product of:
            0.5986566 = queryWeight, product of:
              6.9522038 = idf(docFreq=109, maxDocs=42306)
              0.08611033 = queryNorm
            3.4761019 = fieldWeight in 817, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              6.9522038 = idf(docFreq=109, maxDocs=42306)
              0.5 = fieldNorm(doc=817)
        3.2207506 = weight(author_txt:sparck in 817) [ClassicSimilarity], result of:
          3.2207506 = score(doc=817,freq=1.0), product of:
            0.8010058 = queryWeight, product of:
              1.1567218 = boost
              8.041766 = idf(docFreq=36, maxDocs=42306)
              0.08611033 = queryNorm
            4.020883 = fieldWeight in 817, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.041766 = idf(docFreq=36, maxDocs=42306)
              0.5 = fieldNorm(doc=817)
    
  2. Sparck Jones, K.: Automatic classification (1976) 5.30
    5.3017416 = sum of:
      5.3017416 = sum of:
        2.0809913 = weight(author_txt:jones in 2908) [ClassicSimilarity], result of:
          2.0809913 = score(doc=2908,freq=1.0), product of:
            0.5986566 = queryWeight, product of:
              6.9522038 = idf(docFreq=109, maxDocs=42306)
              0.08611033 = queryNorm
            3.4761019 = fieldWeight in 2908, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              6.9522038 = idf(docFreq=109, maxDocs=42306)
              0.5 = fieldNorm(doc=2908)
        3.2207506 = weight(author_txt:sparck in 2908) [ClassicSimilarity], result of:
          3.2207506 = score(doc=2908,freq=1.0), product of:
            0.8010058 = queryWeight, product of:
              1.1567218 = boost
              8.041766 = idf(docFreq=36, maxDocs=42306)
              0.08611033 = queryNorm
            4.020883 = fieldWeight in 2908, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.041766 = idf(docFreq=36, maxDocs=42306)
              0.5 = fieldNorm(doc=2908)
    
  3. Sparck Jones, K.: ¬The role of artificial intelligence in information retrieval (1991) 5.30
    5.3017416 = sum of:
      5.3017416 = sum of:
        2.0809913 = weight(author_txt:jones in 4811) [ClassicSimilarity], result of:
          2.0809913 = score(doc=4811,freq=1.0), product of:
            0.5986566 = queryWeight, product of:
              6.9522038 = idf(docFreq=109, maxDocs=42306)
              0.08611033 = queryNorm
            3.4761019 = fieldWeight in 4811, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              6.9522038 = idf(docFreq=109, maxDocs=42306)
              0.5 = fieldNorm(doc=4811)
        3.2207506 = weight(author_txt:sparck in 4811) [ClassicSimilarity], result of:
          3.2207506 = score(doc=4811,freq=1.0), product of:
            0.8010058 = queryWeight, product of:
              1.1567218 = boost
              8.041766 = idf(docFreq=36, maxDocs=42306)
              0.08611033 = queryNorm
            4.020883 = fieldWeight in 4811, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.041766 = idf(docFreq=36, maxDocs=42306)
              0.5 = fieldNorm(doc=4811)
    
  4. Sparck Jones, K.: Automatic keyword classification for information retrieval (1971) 5.30
    5.3017416 = sum of:
      5.3017416 = sum of:
        2.0809913 = weight(author_txt:jones in 5176) [ClassicSimilarity], result of:
          2.0809913 = score(doc=5176,freq=1.0), product of:
            0.5986566 = queryWeight, product of:
              6.9522038 = idf(docFreq=109, maxDocs=42306)
              0.08611033 = queryNorm
            3.4761019 = fieldWeight in 5176, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              6.9522038 = idf(docFreq=109, maxDocs=42306)
              0.5 = fieldNorm(doc=5176)
        3.2207506 = weight(author_txt:sparck in 5176) [ClassicSimilarity], result of:
          3.2207506 = score(doc=5176,freq=1.0), product of:
            0.8010058 = queryWeight, product of:
              1.1567218 = boost
              8.041766 = idf(docFreq=36, maxDocs=42306)
              0.08611033 = queryNorm
            4.020883 = fieldWeight in 5176, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.041766 = idf(docFreq=36, maxDocs=42306)
              0.5 = fieldNorm(doc=5176)
    
  5. Sparck Jones, K.: ¬A statistical interpretation of term specifity and its application in retrieval (1972) 5.30
    5.3017416 = sum of:
      5.3017416 = sum of:
        2.0809913 = weight(author_txt:jones in 5187) [ClassicSimilarity], result of:
          2.0809913 = score(doc=5187,freq=1.0), product of:
            0.5986566 = queryWeight, product of:
              6.9522038 = idf(docFreq=109, maxDocs=42306)
              0.08611033 = queryNorm
            3.4761019 = fieldWeight in 5187, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              6.9522038 = idf(docFreq=109, maxDocs=42306)
              0.5 = fieldNorm(doc=5187)
        3.2207506 = weight(author_txt:sparck in 5187) [ClassicSimilarity], result of:
          3.2207506 = score(doc=5187,freq=1.0), product of:
            0.8010058 = queryWeight, product of:
              1.1567218 = boost
              8.041766 = idf(docFreq=36, maxDocs=42306)
              0.08611033 = queryNorm
            4.020883 = fieldWeight in 5187, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.041766 = idf(docFreq=36, maxDocs=42306)
              0.5 = fieldNorm(doc=5187)
    

Similar documents (content)

  1. Wong, S.K.M.: On modelling information retrieval with probabilistic inference (1995) 0.22
    0.22330362 = sum of:
      0.22330362 = product of:
        0.70712817 = sum of:
          0.043339536 = weight(abstract_txt:context in 2007) [ClassicSimilarity], result of:
            0.043339536 = score(doc=2007,freq=1.0), product of:
              0.10513501 = queryWeight, product of:
                1.0143685 = boost
                4.397093 = idf(docFreq=1415, maxDocs=42306)
                0.02357143 = queryNorm
              0.41222745 = fieldWeight in 2007, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.397093 = idf(docFreq=1415, maxDocs=42306)
                0.09375 = fieldNorm(doc=2007)
          0.049368735 = weight(abstract_txt:theory in 2007) [ClassicSimilarity], result of:
            0.049368735 = score(doc=2007,freq=1.0), product of:
              0.114672475 = queryWeight, product of:
                1.0593798 = boost
                4.592208 = idf(docFreq=1164, maxDocs=42306)
                0.02357143 = queryNorm
              0.4305195 = fieldWeight in 2007, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.592208 = idf(docFreq=1164, maxDocs=42306)
                0.09375 = fieldNorm(doc=2007)
          0.06117573 = weight(abstract_txt:theoretical in 2007) [ClassicSimilarity], result of:
            0.06117573 = score(doc=2007,freq=1.0), product of:
              0.13229516 = queryWeight, product of:
                1.1378738 = boost
                4.932464 = idf(docFreq=828, maxDocs=42306)
                0.02357143 = queryNorm
              0.4624185 = fieldWeight in 2007, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.932464 = idf(docFreq=828, maxDocs=42306)
                0.09375 = fieldNorm(doc=2007)
          0.09261757 = weight(abstract_txt:ideas in 2007) [ClassicSimilarity], result of:
            0.09261757 = score(doc=2007,freq=1.0), product of:
              0.17442957 = queryWeight, product of:
                1.3065684 = boost
                5.663723 = idf(docFreq=398, maxDocs=42306)
                0.02357143 = queryNorm
              0.53097403 = fieldWeight in 2007, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.663723 = idf(docFreq=398, maxDocs=42306)
                0.09375 = fieldNorm(doc=2007)
          0.115308754 = weight(abstract_txt:term in 2007) [ClassicSimilarity], result of:
            0.115308754 = score(doc=2007,freq=1.0), product of:
              0.2543369 = queryWeight, product of:
                2.231217 = boost
                4.8359485 = idf(docFreq=912, maxDocs=42306)
                0.02357143 = queryNorm
              0.45337015 = fieldWeight in 2007, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.8359485 = idf(docFreq=912, maxDocs=42306)
                0.09375 = fieldNorm(doc=2007)
          0.34531787 = weight(abstract_txt:weighting in 2007) [ClassicSimilarity], result of:
            0.34531787 = score(doc=2007,freq=1.0), product of:
              0.5284216 = queryWeight, product of:
                3.216084 = boost
                6.970553 = idf(docFreq=107, maxDocs=42306)
                0.02357143 = queryNorm
              0.65348935 = fieldWeight in 2007, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.970553 = idf(docFreq=107, maxDocs=42306)
                0.09375 = fieldNorm(doc=2007)
        0.31578946 = coord(6/19)
    
  2. Robertson, S.E.: OKAPI at TREC-1 (1994) 0.20
    0.20223978 = sum of:
      0.20223978 = product of:
        1.280852 = sum of:
          0.47596934 = weight(abstract_txt:robertson in 7953) [ClassicSimilarity], result of:
            0.47596934 = score(doc=7953,freq=1.0), product of:
              0.4287966 = queryWeight, product of:
                2.0485556 = boost
                8.8800955 = idf(docFreq=15, maxDocs=42306)
                0.02357143 = queryNorm
              1.1100119 = fieldWeight in 7953, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.8800955 = idf(docFreq=15, maxDocs=42306)
                0.125 = fieldNorm(doc=7953)
          0.15374501 = weight(abstract_txt:term in 7953) [ClassicSimilarity], result of:
            0.15374501 = score(doc=7953,freq=1.0), product of:
              0.2543369 = queryWeight, product of:
                2.231217 = boost
                4.8359485 = idf(docFreq=912, maxDocs=42306)
                0.02357143 = queryNorm
              0.60449356 = fieldWeight in 7953, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.8359485 = idf(docFreq=912, maxDocs=42306)
                0.125 = fieldNorm(doc=7953)
          0.6511376 = weight(abstract_txt:weighting in 7953) [ClassicSimilarity], result of:
            0.6511376 = score(doc=7953,freq=2.0), product of:
              0.5284216 = queryWeight, product of:
                3.216084 = boost
                6.970553 = idf(docFreq=107, maxDocs=42306)
                0.02357143 = queryNorm
              1.2322313 = fieldWeight in 7953, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.970553 = idf(docFreq=107, maxDocs=42306)
                0.125 = fieldNorm(doc=7953)
        0.15789473 = coord(3/19)
    
  3. Robertson, S.E.; Sparck Jones, K.: Relevance weighting of search terms (1976) 0.18
    0.18425384 = sum of:
      0.18425384 = product of:
        0.87520576 = sum of:
          0.048444413 = weight(abstract_txt:specific in 140) [ClassicSimilarity], result of:
            0.048444413 = score(doc=140,freq=1.0), product of:
              0.10217762 = queryWeight, product of:
                4.334808 = idf(docFreq=1506, maxDocs=42306)
                0.02357143 = queryNorm
              0.4741196 = fieldWeight in 140, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.334808 = idf(docFreq=1506, maxDocs=42306)
                0.109375 = fieldNorm(doc=140)
          0.057596855 = weight(abstract_txt:theory in 140) [ClassicSimilarity], result of:
            0.057596855 = score(doc=140,freq=1.0), product of:
              0.114672475 = queryWeight, product of:
                1.0593798 = boost
                4.592208 = idf(docFreq=1164, maxDocs=42306)
                0.02357143 = queryNorm
              0.5022727 = fieldWeight in 140, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.592208 = idf(docFreq=1164, maxDocs=42306)
                0.109375 = fieldNorm(doc=140)
          0.07137169 = weight(abstract_txt:theoretical in 140) [ClassicSimilarity], result of:
            0.07137169 = score(doc=140,freq=1.0), product of:
              0.13229516 = queryWeight, product of:
                1.1378738 = boost
                4.932464 = idf(docFreq=828, maxDocs=42306)
                0.02357143 = queryNorm
              0.53948826 = fieldWeight in 140, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.932464 = idf(docFreq=828, maxDocs=42306)
                0.109375 = fieldNorm(doc=140)
          0.6977928 = weight(abstract_txt:weighting in 140) [ClassicSimilarity], result of:
            0.6977928 = score(doc=140,freq=3.0), product of:
              0.5284216 = queryWeight, product of:
                3.216084 = boost
                6.970553 = idf(docFreq=107, maxDocs=42306)
                0.02357143 = queryNorm
              1.3205229 = fieldWeight in 140, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.970553 = idf(docFreq=107, maxDocs=42306)
                0.109375 = fieldNorm(doc=140)
        0.21052632 = coord(4/19)
    
  4. Dominich, S.; Góth, J.; Kiezer, T.; Szlávik, Z.: ¬An entropy-based interpretation of retrieval status value-based retrieval, and its application to the computation of term and query discrimination value (2004) 0.17
    0.16680118 = sum of:
      0.16680118 = product of:
        0.6338445 = sum of:
          0.034795795 = weight(abstract_txt:practice in 3238) [ClassicSimilarity], result of:
            0.034795795 = score(doc=3238,freq=1.0), product of:
              0.13008618 = queryWeight, product of:
                1.128334 = boost
                4.8911114 = idf(docFreq=863, maxDocs=42306)
                0.02357143 = queryNorm
              0.26748264 = fieldWeight in 3238, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.8911114 = idf(docFreq=863, maxDocs=42306)
                0.0546875 = fieldNorm(doc=3238)
          0.0504674 = weight(abstract_txt:theoretical in 3238) [ClassicSimilarity], result of:
            0.0504674 = score(doc=3238,freq=2.0), product of:
              0.13229516 = queryWeight, product of:
                1.1378738 = boost
                4.932464 = idf(docFreq=828, maxDocs=42306)
                0.02357143 = queryNorm
              0.3814758 = fieldWeight in 3238, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.932464 = idf(docFreq=828, maxDocs=42306)
                0.0546875 = fieldNorm(doc=3238)
          0.065158024 = weight(abstract_txt:status in 3238) [ClassicSimilarity], result of:
            0.065158024 = score(doc=3238,freq=1.0), product of:
              0.1976326 = queryWeight, product of:
                1.3907574 = boost
                6.0286665 = idf(docFreq=276, maxDocs=42306)
                0.02357143 = queryNorm
              0.3296927 = fieldWeight in 3238, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.0286665 = idf(docFreq=276, maxDocs=42306)
                0.0546875 = fieldNorm(doc=3238)
          0.13452688 = weight(abstract_txt:term in 3238) [ClassicSimilarity], result of:
            0.13452688 = score(doc=3238,freq=4.0), product of:
              0.2543369 = queryWeight, product of:
                2.231217 = boost
                4.8359485 = idf(docFreq=912, maxDocs=42306)
                0.02357143 = queryNorm
              0.52893186 = fieldWeight in 3238, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.8359485 = idf(docFreq=912, maxDocs=42306)
                0.0546875 = fieldNorm(doc=3238)
          0.3488964 = weight(abstract_txt:weighting in 3238) [ClassicSimilarity], result of:
            0.3488964 = score(doc=3238,freq=3.0), product of:
              0.5284216 = queryWeight, product of:
                3.216084 = boost
                6.970553 = idf(docFreq=107, maxDocs=42306)
                0.02357143 = queryNorm
              0.66026145 = fieldWeight in 3238, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.970553 = idf(docFreq=107, maxDocs=42306)
                0.0546875 = fieldNorm(doc=3238)
        0.2631579 = coord(5/19)
    
  5. Li, X.; Zhang, A.; Li, C.; Ouyang, J.; Cai, Y.: Exploring coherent topics by topic modeling with term weighting (2018) 0.16
    0.16190319 = sum of:
      0.16190319 = product of:
        0.7690401 = sum of:
          0.027682522 = weight(abstract_txt:specific in 1964) [ClassicSimilarity], result of:
            0.027682522 = score(doc=1964,freq=1.0), product of:
              0.10217762 = queryWeight, product of:
                4.334808 = idf(docFreq=1506, maxDocs=42306)
                0.02357143 = queryNorm
              0.2709255 = fieldWeight in 1964, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.334808 = idf(docFreq=1506, maxDocs=42306)
                0.0625 = fieldNorm(doc=1964)
          0.044308733 = weight(abstract_txt:develop in 1964) [ClassicSimilarity], result of:
            0.044308733 = score(doc=1964,freq=1.0), product of:
              0.13981214 = queryWeight, product of:
                1.169754 = boost
                5.070659 = idf(docFreq=721, maxDocs=42306)
                0.02357143 = queryNorm
              0.3169162 = fieldWeight in 1964, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.070659 = idf(docFreq=721, maxDocs=42306)
                0.0625 = fieldNorm(doc=1964)
          0.13314709 = weight(abstract_txt:term in 1964) [ClassicSimilarity], result of:
            0.13314709 = score(doc=1964,freq=3.0), product of:
              0.2543369 = queryWeight, product of:
                2.231217 = boost
                4.8359485 = idf(docFreq=912, maxDocs=42306)
                0.02357143 = queryNorm
              0.52350676 = fieldWeight in 1964, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.8359485 = idf(docFreq=912, maxDocs=42306)
                0.0625 = fieldNorm(doc=1964)
          0.5639017 = weight(abstract_txt:weighting in 1964) [ClassicSimilarity], result of:
            0.5639017 = score(doc=1964,freq=6.0), product of:
              0.5284216 = queryWeight, product of:
                3.216084 = boost
                6.970553 = idf(docFreq=107, maxDocs=42306)
                0.02357143 = queryNorm
              1.0671437 = fieldWeight in 1964, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                6.970553 = idf(docFreq=107, maxDocs=42306)
                0.0625 = fieldNorm(doc=1964)
        0.21052632 = coord(4/19)