Document (#29423)

Author
Sparck Jones, K.
Title
IDF term weighting and IR research lessons
Source
Journal of documentation. 60(2004) no.5, S.521-523
Year
2004
Abstract
Robertson comments on the theoretical status of IDF term weighting. Its history illustrates how ideas develop in a specific research context, in theory/experiment interaction, and in operational practice.
Footnote
Vgl. auch unter:http://www.emeraldinsight.com/10.1108/00220410410560591.
Theme
Retrievalalgorithmen
Object
IDF

Similar documents (author)

  1. Sparck Jones, K.: Fashionable trends and feasible strategies in information management (1988) 5.31
    5.3113213 = sum of:
      5.3113213 = sum of:
        2.0544336 = weight(author_txt:jones in 817) [ClassicSimilarity], result of:
          2.0544336 = score(doc=817,freq=1.0), product of:
            0.5925071 = queryWeight, product of:
              6.9347134 = idf(docFreq=116, maxDocs=44218)
              0.085440755 = queryNorm
            3.4673567 = fieldWeight in 817, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              6.9347134 = idf(docFreq=116, maxDocs=44218)
              0.5 = fieldNorm(doc=817)
        3.2568877 = weight(author_txt:sparck in 817) [ClassicSimilarity], result of:
          3.2568877 = score(doc=817,freq=1.0), product of:
            0.80556524 = queryWeight, product of:
              1.1660135 = boost
              8.085969 = idf(docFreq=36, maxDocs=44218)
              0.085440755 = queryNorm
            4.0429845 = fieldWeight in 817, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.085969 = idf(docFreq=36, maxDocs=44218)
              0.5 = fieldNorm(doc=817)
    
  2. Sparck Jones, K.: Automatic classification (1976) 5.31
    5.3113213 = sum of:
      5.3113213 = sum of:
        2.0544336 = weight(author_txt:jones in 2908) [ClassicSimilarity], result of:
          2.0544336 = score(doc=2908,freq=1.0), product of:
            0.5925071 = queryWeight, product of:
              6.9347134 = idf(docFreq=116, maxDocs=44218)
              0.085440755 = queryNorm
            3.4673567 = fieldWeight in 2908, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              6.9347134 = idf(docFreq=116, maxDocs=44218)
              0.5 = fieldNorm(doc=2908)
        3.2568877 = weight(author_txt:sparck in 2908) [ClassicSimilarity], result of:
          3.2568877 = score(doc=2908,freq=1.0), product of:
            0.80556524 = queryWeight, product of:
              1.1660135 = boost
              8.085969 = idf(docFreq=36, maxDocs=44218)
              0.085440755 = queryNorm
            4.0429845 = fieldWeight in 2908, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.085969 = idf(docFreq=36, maxDocs=44218)
              0.5 = fieldNorm(doc=2908)
    
  3. Sparck Jones, K.: ¬The role of artificial intelligence in information retrieval (1991) 5.31
    5.3113213 = sum of:
      5.3113213 = sum of:
        2.0544336 = weight(author_txt:jones in 4811) [ClassicSimilarity], result of:
          2.0544336 = score(doc=4811,freq=1.0), product of:
            0.5925071 = queryWeight, product of:
              6.9347134 = idf(docFreq=116, maxDocs=44218)
              0.085440755 = queryNorm
            3.4673567 = fieldWeight in 4811, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              6.9347134 = idf(docFreq=116, maxDocs=44218)
              0.5 = fieldNorm(doc=4811)
        3.2568877 = weight(author_txt:sparck in 4811) [ClassicSimilarity], result of:
          3.2568877 = score(doc=4811,freq=1.0), product of:
            0.80556524 = queryWeight, product of:
              1.1660135 = boost
              8.085969 = idf(docFreq=36, maxDocs=44218)
              0.085440755 = queryNorm
            4.0429845 = fieldWeight in 4811, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.085969 = idf(docFreq=36, maxDocs=44218)
              0.5 = fieldNorm(doc=4811)
    
  4. Sparck Jones, K.: Automatic keyword classification for information retrieval (1971) 5.31
    5.3113213 = sum of:
      5.3113213 = sum of:
        2.0544336 = weight(author_txt:jones in 5176) [ClassicSimilarity], result of:
          2.0544336 = score(doc=5176,freq=1.0), product of:
            0.5925071 = queryWeight, product of:
              6.9347134 = idf(docFreq=116, maxDocs=44218)
              0.085440755 = queryNorm
            3.4673567 = fieldWeight in 5176, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              6.9347134 = idf(docFreq=116, maxDocs=44218)
              0.5 = fieldNorm(doc=5176)
        3.2568877 = weight(author_txt:sparck in 5176) [ClassicSimilarity], result of:
          3.2568877 = score(doc=5176,freq=1.0), product of:
            0.80556524 = queryWeight, product of:
              1.1660135 = boost
              8.085969 = idf(docFreq=36, maxDocs=44218)
              0.085440755 = queryNorm
            4.0429845 = fieldWeight in 5176, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.085969 = idf(docFreq=36, maxDocs=44218)
              0.5 = fieldNorm(doc=5176)
    
  5. Sparck Jones, K.: ¬A statistical interpretation of term specifity and its application in retrieval (1972) 5.31
    5.3113213 = sum of:
      5.3113213 = sum of:
        2.0544336 = weight(author_txt:jones in 5187) [ClassicSimilarity], result of:
          2.0544336 = score(doc=5187,freq=1.0), product of:
            0.5925071 = queryWeight, product of:
              6.9347134 = idf(docFreq=116, maxDocs=44218)
              0.085440755 = queryNorm
            3.4673567 = fieldWeight in 5187, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              6.9347134 = idf(docFreq=116, maxDocs=44218)
              0.5 = fieldNorm(doc=5187)
        3.2568877 = weight(author_txt:sparck in 5187) [ClassicSimilarity], result of:
          3.2568877 = score(doc=5187,freq=1.0), product of:
            0.80556524 = queryWeight, product of:
              1.1660135 = boost
              8.085969 = idf(docFreq=36, maxDocs=44218)
              0.085440755 = queryNorm
            4.0429845 = fieldWeight in 5187, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.085969 = idf(docFreq=36, maxDocs=44218)
              0.5 = fieldNorm(doc=5187)
    

Similar documents (content)

  1. Wong, S.K.M.: On modelling information retrieval with probabilistic inference (1995) 0.22
    0.2224213 = sum of:
      0.2224213 = product of:
        0.70433414 = sum of:
          0.042139694 = weight(abstract_txt:context in 1938) [ClassicSimilarity], result of:
            0.042139694 = score(doc=1938,freq=1.0), product of:
              0.10356988 = queryWeight, product of:
                1.0059869 = boost
                4.339969 = idf(docFreq=1566, maxDocs=44218)
                0.023722176 = queryNorm
              0.4068721 = fieldWeight in 1938, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.339969 = idf(docFreq=1566, maxDocs=44218)
                0.09375 = fieldNorm(doc=1938)
          0.048113003 = weight(abstract_txt:theory in 1938) [ClassicSimilarity], result of:
            0.048113003 = score(doc=1938,freq=1.0), product of:
              0.11313948 = queryWeight, product of:
                1.0514356 = boost
                4.5360413 = idf(docFreq=1287, maxDocs=44218)
                0.023722176 = queryNorm
              0.42525387 = fieldWeight in 1938, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5360413 = idf(docFreq=1287, maxDocs=44218)
                0.09375 = fieldNorm(doc=1938)
          0.058693007 = weight(abstract_txt:theoretical in 1938) [ClassicSimilarity], result of:
            0.058693007 = score(doc=1938,freq=1.0), product of:
              0.12917054 = queryWeight, product of:
                1.1234592 = boost
                4.846761 = idf(docFreq=943, maxDocs=44218)
                0.023722176 = queryNorm
              0.45438385 = fieldWeight in 1938, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.846761 = idf(docFreq=943, maxDocs=44218)
                0.09375 = fieldNorm(doc=1938)
          0.090911515 = weight(abstract_txt:ideas in 1938) [ClassicSimilarity], result of:
            0.090911515 = score(doc=1938,freq=1.0), product of:
              0.17292261 = queryWeight, product of:
                1.2998747 = boost
                5.6078424 = idf(docFreq=440, maxDocs=44218)
                0.023722176 = queryNorm
              0.52573526 = fieldWeight in 1938, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.6078424 = idf(docFreq=440, maxDocs=44218)
                0.09375 = fieldNorm(doc=1938)
          0.11410696 = weight(abstract_txt:term in 1938) [ClassicSimilarity], result of:
            0.11410696 = score(doc=1938,freq=1.0), product of:
              0.2535074 = queryWeight, product of:
                2.2257988 = boost
                4.8012047 = idf(docFreq=987, maxDocs=44218)
                0.023722176 = queryNorm
              0.45011294 = fieldWeight in 1938, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.8012047 = idf(docFreq=987, maxDocs=44218)
                0.09375 = fieldNorm(doc=1938)
          0.35036996 = weight(abstract_txt:weighting in 1938) [ClassicSimilarity], result of:
            0.35036996 = score(doc=1938,freq=1.0), product of:
              0.53555053 = queryWeight, product of:
                3.2351232 = boost
                6.9783883 = idf(docFreq=111, maxDocs=44218)
                0.023722176 = queryNorm
              0.6542239 = fieldWeight in 1938, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.9783883 = idf(docFreq=111, maxDocs=44218)
                0.09375 = fieldNorm(doc=1938)
        0.31578946 = coord(6/19)
    
  2. Robertson, S.E.: OKAPI at TREC-1 (1994) 0.20
    0.20110352 = sum of:
      0.20110352 = product of:
        1.2736557 = sum of:
          0.46084917 = weight(abstract_txt:robertson in 7953) [ClassicSimilarity], result of:
            0.46084917 = score(doc=7953,freq=1.0), product of:
              0.42122996 = queryWeight, product of:
                2.0287812 = boost
                8.752448 = idf(docFreq=18, maxDocs=44218)
                0.023722176 = queryNorm
              1.094056 = fieldWeight in 7953, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.752448 = idf(docFreq=18, maxDocs=44218)
                0.125 = fieldNorm(doc=7953)
          0.15214261 = weight(abstract_txt:term in 7953) [ClassicSimilarity], result of:
            0.15214261 = score(doc=7953,freq=1.0), product of:
              0.2535074 = queryWeight, product of:
                2.2257988 = boost
                4.8012047 = idf(docFreq=987, maxDocs=44218)
                0.023722176 = queryNorm
              0.6001506 = fieldWeight in 7953, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.8012047 = idf(docFreq=987, maxDocs=44218)
                0.125 = fieldNorm(doc=7953)
          0.6606639 = weight(abstract_txt:weighting in 7953) [ClassicSimilarity], result of:
            0.6606639 = score(doc=7953,freq=2.0), product of:
              0.53555053 = queryWeight, product of:
                3.2351232 = boost
                6.9783883 = idf(docFreq=111, maxDocs=44218)
                0.023722176 = queryNorm
              1.2336164 = fieldWeight in 7953, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.9783883 = idf(docFreq=111, maxDocs=44218)
                0.125 = fieldNorm(doc=7953)
        0.15789473 = coord(3/19)
    
  3. Robertson, S.E.; Sparck Jones, K.: Relevance weighting of search terms (1976) 0.19
    0.18545245 = sum of:
      0.18545245 = product of:
        0.88089913 = sum of:
          0.048290446 = weight(abstract_txt:specific in 71) [ClassicSimilarity], result of:
            0.048290446 = score(doc=71,freq=1.0), product of:
              0.1023408 = queryWeight, product of:
                4.314141 = idf(docFreq=1607, maxDocs=44218)
                0.023722176 = queryNorm
              0.47185916 = fieldWeight in 71, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.314141 = idf(docFreq=1607, maxDocs=44218)
                0.109375 = fieldNorm(doc=71)
          0.056131836 = weight(abstract_txt:theory in 71) [ClassicSimilarity], result of:
            0.056131836 = score(doc=71,freq=1.0), product of:
              0.11313948 = queryWeight, product of:
                1.0514356 = boost
                4.5360413 = idf(docFreq=1287, maxDocs=44218)
                0.023722176 = queryNorm
              0.4961295 = fieldWeight in 71, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5360413 = idf(docFreq=1287, maxDocs=44218)
                0.109375 = fieldNorm(doc=71)
          0.06847518 = weight(abstract_txt:theoretical in 71) [ClassicSimilarity], result of:
            0.06847518 = score(doc=71,freq=1.0), product of:
              0.12917054 = queryWeight, product of:
                1.1234592 = boost
                4.846761 = idf(docFreq=943, maxDocs=44218)
                0.023722176 = queryNorm
              0.53011453 = fieldWeight in 71, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.846761 = idf(docFreq=943, maxDocs=44218)
                0.109375 = fieldNorm(doc=71)
          0.7080017 = weight(abstract_txt:weighting in 71) [ClassicSimilarity], result of:
            0.7080017 = score(doc=71,freq=3.0), product of:
              0.53555053 = queryWeight, product of:
                3.2351232 = boost
                6.9783883 = idf(docFreq=111, maxDocs=44218)
                0.023722176 = queryNorm
              1.3220072 = fieldWeight in 71, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.9783883 = idf(docFreq=111, maxDocs=44218)
                0.109375 = fieldNorm(doc=71)
        0.21052632 = coord(4/19)
    
  4. Dominich, S.; Góth, J.; Kiezer, T.; Szlávik, Z.: ¬An entropy-based interpretation of retrieval status value-based retrieval, and its application to the computation of term and query discrimination value (2004) 0.17
    0.16679879 = sum of:
      0.16679879 = product of:
        0.6338354 = sum of:
          0.033882644 = weight(abstract_txt:practice in 2237) [ClassicSimilarity], result of:
            0.033882644 = score(doc=2237,freq=1.0), product of:
              0.12827624 = queryWeight, product of:
                1.1195635 = boost
                4.829954 = idf(docFreq=959, maxDocs=44218)
                0.023722176 = queryNorm
              0.2641381 = fieldWeight in 2237, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.829954 = idf(docFreq=959, maxDocs=44218)
                0.0546875 = fieldNorm(doc=2237)
          0.04841926 = weight(abstract_txt:theoretical in 2237) [ClassicSimilarity], result of:
            0.04841926 = score(doc=2237,freq=2.0), product of:
              0.12917054 = queryWeight, product of:
                1.1234592 = boost
                4.846761 = idf(docFreq=943, maxDocs=44218)
                0.023722176 = queryNorm
              0.37484756 = fieldWeight in 2237, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.846761 = idf(docFreq=943, maxDocs=44218)
                0.0546875 = fieldNorm(doc=2237)
          0.064407855 = weight(abstract_txt:status in 2237) [ClassicSimilarity], result of:
            0.064407855 = score(doc=2237,freq=1.0), product of:
              0.19684327 = queryWeight, product of:
                1.3868704 = boost
                5.9831543 = idf(docFreq=302, maxDocs=44218)
                0.023722176 = queryNorm
              0.32720375 = fieldWeight in 2237, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.9831543 = idf(docFreq=302, maxDocs=44218)
                0.0546875 = fieldNorm(doc=2237)
          0.13312478 = weight(abstract_txt:term in 2237) [ClassicSimilarity], result of:
            0.13312478 = score(doc=2237,freq=4.0), product of:
              0.2535074 = queryWeight, product of:
                2.2257988 = boost
                4.8012047 = idf(docFreq=987, maxDocs=44218)
                0.023722176 = queryNorm
              0.52513176 = fieldWeight in 2237, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.8012047 = idf(docFreq=987, maxDocs=44218)
                0.0546875 = fieldNorm(doc=2237)
          0.35400084 = weight(abstract_txt:weighting in 2237) [ClassicSimilarity], result of:
            0.35400084 = score(doc=2237,freq=3.0), product of:
              0.53555053 = queryWeight, product of:
                3.2351232 = boost
                6.9783883 = idf(docFreq=111, maxDocs=44218)
                0.023722176 = queryNorm
              0.6610036 = fieldWeight in 2237, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.9783883 = idf(docFreq=111, maxDocs=44218)
                0.0546875 = fieldNorm(doc=2237)
        0.2631579 = coord(5/19)
    
  5. Li, X.; Zhang, A.; Li, C.; Ouyang, J.; Cai, Y.: Exploring coherent topics by topic modeling with term weighting (2018) 0.16
    0.16325773 = sum of:
      0.16325773 = product of:
        0.7754742 = sum of:
          0.02759454 = weight(abstract_txt:specific in 5045) [ClassicSimilarity], result of:
            0.02759454 = score(doc=5045,freq=1.0), product of:
              0.1023408 = queryWeight, product of:
                4.314141 = idf(docFreq=1607, maxDocs=44218)
                0.023722176 = queryNorm
              0.2696338 = fieldWeight in 5045, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.314141 = idf(docFreq=1607, maxDocs=44218)
                0.0625 = fieldNorm(doc=5045)
          0.04396846 = weight(abstract_txt:develop in 5045) [ClassicSimilarity], result of:
            0.04396846 = score(doc=5045,freq=1.0), product of:
              0.13961355 = queryWeight, product of:
                1.1679907 = boost
                5.038876 = idf(docFreq=778, maxDocs=44218)
                0.023722176 = queryNorm
              0.31492975 = fieldWeight in 5045, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.038876 = idf(docFreq=778, maxDocs=44218)
                0.0625 = fieldNorm(doc=5045)
          0.13175938 = weight(abstract_txt:term in 5045) [ClassicSimilarity], result of:
            0.13175938 = score(doc=5045,freq=3.0), product of:
              0.2535074 = queryWeight, product of:
                2.2257988 = boost
                4.8012047 = idf(docFreq=987, maxDocs=44218)
                0.023722176 = queryNorm
              0.51974565 = fieldWeight in 5045, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.8012047 = idf(docFreq=987, maxDocs=44218)
                0.0625 = fieldNorm(doc=5045)
          0.5721518 = weight(abstract_txt:weighting in 5045) [ClassicSimilarity], result of:
            0.5721518 = score(doc=5045,freq=6.0), product of:
              0.53555053 = queryWeight, product of:
                3.2351232 = boost
                6.9783883 = idf(docFreq=111, maxDocs=44218)
                0.023722176 = queryNorm
              1.0683432 = fieldWeight in 5045, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                6.9783883 = idf(docFreq=111, maxDocs=44218)
                0.0625 = fieldNorm(doc=5045)
        0.21052632 = coord(4/19)