Document (#32027)

Author
Dubin, D.
Title
¬The most influential paper Gerard Salton never wrote
Source
Library trends. 52(2004) no.4, S.748-764
Year
2004
Abstract
Gerard Salton is often credited with developing the vector space model (VSM) for information retrieval (IR). Citations to Salton give the impression that the VSM must have been articulated as an IR model sometime between 1970 and 1975. However, the VSM as it is understood today evolved over a longer time period than is usually acknowledged, and an articulation of the model and its assumptions did not appear in print until several years after those assumptions had been criticized and alternative models proposed. An often cited overview paper titled "A Vector Space Model for Information Retrieval" (alleged to have been published in 1975) does not exist, and citations to it represent a confusion of two 1975 articles, neither of which were overviews of the VSM as a model of information retrieval. Until the late 1970s, Salton did not present vector spaces as models of IR generally but rather as models of specific computations. Citations to the phantom paper reflect an apparently widely held misconception that the operational features and explanatory devices now associated with the VSM must have been introduced at the same time it was first proposed as an IR model.
Footnote
Beitrag in einem Themenheft: Pioneers in library and information science
Theme
Biographische Darstellungen

Similar documents (content)

  1. Lopez-Pujalte, C.; Guerrero Bote, V.P.; Moya-Anegón, F. de: Evaluation of the application of genetic algorithms to relevance feedback (2003) 0.14
    0.14171521 = sum of:
      0.14171521 = product of:
        0.7085761 = sum of:
          0.018925235 = weight(abstract_txt:retrieval in 2756) [ClassicSimilarity], result of:
            0.018925235 = score(doc=2756,freq=1.0), product of:
              0.06970742 = queryWeight, product of:
                1.2297009 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.016312003 = queryNorm
              0.27149525 = fieldWeight in 2756, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.078125 = fieldNorm(doc=2756)
          0.047139928 = weight(abstract_txt:space in 2756) [ClassicSimilarity], result of:
            0.047139928 = score(doc=2756,freq=1.0), product of:
              0.11189578 = queryWeight, product of:
                1.2720999 = boost
                5.3924384 = idf(docFreq=546, maxDocs=44218)
                0.016312003 = queryNorm
              0.42128426 = fieldWeight in 2756, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.3924384 = idf(docFreq=546, maxDocs=44218)
                0.078125 = fieldNorm(doc=2756)
          0.12503016 = weight(abstract_txt:vector in 2756) [ClassicSimilarity], result of:
            0.12503016 = score(doc=2756,freq=1.0), product of:
              0.24543022 = queryWeight, product of:
                2.307406 = boost
                6.5207376 = idf(docFreq=176, maxDocs=44218)
                0.016312003 = queryNorm
              0.5094326 = fieldWeight in 2756, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.5207376 = idf(docFreq=176, maxDocs=44218)
                0.078125 = fieldNorm(doc=2756)
          0.08079006 = weight(abstract_txt:model in 2756) [ClassicSimilarity], result of:
            0.08079006 = score(doc=2756,freq=2.0), product of:
              0.18343835 = queryWeight, product of:
                2.8211102 = boost
                3.986234 = idf(docFreq=2231, maxDocs=44218)
                0.016312003 = queryNorm
              0.44042078 = fieldWeight in 2756, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.986234 = idf(docFreq=2231, maxDocs=44218)
                0.078125 = fieldNorm(doc=2756)
          0.4366907 = weight(abstract_txt:salton in 2756) [ClassicSimilarity], result of:
            0.4366907 = score(doc=2756,freq=1.0), product of:
              0.62184244 = queryWeight, product of:
                4.241012 = boost
                8.988837 = idf(docFreq=14, maxDocs=44218)
                0.016312003 = queryNorm
              0.7022529 = fieldWeight in 2756, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.988837 = idf(docFreq=14, maxDocs=44218)
                0.078125 = fieldNorm(doc=2756)
        0.2 = coord(5/25)
    
  2. Crouch, C.J.: ¬An approach to the automatic construction of global thesauri (1990) 0.14
    0.13816291 = sum of:
      0.13816291 = product of:
        0.69081455 = sum of:
          0.01780853 = weight(abstract_txt:have in 4042) [ClassicSimilarity], result of:
            0.01780853 = score(doc=4042,freq=1.0), product of:
              0.05927652 = queryWeight, product of:
                1.1339694 = boost
                3.2046018 = idf(docFreq=4876, maxDocs=44218)
                0.016312003 = queryNorm
              0.30043143 = fieldWeight in 4042, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.2046018 = idf(docFreq=4876, maxDocs=44218)
                0.09375 = fieldNorm(doc=4042)
          0.032117188 = weight(abstract_txt:retrieval in 4042) [ClassicSimilarity], result of:
            0.032117188 = score(doc=4042,freq=2.0), product of:
              0.06970742 = queryWeight, product of:
                1.2297009 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.016312003 = queryNorm
              0.4607427 = fieldWeight in 4042, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.09375 = fieldNorm(doc=4042)
          0.0483074 = weight(abstract_txt:been in 4042) [ClassicSimilarity], result of:
            0.0483074 = score(doc=4042,freq=2.0), product of:
              0.10071853 = queryWeight, product of:
                1.7068055 = boost
                3.617579 = idf(docFreq=3226, maxDocs=44218)
                0.016312003 = queryNorm
              0.47962773 = fieldWeight in 4042, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.617579 = idf(docFreq=3226, maxDocs=44218)
                0.09375 = fieldNorm(doc=4042)
          0.06855264 = weight(abstract_txt:model in 4042) [ClassicSimilarity], result of:
            0.06855264 = score(doc=4042,freq=1.0), product of:
              0.18343835 = queryWeight, product of:
                2.8211102 = boost
                3.986234 = idf(docFreq=2231, maxDocs=44218)
                0.016312003 = queryNorm
              0.37370944 = fieldWeight in 4042, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.986234 = idf(docFreq=2231, maxDocs=44218)
                0.09375 = fieldNorm(doc=4042)
          0.5240288 = weight(abstract_txt:salton in 4042) [ClassicSimilarity], result of:
            0.5240288 = score(doc=4042,freq=1.0), product of:
              0.62184244 = queryWeight, product of:
                4.241012 = boost
                8.988837 = idf(docFreq=14, maxDocs=44218)
                0.016312003 = queryNorm
              0.84270346 = fieldWeight in 4042, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.988837 = idf(docFreq=14, maxDocs=44218)
                0.09375 = fieldNorm(doc=4042)
        0.2 = coord(5/25)
    
  3. Song, D.; Bruza, P.D.: Towards context sensitive information inference (2003) 0.13
    0.13322584 = sum of:
      0.13322584 = product of:
        0.47580656 = sum of:
          0.12351477 = weight(abstract_txt:computations in 1428) [ClassicSimilarity], result of:
            0.12351477 = score(doc=1428,freq=2.0), product of:
              0.15546061 = queryWeight, product of:
                1.060253 = boost
                8.988837 = idf(docFreq=14, maxDocs=44218)
                0.016312003 = queryNorm
              0.79450846 = fieldWeight in 1428, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.988837 = idf(docFreq=14, maxDocs=44218)
                0.0625 = fieldNorm(doc=1428)
          0.023552231 = weight(abstract_txt:proposed in 1428) [ClassicSimilarity], result of:
            0.023552231 = score(doc=1428,freq=1.0), product of:
              0.08175528 = queryWeight, product of:
                1.0873573 = boost
                4.6093135 = idf(docFreq=1196, maxDocs=44218)
                0.016312003 = queryNorm
              0.2880821 = fieldWeight in 1428, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.6093135 = idf(docFreq=1196, maxDocs=44218)
                0.0625 = fieldNorm(doc=1428)
          0.015140188 = weight(abstract_txt:retrieval in 1428) [ClassicSimilarity], result of:
            0.015140188 = score(doc=1428,freq=1.0), product of:
              0.06970742 = queryWeight, product of:
                1.2297009 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.016312003 = queryNorm
              0.21719621 = fieldWeight in 1428, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.0625 = fieldNorm(doc=1428)
          0.07542389 = weight(abstract_txt:space in 1428) [ClassicSimilarity], result of:
            0.07542389 = score(doc=1428,freq=4.0), product of:
              0.11189578 = queryWeight, product of:
                1.2720999 = boost
                5.3924384 = idf(docFreq=546, maxDocs=44218)
                0.016312003 = queryNorm
              0.6740548 = fieldWeight in 1428, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.3924384 = idf(docFreq=546, maxDocs=44218)
                0.0625 = fieldNorm(doc=1428)
          0.05101824 = weight(abstract_txt:models in 1428) [ClassicSimilarity], result of:
            0.05101824 = score(doc=1428,freq=2.0), product of:
              0.12435555 = queryWeight, product of:
                1.6424515 = boost
                4.6415744 = idf(docFreq=1158, maxDocs=44218)
                0.016312003 = queryNorm
              0.4102611 = fieldWeight in 1428, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.6415744 = idf(docFreq=1158, maxDocs=44218)
                0.0625 = fieldNorm(doc=1428)
          0.14145549 = weight(abstract_txt:vector in 1428) [ClassicSimilarity], result of:
            0.14145549 = score(doc=1428,freq=2.0), product of:
              0.24543022 = queryWeight, product of:
                2.307406 = boost
                6.5207376 = idf(docFreq=176, maxDocs=44218)
                0.016312003 = queryNorm
              0.57635725 = fieldWeight in 1428, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.5207376 = idf(docFreq=176, maxDocs=44218)
                0.0625 = fieldNorm(doc=1428)
          0.04570176 = weight(abstract_txt:model in 1428) [ClassicSimilarity], result of:
            0.04570176 = score(doc=1428,freq=1.0), product of:
              0.18343835 = queryWeight, product of:
                2.8211102 = boost
                3.986234 = idf(docFreq=2231, maxDocs=44218)
                0.016312003 = queryNorm
              0.24913962 = fieldWeight in 1428, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.986234 = idf(docFreq=2231, maxDocs=44218)
                0.0625 = fieldNorm(doc=1428)
        0.28 = coord(7/25)
    
  4. Malesios, C.: Some variations on the standard theoretical models for the h-index : a comparative analysis (2015) 0.13
    0.13088983 = sum of:
      0.13088983 = product of:
        0.46746367 = sum of:
          0.035328347 = weight(abstract_txt:proposed in 2267) [ClassicSimilarity], result of:
            0.035328347 = score(doc=2267,freq=1.0), product of:
              0.08175528 = queryWeight, product of:
                1.0873573 = boost
                4.6093135 = idf(docFreq=1196, maxDocs=44218)
                0.016312003 = queryNorm
              0.43212312 = fieldWeight in 2267, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.6093135 = idf(docFreq=1196, maxDocs=44218)
                0.09375 = fieldNorm(doc=2267)
          0.025185062 = weight(abstract_txt:have in 2267) [ClassicSimilarity], result of:
            0.025185062 = score(doc=2267,freq=2.0), product of:
              0.05927652 = queryWeight, product of:
                1.1339694 = boost
                3.2046018 = idf(docFreq=4876, maxDocs=44218)
                0.016312003 = queryNorm
              0.42487416 = fieldWeight in 2267, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.2046018 = idf(docFreq=4876, maxDocs=44218)
                0.09375 = fieldNorm(doc=2267)
          0.09950794 = weight(abstract_txt:assumptions in 2267) [ClassicSimilarity], result of:
            0.09950794 = score(doc=2267,freq=1.0), product of:
              0.16305675 = queryWeight, product of:
                1.53562 = boost
                6.5095015 = idf(docFreq=178, maxDocs=44218)
                0.016312003 = queryNorm
              0.61026573 = fieldWeight in 2267, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.5095015 = idf(docFreq=178, maxDocs=44218)
                0.09375 = fieldNorm(doc=2267)
          0.10822603 = weight(abstract_txt:models in 2267) [ClassicSimilarity], result of:
            0.10822603 = score(doc=2267,freq=4.0), product of:
              0.12435555 = queryWeight, product of:
                1.6424515 = boost
                4.6415744 = idf(docFreq=1158, maxDocs=44218)
                0.016312003 = queryNorm
              0.87029517 = fieldWeight in 2267, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.6415744 = idf(docFreq=1158, maxDocs=44218)
                0.09375 = fieldNorm(doc=2267)
          0.0483074 = weight(abstract_txt:been in 2267) [ClassicSimilarity], result of:
            0.0483074 = score(doc=2267,freq=2.0), product of:
              0.10071853 = queryWeight, product of:
                1.7068055 = boost
                3.617579 = idf(docFreq=3226, maxDocs=44218)
                0.016312003 = queryNorm
              0.47962773 = fieldWeight in 2267, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.617579 = idf(docFreq=3226, maxDocs=44218)
                0.09375 = fieldNorm(doc=2267)
          0.08235627 = weight(abstract_txt:citations in 2267) [ClassicSimilarity], result of:
            0.08235627 = score(doc=2267,freq=1.0), product of:
              0.16453631 = queryWeight, product of:
                1.8892562 = boost
                5.339045 = idf(docFreq=576, maxDocs=44218)
                0.016312003 = queryNorm
              0.5005355 = fieldWeight in 2267, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.339045 = idf(docFreq=576, maxDocs=44218)
                0.09375 = fieldNorm(doc=2267)
          0.06855264 = weight(abstract_txt:model in 2267) [ClassicSimilarity], result of:
            0.06855264 = score(doc=2267,freq=1.0), product of:
              0.18343835 = queryWeight, product of:
                2.8211102 = boost
                3.986234 = idf(docFreq=2231, maxDocs=44218)
                0.016312003 = queryNorm
              0.37370944 = fieldWeight in 2267, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.986234 = idf(docFreq=2231, maxDocs=44218)
                0.09375 = fieldNorm(doc=2267)
        0.28 = coord(7/25)
    
  5. Dominich, S.; Kiezer, T.: ¬A measure theoretic approach to information retrieval (2007) 0.13
    0.13001773 = sum of:
      0.13001773 = product of:
        0.54174054 = sum of:
          0.039742995 = weight(abstract_txt:retrieval in 445) [ClassicSimilarity], result of:
            0.039742995 = score(doc=445,freq=9.0), product of:
              0.06970742 = queryWeight, product of:
                1.2297009 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.016312003 = queryNorm
              0.57014006 = fieldWeight in 445, product of:
                3.0 = tf(freq=9.0), with freq of:
                  9.0 = termFreq=9.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.0546875 = fieldNorm(doc=445)
          0.0933323 = weight(abstract_txt:space in 445) [ClassicSimilarity], result of:
            0.0933323 = score(doc=445,freq=8.0), product of:
              0.11189578 = queryWeight, product of:
                1.2720999 = boost
                5.3924384 = idf(docFreq=546, maxDocs=44218)
                0.016312003 = queryNorm
              0.83410025 = fieldWeight in 445, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                5.3924384 = idf(docFreq=546, maxDocs=44218)
                0.0546875 = fieldNorm(doc=445)
          0.044640962 = weight(abstract_txt:models in 445) [ClassicSimilarity], result of:
            0.044640962 = score(doc=445,freq=2.0), product of:
              0.12435555 = queryWeight, product of:
                1.6424515 = boost
                4.6415744 = idf(docFreq=1158, maxDocs=44218)
                0.016312003 = queryNorm
              0.35897845 = fieldWeight in 445, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.6415744 = idf(docFreq=1158, maxDocs=44218)
                0.0546875 = fieldNorm(doc=445)
          0.03451247 = weight(abstract_txt:been in 445) [ClassicSimilarity], result of:
            0.03451247 = score(doc=445,freq=3.0), product of:
              0.10071853 = queryWeight, product of:
                1.7068055 = boost
                3.617579 = idf(docFreq=3226, maxDocs=44218)
                0.016312003 = queryNorm
              0.3426626 = fieldWeight in 445, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.617579 = idf(docFreq=3226, maxDocs=44218)
                0.0546875 = fieldNorm(doc=445)
          0.23155907 = weight(abstract_txt:vector in 445) [ClassicSimilarity], result of:
            0.23155907 = score(doc=445,freq=7.0), product of:
              0.24543022 = queryWeight, product of:
                2.307406 = boost
                6.5207376 = idf(docFreq=176, maxDocs=44218)
                0.016312003 = queryNorm
              0.94348234 = fieldWeight in 445, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                6.5207376 = idf(docFreq=176, maxDocs=44218)
                0.0546875 = fieldNorm(doc=445)
          0.097952746 = weight(abstract_txt:model in 445) [ClassicSimilarity], result of:
            0.097952746 = score(doc=445,freq=6.0), product of:
              0.18343835 = queryWeight, product of:
                2.8211102 = boost
                3.986234 = idf(docFreq=2231, maxDocs=44218)
                0.016312003 = queryNorm
              0.53398186 = fieldWeight in 445, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                3.986234 = idf(docFreq=2231, maxDocs=44218)
                0.0546875 = fieldNorm(doc=445)
        0.24 = coord(6/25)