Document (#28646)

Author
Needham, R.M.
Sparck Jones, K.
Title
Keywords and clumps
Source
Theory of subject analysis: a sourcebook. Ed.: L.M. Chan, et al
Imprint
Littleton, CO : Libraries Unlimited
Year
1985
Pages
S.262-272
Abstract
The selection that follows was chosen as it represents "a very early paper an the possibilities allowed by computers an documentation." In the early 1960s computers were being used to provide simple automatic indexing systems wherein keywords were extracted from documents. The problem with such systems was that they lacked vocabulary control, thus documents related in subject matter were not always collocated in retrieval. To improve retrieval by improving recall is the raison d'être of vocabulary control tools such as classifications and thesauri. The question arose whether it was possible by automatic means to construct classes of terms, which when substituted, one for another, could be used to improve retrieval performance? One of the first theoretical approaches to this question was initiated by R. M. Needham and Karen Sparck Jones at the Cambridge Language Research Institute in England.t The question was later pursued using experimental methodologies by Sparck Jones, who, as a Senior Research Associate in the Computer Laboratory at the University of Cambridge, has devoted her life's work to research in information retrieval and automatic naturai language processing. Based an the principles of numerical taxonomy, automatic classification techniques start from the premise that two objects are similar to the degree that they share attributes in common. When these two objects are keywords, their similarity is measured in terms of the number of documents they index in common. Step 1 in automatic classification is to compute mathematically the degree to which two terms are similar. Step 2 is to group together those terms that are "most similar" to each other, forming equivalence classes of intersubstitutable terms. The technique for forming such classes varies and is the factor that characteristically distinguishes different approaches to automatic classification. The technique used by Needham and Sparck Jones, that of clumping, is described in the selection that follows. Questions that must be asked are whether the use of automatically generated classes really does improve retrieval performance and whether there is a true eco nomic advantage in substituting mechanical for manual labor. Several years after her work with clumping, Sparck Jones was to observe that while it was not wholly satisfactory in itself, it was valuable in that it stimulated research into automatic classification. To this it might be added that it was valuable in that it introduced to libraryl information science the methods of numerical taxonomy, thus stimulating us to think again about the fundamental nature and purpose of classification. In this connection it might be useful to review how automatically derived classes differ from those of manually constructed classifications: 1) the manner of their derivation is purely a posteriori, the ultimate operationalization of the principle of literary warrant; 2) the relationship between members forming such classes is essentially statistical; the members of a given class are similar to each other not because they possess the class-defining characteristic but by virtue of sharing a family resemblance; and finally, 3) automatically derived classes are not related meaningfully one to another, that is, they are not ordered in traditional hierarchical and precedence relationships.
Footnote
Nachdruck des Originalartikels mit Kommentierung durch die Herausgeber
Original in: Journal of documentation 20(1964) no.1, S.5-15.
Theme
Computerlinguistik
Automatisches Indexieren

Similar documents (author)

  1. Sparck Jones, K.: Fashionable trends and feasible strategies in information management (1988) 5.31
    5.3113213 = sum of:
      5.3113213 = sum of:
        2.0544336 = weight(author_txt:jones in 817) [ClassicSimilarity], result of:
          2.0544336 = score(doc=817,freq=1.0), product of:
            0.5925071 = queryWeight, product of:
              6.9347134 = idf(docFreq=116, maxDocs=44218)
              0.085440755 = queryNorm
            3.4673567 = fieldWeight in 817, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              6.9347134 = idf(docFreq=116, maxDocs=44218)
              0.5 = fieldNorm(doc=817)
        3.2568877 = weight(author_txt:sparck in 817) [ClassicSimilarity], result of:
          3.2568877 = score(doc=817,freq=1.0), product of:
            0.80556524 = queryWeight, product of:
              1.1660135 = boost
              8.085969 = idf(docFreq=36, maxDocs=44218)
              0.085440755 = queryNorm
            4.0429845 = fieldWeight in 817, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.085969 = idf(docFreq=36, maxDocs=44218)
              0.5 = fieldNorm(doc=817)
    
  2. Sparck Jones, K.: Automatic classification (1976) 5.31
    5.3113213 = sum of:
      5.3113213 = sum of:
        2.0544336 = weight(author_txt:jones in 2908) [ClassicSimilarity], result of:
          2.0544336 = score(doc=2908,freq=1.0), product of:
            0.5925071 = queryWeight, product of:
              6.9347134 = idf(docFreq=116, maxDocs=44218)
              0.085440755 = queryNorm
            3.4673567 = fieldWeight in 2908, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              6.9347134 = idf(docFreq=116, maxDocs=44218)
              0.5 = fieldNorm(doc=2908)
        3.2568877 = weight(author_txt:sparck in 2908) [ClassicSimilarity], result of:
          3.2568877 = score(doc=2908,freq=1.0), product of:
            0.80556524 = queryWeight, product of:
              1.1660135 = boost
              8.085969 = idf(docFreq=36, maxDocs=44218)
              0.085440755 = queryNorm
            4.0429845 = fieldWeight in 2908, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.085969 = idf(docFreq=36, maxDocs=44218)
              0.5 = fieldNorm(doc=2908)
    
  3. Sparck Jones, K.: ¬The role of artificial intelligence in information retrieval (1991) 5.31
    5.3113213 = sum of:
      5.3113213 = sum of:
        2.0544336 = weight(author_txt:jones in 4811) [ClassicSimilarity], result of:
          2.0544336 = score(doc=4811,freq=1.0), product of:
            0.5925071 = queryWeight, product of:
              6.9347134 = idf(docFreq=116, maxDocs=44218)
              0.085440755 = queryNorm
            3.4673567 = fieldWeight in 4811, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              6.9347134 = idf(docFreq=116, maxDocs=44218)
              0.5 = fieldNorm(doc=4811)
        3.2568877 = weight(author_txt:sparck in 4811) [ClassicSimilarity], result of:
          3.2568877 = score(doc=4811,freq=1.0), product of:
            0.80556524 = queryWeight, product of:
              1.1660135 = boost
              8.085969 = idf(docFreq=36, maxDocs=44218)
              0.085440755 = queryNorm
            4.0429845 = fieldWeight in 4811, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.085969 = idf(docFreq=36, maxDocs=44218)
              0.5 = fieldNorm(doc=4811)
    
  4. Sparck Jones, K.: Automatic keyword classification for information retrieval (1971) 5.31
    5.3113213 = sum of:
      5.3113213 = sum of:
        2.0544336 = weight(author_txt:jones in 5176) [ClassicSimilarity], result of:
          2.0544336 = score(doc=5176,freq=1.0), product of:
            0.5925071 = queryWeight, product of:
              6.9347134 = idf(docFreq=116, maxDocs=44218)
              0.085440755 = queryNorm
            3.4673567 = fieldWeight in 5176, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              6.9347134 = idf(docFreq=116, maxDocs=44218)
              0.5 = fieldNorm(doc=5176)
        3.2568877 = weight(author_txt:sparck in 5176) [ClassicSimilarity], result of:
          3.2568877 = score(doc=5176,freq=1.0), product of:
            0.80556524 = queryWeight, product of:
              1.1660135 = boost
              8.085969 = idf(docFreq=36, maxDocs=44218)
              0.085440755 = queryNorm
            4.0429845 = fieldWeight in 5176, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.085969 = idf(docFreq=36, maxDocs=44218)
              0.5 = fieldNorm(doc=5176)
    
  5. Sparck Jones, K.: ¬A statistical interpretation of term specifity and its application in retrieval (1972) 5.31
    5.3113213 = sum of:
      5.3113213 = sum of:
        2.0544336 = weight(author_txt:jones in 5187) [ClassicSimilarity], result of:
          2.0544336 = score(doc=5187,freq=1.0), product of:
            0.5925071 = queryWeight, product of:
              6.9347134 = idf(docFreq=116, maxDocs=44218)
              0.085440755 = queryNorm
            3.4673567 = fieldWeight in 5187, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              6.9347134 = idf(docFreq=116, maxDocs=44218)
              0.5 = fieldNorm(doc=5187)
        3.2568877 = weight(author_txt:sparck in 5187) [ClassicSimilarity], result of:
          3.2568877 = score(doc=5187,freq=1.0), product of:
            0.80556524 = queryWeight, product of:
              1.1660135 = boost
              8.085969 = idf(docFreq=36, maxDocs=44218)
              0.085440755 = queryNorm
            4.0429845 = fieldWeight in 5187, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.085969 = idf(docFreq=36, maxDocs=44218)
              0.5 = fieldNorm(doc=5187)
    

Similar documents (content)

  1. Borko, H.: Research in computer based classification systems (1985) 0.78
    0.783476 = sum of:
      0.783476 = product of:
        1.1521705 = sum of:
          0.020666614 = weight(abstract_txt:documents in 3647) [ClassicSimilarity], result of:
            0.020666614 = score(doc=3647,freq=3.0), product of:
              0.07411638 = queryWeight, product of:
                1.0093426 = boost
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.017817266 = queryNorm
              0.27884004 = fieldWeight in 3647, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.0390625 = fieldNorm(doc=3647)
          0.007242034 = weight(abstract_txt:research in 3647) [ClassicSimilarity], result of:
            0.007242034 = score(doc=3647,freq=1.0), product of:
              0.058478333 = queryWeight, product of:
                1.0352575 = boost
                3.170338 = idf(docFreq=5046, maxDocs=44218)
                0.017817266 = queryNorm
              0.12384132 = fieldWeight in 3647, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.170338 = idf(docFreq=5046, maxDocs=44218)
                0.0390625 = fieldNorm(doc=3647)
          0.00913591 = weight(abstract_txt:such in 3647) [ClassicSimilarity], result of:
            0.00913591 = score(doc=3647,freq=1.0), product of:
              0.06827407 = queryWeight, product of:
                1.1186103 = boost
                3.4255946 = idf(docFreq=3909, maxDocs=44218)
                0.017817266 = queryNorm
              0.1338123 = fieldWeight in 3647, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4255946 = idf(docFreq=3909, maxDocs=44218)
                0.0390625 = fieldNorm(doc=3647)
          0.027373971 = weight(abstract_txt:whether in 3647) [ClassicSimilarity], result of:
            0.027373971 = score(doc=3647,freq=2.0), product of:
              0.102327004 = queryWeight, product of:
                1.185978 = boost
                4.8425326 = idf(docFreq=947, maxDocs=44218)
                0.017817266 = queryNorm
              0.26751465 = fieldWeight in 3647, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.8425326 = idf(docFreq=947, maxDocs=44218)
                0.0390625 = fieldNorm(doc=3647)
          0.041696467 = weight(abstract_txt:question in 3647) [ClassicSimilarity], result of:
            0.041696467 = score(doc=3647,freq=3.0), product of:
              0.1183407 = queryWeight, product of:
                1.2754064 = boost
                5.207682 = idf(docFreq=657, maxDocs=44218)
                0.017817266 = queryNorm
              0.35234258 = fieldWeight in 3647, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.207682 = idf(docFreq=657, maxDocs=44218)
                0.0390625 = fieldNorm(doc=3647)
          0.057241835 = weight(abstract_txt:automatically in 3647) [ClassicSimilarity], result of:
            0.057241835 = score(doc=3647,freq=4.0), product of:
              0.13280998 = queryWeight, product of:
                1.3511292 = boost
                5.5168705 = idf(docFreq=482, maxDocs=44218)
                0.017817266 = queryNorm
              0.4310055 = fieldWeight in 3647, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.5168705 = idf(docFreq=482, maxDocs=44218)
                0.0390625 = fieldNorm(doc=3647)
          0.016861087 = weight(abstract_txt:retrieval in 3647) [ClassicSimilarity], result of:
            0.016861087 = score(doc=3647,freq=2.0), product of:
              0.08782908 = queryWeight, product of:
                1.418486 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.017817266 = queryNorm
              0.19197613 = fieldWeight in 3647, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.0390625 = fieldNorm(doc=3647)
          0.025990654 = weight(abstract_txt:they in 3647) [ClassicSimilarity], result of:
            0.025990654 = score(doc=3647,freq=3.0), product of:
              0.10238327 = queryWeight, product of:
                1.5315119 = boost
                3.7520406 = idf(docFreq=2820, maxDocs=44218)
                0.017817266 = queryNorm
              0.25385645 = fieldWeight in 3647, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.7520406 = idf(docFreq=2820, maxDocs=44218)
                0.0390625 = fieldNorm(doc=3647)
          0.110431805 = weight(abstract_txt:clumping in 3647) [ClassicSimilarity], result of:
            0.110431805 = score(doc=3647,freq=1.0), product of:
              0.2854132 = queryWeight, product of:
                1.6172342 = boost
                9.905128 = idf(docFreq=5, maxDocs=44218)
                0.017817266 = queryNorm
              0.38691905 = fieldWeight in 3647, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.905128 = idf(docFreq=5, maxDocs=44218)
                0.0390625 = fieldNorm(doc=3647)
          0.059944104 = weight(abstract_txt:classification in 3647) [ClassicSimilarity], result of:
            0.059944104 = score(doc=3647,freq=11.0), product of:
              0.11590212 = queryWeight, product of:
                1.6294895 = boost
                3.9920752 = idf(docFreq=2218, maxDocs=44218)
                0.017817266 = queryNorm
              0.51719594 = fieldWeight in 3647, product of:
                3.3166249 = tf(freq=11.0), with freq of:
                  11.0 = termFreq=11.0
                3.9920752 = idf(docFreq=2218, maxDocs=44218)
                0.0390625 = fieldNorm(doc=3647)
          0.032539062 = weight(abstract_txt:terms in 3647) [ClassicSimilarity], result of:
            0.032539062 = score(doc=3647,freq=3.0), product of:
              0.11892895 = queryWeight, product of:
                1.6506298 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.017817266 = queryNorm
              0.27360088 = fieldWeight in 3647, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.0390625 = fieldNorm(doc=3647)
          0.04547301 = weight(abstract_txt:similar in 3647) [ClassicSimilarity], result of:
            0.04547301 = score(doc=3647,freq=2.0), product of:
              0.15797211 = queryWeight, product of:
                1.7015358 = boost
                5.2107263 = idf(docFreq=655, maxDocs=44218)
                0.017817266 = queryNorm
              0.28785467 = fieldWeight in 3647, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.2107263 = idf(docFreq=655, maxDocs=44218)
                0.0390625 = fieldNorm(doc=3647)
          0.155284 = weight(abstract_txt:jones in 3647) [ClassicSimilarity], result of:
            0.155284 = score(doc=3647,freq=2.0), product of:
              0.35823038 = queryWeight, product of:
                2.5623124 = boost
                7.84674 = idf(docFreq=46, maxDocs=44218)
                0.017817266 = queryNorm
              0.43347523 = fieldWeight in 3647, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.84674 = idf(docFreq=46, maxDocs=44218)
                0.0390625 = fieldNorm(doc=3647)
          0.03665718 = weight(abstract_txt:that in 3647) [ClassicSimilarity], result of:
            0.03665718 = score(doc=3647,freq=12.0), product of:
              0.114328966 = queryWeight, product of:
                2.7080898 = boost
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.017817266 = queryNorm
              0.320629 = fieldWeight in 3647, product of:
                3.4641016 = tf(freq=12.0), with freq of:
                  12.0 = termFreq=12.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.0390625 = fieldNorm(doc=3647)
          0.12473076 = weight(abstract_txt:automatic in 3647) [ClassicSimilarity], result of:
            0.12473076 = score(doc=3647,freq=5.0), product of:
              0.27484825 = queryWeight, product of:
                2.9690423 = boost
                5.1955976 = idf(docFreq=665, maxDocs=44218)
                0.017817266 = queryNorm
              0.4538168 = fieldWeight in 3647, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                5.1955976 = idf(docFreq=665, maxDocs=44218)
                0.0390625 = fieldNorm(doc=3647)
          0.24476416 = weight(abstract_txt:sparck in 3647) [ClassicSimilarity], result of:
            0.24476416 = score(doc=3647,freq=2.0), product of:
              0.48518774 = queryWeight, product of:
                2.9819872 = boost
                9.131938 = idf(docFreq=12, maxDocs=44218)
                0.017817266 = queryNorm
              0.5044731 = fieldWeight in 3647, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.131938 = idf(docFreq=12, maxDocs=44218)
                0.0390625 = fieldNorm(doc=3647)
          0.13613784 = weight(abstract_txt:classes in 3647) [ClassicSimilarity], result of:
            0.13613784 = score(doc=3647,freq=3.0), product of:
              0.34544542 = queryWeight, product of:
                3.328585 = boost
                5.8247695 = idf(docFreq=354, maxDocs=44218)
                0.017817266 = queryNorm
              0.39409363 = fieldWeight in 3647, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.8247695 = idf(docFreq=354, maxDocs=44218)
                0.0390625 = fieldNorm(doc=3647)
        0.68 = coord(17/25)
    
  2. Hjoerland, B.; Pedersen, K.N.: ¬A substantive theory of classification for information retrieval (2005) 0.22
    0.21587653 = sum of:
      0.21587653 = product of:
        0.77098763 = sum of:
          0.011587255 = weight(abstract_txt:research in 1892) [ClassicSimilarity], result of:
            0.011587255 = score(doc=1892,freq=1.0), product of:
              0.058478333 = queryWeight, product of:
                1.0352575 = boost
                3.170338 = idf(docFreq=5046, maxDocs=44218)
                0.017817266 = queryNorm
              0.19814612 = fieldWeight in 1892, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.170338 = idf(docFreq=5046, maxDocs=44218)
                0.0625 = fieldNorm(doc=1892)
          0.03304085 = weight(abstract_txt:retrieval in 1892) [ClassicSimilarity], result of:
            0.03304085 = score(doc=1892,freq=3.0), product of:
              0.08782908 = queryWeight, product of:
                1.418486 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.017817266 = queryNorm
              0.37619486 = fieldWeight in 1892, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.0625 = fieldNorm(doc=1892)
          0.057836246 = weight(abstract_txt:classification in 1892) [ClassicSimilarity], result of:
            0.057836246 = score(doc=1892,freq=4.0), product of:
              0.11590212 = queryWeight, product of:
                1.6294895 = boost
                3.9920752 = idf(docFreq=2218, maxDocs=44218)
                0.017817266 = queryNorm
              0.4990094 = fieldWeight in 1892, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.9920752 = idf(docFreq=2218, maxDocs=44218)
                0.0625 = fieldNorm(doc=1892)
          0.24845439 = weight(abstract_txt:jones in 1892) [ClassicSimilarity], result of:
            0.24845439 = score(doc=1892,freq=2.0), product of:
              0.35823038 = queryWeight, product of:
                2.5623124 = boost
                7.84674 = idf(docFreq=46, maxDocs=44218)
                0.017817266 = queryNorm
              0.69356036 = fieldWeight in 1892, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.84674 = idf(docFreq=46, maxDocs=44218)
                0.0625 = fieldNorm(doc=1892)
          0.016931228 = weight(abstract_txt:that in 1892) [ClassicSimilarity], result of:
            0.016931228 = score(doc=1892,freq=1.0), product of:
              0.114328966 = queryWeight, product of:
                2.7080898 = boost
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.017817266 = queryNorm
              0.1480922 = fieldWeight in 1892, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.0625 = fieldNorm(doc=1892)
          0.12621863 = weight(abstract_txt:automatic in 1892) [ClassicSimilarity], result of:
            0.12621863 = score(doc=1892,freq=2.0), product of:
              0.27484825 = queryWeight, product of:
                2.9690423 = boost
                5.1955976 = idf(docFreq=665, maxDocs=44218)
                0.017817266 = queryNorm
              0.45923027 = fieldWeight in 1892, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.1955976 = idf(docFreq=665, maxDocs=44218)
                0.0625 = fieldNorm(doc=1892)
          0.276919 = weight(abstract_txt:sparck in 1892) [ClassicSimilarity], result of:
            0.276919 = score(doc=1892,freq=1.0), product of:
              0.48518774 = queryWeight, product of:
                2.9819872 = boost
                9.131938 = idf(docFreq=12, maxDocs=44218)
                0.017817266 = queryNorm
              0.5707461 = fieldWeight in 1892, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.131938 = idf(docFreq=12, maxDocs=44218)
                0.0625 = fieldNorm(doc=1892)
        0.28 = coord(7/25)
    
  3. Robertson, M.; Willett, P.: ¬An upperbound to the performance of ranked output searching : optimal weighting of query terms using a genetic algorithms (1996) 0.21
    0.20746739 = sum of:
      0.20746739 = product of:
        1.037337 = sum of:
          0.03815229 = weight(abstract_txt:retrieval in 6977) [ClassicSimilarity], result of:
            0.03815229 = score(doc=6977,freq=1.0), product of:
              0.08782908 = queryWeight, product of:
                1.418486 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.017817266 = queryNorm
              0.43439242 = fieldWeight in 6977, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.125 = fieldNorm(doc=6977)
          0.0601166 = weight(abstract_txt:terms in 6977) [ClassicSimilarity], result of:
            0.0601166 = score(doc=6977,freq=1.0), product of:
              0.11892895 = queryWeight, product of:
                1.6506298 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.017817266 = queryNorm
              0.5054833 = fieldWeight in 6977, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.125 = fieldNorm(doc=6977)
          0.35136756 = weight(abstract_txt:jones in 6977) [ClassicSimilarity], result of:
            0.35136756 = score(doc=6977,freq=1.0), product of:
              0.35823038 = queryWeight, product of:
                2.5623124 = boost
                7.84674 = idf(docFreq=46, maxDocs=44218)
                0.017817266 = queryNorm
              0.9808425 = fieldWeight in 6977, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.84674 = idf(docFreq=46, maxDocs=44218)
                0.125 = fieldNorm(doc=6977)
          0.033862457 = weight(abstract_txt:that in 6977) [ClassicSimilarity], result of:
            0.033862457 = score(doc=6977,freq=1.0), product of:
              0.114328966 = queryWeight, product of:
                2.7080898 = boost
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.017817266 = queryNorm
              0.2961844 = fieldWeight in 6977, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.125 = fieldNorm(doc=6977)
          0.553838 = weight(abstract_txt:sparck in 6977) [ClassicSimilarity], result of:
            0.553838 = score(doc=6977,freq=1.0), product of:
              0.48518774 = queryWeight, product of:
                2.9819872 = boost
                9.131938 = idf(docFreq=12, maxDocs=44218)
                0.017817266 = queryNorm
              1.1414922 = fieldWeight in 6977, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.131938 = idf(docFreq=12, maxDocs=44218)
                0.125 = fieldNorm(doc=6977)
        0.2 = coord(5/25)
    
  4. Robertson, S.E.: On relevance weight estimation and query expansion (1986) 0.20
    0.20070085 = sum of:
      0.20070085 = product of:
        1.2543803 = sum of:
          0.047727507 = weight(abstract_txt:documents in 3875) [ClassicSimilarity], result of:
            0.047727507 = score(doc=3875,freq=1.0), product of:
              0.07411638 = queryWeight, product of:
                1.0093426 = boost
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.017817266 = queryNorm
              0.64395356 = fieldWeight in 3875, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.15625 = fieldNorm(doc=3875)
          0.07514575 = weight(abstract_txt:terms in 3875) [ClassicSimilarity], result of:
            0.07514575 = score(doc=3875,freq=1.0), product of:
              0.11892895 = queryWeight, product of:
                1.6506298 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.017817266 = queryNorm
              0.6318542 = fieldWeight in 3875, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.15625 = fieldNorm(doc=3875)
          0.4392095 = weight(abstract_txt:jones in 3875) [ClassicSimilarity], result of:
            0.4392095 = score(doc=3875,freq=1.0), product of:
              0.35823038 = queryWeight, product of:
                2.5623124 = boost
                7.84674 = idf(docFreq=46, maxDocs=44218)
                0.017817266 = queryNorm
              1.2260531 = fieldWeight in 3875, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.84674 = idf(docFreq=46, maxDocs=44218)
                0.15625 = fieldNorm(doc=3875)
          0.6922976 = weight(abstract_txt:sparck in 3875) [ClassicSimilarity], result of:
            0.6922976 = score(doc=3875,freq=1.0), product of:
              0.48518774 = queryWeight, product of:
                2.9819872 = boost
                9.131938 = idf(docFreq=12, maxDocs=44218)
                0.017817266 = queryNorm
              1.4268653 = fieldWeight in 3875, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.131938 = idf(docFreq=12, maxDocs=44218)
                0.15625 = fieldNorm(doc=3875)
        0.16 = coord(4/25)
    
  5. Sterner, B.; Sen, A.; Witteveen, J.: Consensus and scientific classification (2022) 0.17
    0.16660738 = sum of:
      0.16660738 = product of:
        0.52064806 = sum of:
          0.05221622 = weight(abstract_txt:classifications in 1103) [ClassicSimilarity], result of:
            0.05221622 = score(doc=1103,freq=1.0), product of:
              0.109126 = queryWeight, product of:
                6.124733 = idf(docFreq=262, maxDocs=44218)
                0.017817266 = queryNorm
              0.47849476 = fieldWeight in 1103, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.124733 = idf(docFreq=262, maxDocs=44218)
                0.078125 = fieldNorm(doc=1103)
          0.020483566 = weight(abstract_txt:research in 1103) [ClassicSimilarity], result of:
            0.020483566 = score(doc=1103,freq=2.0), product of:
              0.058478333 = queryWeight, product of:
                1.0352575 = boost
                3.170338 = idf(docFreq=5046, maxDocs=44218)
                0.017817266 = queryNorm
              0.35027617 = fieldWeight in 1103, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.170338 = idf(docFreq=5046, maxDocs=44218)
                0.078125 = fieldNorm(doc=1103)
          0.025840255 = weight(abstract_txt:such in 1103) [ClassicSimilarity], result of:
            0.025840255 = score(doc=1103,freq=2.0), product of:
              0.06827407 = queryWeight, product of:
                1.1186103 = boost
                3.4255946 = idf(docFreq=3909, maxDocs=44218)
                0.017817266 = queryNorm
              0.3784783 = fieldWeight in 1103, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4255946 = idf(docFreq=3909, maxDocs=44218)
                0.078125 = fieldNorm(doc=1103)
          0.03001142 = weight(abstract_txt:they in 1103) [ClassicSimilarity], result of:
            0.03001142 = score(doc=1103,freq=1.0), product of:
              0.10238327 = queryWeight, product of:
                1.5315119 = boost
                3.7520406 = idf(docFreq=2820, maxDocs=44218)
                0.017817266 = queryNorm
              0.29312816 = fieldWeight in 1103, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.7520406 = idf(docFreq=2820, maxDocs=44218)
                0.078125 = fieldNorm(doc=1103)
          0.036147654 = weight(abstract_txt:classification in 1103) [ClassicSimilarity], result of:
            0.036147654 = score(doc=1103,freq=1.0), product of:
              0.11590212 = queryWeight, product of:
                1.6294895 = boost
                3.9920752 = idf(docFreq=2218, maxDocs=44218)
                0.017817266 = queryNorm
              0.3118809 = fieldWeight in 1103, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.9920752 = idf(docFreq=2218, maxDocs=44218)
                0.078125 = fieldNorm(doc=1103)
          0.16209333 = weight(abstract_txt:forming in 1103) [ClassicSimilarity], result of:
            0.16209333 = score(doc=1103,freq=1.0), product of:
              0.2658266 = queryWeight, product of:
                1.9115283 = boost
                7.805067 = idf(docFreq=48, maxDocs=44218)
                0.017817266 = queryNorm
              0.6097709 = fieldWeight in 1103, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.805067 = idf(docFreq=48, maxDocs=44218)
                0.078125 = fieldNorm(doc=1103)
          0.03665718 = weight(abstract_txt:that in 1103) [ClassicSimilarity], result of:
            0.03665718 = score(doc=1103,freq=3.0), product of:
              0.114328966 = queryWeight, product of:
                2.7080898 = boost
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.017817266 = queryNorm
              0.320629 = fieldWeight in 1103, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.078125 = fieldNorm(doc=1103)
          0.15719844 = weight(abstract_txt:classes in 1103) [ClassicSimilarity], result of:
            0.15719844 = score(doc=1103,freq=1.0), product of:
              0.34544542 = queryWeight, product of:
                3.328585 = boost
                5.8247695 = idf(docFreq=354, maxDocs=44218)
                0.017817266 = queryNorm
              0.45506012 = fieldWeight in 1103, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8247695 = idf(docFreq=354, maxDocs=44218)
                0.078125 = fieldNorm(doc=1103)
        0.32 = coord(8/25)