Document (#28648)

Author
Borko, H.
Title
Research in computer based classification systems
Source
Theory of subject analysis: a sourcebook. Ed.: L.M. Chan, et al
Imprint
Littleton, CO : Libraries Unlimited
Year
1985
Pages
S.287-305
Abstract
The selection in this reader by R. M. Needham and K. Sparck Jones reports an early approach to automatic classification that was taken in England. The following selection reviews various approaches that were being pursued in the United States at about the same time. It then discusses a particular approach initiated in the early 1960s by Harold Borko, at that time Head of the Language Processing and Retrieval Research Staff at the System Development Corporation, Santa Monica, California and, since 1966, a member of the faculty at the Graduate School of Library and Information Science, University of California, Los Angeles. As was described earlier, there are two steps in automatic classification, the first being to identify pairs of terms that are similar by virtue of co-occurring as index terms in the same documents, and the second being to form equivalence classes of intersubstitutable terms. To compute similarities, Borko and his associates used a standard correlation formula; to derive classification categories, where Needham and Sparck Jones used clumping, the Borko team used the statistical technique of factor analysis. The fact that documents can be classified automatically, and in any number of ways, is worthy of passing notice. Worthy of serious attention would be a demonstra tion that a computer-based classification system was effective in the organization and retrieval of documents. One reason for the inclusion of the following selection in the reader is that it addresses the question of evaluation. To evaluate the effectiveness of their automatically derived classification, Borko and his team asked three questions. The first was Is the classification reliable? in other words, could the categories derived from one sample of texts be used to classify other texts? Reliability was assessed by a case-study comparison of the classes derived from three different samples of abstracts. The notso-surprising conclusion reached was that automatically derived classes were reliable only to the extent that the sample from which they were derived was representative of the total document collection. The second evaluation question asked whether the classification was reasonable, in the sense of adequately describing the content of the document collection. The answer was sought by comparing the automatically derived categories with categories in a related classification system that was manually constructed. Here the conclusion was that the automatic method yielded categories that fairly accurately reflected the major area of interest in the sample collection of texts; however, since there were only eleven such categories and they were quite broad, they could not be regarded as suitable for use in a university or any large general library. The third evaluation question asked whether automatic classification was accurate, in the sense of producing results similar to those obtainabie by human cIassifiers. When using human classification as a criterion, automatic classification was found to be 50 percent accurate.
Footnote
Nachdruck des Originalartikels mit Kommentierung durch die Herausgeber
Original in: Classification research: Proceedings of the Second International Study Conference held at Hotel Prins Hamlet, Elsinore, Denmark, 14th-18th Sept. 1964. Ed.: Pauline Atherton. Copenhagen: Munksgaard 1965. S.220-238.
Theme
Automatisches Klassifizieren

Similar documents (author)

  1. Borko, H.: Getting started in library expert systems research (1987) 5.50
    5.504072 = sum of:
      5.504072 = weight(author_txt:borko in 1090) [ClassicSimilarity], result of:
        5.504072 = fieldWeight in 1090, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.806516 = idf(docFreq=17, maxDocs=44218)
          0.625 = fieldNorm(doc=1090)
    
  2. Borko, H.: ¬A note commenting on 'vocabulary control and information technology' by Derek Austin (1986) 5.50
    5.504072 = sum of:
      5.504072 = weight(author_txt:borko in 1352) [ClassicSimilarity], result of:
        5.504072 = fieldWeight in 1352, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.806516 = idf(docFreq=17, maxDocs=44218)
          0.625 = fieldNorm(doc=1352)
    
  3. Borko, H.: Toward a theory of indexing (1977) 5.50
    5.504072 = sum of:
      5.504072 = weight(author_txt:borko in 2591) [ClassicSimilarity], result of:
        5.504072 = fieldWeight in 2591, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.806516 = idf(docFreq=17, maxDocs=44218)
          0.625 = fieldNorm(doc=2591)
    
  4. Borko, H.: Determining user requirements for an information storage and retrieval system : a systems approach (1962) 5.50
    5.504072 = sum of:
      5.504072 = weight(author_txt:borko in 4980) [ClassicSimilarity], result of:
        5.504072 = fieldWeight in 4980, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.806516 = idf(docFreq=17, maxDocs=44218)
          0.625 = fieldNorm(doc=4980)
    
  5. Borko, H.: Informationswissenschaft : was ist das? (1968) 5.50
    5.504072 = sum of:
      5.504072 = weight(author_txt:borko in 4981) [ClassicSimilarity], result of:
        5.504072 = fieldWeight in 4981, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.806516 = idf(docFreq=17, maxDocs=44218)
          0.625 = fieldNorm(doc=4981)
    

Similar documents (content)

  1. Needham, R.M.; Sparck Jones, K.: Keywords and clumps (1985) 0.64
    0.6425725 = sum of:
      0.6425725 = product of:
        1.1474509 = sum of:
          0.018207088 = weight(abstract_txt:used in 3645) [ClassicSimilarity], result of:
            0.018207088 = score(doc=3645,freq=3.0), product of:
              0.08010712 = queryWeight, product of:
                1.0564915 = boost
                3.3592992 = idf(docFreq=4177, maxDocs=44218)
                0.022571294 = queryNorm
              0.22728425 = fieldWeight in 3645, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.3592992 = idf(docFreq=4177, maxDocs=44218)
                0.0390625 = fieldNorm(doc=3645)
          0.01909669 = weight(abstract_txt:being in 3645) [ClassicSimilarity], result of:
            0.01909669 = score(doc=3645,freq=1.0), product of:
              0.108362004 = queryWeight, product of:
                1.064142 = boost
                4.5115004 = idf(docFreq=1319, maxDocs=44218)
                0.022571294 = queryNorm
              0.17623049 = fieldWeight in 3645, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5115004 = idf(docFreq=1319, maxDocs=44218)
                0.0390625 = fieldNorm(doc=3645)
          0.050873205 = weight(abstract_txt:question in 3645) [ClassicSimilarity], result of:
            0.050873205 = score(doc=3645,freq=3.0), product of:
              0.14438564 = queryWeight, product of:
                1.2283527 = boost
                5.207682 = idf(docFreq=657, maxDocs=44218)
                0.022571294 = queryNorm
              0.35234258 = fieldWeight in 3645, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.207682 = idf(docFreq=657, maxDocs=44218)
                0.0390625 = fieldNorm(doc=3645)
          0.13396816 = weight(abstract_txt:jones in 3645) [ClassicSimilarity], result of:
            0.13396816 = score(doc=3645,freq=4.0), product of:
              0.21853566 = queryWeight, product of:
                1.2338904 = boost
                7.84674 = idf(docFreq=46, maxDocs=44218)
                0.022571294 = queryNorm
              0.61302656 = fieldWeight in 3645, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                7.84674 = idf(docFreq=46, maxDocs=44218)
                0.0390625 = fieldNorm(doc=3645)
          0.045745973 = weight(abstract_txt:selection in 3645) [ClassicSimilarity], result of:
            0.045745973 = score(doc=3645,freq=2.0), product of:
              0.15397973 = queryWeight, product of:
                1.268507 = boost
                5.377919 = idf(docFreq=554, maxDocs=44218)
                0.022571294 = queryNorm
              0.29709086 = fieldWeight in 3645, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.377919 = idf(docFreq=554, maxDocs=44218)
                0.0390625 = fieldNorm(doc=3645)
          0.108737744 = weight(abstract_txt:classes in 3645) [ClassicSimilarity], result of:
            0.108737744 = score(doc=3645,freq=7.0), product of:
              0.1806311 = queryWeight, product of:
                1.373907 = boost
                5.8247695 = idf(docFreq=354, maxDocs=44218)
                0.022571294 = queryNorm
              0.60198796 = fieldWeight in 3645, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                5.8247695 = idf(docFreq=354, maxDocs=44218)
                0.0390625 = fieldNorm(doc=3645)
          0.21116535 = weight(abstract_txt:sparck in 3645) [ClassicSimilarity], result of:
            0.21116535 = score(doc=3645,freq=4.0), product of:
              0.29598498 = queryWeight, product of:
                1.4359863 = boost
                9.131938 = idf(docFreq=12, maxDocs=44218)
                0.022571294 = queryNorm
              0.71343267 = fieldWeight in 3645, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                9.131938 = idf(docFreq=12, maxDocs=44218)
                0.0390625 = fieldNorm(doc=3645)
          0.029677358 = weight(abstract_txt:were in 3645) [ClassicSimilarity], result of:
            0.029677358 = score(doc=3645,freq=3.0), product of:
              0.119517356 = queryWeight, product of:
                1.4427828 = boost
                3.6700637 = idf(docFreq=3061, maxDocs=44218)
                0.022571294 = queryNorm
              0.24831003 = fieldWeight in 3645, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.6700637 = idf(docFreq=3061, maxDocs=44218)
                0.0390625 = fieldNorm(doc=3645)
          0.049362186 = weight(abstract_txt:asked in 3645) [ClassicSimilarity], result of:
            0.049362186 = score(doc=3645,freq=1.0), product of:
              0.20409605 = queryWeight, product of:
                1.460422 = boost
                6.1915555 = idf(docFreq=245, maxDocs=44218)
                0.022571294 = queryNorm
              0.24185763 = fieldWeight in 3645, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.1915555 = idf(docFreq=245, maxDocs=44218)
                0.0390625 = fieldNorm(doc=3645)
          0.08064413 = weight(abstract_txt:automatically in 3645) [ClassicSimilarity], result of:
            0.08064413 = score(doc=3645,freq=3.0), product of:
              0.21605252 = queryWeight, product of:
                1.7350425 = boost
                5.5168705 = idf(docFreq=482, maxDocs=44218)
                0.022571294 = queryNorm
              0.37326172 = fieldWeight in 3645, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.5168705 = idf(docFreq=482, maxDocs=44218)
                0.0390625 = fieldNorm(doc=3645)
          0.1286173 = weight(abstract_txt:automatic in 3645) [ClassicSimilarity], result of:
            0.1286173 = score(doc=3645,freq=7.0), product of:
              0.2395272 = queryWeight, product of:
                2.0425038 = boost
                5.1955976 = idf(docFreq=665, maxDocs=44218)
                0.022571294 = queryNorm
              0.5369632 = fieldWeight in 3645, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                5.1955976 = idf(docFreq=665, maxDocs=44218)
                0.0390625 = fieldNorm(doc=3645)
          0.041407168 = weight(abstract_txt:that in 3645) [ClassicSimilarity], result of:
            0.041407168 = score(doc=3645,freq=14.0), product of:
              0.11956369 = queryWeight, product of:
                2.2355826 = boost
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.022571294 = queryNorm
              0.34631893 = fieldWeight in 3645, product of:
                3.7416575 = tf(freq=14.0), with freq of:
                  14.0 = termFreq=14.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.0390625 = fieldNorm(doc=3645)
          0.11160727 = weight(abstract_txt:derived in 3645) [ClassicSimilarity], result of:
            0.11160727 = score(doc=3645,freq=2.0), product of:
              0.3515874 = queryWeight, product of:
                2.7107704 = boost
                5.746245 = idf(docFreq=383, maxDocs=44218)
                0.022571294 = queryNorm
              0.3174382 = fieldWeight in 3645, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.746245 = idf(docFreq=383, maxDocs=44218)
                0.0390625 = fieldNorm(doc=3645)
          0.11834127 = weight(abstract_txt:classification in 3645) [ClassicSimilarity], result of:
            0.11834127 = score(doc=3645,freq=5.0), product of:
              0.33938485 = queryWeight, product of:
                3.766494 = boost
                3.9920752 = idf(docFreq=2218, maxDocs=44218)
                0.022571294 = queryNorm
              0.34869343 = fieldWeight in 3645, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                3.9920752 = idf(docFreq=2218, maxDocs=44218)
                0.0390625 = fieldNorm(doc=3645)
        0.56 = coord(14/25)
    
  2. Adamson, G.W.; Boreham, J.: ¬The use of an association measure based on character structure to identify semantically related pairs of words and document titles (1974) 0.21
    0.21332808 = sum of:
      0.21332808 = product of:
        0.761886 = sum of:
          0.025228482 = weight(abstract_txt:used in 398) [ClassicSimilarity], result of:
            0.025228482 = score(doc=398,freq=1.0), product of:
              0.08010712 = queryWeight, product of:
                1.0564915 = boost
                3.3592992 = idf(docFreq=4177, maxDocs=44218)
                0.022571294 = queryNorm
              0.3149343 = fieldWeight in 398, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3592992 = idf(docFreq=4177, maxDocs=44218)
                0.09375 = fieldNorm(doc=398)
          0.0853525 = weight(abstract_txt:sample in 398) [ClassicSimilarity], result of:
            0.0853525 = score(doc=398,freq=1.0), product of:
              0.16402435 = queryWeight, product of:
                1.3092278 = boost
                5.550558 = idf(docFreq=466, maxDocs=44218)
                0.022571294 = queryNorm
              0.5203648 = fieldWeight in 398, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.550558 = idf(docFreq=466, maxDocs=44218)
                0.09375 = fieldNorm(doc=398)
          0.09863761 = weight(abstract_txt:classes in 398) [ClassicSimilarity], result of:
            0.09863761 = score(doc=398,freq=1.0), product of:
              0.1806311 = queryWeight, product of:
                1.373907 = boost
                5.8247695 = idf(docFreq=354, maxDocs=44218)
                0.022571294 = queryNorm
              0.5460721 = fieldWeight in 398, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8247695 = idf(docFreq=354, maxDocs=44218)
                0.09375 = fieldNorm(doc=398)
          0.041122153 = weight(abstract_txt:were in 398) [ClassicSimilarity], result of:
            0.041122153 = score(doc=398,freq=1.0), product of:
              0.119517356 = queryWeight, product of:
                1.4427828 = boost
                3.6700637 = idf(docFreq=3061, maxDocs=44218)
                0.022571294 = queryNorm
              0.34406847 = fieldWeight in 398, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6700637 = idf(docFreq=3061, maxDocs=44218)
                0.09375 = fieldNorm(doc=398)
          0.11667065 = weight(abstract_txt:automatic in 398) [ClassicSimilarity], result of:
            0.11667065 = score(doc=398,freq=1.0), product of:
              0.2395272 = queryWeight, product of:
                2.0425038 = boost
                5.1955976 = idf(docFreq=665, maxDocs=44218)
                0.022571294 = queryNorm
              0.48708728 = fieldWeight in 398, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.1955976 = idf(docFreq=665, maxDocs=44218)
                0.09375 = fieldNorm(doc=398)
          0.26785743 = weight(abstract_txt:derived in 398) [ClassicSimilarity], result of:
            0.26785743 = score(doc=398,freq=2.0), product of:
              0.3515874 = queryWeight, product of:
                2.7107704 = boost
                5.746245 = idf(docFreq=383, maxDocs=44218)
                0.022571294 = queryNorm
              0.7618516 = fieldWeight in 398, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.746245 = idf(docFreq=383, maxDocs=44218)
                0.09375 = fieldNorm(doc=398)
          0.12701717 = weight(abstract_txt:classification in 398) [ClassicSimilarity], result of:
            0.12701717 = score(doc=398,freq=1.0), product of:
              0.33938485 = queryWeight, product of:
                3.766494 = boost
                3.9920752 = idf(docFreq=2218, maxDocs=44218)
                0.022571294 = queryNorm
              0.37425706 = fieldWeight in 398, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.9920752 = idf(docFreq=2218, maxDocs=44218)
                0.09375 = fieldNorm(doc=398)
        0.28 = coord(7/25)
    
  3. Lewandowski, D.; Drechsler, J.; Mach, S. von: Deriving query intents from web search engine queries (2012) 0.21
    0.21083875 = sum of:
      0.21083875 = product of:
        0.6588711 = sum of:
          0.02913134 = weight(abstract_txt:used in 385) [ClassicSimilarity], result of:
            0.02913134 = score(doc=385,freq=3.0), product of:
              0.08010712 = queryWeight, product of:
                1.0564915 = boost
                3.3592992 = idf(docFreq=4177, maxDocs=44218)
                0.022571294 = queryNorm
              0.3636548 = fieldWeight in 385, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.3592992 = idf(docFreq=4177, maxDocs=44218)
                0.0625 = fieldNorm(doc=385)
          0.046994656 = weight(abstract_txt:question in 385) [ClassicSimilarity], result of:
            0.046994656 = score(doc=385,freq=1.0), product of:
              0.14438564 = queryWeight, product of:
                1.2283527 = boost
                5.207682 = idf(docFreq=657, maxDocs=44218)
                0.022571294 = queryNorm
              0.32548013 = fieldWeight in 385, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.207682 = idf(docFreq=657, maxDocs=44218)
                0.0625 = fieldNorm(doc=385)
          0.047483772 = weight(abstract_txt:were in 385) [ClassicSimilarity], result of:
            0.047483772 = score(doc=385,freq=3.0), product of:
              0.119517356 = queryWeight, product of:
                1.4427828 = boost
                3.6700637 = idf(docFreq=3061, maxDocs=44218)
                0.022571294 = queryNorm
              0.39729604 = fieldWeight in 385, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.6700637 = idf(docFreq=3061, maxDocs=44218)
                0.0625 = fieldNorm(doc=385)
          0.0789795 = weight(abstract_txt:asked in 385) [ClassicSimilarity], result of:
            0.0789795 = score(doc=385,freq=1.0), product of:
              0.20409605 = queryWeight, product of:
                1.460422 = boost
                6.1915555 = idf(docFreq=245, maxDocs=44218)
                0.022571294 = queryNorm
              0.38697222 = fieldWeight in 385, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.1915555 = idf(docFreq=245, maxDocs=44218)
                0.0625 = fieldNorm(doc=385)
          0.10999814 = weight(abstract_txt:automatic in 385) [ClassicSimilarity], result of:
            0.10999814 = score(doc=385,freq=2.0), product of:
              0.2395272 = queryWeight, product of:
                2.0425038 = boost
                5.1955976 = idf(docFreq=665, maxDocs=44218)
                0.022571294 = queryNorm
              0.45923027 = fieldWeight in 385, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.1955976 = idf(docFreq=665, maxDocs=44218)
                0.0625 = fieldNorm(doc=385)
          0.03066847 = weight(abstract_txt:that in 385) [ClassicSimilarity], result of:
            0.03066847 = score(doc=385,freq=3.0), product of:
              0.11956369 = queryWeight, product of:
                2.2355826 = boost
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.022571294 = queryNorm
              0.2565032 = fieldWeight in 385, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.0625 = fieldNorm(doc=385)
          0.1262692 = weight(abstract_txt:derived in 385) [ClassicSimilarity], result of:
            0.1262692 = score(doc=385,freq=1.0), product of:
              0.3515874 = queryWeight, product of:
                2.7107704 = boost
                5.746245 = idf(docFreq=383, maxDocs=44218)
                0.022571294 = queryNorm
              0.3591403 = fieldWeight in 385, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.746245 = idf(docFreq=383, maxDocs=44218)
                0.0625 = fieldNorm(doc=385)
          0.18934603 = weight(abstract_txt:classification in 385) [ClassicSimilarity], result of:
            0.18934603 = score(doc=385,freq=5.0), product of:
              0.33938485 = queryWeight, product of:
                3.766494 = boost
                3.9920752 = idf(docFreq=2218, maxDocs=44218)
                0.022571294 = queryNorm
              0.5579095 = fieldWeight in 385, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                3.9920752 = idf(docFreq=2218, maxDocs=44218)
                0.0625 = fieldNorm(doc=385)
        0.32 = coord(8/25)
    
  4. Sebastiani, F.: Classification of text, automatic (2006) 0.19
    0.18663128 = sum of:
      0.18663128 = product of:
        0.7776303 = sum of:
          0.12759902 = weight(abstract_txt:texts in 5003) [ClassicSimilarity], result of:
            0.12759902 = score(doc=5003,freq=2.0), product of:
              0.17021024 = queryWeight, product of:
                1.333687 = boost
                5.6542544 = idf(docFreq=420, maxDocs=44218)
                0.022571294 = queryNorm
              0.7496553 = fieldWeight in 5003, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.6542544 = idf(docFreq=420, maxDocs=44218)
                0.09375 = fieldNorm(doc=5003)
          0.09863761 = weight(abstract_txt:classes in 5003) [ClassicSimilarity], result of:
            0.09863761 = score(doc=5003,freq=1.0), product of:
              0.1806311 = queryWeight, product of:
                1.373907 = boost
                5.8247695 = idf(docFreq=354, maxDocs=44218)
                0.022571294 = queryNorm
              0.5460721 = fieldWeight in 5003, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8247695 = idf(docFreq=354, maxDocs=44218)
                0.09375 = fieldNorm(doc=5003)
          0.11174379 = weight(abstract_txt:automatically in 5003) [ClassicSimilarity], result of:
            0.11174379 = score(doc=5003,freq=1.0), product of:
              0.21605252 = queryWeight, product of:
                1.7350425 = boost
                5.5168705 = idf(docFreq=482, maxDocs=44218)
                0.022571294 = queryNorm
              0.5172066 = fieldWeight in 5003, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.5168705 = idf(docFreq=482, maxDocs=44218)
                0.09375 = fieldNorm(doc=5003)
          0.11667065 = weight(abstract_txt:automatic in 5003) [ClassicSimilarity], result of:
            0.11667065 = score(doc=5003,freq=1.0), product of:
              0.2395272 = queryWeight, product of:
                2.0425038 = boost
                5.1955976 = idf(docFreq=665, maxDocs=44218)
                0.022571294 = queryNorm
              0.48708728 = fieldWeight in 5003, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.1955976 = idf(docFreq=665, maxDocs=44218)
                0.09375 = fieldNorm(doc=5003)
          0.19596209 = weight(abstract_txt:categories in 5003) [ClassicSimilarity], result of:
            0.19596209 = score(doc=5003,freq=2.0), product of:
              0.28546017 = queryWeight, product of:
                2.4425802 = boost
                5.17774 = idf(docFreq=677, maxDocs=44218)
                0.022571294 = queryNorm
              0.68647784 = fieldWeight in 5003, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.17774 = idf(docFreq=677, maxDocs=44218)
                0.09375 = fieldNorm(doc=5003)
          0.12701717 = weight(abstract_txt:classification in 5003) [ClassicSimilarity], result of:
            0.12701717 = score(doc=5003,freq=1.0), product of:
              0.33938485 = queryWeight, product of:
                3.766494 = boost
                3.9920752 = idf(docFreq=2218, maxDocs=44218)
                0.022571294 = queryNorm
              0.37425706 = fieldWeight in 5003, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.9920752 = idf(docFreq=2218, maxDocs=44218)
                0.09375 = fieldNorm(doc=5003)
        0.24 = coord(6/25)
    
  5. Tsai, C.-F.; McGarry, K.; Tait, J.: Qualitative evaluation of automatic assignment of keywords to images (2006) 0.17
    0.17400414 = sum of:
      0.17400414 = product of:
        0.5437629 = sum of:
          0.016818987 = weight(abstract_txt:used in 963) [ClassicSimilarity], result of:
            0.016818987 = score(doc=963,freq=1.0), product of:
              0.08010712 = queryWeight, product of:
                1.0564915 = boost
                3.3592992 = idf(docFreq=4177, maxDocs=44218)
                0.022571294 = queryNorm
              0.2099562 = fieldWeight in 963, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3592992 = idf(docFreq=4177, maxDocs=44218)
                0.0625 = fieldNorm(doc=963)
          0.08496847 = weight(abstract_txt:evaluation in 963) [ClassicSimilarity], result of:
            0.08496847 = score(doc=963,freq=8.0), product of:
              0.10714376 = queryWeight, product of:
                1.0581434 = boost
                4.4860687 = idf(docFreq=1353, maxDocs=44218)
                0.022571294 = queryNorm
              0.7930324 = fieldWeight in 963, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                4.4860687 = idf(docFreq=1353, maxDocs=44218)
                0.0625 = fieldNorm(doc=963)
          0.0305547 = weight(abstract_txt:being in 963) [ClassicSimilarity], result of:
            0.0305547 = score(doc=963,freq=1.0), product of:
              0.108362004 = queryWeight, product of:
                1.064142 = boost
                4.5115004 = idf(docFreq=1319, maxDocs=44218)
                0.022571294 = queryNorm
              0.28196877 = fieldWeight in 963, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5115004 = idf(docFreq=1319, maxDocs=44218)
                0.0625 = fieldNorm(doc=963)
          0.0789795 = weight(abstract_txt:asked in 963) [ClassicSimilarity], result of:
            0.0789795 = score(doc=963,freq=1.0), product of:
              0.20409605 = queryWeight, product of:
                1.460422 = boost
                6.1915555 = idf(docFreq=245, maxDocs=44218)
                0.022571294 = queryNorm
              0.38697222 = fieldWeight in 963, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.1915555 = idf(docFreq=245, maxDocs=44218)
                0.0625 = fieldNorm(doc=963)
          0.07449586 = weight(abstract_txt:automatically in 963) [ClassicSimilarity], result of:
            0.07449586 = score(doc=963,freq=1.0), product of:
              0.21605252 = queryWeight, product of:
                1.7350425 = boost
                5.5168705 = idf(docFreq=482, maxDocs=44218)
                0.022571294 = queryNorm
              0.3448044 = fieldWeight in 963, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.5168705 = idf(docFreq=482, maxDocs=44218)
                0.0625 = fieldNorm(doc=963)
          0.15556087 = weight(abstract_txt:automatic in 963) [ClassicSimilarity], result of:
            0.15556087 = score(doc=963,freq=4.0), product of:
              0.2395272 = queryWeight, product of:
                2.0425038 = boost
                5.1955976 = idf(docFreq=665, maxDocs=44218)
                0.022571294 = queryNorm
              0.6494497 = fieldWeight in 963, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.1955976 = idf(docFreq=665, maxDocs=44218)
                0.0625 = fieldNorm(doc=963)
          0.01770645 = weight(abstract_txt:that in 963) [ClassicSimilarity], result of:
            0.01770645 = score(doc=963,freq=1.0), product of:
              0.11956369 = queryWeight, product of:
                2.2355826 = boost
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.022571294 = queryNorm
              0.1480922 = fieldWeight in 963, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.0625 = fieldNorm(doc=963)
          0.08467811 = weight(abstract_txt:classification in 963) [ClassicSimilarity], result of:
            0.08467811 = score(doc=963,freq=1.0), product of:
              0.33938485 = queryWeight, product of:
                3.766494 = boost
                3.9920752 = idf(docFreq=2218, maxDocs=44218)
                0.022571294 = queryNorm
              0.2495047 = fieldWeight in 963, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.9920752 = idf(docFreq=2218, maxDocs=44218)
                0.0625 = fieldNorm(doc=963)
        0.32 = coord(8/25)