Document (#30899)

Author
Golub, K.
Title
Automated subject classification of textual Web pages, based on a controlled vocabulary : challenges and recommendations
Source
New review of hypermedia and multimedia. 12(2006) no.1, S.11-27
Year
2006
Abstract
The primary objective of this study was to identify and address problems of applying a controlled vocabulary in automated subject classification of textual Web pages, in the area of engineering. Web pages have special characteristics such as structural information, but are at the same time rather heterogeneous. The classification approach used comprises string-to-string matching between words in a term list extracted from the Ei (Engineering Information) thesaurus and classification scheme, and words in the text to be classified. Based on a sample of 70 Web pages, a number of problems with the term list are identified. Reasons for those problems are discussed and improvements proposed. Methods for implementing the improvements are also specified, suggesting further research.
Content
Beitrag eines Themenheftes "Knowledge organization systems and services"
Theme
Automatisches Klassifizieren
Field
Ingenieurwissenschaften

Similar documents (author)

  1. Golub, K.: Automated subject classification of textual web documents (2006) 5.30
    5.296846 = sum of:
      5.296846 = weight(author_txt:golub in 601) [ClassicSimilarity], result of:
        5.296846 = fieldWeight in 601, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.474954 = idf(docFreq=24, maxDocs=44083)
          0.625 = fieldNorm(doc=601)
    
  2. Golub, K.: Subject access to information : an interdisciplinary approach (2015) 5.30
    5.296846 = sum of:
      5.296846 = weight(author_txt:golub in 1135) [ClassicSimilarity], result of:
        5.296846 = fieldWeight in 1135, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.474954 = idf(docFreq=24, maxDocs=44083)
          0.625 = fieldNorm(doc=1135)
    
  3. Golub, K.: Automated subject classification of textual documents in the context of Web-based hierarchical browsing (2011) 5.30
    5.296846 = sum of:
      5.296846 = weight(author_txt:golub in 559) [ClassicSimilarity], result of:
        5.296846 = fieldWeight in 559, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.474954 = idf(docFreq=24, maxDocs=44083)
          0.625 = fieldNorm(doc=559)
    
  4. Golub, K.: Subject access in Swedish discovery services (2018) 5.30
    5.296846 = sum of:
      5.296846 = weight(author_txt:golub in 5380) [ClassicSimilarity], result of:
        5.296846 = fieldWeight in 5380, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.474954 = idf(docFreq=24, maxDocs=44083)
          0.625 = fieldNorm(doc=5380)
    
  5. Golub, K.: Automatic subject indexing of text (2019) 5.30
    5.296846 = sum of:
      5.296846 = weight(author_txt:golub in 269) [ClassicSimilarity], result of:
        5.296846 = fieldWeight in 269, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.474954 = idf(docFreq=24, maxDocs=44083)
          0.625 = fieldNorm(doc=269)
    

Similar documents (content)

  1. Golub, K.; Hamon, T.; Ardö, A.: Automated classification of textual documents based on a controlled vocabulary in engineering (2007) 0.45
    0.44588193 = sum of:
      0.44588193 = product of:
        1.013368 = sum of:
          0.0659883 = weight(abstract_txt:matching in 2462) [ClassicSimilarity], result of:
            0.0659883 = score(doc=2462,freq=2.0), product of:
              0.123361215 = queryWeight, product of:
                1.0334392 = boost
                6.0519223 = idf(docFreq=281, maxDocs=44083)
                0.019724244 = queryNorm
              0.5349194 = fieldWeight in 2462, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.0519223 = idf(docFreq=281, maxDocs=44083)
                0.0625 = fieldNorm(doc=2462)
          0.049121607 = weight(abstract_txt:extracted in 2462) [ClassicSimilarity], result of:
            0.049121607 = score(doc=2462,freq=1.0), product of:
              0.12766123 = queryWeight, product of:
                1.0512962 = boost
                6.156495 = idf(docFreq=253, maxDocs=44083)
                0.019724244 = queryNorm
              0.38478094 = fieldWeight in 2462, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.156495 = idf(docFreq=253, maxDocs=44083)
                0.0625 = fieldNorm(doc=2462)
          0.013689531 = weight(abstract_txt:based in 2462) [ClassicSimilarity], result of:
            0.013689531 = score(doc=2462,freq=1.0), product of:
              0.06862458 = queryWeight, product of:
                1.0900601 = boost
                3.19175 = idf(docFreq=4924, maxDocs=44083)
                0.019724244 = queryNorm
              0.19948438 = fieldWeight in 2462, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.19175 = idf(docFreq=4924, maxDocs=44083)
                0.0625 = fieldNorm(doc=2462)
          0.025202906 = weight(abstract_txt:subject in 2462) [ClassicSimilarity], result of:
            0.025202906 = score(doc=2462,freq=1.0), product of:
              0.103083156 = queryWeight, product of:
                1.3359939 = boost
                3.9118562 = idf(docFreq=2396, maxDocs=44083)
                0.019724244 = queryNorm
              0.24449101 = fieldWeight in 2462, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.9118562 = idf(docFreq=2396, maxDocs=44083)
                0.0625 = fieldNorm(doc=2462)
          0.046863705 = weight(abstract_txt:term in 2462) [ClassicSimilarity], result of:
            0.046863705 = score(doc=2462,freq=1.0), product of:
              0.15587568 = queryWeight, product of:
                1.6428571 = boost
                4.810367 = idf(docFreq=975, maxDocs=44083)
                0.019724244 = queryNorm
              0.30064794 = fieldWeight in 2462, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.810367 = idf(docFreq=975, maxDocs=44083)
                0.0625 = fieldNorm(doc=2462)
          0.13022423 = weight(abstract_txt:vocabulary in 2462) [ClassicSimilarity], result of:
            0.13022423 = score(doc=2462,freq=4.0), product of:
              0.19408643 = queryWeight, product of:
                1.8331931 = boost
                5.36768 = idf(docFreq=558, maxDocs=44083)
                0.019724244 = queryNorm
              0.67096 = fieldWeight in 2462, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.36768 = idf(docFreq=558, maxDocs=44083)
                0.0625 = fieldNorm(doc=2462)
          0.13701597 = weight(abstract_txt:controlled in 2462) [ClassicSimilarity], result of:
            0.13701597 = score(doc=2462,freq=4.0), product of:
              0.20077737 = queryWeight, product of:
                1.864524 = boost
                5.459419 = idf(docFreq=509, maxDocs=44083)
                0.019724244 = queryNorm
              0.68242735 = fieldWeight in 2462, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.459419 = idf(docFreq=509, maxDocs=44083)
                0.0625 = fieldNorm(doc=2462)
          0.10483196 = weight(abstract_txt:automated in 2462) [ClassicSimilarity], result of:
            0.10483196 = score(doc=2462,freq=2.0), product of:
              0.21161175 = queryWeight, product of:
                1.9141699 = boost
                5.6047845 = idf(docFreq=440, maxDocs=44083)
                0.019724244 = queryNorm
              0.49539763 = fieldWeight in 2462, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.6047845 = idf(docFreq=440, maxDocs=44083)
                0.0625 = fieldNorm(doc=2462)
          0.12649725 = weight(abstract_txt:engineering in 2462) [ClassicSimilarity], result of:
            0.12649725 = score(doc=2462,freq=2.0), product of:
              0.2398454 = queryWeight, product of:
                2.0378692 = boost
                5.966982 = idf(docFreq=306, maxDocs=44083)
                0.019724244 = queryNorm
              0.52741164 = fieldWeight in 2462, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.966982 = idf(docFreq=306, maxDocs=44083)
                0.0625 = fieldNorm(doc=2462)
          0.22066279 = weight(abstract_txt:string in 2462) [ClassicSimilarity], result of:
            0.22066279 = score(doc=2462,freq=2.0), product of:
              0.3475602 = queryWeight, product of:
                2.4531586 = boost
                7.18297 = idf(docFreq=90, maxDocs=44083)
                0.019724244 = queryNorm
              0.63489085 = fieldWeight in 2462, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.18297 = idf(docFreq=90, maxDocs=44083)
                0.0625 = fieldNorm(doc=2462)
          0.093269736 = weight(abstract_txt:classification in 2462) [ClassicSimilarity], result of:
            0.093269736 = score(doc=2462,freq=3.0), product of:
              0.2154521 = queryWeight, product of:
                2.7314985 = boost
                3.9989815 = idf(docFreq=2196, maxDocs=44083)
                0.019724244 = queryNorm
              0.43290243 = fieldWeight in 2462, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.9989815 = idf(docFreq=2196, maxDocs=44083)
                0.0625 = fieldNorm(doc=2462)
        0.44 = coord(11/25)
    
  2. Golub, K.: Automated subject classification of textual documents in the context of Web-based hierarchical browsing (2011) 0.35
    0.35075995 = sum of:
      0.35075995 = product of:
        0.87689984 = sum of:
          0.0659883 = weight(abstract_txt:matching in 559) [ClassicSimilarity], result of:
            0.0659883 = score(doc=559,freq=2.0), product of:
              0.123361215 = queryWeight, product of:
                1.0334392 = boost
                6.0519223 = idf(docFreq=281, maxDocs=44083)
                0.019724244 = queryNorm
              0.5349194 = fieldWeight in 559, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.0519223 = idf(docFreq=281, maxDocs=44083)
                0.0625 = fieldNorm(doc=559)
          0.04793649 = weight(abstract_txt:classified in 559) [ClassicSimilarity], result of:
            0.04793649 = score(doc=559,freq=1.0), product of:
              0.12559956 = queryWeight, product of:
                1.0427728 = boost
                6.1065807 = idf(docFreq=266, maxDocs=44083)
                0.019724244 = queryNorm
              0.3816613 = fieldWeight in 559, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.1065807 = idf(docFreq=266, maxDocs=44083)
                0.0625 = fieldNorm(doc=559)
          0.013689531 = weight(abstract_txt:based in 559) [ClassicSimilarity], result of:
            0.013689531 = score(doc=559,freq=1.0), product of:
              0.06862458 = queryWeight, product of:
                1.0900601 = boost
                3.19175 = idf(docFreq=4924, maxDocs=44083)
                0.019724244 = queryNorm
              0.19948438 = fieldWeight in 559, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.19175 = idf(docFreq=4924, maxDocs=44083)
                0.0625 = fieldNorm(doc=559)
          0.025202906 = weight(abstract_txt:subject in 559) [ClassicSimilarity], result of:
            0.025202906 = score(doc=559,freq=1.0), product of:
              0.103083156 = queryWeight, product of:
                1.3359939 = boost
                3.9118562 = idf(docFreq=2396, maxDocs=44083)
                0.019724244 = queryNorm
              0.24449101 = fieldWeight in 559, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.9118562 = idf(docFreq=2396, maxDocs=44083)
                0.0625 = fieldNorm(doc=559)
          0.112777494 = weight(abstract_txt:vocabulary in 559) [ClassicSimilarity], result of:
            0.112777494 = score(doc=559,freq=3.0), product of:
              0.19408643 = queryWeight, product of:
                1.8331931 = boost
                5.36768 = idf(docFreq=558, maxDocs=44083)
                0.019724244 = queryNorm
              0.5810684 = fieldWeight in 559, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.36768 = idf(docFreq=558, maxDocs=44083)
                0.0625 = fieldNorm(doc=559)
          0.11865931 = weight(abstract_txt:controlled in 559) [ClassicSimilarity], result of:
            0.11865931 = score(doc=559,freq=3.0), product of:
              0.20077737 = queryWeight, product of:
                1.864524 = boost
                5.459419 = idf(docFreq=509, maxDocs=44083)
                0.019724244 = queryNorm
              0.5909994 = fieldWeight in 559, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.459419 = idf(docFreq=509, maxDocs=44083)
                0.0625 = fieldNorm(doc=559)
          0.1283924 = weight(abstract_txt:automated in 559) [ClassicSimilarity], result of:
            0.1283924 = score(doc=559,freq=3.0), product of:
              0.21161175 = queryWeight, product of:
                1.9141699 = boost
                5.6047845 = idf(docFreq=440, maxDocs=44083)
                0.019724244 = queryNorm
              0.6067357 = fieldWeight in 559, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.6047845 = idf(docFreq=440, maxDocs=44083)
                0.0625 = fieldNorm(doc=559)
          0.08974132 = weight(abstract_txt:textual in 559) [ClassicSimilarity], result of:
            0.08974132 = score(doc=559,freq=1.0), product of:
              0.24037111 = queryWeight, product of:
                2.0401013 = boost
                5.973518 = idf(docFreq=304, maxDocs=44083)
                0.019724244 = queryNorm
              0.37334487 = fieldWeight in 559, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.973518 = idf(docFreq=304, maxDocs=44083)
                0.0625 = fieldNorm(doc=559)
          0.22066279 = weight(abstract_txt:string in 559) [ClassicSimilarity], result of:
            0.22066279 = score(doc=559,freq=2.0), product of:
              0.3475602 = queryWeight, product of:
                2.4531586 = boost
                7.18297 = idf(docFreq=90, maxDocs=44083)
                0.019724244 = queryNorm
              0.63489085 = fieldWeight in 559, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.18297 = idf(docFreq=90, maxDocs=44083)
                0.0625 = fieldNorm(doc=559)
          0.05384931 = weight(abstract_txt:classification in 559) [ClassicSimilarity], result of:
            0.05384931 = score(doc=559,freq=1.0), product of:
              0.2154521 = queryWeight, product of:
                2.7314985 = boost
                3.9989815 = idf(docFreq=2196, maxDocs=44083)
                0.019724244 = queryNorm
              0.24993634 = fieldWeight in 559, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.9989815 = idf(docFreq=2196, maxDocs=44083)
                0.0625 = fieldNorm(doc=559)
        0.4 = coord(10/25)
    
  3. Golub, K.; Lykke, M.: Automated classification of web pages in hierarchical browsing (2009) 0.28
    0.27633548 = sum of:
      0.27633548 = product of:
        0.6908387 = sum of:
          0.01693993 = weight(abstract_txt:based in 4615) [ClassicSimilarity], result of:
            0.01693993 = score(doc=4615,freq=2.0), product of:
              0.06862458 = queryWeight, product of:
                1.0900601 = boost
                3.19175 = idf(docFreq=4924, maxDocs=44083)
                0.019724244 = queryNorm
              0.24684933 = fieldWeight in 4615, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.19175 = idf(docFreq=4924, maxDocs=44083)
                0.0546875 = fieldNorm(doc=4615)
          0.022052541 = weight(abstract_txt:subject in 4615) [ClassicSimilarity], result of:
            0.022052541 = score(doc=4615,freq=1.0), product of:
              0.103083156 = queryWeight, product of:
                1.3359939 = boost
                3.9118562 = idf(docFreq=2396, maxDocs=44083)
                0.019724244 = queryNorm
              0.21392964 = fieldWeight in 4615, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.9118562 = idf(docFreq=2396, maxDocs=44083)
                0.0546875 = fieldNorm(doc=4615)
          0.04100574 = weight(abstract_txt:term in 4615) [ClassicSimilarity], result of:
            0.04100574 = score(doc=4615,freq=1.0), product of:
              0.15587568 = queryWeight, product of:
                1.6428571 = boost
                4.810367 = idf(docFreq=975, maxDocs=44083)
                0.019724244 = queryNorm
              0.26306695 = fieldWeight in 4615, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.810367 = idf(docFreq=975, maxDocs=44083)
                0.0546875 = fieldNorm(doc=4615)
          0.05652183 = weight(abstract_txt:words in 4615) [ClassicSimilarity], result of:
            0.05652183 = score(doc=4615,freq=1.0), product of:
              0.19306019 = queryWeight, product of:
                1.82834 = boost
                5.3534703 = idf(docFreq=566, maxDocs=44083)
                0.019724244 = queryNorm
              0.2927679 = fieldWeight in 4615, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.3534703 = idf(docFreq=566, maxDocs=44083)
                0.0546875 = fieldNorm(doc=4615)
          0.059944484 = weight(abstract_txt:controlled in 4615) [ClassicSimilarity], result of:
            0.059944484 = score(doc=4615,freq=1.0), product of:
              0.20077737 = queryWeight, product of:
                1.864524 = boost
                5.459419 = idf(docFreq=509, maxDocs=44083)
                0.019724244 = queryNorm
              0.29856196 = fieldWeight in 4615, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.459419 = idf(docFreq=509, maxDocs=44083)
                0.0546875 = fieldNorm(doc=4615)
          0.091727965 = weight(abstract_txt:automated in 4615) [ClassicSimilarity], result of:
            0.091727965 = score(doc=4615,freq=2.0), product of:
              0.21161175 = queryWeight, product of:
                1.9141699 = boost
                5.6047845 = idf(docFreq=440, maxDocs=44083)
                0.019724244 = queryNorm
              0.43347293 = fieldWeight in 4615, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.6047845 = idf(docFreq=440, maxDocs=44083)
                0.0546875 = fieldNorm(doc=4615)
          0.07826619 = weight(abstract_txt:engineering in 4615) [ClassicSimilarity], result of:
            0.07826619 = score(doc=4615,freq=1.0), product of:
              0.2398454 = queryWeight, product of:
                2.0378692 = boost
                5.966982 = idf(docFreq=306, maxDocs=44083)
                0.019724244 = queryNorm
              0.32631934 = fieldWeight in 4615, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.966982 = idf(docFreq=306, maxDocs=44083)
                0.0546875 = fieldNorm(doc=4615)
          0.12421139 = weight(abstract_txt:improvements in 4615) [ClassicSimilarity], result of:
            0.12421139 = score(doc=4615,freq=2.0), product of:
              0.25900784 = queryWeight, product of:
                2.1177127 = boost
                6.200768 = idf(docFreq=242, maxDocs=44083)
                0.019724244 = queryNorm
              0.47956616 = fieldWeight in 4615, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.200768 = idf(docFreq=242, maxDocs=44083)
                0.0546875 = fieldNorm(doc=4615)
          0.043895427 = weight(abstract_txt:problems in 4615) [ClassicSimilarity], result of:
            0.043895427 = score(doc=4615,freq=1.0), product of:
              0.18672045 = queryWeight, product of:
                2.2021768 = boost
                4.298722 = idf(docFreq=1627, maxDocs=44083)
                0.019724244 = queryNorm
              0.23508635 = fieldWeight in 4615, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.298722 = idf(docFreq=1627, maxDocs=44083)
                0.0546875 = fieldNorm(doc=4615)
          0.15627322 = weight(abstract_txt:classification in 4615) [ClassicSimilarity], result of:
            0.15627322 = score(doc=4615,freq=11.0), product of:
              0.2154521 = queryWeight, product of:
                2.7314985 = boost
                3.9989815 = idf(docFreq=2196, maxDocs=44083)
                0.019724244 = queryNorm
              0.72532696 = fieldWeight in 4615, product of:
                3.3166249 = tf(freq=11.0), with freq of:
                  11.0 = termFreq=11.0
                3.9989815 = idf(docFreq=2196, maxDocs=44083)
                0.0546875 = fieldNorm(doc=4615)
        0.4 = coord(10/25)
    
  4. Dumais, S.T.: Latent semantic analysis (2003) 0.24
    0.24038547 = sum of:
      0.24038547 = product of:
        0.54633063 = sum of:
          0.02333039 = weight(abstract_txt:matching in 3463) [ClassicSimilarity], result of:
            0.02333039 = score(doc=3463,freq=1.0), product of:
              0.123361215 = queryWeight, product of:
                1.0334392 = boost
                6.0519223 = idf(docFreq=281, maxDocs=44083)
                0.019724244 = queryNorm
              0.18912257 = fieldWeight in 3463, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.0519223 = idf(docFreq=281, maxDocs=44083)
                0.03125 = fieldNorm(doc=3463)
          0.009679961 = weight(abstract_txt:based in 3463) [ClassicSimilarity], result of:
            0.009679961 = score(doc=3463,freq=2.0), product of:
              0.06862458 = queryWeight, product of:
                1.0900601 = boost
                3.19175 = idf(docFreq=4924, maxDocs=44083)
                0.019724244 = queryNorm
              0.14105676 = fieldWeight in 3463, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.19175 = idf(docFreq=4924, maxDocs=44083)
                0.03125 = fieldNorm(doc=3463)
          0.037024572 = weight(abstract_txt:specified in 3463) [ClassicSimilarity], result of:
            0.037024572 = score(doc=3463,freq=1.0), product of:
              0.16783814 = queryWeight, product of:
                1.205427 = boost
                7.0591006 = idf(docFreq=102, maxDocs=44083)
                0.019724244 = queryNorm
              0.2205969 = fieldWeight in 3463, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.0591006 = idf(docFreq=102, maxDocs=44083)
                0.03125 = fieldNorm(doc=3463)
          0.021826357 = weight(abstract_txt:subject in 3463) [ClassicSimilarity], result of:
            0.021826357 = score(doc=3463,freq=3.0), product of:
              0.103083156 = queryWeight, product of:
                1.3359939 = boost
                3.9118562 = idf(docFreq=2396, maxDocs=44083)
                0.019724244 = queryNorm
              0.21173543 = fieldWeight in 3463, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.9118562 = idf(docFreq=2396, maxDocs=44083)
                0.03125 = fieldNorm(doc=3463)
          0.023431852 = weight(abstract_txt:term in 3463) [ClassicSimilarity], result of:
            0.023431852 = score(doc=3463,freq=1.0), product of:
              0.15587568 = queryWeight, product of:
                1.6428571 = boost
                4.810367 = idf(docFreq=975, maxDocs=44083)
                0.019724244 = queryNorm
              0.15032397 = fieldWeight in 3463, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.810367 = idf(docFreq=975, maxDocs=44083)
                0.03125 = fieldNorm(doc=3463)
          0.11188421 = weight(abstract_txt:words in 3463) [ClassicSimilarity], result of:
            0.11188421 = score(doc=3463,freq=12.0), product of:
              0.19306019 = queryWeight, product of:
                1.82834 = boost
                5.3534703 = idf(docFreq=566, maxDocs=44083)
                0.019724244 = queryNorm
              0.5795302 = fieldWeight in 3463, product of:
                3.4641016 = tf(freq=12.0), with freq of:
                  12.0 = termFreq=12.0
                5.3534703 = idf(docFreq=566, maxDocs=44083)
                0.03125 = fieldNorm(doc=3463)
          0.065112114 = weight(abstract_txt:vocabulary in 3463) [ClassicSimilarity], result of:
            0.065112114 = score(doc=3463,freq=4.0), product of:
              0.19408643 = queryWeight, product of:
                1.8331931 = boost
                5.36768 = idf(docFreq=558, maxDocs=44083)
                0.019724244 = queryNorm
              0.33548 = fieldWeight in 3463, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.36768 = idf(docFreq=558, maxDocs=44083)
                0.03125 = fieldNorm(doc=3463)
          0.03295252 = weight(abstract_txt:list in 3463) [ClassicSimilarity], result of:
            0.03295252 = score(doc=3463,freq=1.0), product of:
              0.19565895 = queryWeight, product of:
                1.8406044 = boost
                5.389381 = idf(docFreq=546, maxDocs=44083)
                0.019724244 = queryNorm
              0.16841815 = fieldWeight in 3463, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.389381 = idf(docFreq=546, maxDocs=44083)
                0.03125 = fieldNorm(doc=3463)
          0.096884914 = weight(abstract_txt:controlled in 3463) [ClassicSimilarity], result of:
            0.096884914 = score(doc=3463,freq=8.0), product of:
              0.20077737 = queryWeight, product of:
                1.864524 = boost
                5.459419 = idf(docFreq=509, maxDocs=44083)
                0.019724244 = queryNorm
              0.48254898 = fieldWeight in 3463, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                5.459419 = idf(docFreq=509, maxDocs=44083)
                0.03125 = fieldNorm(doc=3463)
          0.050166205 = weight(abstract_txt:problems in 3463) [ClassicSimilarity], result of:
            0.050166205 = score(doc=3463,freq=4.0), product of:
              0.18672045 = queryWeight, product of:
                2.2021768 = boost
                4.298722 = idf(docFreq=1627, maxDocs=44083)
                0.019724244 = queryNorm
              0.2686701 = fieldWeight in 3463, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.298722 = idf(docFreq=1627, maxDocs=44083)
                0.03125 = fieldNorm(doc=3463)
          0.07403755 = weight(abstract_txt:pages in 3463) [ClassicSimilarity], result of:
            0.07403755 = score(doc=3463,freq=1.0), product of:
              0.42288148 = queryWeight, product of:
                3.8267927 = boost
                5.6025195 = idf(docFreq=441, maxDocs=44083)
                0.019724244 = queryNorm
              0.17507873 = fieldWeight in 3463, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.6025195 = idf(docFreq=441, maxDocs=44083)
                0.03125 = fieldNorm(doc=3463)
        0.44 = coord(11/25)
    
  5. Wang, J.: Automatic thesaurus development : term extraction from title metadata (2006) 0.22
    0.22065715 = sum of:
      0.22065715 = product of:
        0.6129365 = sum of:
          0.04318529 = weight(abstract_txt:applying in 64) [ClassicSimilarity], result of:
            0.04318529 = score(doc=64,freq=1.0), product of:
              0.11715689 = queryWeight, product of:
                1.0071161 = boost
                5.897772 = idf(docFreq=328, maxDocs=44083)
                0.019724244 = queryNorm
              0.36861074 = fieldWeight in 64, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.897772 = idf(docFreq=328, maxDocs=44083)
                0.0625 = fieldNorm(doc=64)
          0.049121607 = weight(abstract_txt:extracted in 64) [ClassicSimilarity], result of:
            0.049121607 = score(doc=64,freq=1.0), product of:
              0.12766123 = queryWeight, product of:
                1.0512962 = boost
                6.156495 = idf(docFreq=253, maxDocs=44083)
                0.019724244 = queryNorm
              0.38478094 = fieldWeight in 64, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.156495 = idf(docFreq=253, maxDocs=44083)
                0.0625 = fieldNorm(doc=64)
          0.013689531 = weight(abstract_txt:based in 64) [ClassicSimilarity], result of:
            0.013689531 = score(doc=64,freq=1.0), product of:
              0.06862458 = queryWeight, product of:
                1.0900601 = boost
                3.19175 = idf(docFreq=4924, maxDocs=44083)
                0.019724244 = queryNorm
              0.19948438 = fieldWeight in 64, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.19175 = idf(docFreq=4924, maxDocs=44083)
                0.0625 = fieldNorm(doc=64)
          0.03564229 = weight(abstract_txt:subject in 64) [ClassicSimilarity], result of:
            0.03564229 = score(doc=64,freq=2.0), product of:
              0.103083156 = queryWeight, product of:
                1.3359939 = boost
                3.9118562 = idf(docFreq=2396, maxDocs=44083)
                0.019724244 = queryNorm
              0.3457625 = fieldWeight in 64, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.9118562 = idf(docFreq=2396, maxDocs=44083)
                0.0625 = fieldNorm(doc=64)
          0.11188421 = weight(abstract_txt:words in 64) [ClassicSimilarity], result of:
            0.11188421 = score(doc=64,freq=3.0), product of:
              0.19306019 = queryWeight, product of:
                1.82834 = boost
                5.3534703 = idf(docFreq=566, maxDocs=44083)
                0.019724244 = queryNorm
              0.5795302 = fieldWeight in 64, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.3534703 = idf(docFreq=566, maxDocs=44083)
                0.0625 = fieldNorm(doc=64)
          0.112777494 = weight(abstract_txt:vocabulary in 64) [ClassicSimilarity], result of:
            0.112777494 = score(doc=64,freq=3.0), product of:
              0.19408643 = queryWeight, product of:
                1.8331931 = boost
                5.36768 = idf(docFreq=558, maxDocs=44083)
                0.019724244 = queryNorm
              0.5810684 = fieldWeight in 64, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.36768 = idf(docFreq=558, maxDocs=44083)
                0.0625 = fieldNorm(doc=64)
          0.11865931 = weight(abstract_txt:controlled in 64) [ClassicSimilarity], result of:
            0.11865931 = score(doc=64,freq=3.0), product of:
              0.20077737 = queryWeight, product of:
                1.864524 = boost
                5.459419 = idf(docFreq=509, maxDocs=44083)
                0.019724244 = queryNorm
              0.5909994 = fieldWeight in 64, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.459419 = idf(docFreq=509, maxDocs=44083)
                0.0625 = fieldNorm(doc=64)
          0.07412739 = weight(abstract_txt:automated in 64) [ClassicSimilarity], result of:
            0.07412739 = score(doc=64,freq=1.0), product of:
              0.21161175 = queryWeight, product of:
                1.9141699 = boost
                5.6047845 = idf(docFreq=440, maxDocs=44083)
                0.019724244 = queryNorm
              0.35029903 = fieldWeight in 64, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.6047845 = idf(docFreq=440, maxDocs=44083)
                0.0625 = fieldNorm(doc=64)
          0.05384931 = weight(abstract_txt:classification in 64) [ClassicSimilarity], result of:
            0.05384931 = score(doc=64,freq=1.0), product of:
              0.2154521 = queryWeight, product of:
                2.7314985 = boost
                3.9989815 = idf(docFreq=2196, maxDocs=44083)
                0.019724244 = queryNorm
              0.24993634 = fieldWeight in 64, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.9989815 = idf(docFreq=2196, maxDocs=44083)
                0.0625 = fieldNorm(doc=64)
        0.36 = coord(9/25)