Document (#30899)

Author
Golub, K.
Title
Automated subject classification of textual Web pages, based on a controlled vocabulary : challenges and recommendations
Source
New review of hypermedia and multimedia. 12(2006) no.1, S.11-27
Year
2006
Abstract
The primary objective of this study was to identify and address problems of applying a controlled vocabulary in automated subject classification of textual Web pages, in the area of engineering. Web pages have special characteristics such as structural information, but are at the same time rather heterogeneous. The classification approach used comprises string-to-string matching between words in a term list extracted from the Ei (Engineering Information) thesaurus and classification scheme, and words in the text to be classified. Based on a sample of 70 Web pages, a number of problems with the term list are identified. Reasons for those problems are discussed and improvements proposed. Methods for implementing the improvements are also specified, suggesting further research.
Content
Beitrag eines Themenheftes "Knowledge organization systems and services"
Theme
Automatisches Klassifizieren
Field
Ingenieurwissenschaften

Similar documents (author)

  1. Golub, K.: Automated subject classification of textual web documents (2006) 5.42
    5.416974 = sum of:
      5.416974 = weight(author_txt:golub in 601) [ClassicSimilarity], result of:
        5.416974 = fieldWeight in 601, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.667158 = idf(docFreq=19, maxDocs=42740)
          0.625 = fieldNorm(doc=601)
    
  2. Golub, K.: Subject access to information : an interdisciplinary approach (2015) 5.42
    5.416974 = sum of:
      5.416974 = weight(author_txt:golub in 2135) [ClassicSimilarity], result of:
        5.416974 = fieldWeight in 2135, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.667158 = idf(docFreq=19, maxDocs=42740)
          0.625 = fieldNorm(doc=2135)
    
  3. Golub, K.: Automated subject classification of textual documents in the context of Web-based hierarchical browsing (2011) 5.42
    5.416974 = sum of:
      5.416974 = weight(author_txt:golub in 1559) [ClassicSimilarity], result of:
        5.416974 = fieldWeight in 1559, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.667158 = idf(docFreq=19, maxDocs=42740)
          0.625 = fieldNorm(doc=1559)
    
  4. Golub, K.: Subject access in Swedish discovery services (2018) 5.42
    5.416974 = sum of:
      5.416974 = weight(author_txt:golub in 380) [ClassicSimilarity], result of:
        5.416974 = fieldWeight in 380, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.667158 = idf(docFreq=19, maxDocs=42740)
          0.625 = fieldNorm(doc=380)
    
  5. Golub, K.: Automatic subject indexing of text (2019) 5.42
    5.416974 = sum of:
      5.416974 = weight(author_txt:golub in 1269) [ClassicSimilarity], result of:
        5.416974 = fieldWeight in 1269, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.667158 = idf(docFreq=19, maxDocs=42740)
          0.625 = fieldNorm(doc=1269)
    

Similar documents (content)

  1. Golub, K.; Hamon, T.; Ardö, A.: Automated classification of textual documents based on a controlled vocabulary in engineering (2007) 0.45
    0.4460344 = sum of:
      0.4460344 = product of:
        1.0137146 = sum of:
          0.06616206 = weight(abstract_txt:matching in 3462) [ClassicSimilarity], result of:
            0.06616206 = score(doc=3462,freq=2.0), product of:
              0.12350542 = queryWeight, product of:
                1.0281959 = boost
                6.060772 = idf(docFreq=270, maxDocs=42740)
                0.019819021 = queryNorm
              0.53570163 = fieldWeight in 3462, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.060772 = idf(docFreq=270, maxDocs=42740)
                0.0625 = fieldNorm(doc=3462)
          0.050162546 = weight(abstract_txt:extracted in 3462) [ClassicSimilarity], result of:
            0.050162546 = score(doc=3462,freq=1.0), product of:
              0.12938276 = queryWeight, product of:
                1.0523763 = boost
                6.2033052 = idf(docFreq=234, maxDocs=42740)
                0.019819021 = queryNorm
              0.38770658 = fieldWeight in 3462, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.2033052 = idf(docFreq=234, maxDocs=42740)
                0.0625 = fieldNorm(doc=3462)
          0.013886361 = weight(abstract_txt:based in 3462) [ClassicSimilarity], result of:
            0.013886361 = score(doc=3462,freq=1.0), product of:
              0.06924031 = queryWeight, product of:
                1.088748 = boost
                3.2088501 = idf(docFreq=4693, maxDocs=42740)
                0.019819021 = queryNorm
              0.20055313 = fieldWeight in 3462, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.2088501 = idf(docFreq=4693, maxDocs=42740)
                0.0625 = fieldNorm(doc=3462)
          0.02512536 = weight(abstract_txt:subject in 3462) [ClassicSimilarity], result of:
            0.02512536 = score(doc=3462,freq=1.0), product of:
              0.10281146 = queryWeight, product of:
                1.3266875 = boost
                3.9101257 = idf(docFreq=2327, maxDocs=42740)
                0.019819021 = queryNorm
              0.24438286 = fieldWeight in 3462, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.9101257 = idf(docFreq=2327, maxDocs=42740)
                0.0625 = fieldNorm(doc=3462)
          0.047320884 = weight(abstract_txt:term in 3462) [ClassicSimilarity], result of:
            0.047320884 = score(doc=3462,freq=1.0), product of:
              0.1567961 = queryWeight, product of:
                1.6383832 = boost
                4.8287816 = idf(docFreq=928, maxDocs=42740)
                0.019819021 = queryNorm
              0.30179885 = fieldWeight in 3462, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.8287816 = idf(docFreq=928, maxDocs=42740)
                0.0625 = fieldNorm(doc=3462)
          0.12945847 = weight(abstract_txt:vocabulary in 3462) [ClassicSimilarity], result of:
            0.12945847 = score(doc=3462,freq=4.0), product of:
              0.19321181 = queryWeight, product of:
                1.8187152 = boost
                5.3602715 = idf(docFreq=545, maxDocs=42740)
                0.019819021 = queryNorm
              0.67003393 = fieldWeight in 3462, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.3602715 = idf(docFreq=545, maxDocs=42740)
                0.0625 = fieldNorm(doc=3462)
          0.1376125 = weight(abstract_txt:controlled in 3462) [ClassicSimilarity], result of:
            0.1376125 = score(doc=3462,freq=4.0), product of:
              0.201242 = queryWeight, product of:
                1.8561248 = boost
                5.470528 = idf(docFreq=488, maxDocs=42740)
                0.019819021 = queryNorm
              0.683816 = fieldWeight in 3462, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.470528 = idf(docFreq=488, maxDocs=42740)
                0.0625 = fieldNorm(doc=3462)
          0.105517335 = weight(abstract_txt:automated in 3462) [ClassicSimilarity], result of:
            0.105517335 = score(doc=3462,freq=2.0), product of:
              0.21240884 = queryWeight, product of:
                1.9069273 = boost
                5.620258 = idf(docFreq=420, maxDocs=42740)
                0.019819021 = queryNorm
              0.4967653 = fieldWeight in 3462, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.620258 = idf(docFreq=420, maxDocs=42740)
                0.0625 = fieldNorm(doc=3462)
          0.12684354 = weight(abstract_txt:engineering in 3462) [ClassicSimilarity], result of:
            0.12684354 = score(doc=3462,freq=2.0), product of:
              0.24014246 = queryWeight, product of:
                2.0276003 = boost
                5.975915 = idf(docFreq=294, maxDocs=42740)
                0.019819021 = queryNorm
              0.5282012 = fieldWeight in 3462, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.975915 = idf(docFreq=294, maxDocs=42740)
                0.0625 = fieldNorm(doc=3462)
          0.21845132 = weight(abstract_txt:string in 3462) [ClassicSimilarity], result of:
            0.21845132 = score(doc=3462,freq=2.0), product of:
              0.34503233 = queryWeight, product of:
                2.4304 = boost
                7.1630807 = idf(docFreq=89, maxDocs=42740)
                0.019819021 = queryNorm
              0.6331329 = fieldWeight in 3462, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.1630807 = idf(docFreq=89, maxDocs=42740)
                0.0625 = fieldNorm(doc=3462)
          0.09317414 = weight(abstract_txt:classification in 3462) [ClassicSimilarity], result of:
            0.09317414 = score(doc=3462,freq=3.0), product of:
              0.21517898 = queryWeight, product of:
                2.7143307 = boost
                3.9999528 = idf(docFreq=2127, maxDocs=42740)
                0.019819021 = queryNorm
              0.4330076 = fieldWeight in 3462, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.9999528 = idf(docFreq=2127, maxDocs=42740)
                0.0625 = fieldNorm(doc=3462)
        0.44 = coord(11/25)
    
  2. Golub, K.: Automated subject classification of textual documents in the context of Web-based hierarchical browsing (2011) 0.35
    0.35031205 = sum of:
      0.35031205 = product of:
        0.8757801 = sum of:
          0.06616206 = weight(abstract_txt:matching in 1559) [ClassicSimilarity], result of:
            0.06616206 = score(doc=1559,freq=2.0), product of:
              0.12350542 = queryWeight, product of:
                1.0281959 = boost
                6.060772 = idf(docFreq=270, maxDocs=42740)
                0.019819021 = queryNorm
              0.53570163 = fieldWeight in 1559, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.060772 = idf(docFreq=270, maxDocs=42740)
                0.0625 = fieldNorm(doc=1559)
          0.047840293 = weight(abstract_txt:classified in 1559) [ClassicSimilarity], result of:
            0.047840293 = score(doc=1559,freq=1.0), product of:
              0.12535815 = queryWeight, product of:
                1.0358793 = boost
                6.1060624 = idf(docFreq=258, maxDocs=42740)
                0.019819021 = queryNorm
              0.3816289 = fieldWeight in 1559, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.1060624 = idf(docFreq=258, maxDocs=42740)
                0.0625 = fieldNorm(doc=1559)
          0.013886361 = weight(abstract_txt:based in 1559) [ClassicSimilarity], result of:
            0.013886361 = score(doc=1559,freq=1.0), product of:
              0.06924031 = queryWeight, product of:
                1.088748 = boost
                3.2088501 = idf(docFreq=4693, maxDocs=42740)
                0.019819021 = queryNorm
              0.20055313 = fieldWeight in 1559, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.2088501 = idf(docFreq=4693, maxDocs=42740)
                0.0625 = fieldNorm(doc=1559)
          0.02512536 = weight(abstract_txt:subject in 1559) [ClassicSimilarity], result of:
            0.02512536 = score(doc=1559,freq=1.0), product of:
              0.10281146 = queryWeight, product of:
                1.3266875 = boost
                3.9101257 = idf(docFreq=2327, maxDocs=42740)
                0.019819021 = queryNorm
              0.24438286 = fieldWeight in 1559, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.9101257 = idf(docFreq=2327, maxDocs=42740)
                0.0625 = fieldNorm(doc=1559)
          0.112114325 = weight(abstract_txt:vocabulary in 1559) [ClassicSimilarity], result of:
            0.112114325 = score(doc=1559,freq=3.0), product of:
              0.19321181 = queryWeight, product of:
                1.8187152 = boost
                5.3602715 = idf(docFreq=545, maxDocs=42740)
                0.019819021 = queryNorm
              0.5802664 = fieldWeight in 1559, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.3602715 = idf(docFreq=545, maxDocs=42740)
                0.0625 = fieldNorm(doc=1559)
          0.11917592 = weight(abstract_txt:controlled in 1559) [ClassicSimilarity], result of:
            0.11917592 = score(doc=1559,freq=3.0), product of:
              0.201242 = queryWeight, product of:
                1.8561248 = boost
                5.470528 = idf(docFreq=488, maxDocs=42740)
                0.019819021 = queryNorm
              0.592202 = fieldWeight in 1559, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.470528 = idf(docFreq=488, maxDocs=42740)
                0.0625 = fieldNorm(doc=1559)
          0.12923183 = weight(abstract_txt:automated in 1559) [ClassicSimilarity], result of:
            0.12923183 = score(doc=1559,freq=3.0), product of:
              0.21240884 = queryWeight, product of:
                1.9069273 = boost
                5.620258 = idf(docFreq=420, maxDocs=42740)
                0.019819021 = queryNorm
              0.6084108 = fieldWeight in 1559, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.620258 = idf(docFreq=420, maxDocs=42740)
                0.0625 = fieldNorm(doc=1559)
          0.0899986 = weight(abstract_txt:textual in 1559) [ClassicSimilarity], result of:
            0.0899986 = score(doc=1559,freq=1.0), product of:
              0.24068953 = queryWeight, product of:
                2.0299084 = boost
                5.982718 = idf(docFreq=292, maxDocs=42740)
                0.019819021 = queryNorm
              0.37391987 = fieldWeight in 1559, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.982718 = idf(docFreq=292, maxDocs=42740)
                0.0625 = fieldNorm(doc=1559)
          0.21845132 = weight(abstract_txt:string in 1559) [ClassicSimilarity], result of:
            0.21845132 = score(doc=1559,freq=2.0), product of:
              0.34503233 = queryWeight, product of:
                2.4304 = boost
                7.1630807 = idf(docFreq=89, maxDocs=42740)
                0.019819021 = queryNorm
              0.6331329 = fieldWeight in 1559, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.1630807 = idf(docFreq=89, maxDocs=42740)
                0.0625 = fieldNorm(doc=1559)
          0.053794112 = weight(abstract_txt:classification in 1559) [ClassicSimilarity], result of:
            0.053794112 = score(doc=1559,freq=1.0), product of:
              0.21517898 = queryWeight, product of:
                2.7143307 = boost
                3.9999528 = idf(docFreq=2127, maxDocs=42740)
                0.019819021 = queryNorm
              0.24999705 = fieldWeight in 1559, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.9999528 = idf(docFreq=2127, maxDocs=42740)
                0.0625 = fieldNorm(doc=1559)
        0.4 = coord(10/25)
    
  3. Golub, K.; Lykke, M.: Automated classification of web pages in hierarchical browsing (2009) 0.28
    0.27766815 = sum of:
      0.27766815 = product of:
        0.69417036 = sum of:
          0.017183494 = weight(abstract_txt:based in 615) [ClassicSimilarity], result of:
            0.017183494 = score(doc=615,freq=2.0), product of:
              0.06924031 = queryWeight, product of:
                1.088748 = boost
                3.2088501 = idf(docFreq=4693, maxDocs=42740)
                0.019819021 = queryNorm
              0.24817184 = fieldWeight in 615, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.2088501 = idf(docFreq=4693, maxDocs=42740)
                0.0546875 = fieldNorm(doc=615)
          0.021984689 = weight(abstract_txt:subject in 615) [ClassicSimilarity], result of:
            0.021984689 = score(doc=615,freq=1.0), product of:
              0.10281146 = queryWeight, product of:
                1.3266875 = boost
                3.9101257 = idf(docFreq=2327, maxDocs=42740)
                0.019819021 = queryNorm
              0.213835 = fieldWeight in 615, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.9101257 = idf(docFreq=2327, maxDocs=42740)
                0.0546875 = fieldNorm(doc=615)
          0.04140577 = weight(abstract_txt:term in 615) [ClassicSimilarity], result of:
            0.04140577 = score(doc=615,freq=1.0), product of:
              0.1567961 = queryWeight, product of:
                1.6383832 = boost
                4.8287816 = idf(docFreq=928, maxDocs=42740)
                0.019819021 = queryNorm
              0.264074 = fieldWeight in 615, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.8287816 = idf(docFreq=928, maxDocs=42740)
                0.0546875 = fieldNorm(doc=615)
          0.056580104 = weight(abstract_txt:words in 615) [ClassicSimilarity], result of:
            0.056580104 = score(doc=615,freq=1.0), product of:
              0.19307993 = queryWeight, product of:
                1.8180944 = boost
                5.358442 = idf(docFreq=546, maxDocs=42740)
                0.019819021 = queryNorm
              0.2930398 = fieldWeight in 615, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.358442 = idf(docFreq=546, maxDocs=42740)
                0.0546875 = fieldNorm(doc=615)
          0.06020547 = weight(abstract_txt:controlled in 615) [ClassicSimilarity], result of:
            0.06020547 = score(doc=615,freq=1.0), product of:
              0.201242 = queryWeight, product of:
                1.8561248 = boost
                5.470528 = idf(docFreq=488, maxDocs=42740)
                0.019819021 = queryNorm
              0.2991695 = fieldWeight in 615, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.470528 = idf(docFreq=488, maxDocs=42740)
                0.0546875 = fieldNorm(doc=615)
          0.09232767 = weight(abstract_txt:automated in 615) [ClassicSimilarity], result of:
            0.09232767 = score(doc=615,freq=2.0), product of:
              0.21240884 = queryWeight, product of:
                1.9069273 = boost
                5.620258 = idf(docFreq=420, maxDocs=42740)
                0.019819021 = queryNorm
              0.4346696 = fieldWeight in 615, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.620258 = idf(docFreq=420, maxDocs=42740)
                0.0546875 = fieldNorm(doc=615)
          0.078480445 = weight(abstract_txt:engineering in 615) [ClassicSimilarity], result of:
            0.078480445 = score(doc=615,freq=1.0), product of:
              0.24014246 = queryWeight, product of:
                2.0276003 = boost
                5.975915 = idf(docFreq=294, maxDocs=42740)
                0.019819021 = queryNorm
              0.32680786 = fieldWeight in 615, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.975915 = idf(docFreq=294, maxDocs=42740)
                0.0546875 = fieldNorm(doc=615)
          0.12623705 = weight(abstract_txt:improvements in 615) [ClassicSimilarity], result of:
            0.12623705 = score(doc=615,freq=2.0), product of:
              0.26166314 = queryWeight, product of:
                2.1165042 = boost
                6.2379403 = idf(docFreq=226, maxDocs=42740)
                0.019819021 = queryNorm
              0.48244107 = fieldWeight in 615, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.2379403 = idf(docFreq=226, maxDocs=42740)
                0.0546875 = fieldNorm(doc=615)
          0.043652598 = weight(abstract_txt:problems in 615) [ClassicSimilarity], result of:
            0.043652598 = score(doc=615,freq=1.0), product of:
              0.18592244 = queryWeight, product of:
                2.18504 = boost
                4.2932897 = idf(docFreq=1586, maxDocs=42740)
                0.019819021 = queryNorm
              0.23478928 = fieldWeight in 615, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.2932897 = idf(docFreq=1586, maxDocs=42740)
                0.0546875 = fieldNorm(doc=615)
          0.15611303 = weight(abstract_txt:classification in 615) [ClassicSimilarity], result of:
            0.15611303 = score(doc=615,freq=11.0), product of:
              0.21517898 = queryWeight, product of:
                2.7143307 = boost
                3.9999528 = idf(docFreq=2127, maxDocs=42740)
                0.019819021 = queryNorm
              0.72550315 = fieldWeight in 615, product of:
                3.3166249 = tf(freq=11.0), with freq of:
                  11.0 = termFreq=11.0
                3.9999528 = idf(docFreq=2127, maxDocs=42740)
                0.0546875 = fieldNorm(doc=615)
        0.4 = coord(10/25)
    
  4. Dumais, S.T.: Latent semantic analysis (2003) 0.24
    0.23999143 = sum of:
      0.23999143 = product of:
        0.5454351 = sum of:
          0.023391819 = weight(abstract_txt:matching in 4463) [ClassicSimilarity], result of:
            0.023391819 = score(doc=4463,freq=1.0), product of:
              0.12350542 = queryWeight, product of:
                1.0281959 = boost
                6.060772 = idf(docFreq=270, maxDocs=42740)
                0.019819021 = queryNorm
              0.18939912 = fieldWeight in 4463, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.060772 = idf(docFreq=270, maxDocs=42740)
                0.03125 = fieldNorm(doc=4463)
          0.00981914 = weight(abstract_txt:based in 4463) [ClassicSimilarity], result of:
            0.00981914 = score(doc=4463,freq=2.0), product of:
              0.06924031 = queryWeight, product of:
                1.088748 = boost
                3.2088501 = idf(docFreq=4693, maxDocs=42740)
                0.019819021 = queryNorm
              0.14181247 = fieldWeight in 4463, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.2088501 = idf(docFreq=4693, maxDocs=42740)
                0.03125 = fieldNorm(doc=4463)
          0.036782 = weight(abstract_txt:specified in 4463) [ClassicSimilarity], result of:
            0.036782 = score(doc=4463,freq=1.0), product of:
              0.16700658 = queryWeight, product of:
                1.1956378 = boost
                7.04777 = idf(docFreq=100, maxDocs=42740)
                0.019819021 = queryNorm
              0.22024281 = fieldWeight in 4463, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.04777 = idf(docFreq=100, maxDocs=42740)
                0.03125 = fieldNorm(doc=4463)
          0.021759199 = weight(abstract_txt:subject in 4463) [ClassicSimilarity], result of:
            0.021759199 = score(doc=4463,freq=3.0), product of:
              0.10281146 = queryWeight, product of:
                1.3266875 = boost
                3.9101257 = idf(docFreq=2327, maxDocs=42740)
                0.019819021 = queryNorm
              0.21164176 = fieldWeight in 4463, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.9101257 = idf(docFreq=2327, maxDocs=42740)
                0.03125 = fieldNorm(doc=4463)
          0.023660442 = weight(abstract_txt:term in 4463) [ClassicSimilarity], result of:
            0.023660442 = score(doc=4463,freq=1.0), product of:
              0.1567961 = queryWeight, product of:
                1.6383832 = boost
                4.8287816 = idf(docFreq=928, maxDocs=42740)
                0.019819021 = queryNorm
              0.15089943 = fieldWeight in 4463, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.8287816 = idf(docFreq=928, maxDocs=42740)
                0.03125 = fieldNorm(doc=4463)
          0.11199956 = weight(abstract_txt:words in 4463) [ClassicSimilarity], result of:
            0.11199956 = score(doc=4463,freq=12.0), product of:
              0.19307993 = queryWeight, product of:
                1.8180944 = boost
                5.358442 = idf(docFreq=546, maxDocs=42740)
                0.019819021 = queryNorm
              0.58006835 = fieldWeight in 4463, product of:
                3.4641016 = tf(freq=12.0), with freq of:
                  12.0 = termFreq=12.0
                5.358442 = idf(docFreq=546, maxDocs=42740)
                0.03125 = fieldNorm(doc=4463)
          0.064729236 = weight(abstract_txt:vocabulary in 4463) [ClassicSimilarity], result of:
            0.064729236 = score(doc=4463,freq=4.0), product of:
              0.19321181 = queryWeight, product of:
                1.8187152 = boost
                5.3602715 = idf(docFreq=545, maxDocs=42740)
                0.019819021 = queryNorm
              0.33501697 = fieldWeight in 4463, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.3602715 = idf(docFreq=545, maxDocs=42740)
                0.03125 = fieldNorm(doc=4463)
          0.03287183 = weight(abstract_txt:list in 4463) [ClassicSimilarity], result of:
            0.03287183 = score(doc=4463,freq=1.0), product of:
              0.19522522 = queryWeight, product of:
                1.8281668 = boost
                5.3881283 = idf(docFreq=530, maxDocs=42740)
                0.019819021 = queryNorm
              0.16837901 = fieldWeight in 4463, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.3881283 = idf(docFreq=530, maxDocs=42740)
                0.03125 = fieldNorm(doc=4463)
          0.097306736 = weight(abstract_txt:controlled in 4463) [ClassicSimilarity], result of:
            0.097306736 = score(doc=4463,freq=8.0), product of:
              0.201242 = queryWeight, product of:
                1.8561248 = boost
                5.470528 = idf(docFreq=488, maxDocs=42740)
                0.019819021 = queryNorm
              0.48353094 = fieldWeight in 4463, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                5.470528 = idf(docFreq=488, maxDocs=42740)
                0.03125 = fieldNorm(doc=4463)
          0.04988868 = weight(abstract_txt:problems in 4463) [ClassicSimilarity], result of:
            0.04988868 = score(doc=4463,freq=4.0), product of:
              0.18592244 = queryWeight, product of:
                2.18504 = boost
                4.2932897 = idf(docFreq=1586, maxDocs=42740)
                0.019819021 = queryNorm
              0.2683306 = fieldWeight in 4463, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.2932897 = idf(docFreq=1586, maxDocs=42740)
                0.03125 = fieldNorm(doc=4463)
          0.07322639 = weight(abstract_txt:pages in 4463) [ClassicSimilarity], result of:
            0.07322639 = score(doc=4463,freq=1.0), product of:
              0.4195417 = queryWeight, product of:
                3.7900977 = boost
                5.5852485 = idf(docFreq=435, maxDocs=42740)
                0.019819021 = queryNorm
              0.17453901 = fieldWeight in 4463, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.5852485 = idf(docFreq=435, maxDocs=42740)
                0.03125 = fieldNorm(doc=4463)
        0.44 = coord(11/25)
    
  5. Wang, J.: Automatic thesaurus development : term extraction from title metadata (2006) 0.22
    0.22148342 = sum of:
      0.22148342 = product of:
        0.6152317 = sum of:
          0.043954283 = weight(abstract_txt:applying in 64) [ClassicSimilarity], result of:
            0.043954283 = score(doc=64,freq=1.0), product of:
              0.118474305 = queryWeight, product of:
                1.0070359 = boost
                5.936043 = idf(docFreq=306, maxDocs=42740)
                0.019819021 = queryNorm
              0.37100267 = fieldWeight in 64, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.936043 = idf(docFreq=306, maxDocs=42740)
                0.0625 = fieldNorm(doc=64)
          0.050162546 = weight(abstract_txt:extracted in 64) [ClassicSimilarity], result of:
            0.050162546 = score(doc=64,freq=1.0), product of:
              0.12938276 = queryWeight, product of:
                1.0523763 = boost
                6.2033052 = idf(docFreq=234, maxDocs=42740)
                0.019819021 = queryNorm
              0.38770658 = fieldWeight in 64, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.2033052 = idf(docFreq=234, maxDocs=42740)
                0.0625 = fieldNorm(doc=64)
          0.013886361 = weight(abstract_txt:based in 64) [ClassicSimilarity], result of:
            0.013886361 = score(doc=64,freq=1.0), product of:
              0.06924031 = queryWeight, product of:
                1.088748 = boost
                3.2088501 = idf(docFreq=4693, maxDocs=42740)
                0.019819021 = queryNorm
              0.20055313 = fieldWeight in 64, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.2088501 = idf(docFreq=4693, maxDocs=42740)
                0.0625 = fieldNorm(doc=64)
          0.035532624 = weight(abstract_txt:subject in 64) [ClassicSimilarity], result of:
            0.035532624 = score(doc=64,freq=2.0), product of:
              0.10281146 = queryWeight, product of:
                1.3266875 = boost
                3.9101257 = idf(docFreq=2327, maxDocs=42740)
                0.019819021 = queryNorm
              0.34560955 = fieldWeight in 64, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.9101257 = idf(docFreq=2327, maxDocs=42740)
                0.0625 = fieldNorm(doc=64)
          0.11199956 = weight(abstract_txt:words in 64) [ClassicSimilarity], result of:
            0.11199956 = score(doc=64,freq=3.0), product of:
              0.19307993 = queryWeight, product of:
                1.8180944 = boost
                5.358442 = idf(docFreq=546, maxDocs=42740)
                0.019819021 = queryNorm
              0.58006835 = fieldWeight in 64, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.358442 = idf(docFreq=546, maxDocs=42740)
                0.0625 = fieldNorm(doc=64)
          0.112114325 = weight(abstract_txt:vocabulary in 64) [ClassicSimilarity], result of:
            0.112114325 = score(doc=64,freq=3.0), product of:
              0.19321181 = queryWeight, product of:
                1.8187152 = boost
                5.3602715 = idf(docFreq=545, maxDocs=42740)
                0.019819021 = queryNorm
              0.5802664 = fieldWeight in 64, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.3602715 = idf(docFreq=545, maxDocs=42740)
                0.0625 = fieldNorm(doc=64)
          0.11917592 = weight(abstract_txt:controlled in 64) [ClassicSimilarity], result of:
            0.11917592 = score(doc=64,freq=3.0), product of:
              0.201242 = queryWeight, product of:
                1.8561248 = boost
                5.470528 = idf(docFreq=488, maxDocs=42740)
                0.019819021 = queryNorm
              0.592202 = fieldWeight in 64, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.470528 = idf(docFreq=488, maxDocs=42740)
                0.0625 = fieldNorm(doc=64)
          0.07461203 = weight(abstract_txt:automated in 64) [ClassicSimilarity], result of:
            0.07461203 = score(doc=64,freq=1.0), product of:
              0.21240884 = queryWeight, product of:
                1.9069273 = boost
                5.620258 = idf(docFreq=420, maxDocs=42740)
                0.019819021 = queryNorm
              0.35126612 = fieldWeight in 64, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.620258 = idf(docFreq=420, maxDocs=42740)
                0.0625 = fieldNorm(doc=64)
          0.053794112 = weight(abstract_txt:classification in 64) [ClassicSimilarity], result of:
            0.053794112 = score(doc=64,freq=1.0), product of:
              0.21517898 = queryWeight, product of:
                2.7143307 = boost
                3.9999528 = idf(docFreq=2127, maxDocs=42740)
                0.019819021 = queryNorm
              0.24999705 = fieldWeight in 64, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.9999528 = idf(docFreq=2127, maxDocs=42740)
                0.0625 = fieldNorm(doc=64)
        0.36 = coord(9/25)