Document (#30898)

Author
Golub, K.
Title
Automated subject classification of textual Web pages, based on a controlled vocabulary : challenges and recommendations
Source
New review of hypermedia and multimedia. 12(2006) no.1, S.11-27
Year
2006
Abstract
The primary objective of this study was to identify and address problems of applying a controlled vocabulary in automated subject classification of textual Web pages, in the area of engineering. Web pages have special characteristics such as structural information, but are at the same time rather heterogeneous. The classification approach used comprises string-to-string matching between words in a term list extracted from the Ei (Engineering Information) thesaurus and classification scheme, and words in the text to be classified. Based on a sample of 70 Web pages, a number of problems with the term list are identified. Reasons for those problems are discussed and improvements proposed. Methods for implementing the improvements are also specified, suggesting further research.
Content
Beitrag eines Themenheftes "Knowledge organization systems and services"
Theme
Automatisches Klassifizieren
Field
Ingenieurwissenschaften

Similar documents (author)

  1. Golub, K.: Automated subject classification of textual web documents (2006) 5.30
    5.298757 = sum of:
      5.298757 = weight(author_txt:golub in 5600) [ClassicSimilarity], result of:
        5.298757 = fieldWeight in 5600, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.478011 = idf(docFreq=24, maxDocs=44218)
          0.625 = fieldNorm(doc=5600)
    
  2. Golub, K.: Subject access to information : an interdisciplinary approach (2015) 5.30
    5.298757 = sum of:
      5.298757 = weight(author_txt:golub in 134) [ClassicSimilarity], result of:
        5.298757 = fieldWeight in 134, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.478011 = idf(docFreq=24, maxDocs=44218)
          0.625 = fieldNorm(doc=134)
    
  3. Golub, K.: Automated subject classification of textual documents in the context of Web-based hierarchical browsing (2011) 5.30
    5.298757 = sum of:
      5.298757 = weight(author_txt:golub in 4558) [ClassicSimilarity], result of:
        5.298757 = fieldWeight in 4558, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.478011 = idf(docFreq=24, maxDocs=44218)
          0.625 = fieldNorm(doc=4558)
    
  4. Golub, K.: Subject access in Swedish discovery services (2018) 5.30
    5.298757 = sum of:
      5.298757 = weight(author_txt:golub in 4379) [ClassicSimilarity], result of:
        5.298757 = fieldWeight in 4379, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.478011 = idf(docFreq=24, maxDocs=44218)
          0.625 = fieldNorm(doc=4379)
    
  5. Golub, K.: Automatic subject indexing of text (2019) 5.30
    5.298757 = sum of:
      5.298757 = weight(author_txt:golub in 5268) [ClassicSimilarity], result of:
        5.298757 = fieldWeight in 5268, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.478011 = idf(docFreq=24, maxDocs=44218)
          0.625 = fieldNorm(doc=5268)
    

Similar documents (content)

  1. Golub, K.; Hamon, T.; Ardö, A.: Automated classification of textual documents based on a controlled vocabulary in engineering (2007) 0.44
    0.44492173 = sum of:
      0.44492173 = product of:
        1.0111858 = sum of:
          0.065940045 = weight(abstract_txt:matching in 1461) [ClassicSimilarity], result of:
            0.065940045 = score(doc=1461,freq=2.0), product of:
              0.123352714 = queryWeight, product of:
                1.0347716 = boost
                6.047913 = idf(docFreq=283, maxDocs=44218)
                0.019710548 = queryNorm
              0.53456503 = fieldWeight in 1461, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.047913 = idf(docFreq=283, maxDocs=44218)
                0.0625 = fieldNorm(doc=1461)
          0.049256686 = weight(abstract_txt:extracted in 1461) [ClassicSimilarity], result of:
            0.049256686 = score(doc=1461,freq=1.0), product of:
              0.12794873 = queryWeight, product of:
                1.0538726 = boost
                6.159553 = idf(docFreq=253, maxDocs=44218)
                0.019710548 = queryNorm
              0.38497207 = fieldWeight in 1461, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.159553 = idf(docFreq=253, maxDocs=44218)
                0.0625 = fieldNorm(doc=1461)
          0.013657558 = weight(abstract_txt:based in 1461) [ClassicSimilarity], result of:
            0.013657558 = score(doc=1461,freq=1.0), product of:
              0.068546385 = queryWeight, product of:
                1.090881 = boost
                3.1879277 = idf(docFreq=4958, maxDocs=44218)
                0.019710548 = queryNorm
              0.19924548 = fieldWeight in 1461, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.1879277 = idf(docFreq=4958, maxDocs=44218)
                0.0625 = fieldNorm(doc=1461)
          0.025141092 = weight(abstract_txt:subject in 1461) [ClassicSimilarity], result of:
            0.025141092 = score(doc=1461,freq=1.0), product of:
              0.10295765 = queryWeight, product of:
                1.3369477 = boost
                3.9070187 = idf(docFreq=2415, maxDocs=44218)
                0.019710548 = queryNorm
              0.24418867 = fieldWeight in 1461, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.9070187 = idf(docFreq=2415, maxDocs=44218)
                0.0625 = fieldNorm(doc=1461)
          0.046655007 = weight(abstract_txt:term in 1461) [ClassicSimilarity], result of:
            0.046655007 = score(doc=1461,freq=1.0), product of:
              0.15547767 = queryWeight, product of:
                1.6429303 = boost
                4.8012047 = idf(docFreq=987, maxDocs=44218)
                0.019710548 = queryNorm
              0.3000753 = fieldWeight in 1461, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.8012047 = idf(docFreq=987, maxDocs=44218)
                0.0625 = fieldNorm(doc=1461)
          0.1297051 = weight(abstract_txt:vocabulary in 1461) [ClassicSimilarity], result of:
            0.1297051 = score(doc=1461,freq=4.0), product of:
              0.19365136 = queryWeight, product of:
                1.8335611 = boost
                5.358293 = idf(docFreq=565, maxDocs=44218)
                0.019710548 = queryNorm
              0.66978663 = fieldWeight in 1461, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.358293 = idf(docFreq=565, maxDocs=44218)
                0.0625 = fieldNorm(doc=1461)
          0.13639261 = weight(abstract_txt:controlled in 1461) [ClassicSimilarity], result of:
            0.13639261 = score(doc=1461,freq=4.0), product of:
              0.2002518 = queryWeight, product of:
                1.8645469 = boost
                5.4488444 = idf(docFreq=516, maxDocs=44218)
                0.019710548 = queryNorm
              0.68110555 = fieldWeight in 1461, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.4488444 = idf(docFreq=516, maxDocs=44218)
                0.0625 = fieldNorm(doc=1461)
          0.10488135 = weight(abstract_txt:automated in 1461) [ClassicSimilarity], result of:
            0.10488135 = score(doc=1461,freq=2.0), product of:
              0.21176688 = queryWeight, product of:
                1.9174062 = boost
                5.6033173 = idf(docFreq=442, maxDocs=44218)
                0.019710548 = queryNorm
              0.49526796 = fieldWeight in 1461, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.6033173 = idf(docFreq=442, maxDocs=44218)
                0.0625 = fieldNorm(doc=1461)
          0.12643762 = weight(abstract_txt:engineering in 1461) [ClassicSimilarity], result of:
            0.12643762 = score(doc=1461,freq=2.0), product of:
              0.23987044 = queryWeight, product of:
                2.0406733 = boost
                5.963546 = idf(docFreq=308, maxDocs=44218)
                0.019710548 = queryNorm
              0.52710795 = fieldWeight in 1461, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.963546 = idf(docFreq=308, maxDocs=44218)
                0.0625 = fieldNorm(doc=1461)
          0.22021465 = weight(abstract_txt:string in 1461) [ClassicSimilarity], result of:
            0.22021465 = score(doc=1461,freq=2.0), product of:
              0.34723487 = queryWeight, product of:
                2.455256 = boost
                7.1750984 = idf(docFreq=91, maxDocs=44218)
                0.019710548 = queryNorm
              0.6341951 = fieldWeight in 1461, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.1750984 = idf(docFreq=91, maxDocs=44218)
                0.0625 = fieldNorm(doc=1461)
          0.092903994 = weight(abstract_txt:classification in 1461) [ClassicSimilarity], result of:
            0.092903994 = score(doc=1461,freq=3.0), product of:
              0.2149785 = queryWeight, product of:
                2.7321064 = boost
                3.9920752 = idf(docFreq=2218, maxDocs=44218)
                0.019710548 = queryNorm
              0.4321548 = fieldWeight in 1461, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.9920752 = idf(docFreq=2218, maxDocs=44218)
                0.0625 = fieldNorm(doc=1461)
        0.44 = coord(11/25)
    
  2. Golub, K.: Automated subject classification of textual documents in the context of Web-based hierarchical browsing (2011) 0.35
    0.3500679 = sum of:
      0.3500679 = product of:
        0.87516975 = sum of:
          0.065940045 = weight(abstract_txt:matching in 4558) [ClassicSimilarity], result of:
            0.065940045 = score(doc=4558,freq=2.0), product of:
              0.123352714 = queryWeight, product of:
                1.0347716 = boost
                6.047913 = idf(docFreq=283, maxDocs=44218)
                0.019710548 = queryNorm
              0.53456503 = fieldWeight in 4558, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.047913 = idf(docFreq=283, maxDocs=44218)
                0.0625 = fieldNorm(doc=4558)
          0.047980722 = weight(abstract_txt:classified in 4558) [ClassicSimilarity], result of:
            0.047980722 = score(doc=4558,freq=1.0), product of:
              0.12572946 = queryWeight, product of:
                1.0446929 = boost
                6.1059003 = idf(docFreq=267, maxDocs=44218)
                0.019710548 = queryNorm
              0.38161877 = fieldWeight in 4558, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.1059003 = idf(docFreq=267, maxDocs=44218)
                0.0625 = fieldNorm(doc=4558)
          0.013657558 = weight(abstract_txt:based in 4558) [ClassicSimilarity], result of:
            0.013657558 = score(doc=4558,freq=1.0), product of:
              0.068546385 = queryWeight, product of:
                1.090881 = boost
                3.1879277 = idf(docFreq=4958, maxDocs=44218)
                0.019710548 = queryNorm
              0.19924548 = fieldWeight in 4558, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.1879277 = idf(docFreq=4958, maxDocs=44218)
                0.0625 = fieldNorm(doc=4558)
          0.025141092 = weight(abstract_txt:subject in 4558) [ClassicSimilarity], result of:
            0.025141092 = score(doc=4558,freq=1.0), product of:
              0.10295765 = queryWeight, product of:
                1.3369477 = boost
                3.9070187 = idf(docFreq=2415, maxDocs=44218)
                0.019710548 = queryNorm
              0.24418867 = fieldWeight in 4558, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.9070187 = idf(docFreq=2415, maxDocs=44218)
                0.0625 = fieldNorm(doc=4558)
          0.11232791 = weight(abstract_txt:vocabulary in 4558) [ClassicSimilarity], result of:
            0.11232791 = score(doc=4558,freq=3.0), product of:
              0.19365136 = queryWeight, product of:
                1.8335611 = boost
                5.358293 = idf(docFreq=565, maxDocs=44218)
                0.019710548 = queryNorm
              0.58005226 = fieldWeight in 4558, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.358293 = idf(docFreq=565, maxDocs=44218)
                0.0625 = fieldNorm(doc=4558)
          0.11811947 = weight(abstract_txt:controlled in 4558) [ClassicSimilarity], result of:
            0.11811947 = score(doc=4558,freq=3.0), product of:
              0.2002518 = queryWeight, product of:
                1.8645469 = boost
                5.4488444 = idf(docFreq=516, maxDocs=44218)
                0.019710548 = queryNorm
              0.5898547 = fieldWeight in 4558, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.4488444 = idf(docFreq=516, maxDocs=44218)
                0.0625 = fieldNorm(doc=4558)
          0.1284529 = weight(abstract_txt:automated in 4558) [ClassicSimilarity], result of:
            0.1284529 = score(doc=4558,freq=3.0), product of:
              0.21176688 = queryWeight, product of:
                1.9174062 = boost
                5.6033173 = idf(docFreq=442, maxDocs=44218)
                0.019710548 = queryNorm
              0.60657686 = fieldWeight in 4558, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.6033173 = idf(docFreq=442, maxDocs=44218)
                0.0625 = fieldNorm(doc=4558)
          0.089697264 = weight(abstract_txt:textual in 4558) [ClassicSimilarity], result of:
            0.089697264 = score(doc=4558,freq=1.0), product of:
              0.2403931 = queryWeight, product of:
                2.0428953 = boost
                5.9700394 = idf(docFreq=306, maxDocs=44218)
                0.019710548 = queryNorm
              0.37312746 = fieldWeight in 4558, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.9700394 = idf(docFreq=306, maxDocs=44218)
                0.0625 = fieldNorm(doc=4558)
          0.22021465 = weight(abstract_txt:string in 4558) [ClassicSimilarity], result of:
            0.22021465 = score(doc=4558,freq=2.0), product of:
              0.34723487 = queryWeight, product of:
                2.455256 = boost
                7.1750984 = idf(docFreq=91, maxDocs=44218)
                0.019710548 = queryNorm
              0.6341951 = fieldWeight in 4558, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.1750984 = idf(docFreq=91, maxDocs=44218)
                0.0625 = fieldNorm(doc=4558)
          0.053638145 = weight(abstract_txt:classification in 4558) [ClassicSimilarity], result of:
            0.053638145 = score(doc=4558,freq=1.0), product of:
              0.2149785 = queryWeight, product of:
                2.7321064 = boost
                3.9920752 = idf(docFreq=2218, maxDocs=44218)
                0.019710548 = queryNorm
              0.2495047 = fieldWeight in 4558, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.9920752 = idf(docFreq=2218, maxDocs=44218)
                0.0625 = fieldNorm(doc=4558)
        0.4 = coord(10/25)
    
  3. Golub, K.; Lykke, M.: Automated classification of web pages in hierarchical browsing (2009) 0.28
    0.2760389 = sum of:
      0.2760389 = product of:
        0.6900972 = sum of:
          0.016900364 = weight(abstract_txt:based in 3614) [ClassicSimilarity], result of:
            0.016900364 = score(doc=3614,freq=2.0), product of:
              0.068546385 = queryWeight, product of:
                1.090881 = boost
                3.1879277 = idf(docFreq=4958, maxDocs=44218)
                0.019710548 = queryNorm
              0.24655369 = fieldWeight in 3614, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.1879277 = idf(docFreq=4958, maxDocs=44218)
                0.0546875 = fieldNorm(doc=3614)
          0.021998456 = weight(abstract_txt:subject in 3614) [ClassicSimilarity], result of:
            0.021998456 = score(doc=3614,freq=1.0), product of:
              0.10295765 = queryWeight, product of:
                1.3369477 = boost
                3.9070187 = idf(docFreq=2415, maxDocs=44218)
                0.019710548 = queryNorm
              0.21366508 = fieldWeight in 3614, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.9070187 = idf(docFreq=2415, maxDocs=44218)
                0.0546875 = fieldNorm(doc=3614)
          0.04082313 = weight(abstract_txt:term in 3614) [ClassicSimilarity], result of:
            0.04082313 = score(doc=3614,freq=1.0), product of:
              0.15547767 = queryWeight, product of:
                1.6429303 = boost
                4.8012047 = idf(docFreq=987, maxDocs=44218)
                0.019710548 = queryNorm
              0.26256588 = fieldWeight in 3614, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.8012047 = idf(docFreq=987, maxDocs=44218)
                0.0546875 = fieldNorm(doc=3614)
          0.05657819 = weight(abstract_txt:words in 3614) [ClassicSimilarity], result of:
            0.05657819 = score(doc=3614,freq=1.0), product of:
              0.19326945 = queryWeight, product of:
                1.8317522 = boost
                5.353007 = idf(docFreq=568, maxDocs=44218)
                0.019710548 = queryNorm
              0.29274255 = fieldWeight in 3614, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.353007 = idf(docFreq=568, maxDocs=44218)
                0.0546875 = fieldNorm(doc=3614)
          0.059671767 = weight(abstract_txt:controlled in 3614) [ClassicSimilarity], result of:
            0.059671767 = score(doc=3614,freq=1.0), product of:
              0.2002518 = queryWeight, product of:
                1.8645469 = boost
                5.4488444 = idf(docFreq=516, maxDocs=44218)
                0.019710548 = queryNorm
              0.29798368 = fieldWeight in 3614, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.4488444 = idf(docFreq=516, maxDocs=44218)
                0.0546875 = fieldNorm(doc=3614)
          0.091771185 = weight(abstract_txt:automated in 3614) [ClassicSimilarity], result of:
            0.091771185 = score(doc=3614,freq=2.0), product of:
              0.21176688 = queryWeight, product of:
                1.9174062 = boost
                5.6033173 = idf(docFreq=442, maxDocs=44218)
                0.019710548 = queryNorm
              0.43335947 = fieldWeight in 3614, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.6033173 = idf(docFreq=442, maxDocs=44218)
                0.0546875 = fieldNorm(doc=3614)
          0.078229286 = weight(abstract_txt:engineering in 3614) [ClassicSimilarity], result of:
            0.078229286 = score(doc=3614,freq=1.0), product of:
              0.23987044 = queryWeight, product of:
                2.0406733 = boost
                5.963546 = idf(docFreq=308, maxDocs=44218)
                0.019710548 = queryNorm
              0.3261314 = fieldWeight in 3614, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.963546 = idf(docFreq=308, maxDocs=44218)
                0.0546875 = fieldNorm(doc=3614)
          0.12455165 = weight(abstract_txt:improvements in 3614) [ClassicSimilarity], result of:
            0.12455165 = score(doc=3614,freq=2.0), product of:
              0.2595893 = queryWeight, product of:
                2.122895 = boost
                6.203826 = idf(docFreq=242, maxDocs=44218)
                0.019710548 = queryNorm
              0.47980267 = fieldWeight in 3614, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.203826 = idf(docFreq=242, maxDocs=44218)
                0.0546875 = fieldNorm(doc=3614)
          0.043912817 = weight(abstract_txt:problems in 3614) [ClassicSimilarity], result of:
            0.043912817 = score(doc=3614,freq=1.0), product of:
              0.186848 = queryWeight, product of:
                2.2058449 = boost
                4.297489 = idf(docFreq=1634, maxDocs=44218)
                0.019710548 = queryNorm
              0.23501894 = fieldWeight in 3614, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.297489 = idf(docFreq=1634, maxDocs=44218)
                0.0546875 = fieldNorm(doc=3614)
          0.1556604 = weight(abstract_txt:classification in 3614) [ClassicSimilarity], result of:
            0.1556604 = score(doc=3614,freq=11.0), product of:
              0.2149785 = queryWeight, product of:
                2.7321064 = boost
                3.9920752 = idf(docFreq=2218, maxDocs=44218)
                0.019710548 = queryNorm
              0.7240743 = fieldWeight in 3614, product of:
                3.3166249 = tf(freq=11.0), with freq of:
                  11.0 = termFreq=11.0
                3.9920752 = idf(docFreq=2218, maxDocs=44218)
                0.0546875 = fieldNorm(doc=3614)
        0.4 = coord(10/25)
    
  4. Dumais, S.T.: Latent semantic analysis (2003) 0.24
    0.24003536 = sum of:
      0.24003536 = product of:
        0.5455349 = sum of:
          0.023313329 = weight(abstract_txt:matching in 2462) [ClassicSimilarity], result of:
            0.023313329 = score(doc=2462,freq=1.0), product of:
              0.123352714 = queryWeight, product of:
                1.0347716 = boost
                6.047913 = idf(docFreq=283, maxDocs=44218)
                0.019710548 = queryNorm
              0.18899728 = fieldWeight in 2462, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.047913 = idf(docFreq=283, maxDocs=44218)
                0.03125 = fieldNorm(doc=2462)
          0.009657351 = weight(abstract_txt:based in 2462) [ClassicSimilarity], result of:
            0.009657351 = score(doc=2462,freq=2.0), product of:
              0.068546385 = queryWeight, product of:
                1.090881 = boost
                3.1879277 = idf(docFreq=4958, maxDocs=44218)
                0.019710548 = queryNorm
              0.14088783 = fieldWeight in 2462, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.1879277 = idf(docFreq=4958, maxDocs=44218)
                0.03125 = fieldNorm(doc=2462)
          0.036816895 = weight(abstract_txt:specified in 2462) [ClassicSimilarity], result of:
            0.036816895 = score(doc=2462,freq=1.0), product of:
              0.16727997 = queryWeight, product of:
                1.205014 = boost
                7.042927 = idf(docFreq=104, maxDocs=44218)
                0.019710548 = queryNorm
              0.22009146 = fieldWeight in 2462, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.042927 = idf(docFreq=104, maxDocs=44218)
                0.03125 = fieldNorm(doc=2462)
          0.021772824 = weight(abstract_txt:subject in 2462) [ClassicSimilarity], result of:
            0.021772824 = score(doc=2462,freq=3.0), product of:
              0.10295765 = queryWeight, product of:
                1.3369477 = boost
                3.9070187 = idf(docFreq=2415, maxDocs=44218)
                0.019710548 = queryNorm
              0.21147358 = fieldWeight in 2462, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.9070187 = idf(docFreq=2415, maxDocs=44218)
                0.03125 = fieldNorm(doc=2462)
          0.023327503 = weight(abstract_txt:term in 2462) [ClassicSimilarity], result of:
            0.023327503 = score(doc=2462,freq=1.0), product of:
              0.15547767 = queryWeight, product of:
                1.6429303 = boost
                4.8012047 = idf(docFreq=987, maxDocs=44218)
                0.019710548 = queryNorm
              0.15003765 = fieldWeight in 2462, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.8012047 = idf(docFreq=987, maxDocs=44218)
                0.03125 = fieldNorm(doc=2462)
          0.11199578 = weight(abstract_txt:words in 2462) [ClassicSimilarity], result of:
            0.11199578 = score(doc=2462,freq=12.0), product of:
              0.19326945 = queryWeight, product of:
                1.8317522 = boost
                5.353007 = idf(docFreq=568, maxDocs=44218)
                0.019710548 = queryNorm
              0.57948 = fieldWeight in 2462, product of:
                3.4641016 = tf(freq=12.0), with freq of:
                  12.0 = termFreq=12.0
                5.353007 = idf(docFreq=568, maxDocs=44218)
                0.03125 = fieldNorm(doc=2462)
          0.06485255 = weight(abstract_txt:vocabulary in 2462) [ClassicSimilarity], result of:
            0.06485255 = score(doc=2462,freq=4.0), product of:
              0.19365136 = queryWeight, product of:
                1.8335611 = boost
                5.358293 = idf(docFreq=565, maxDocs=44218)
                0.019710548 = queryNorm
              0.33489332 = fieldWeight in 2462, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.358293 = idf(docFreq=565, maxDocs=44218)
                0.03125 = fieldNorm(doc=2462)
          0.03291635 = weight(abstract_txt:list in 2462) [ClassicSimilarity], result of:
            0.03291635 = score(doc=2462,freq=1.0), product of:
              0.19559765 = queryWeight, product of:
                1.8427521 = boost
                5.3851523 = idf(docFreq=550, maxDocs=44218)
                0.019710548 = queryNorm
              0.16828601 = fieldWeight in 2462, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.3851523 = idf(docFreq=550, maxDocs=44218)
                0.03125 = fieldNorm(doc=2462)
          0.096444145 = weight(abstract_txt:controlled in 2462) [ClassicSimilarity], result of:
            0.096444145 = score(doc=2462,freq=8.0), product of:
              0.2002518 = queryWeight, product of:
                1.8645469 = boost
                5.4488444 = idf(docFreq=516, maxDocs=44218)
                0.019710548 = queryNorm
              0.48161435 = fieldWeight in 2462, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                5.4488444 = idf(docFreq=516, maxDocs=44218)
                0.03125 = fieldNorm(doc=2462)
          0.05018608 = weight(abstract_txt:problems in 2462) [ClassicSimilarity], result of:
            0.05018608 = score(doc=2462,freq=4.0), product of:
              0.186848 = queryWeight, product of:
                2.2058449 = boost
                4.297489 = idf(docFreq=1634, maxDocs=44218)
                0.019710548 = queryNorm
              0.26859307 = fieldWeight in 2462, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.297489 = idf(docFreq=1634, maxDocs=44218)
                0.03125 = fieldNorm(doc=2462)
          0.07425209 = weight(abstract_txt:pages in 2462) [ClassicSimilarity], result of:
            0.07425209 = score(doc=2462,freq=1.0), product of:
              0.4238755 = queryWeight, product of:
                3.8363593 = boost
                5.6055775 = idf(docFreq=441, maxDocs=44218)
                0.019710548 = queryNorm
              0.1751743 = fieldWeight in 2462, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.6055775 = idf(docFreq=441, maxDocs=44218)
                0.03125 = fieldNorm(doc=2462)
        0.44 = coord(11/25)
    
  5. Wang, J.: Automatic thesaurus development : term extraction from title metadata (2006) 0.22
    0.22030303 = sum of:
      0.22030303 = product of:
        0.61195284 = sum of:
          0.043240093 = weight(abstract_txt:applying in 5063) [ClassicSimilarity], result of:
            0.043240093 = score(doc=5063,freq=1.0), product of:
              0.117305115 = queryWeight, product of:
                1.009087 = boost
                5.8977947 = idf(docFreq=329, maxDocs=44218)
                0.019710548 = queryNorm
              0.36861217 = fieldWeight in 5063, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8977947 = idf(docFreq=329, maxDocs=44218)
                0.0625 = fieldNorm(doc=5063)
          0.049256686 = weight(abstract_txt:extracted in 5063) [ClassicSimilarity], result of:
            0.049256686 = score(doc=5063,freq=1.0), product of:
              0.12794873 = queryWeight, product of:
                1.0538726 = boost
                6.159553 = idf(docFreq=253, maxDocs=44218)
                0.019710548 = queryNorm
              0.38497207 = fieldWeight in 5063, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.159553 = idf(docFreq=253, maxDocs=44218)
                0.0625 = fieldNorm(doc=5063)
          0.013657558 = weight(abstract_txt:based in 5063) [ClassicSimilarity], result of:
            0.013657558 = score(doc=5063,freq=1.0), product of:
              0.068546385 = queryWeight, product of:
                1.090881 = boost
                3.1879277 = idf(docFreq=4958, maxDocs=44218)
                0.019710548 = queryNorm
              0.19924548 = fieldWeight in 5063, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.1879277 = idf(docFreq=4958, maxDocs=44218)
                0.0625 = fieldNorm(doc=5063)
          0.03555487 = weight(abstract_txt:subject in 5063) [ClassicSimilarity], result of:
            0.03555487 = score(doc=5063,freq=2.0), product of:
              0.10295765 = queryWeight, product of:
                1.3369477 = boost
                3.9070187 = idf(docFreq=2415, maxDocs=44218)
                0.019710548 = queryNorm
              0.34533492 = fieldWeight in 5063, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.9070187 = idf(docFreq=2415, maxDocs=44218)
                0.0625 = fieldNorm(doc=5063)
          0.11199578 = weight(abstract_txt:words in 5063) [ClassicSimilarity], result of:
            0.11199578 = score(doc=5063,freq=3.0), product of:
              0.19326945 = queryWeight, product of:
                1.8317522 = boost
                5.353007 = idf(docFreq=568, maxDocs=44218)
                0.019710548 = queryNorm
              0.57948 = fieldWeight in 5063, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.353007 = idf(docFreq=568, maxDocs=44218)
                0.0625 = fieldNorm(doc=5063)
          0.11232791 = weight(abstract_txt:vocabulary in 5063) [ClassicSimilarity], result of:
            0.11232791 = score(doc=5063,freq=3.0), product of:
              0.19365136 = queryWeight, product of:
                1.8335611 = boost
                5.358293 = idf(docFreq=565, maxDocs=44218)
                0.019710548 = queryNorm
              0.58005226 = fieldWeight in 5063, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.358293 = idf(docFreq=565, maxDocs=44218)
                0.0625 = fieldNorm(doc=5063)
          0.11811947 = weight(abstract_txt:controlled in 5063) [ClassicSimilarity], result of:
            0.11811947 = score(doc=5063,freq=3.0), product of:
              0.2002518 = queryWeight, product of:
                1.8645469 = boost
                5.4488444 = idf(docFreq=516, maxDocs=44218)
                0.019710548 = queryNorm
              0.5898547 = fieldWeight in 5063, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.4488444 = idf(docFreq=516, maxDocs=44218)
                0.0625 = fieldNorm(doc=5063)
          0.07416231 = weight(abstract_txt:automated in 5063) [ClassicSimilarity], result of:
            0.07416231 = score(doc=5063,freq=1.0), product of:
              0.21176688 = queryWeight, product of:
                1.9174062 = boost
                5.6033173 = idf(docFreq=442, maxDocs=44218)
                0.019710548 = queryNorm
              0.35020733 = fieldWeight in 5063, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.6033173 = idf(docFreq=442, maxDocs=44218)
                0.0625 = fieldNorm(doc=5063)
          0.053638145 = weight(abstract_txt:classification in 5063) [ClassicSimilarity], result of:
            0.053638145 = score(doc=5063,freq=1.0), product of:
              0.2149785 = queryWeight, product of:
                2.7321064 = boost
                3.9920752 = idf(docFreq=2218, maxDocs=44218)
                0.019710548 = queryNorm
              0.2495047 = fieldWeight in 5063, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.9920752 = idf(docFreq=2218, maxDocs=44218)
                0.0625 = fieldNorm(doc=5063)
        0.36 = coord(9/25)