Document (#36559)

Author
Golub, K.
Title
Automated subject classification of textual documents in the context of Web-based hierarchical browsing
Source
Knowledge organization. 38(2011) no.3, S.230-244
Year
2011
Abstract
While automated methods for information organization have been around for several decades now, exponential growth of the World Wide Web has put them into the forefront of research in different communities, within which several approaches can be identified: 1) machine learning (algorithms that allow computers to improve their performance based on learning from pre-existing data); 2) document clustering (algorithms for unsupervised document organization and automated topic extraction); and 3) string matching (algorithms that match given strings within larger text). Here the aim was to automatically organize textual documents into hierarchical structures for subject browsing. The string-matching approach was tested using a controlled vocabulary (containing pre-selected and pre-defined authorized terms, each corresponding to only one concept). The results imply that an appropriate controlled vocabulary, with a sufficient number of entry terms designating classes, could in itself be a solution for automated classification. Then, if the same controlled vocabulary had an appropriat hierarchical structure, it would at the same time provide a good browsing structure for the collection of automatically classified documents.
Content
Vgl.: http://www.ergon-verlag.de/isko_ko/downloads/ko_38_2011_3_d.pdf.
Theme
Automatisches Klassifizieren

Similar documents (author)

  1. Golub, K.: Automated subject classification of textual web documents (2006) 5.30
    5.298757 = sum of:
      5.298757 = weight(author_txt:golub in 5600) [ClassicSimilarity], result of:
        5.298757 = fieldWeight in 5600, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.478011 = idf(docFreq=24, maxDocs=44218)
          0.625 = fieldNorm(doc=5600)
    
  2. Golub, K.: Automated subject classification of textual Web pages, based on a controlled vocabulary : challenges and recommendations (2006) 5.30
    5.298757 = sum of:
      5.298757 = weight(author_txt:golub in 5897) [ClassicSimilarity], result of:
        5.298757 = fieldWeight in 5897, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.478011 = idf(docFreq=24, maxDocs=44218)
          0.625 = fieldNorm(doc=5897)
    
  3. Golub, K.: Subject access to information : an interdisciplinary approach (2015) 5.30
    5.298757 = sum of:
      5.298757 = weight(author_txt:golub in 134) [ClassicSimilarity], result of:
        5.298757 = fieldWeight in 134, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.478011 = idf(docFreq=24, maxDocs=44218)
          0.625 = fieldNorm(doc=134)
    
  4. Golub, K.: Subject access in Swedish discovery services (2018) 5.30
    5.298757 = sum of:
      5.298757 = weight(author_txt:golub in 4379) [ClassicSimilarity], result of:
        5.298757 = fieldWeight in 4379, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.478011 = idf(docFreq=24, maxDocs=44218)
          0.625 = fieldNorm(doc=4379)
    
  5. Golub, K.: Automatic subject indexing of text (2019) 5.30
    5.298757 = sum of:
      5.298757 = weight(author_txt:golub in 5268) [ClassicSimilarity], result of:
        5.298757 = fieldWeight in 5268, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.478011 = idf(docFreq=24, maxDocs=44218)
          0.625 = fieldNorm(doc=5268)
    

Similar documents (content)

  1. Golub, K.; Hamon, T.; Ardö, A.: Automated classification of textual documents based on a controlled vocabulary in engineering (2007) 0.66
    0.66210544 = sum of:
      0.66210544 = product of:
        1.2732798 = sum of:
          0.021696817 = weight(abstract_txt:subject in 1461) [ClassicSimilarity], result of:
            0.021696817 = score(doc=1461,freq=1.0), product of:
              0.08885268 = queryWeight, product of:
                1.0037473 = boost
                3.9070187 = idf(docFreq=2415, maxDocs=44218)
                0.022656908 = queryNorm
              0.24418867 = fieldWeight in 1461, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.9070187 = idf(docFreq=2415, maxDocs=44218)
                0.0625 = fieldNorm(doc=1461)
          0.04008818 = weight(abstract_txt:classification in 1461) [ClassicSimilarity], result of:
            0.04008818 = score(doc=1461,freq=3.0), product of:
              0.092763476 = queryWeight, product of:
                1.0255991 = boost
                3.9920752 = idf(docFreq=2218, maxDocs=44218)
                0.022656908 = queryNorm
              0.4321548 = fieldWeight in 1461, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.9920752 = idf(docFreq=2218, maxDocs=44218)
                0.0625 = fieldNorm(doc=1461)
          0.053794157 = weight(abstract_txt:terms in 1461) [ClassicSimilarity], result of:
            0.053794157 = score(doc=1461,freq=5.0), product of:
              0.09518603 = queryWeight, product of:
                1.0389048 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.022656908 = queryNorm
              0.5651476 = fieldWeight in 1461, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.0625 = fieldNorm(doc=1461)
          0.034217715 = weight(abstract_txt:several in 1461) [ClassicSimilarity], result of:
            0.034217715 = score(doc=1461,freq=1.0), product of:
              0.12038541 = queryWeight, product of:
                1.1683583 = boost
                4.5477557 = idf(docFreq=1272, maxDocs=44218)
                0.022656908 = queryNorm
              0.28423473 = fieldWeight in 1461, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5477557 = idf(docFreq=1272, maxDocs=44218)
                0.0625 = fieldNorm(doc=1461)
          0.055168975 = weight(abstract_txt:learning in 1461) [ClassicSimilarity], result of:
            0.055168975 = score(doc=1461,freq=2.0), product of:
              0.13137916 = queryWeight, product of:
                1.220541 = boost
                4.750873 = idf(docFreq=1038, maxDocs=44218)
                0.022656908 = queryNorm
              0.41992182 = fieldWeight in 1461, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.750873 = idf(docFreq=1038, maxDocs=44218)
                0.0625 = fieldNorm(doc=1461)
          0.061085474 = weight(abstract_txt:automatically in 1461) [ClassicSimilarity], result of:
            0.061085474 = score(doc=1461,freq=1.0), product of:
              0.17715979 = queryWeight, product of:
                1.4173324 = boost
                5.5168705 = idf(docFreq=482, maxDocs=44218)
                0.022656908 = queryNorm
              0.3448044 = fieldWeight in 1461, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.5168705 = idf(docFreq=482, maxDocs=44218)
                0.0625 = fieldNorm(doc=1461)
          0.11381283 = weight(abstract_txt:matching in 1461) [ClassicSimilarity], result of:
            0.11381283 = score(doc=1461,freq=2.0), product of:
              0.21290736 = queryWeight, product of:
                1.553762 = boost
                6.047913 = idf(docFreq=283, maxDocs=44218)
                0.022656908 = queryNorm
              0.53456503 = fieldWeight in 1461, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.047913 = idf(docFreq=283, maxDocs=44218)
                0.0625 = fieldNorm(doc=1461)
          0.07639842 = weight(abstract_txt:documents in 1461) [ClassicSimilarity], result of:
            0.07639842 = score(doc=1461,freq=4.0), product of:
              0.14829956 = queryWeight, product of:
                1.5881982 = boost
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.022656908 = queryNorm
              0.5151628 = fieldWeight in 1461, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.0625 = fieldNorm(doc=1461)
          0.19004577 = weight(abstract_txt:string in 1461) [ClassicSimilarity], result of:
            0.19004577 = score(doc=1461,freq=2.0), product of:
              0.29966453 = queryWeight, product of:
                1.8433458 = boost
                7.1750984 = idf(docFreq=91, maxDocs=44218)
                0.022656908 = queryNorm
              0.6341951 = fieldWeight in 1461, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.1750984 = idf(docFreq=91, maxDocs=44218)
                0.0625 = fieldNorm(doc=1461)
          0.16790369 = weight(abstract_txt:vocabulary in 1461) [ClassicSimilarity], result of:
            0.16790369 = score(doc=1461,freq=4.0), product of:
              0.25068235 = queryWeight, product of:
                2.0648887 = boost
                5.358293 = idf(docFreq=565, maxDocs=44218)
                0.022656908 = queryNorm
              0.66978663 = fieldWeight in 1461, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.358293 = idf(docFreq=565, maxDocs=44218)
                0.0625 = fieldNorm(doc=1461)
          0.17656071 = weight(abstract_txt:controlled in 1461) [ClassicSimilarity], result of:
            0.17656071 = score(doc=1461,freq=4.0), product of:
              0.25922665 = queryWeight, product of:
                2.099784 = boost
                5.4488444 = idf(docFreq=516, maxDocs=44218)
                0.022656908 = queryNorm
              0.68110555 = fieldWeight in 1461, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.4488444 = idf(docFreq=516, maxDocs=44218)
                0.0625 = fieldNorm(doc=1461)
          0.101481244 = weight(abstract_txt:algorithms in 1461) [ClassicSimilarity], result of:
            0.101481244 = score(doc=1461,freq=1.0), product of:
              0.2844641 = queryWeight, product of:
                2.1996243 = boost
                5.707926 = idf(docFreq=398, maxDocs=44218)
                0.022656908 = queryNorm
              0.35674536 = fieldWeight in 1461, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.707926 = idf(docFreq=398, maxDocs=44218)
                0.0625 = fieldNorm(doc=1461)
          0.1810257 = weight(abstract_txt:automated in 1461) [ClassicSimilarity], result of:
            0.1810257 = score(doc=1461,freq=2.0), product of:
              0.3655106 = queryWeight, product of:
                2.8790827 = boost
                5.6033173 = idf(docFreq=442, maxDocs=44218)
                0.022656908 = queryNorm
              0.49526796 = fieldWeight in 1461, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.6033173 = idf(docFreq=442, maxDocs=44218)
                0.0625 = fieldNorm(doc=1461)
        0.52 = coord(13/25)
    
  2. Golub, K.: Automatic subject indexing of text (2019) 0.35
    0.34551007 = sum of:
      0.34551007 = product of:
        0.71981263 = sum of:
          0.048515562 = weight(abstract_txt:subject in 5268) [ClassicSimilarity], result of:
            0.048515562 = score(doc=5268,freq=5.0), product of:
              0.08885268 = queryWeight, product of:
                1.0037473 = boost
                3.9070187 = idf(docFreq=2415, maxDocs=44218)
                0.022656908 = queryNorm
              0.5460225 = fieldWeight in 5268, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                3.9070187 = idf(docFreq=2415, maxDocs=44218)
                0.0625 = fieldNorm(doc=5268)
          0.04008818 = weight(abstract_txt:classification in 5268) [ClassicSimilarity], result of:
            0.04008818 = score(doc=5268,freq=3.0), product of:
              0.092763476 = queryWeight, product of:
                1.0255991 = boost
                3.9920752 = idf(docFreq=2218, maxDocs=44218)
                0.022656908 = queryNorm
              0.4321548 = fieldWeight in 5268, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.9920752 = idf(docFreq=2218, maxDocs=44218)
                0.0625 = fieldNorm(doc=5268)
          0.04811495 = weight(abstract_txt:terms in 5268) [ClassicSimilarity], result of:
            0.04811495 = score(doc=5268,freq=4.0), product of:
              0.09518603 = queryWeight, product of:
                1.0389048 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.022656908 = queryNorm
              0.5054833 = fieldWeight in 5268, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.0625 = fieldNorm(doc=5268)
          0.057551064 = weight(abstract_txt:document in 5268) [ClassicSimilarity], result of:
            0.057551064 = score(doc=5268,freq=4.0), product of:
              0.10725612 = queryWeight, product of:
                1.1028087 = boost
                4.2926083 = idf(docFreq=1642, maxDocs=44218)
                0.022656908 = queryNorm
              0.53657603 = fieldWeight in 5268, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.2926083 = idf(docFreq=1642, maxDocs=44218)
                0.0625 = fieldNorm(doc=5268)
          0.03213122 = weight(abstract_txt:organization in 5268) [ClassicSimilarity], result of:
            0.03213122 = score(doc=5268,freq=1.0), product of:
              0.11544044 = queryWeight, product of:
                1.1441109 = boost
                4.4533744 = idf(docFreq=1398, maxDocs=44218)
                0.022656908 = queryNorm
              0.2783359 = fieldWeight in 5268, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4533744 = idf(docFreq=1398, maxDocs=44218)
                0.0625 = fieldNorm(doc=5268)
          0.038774874 = weight(abstract_txt:same in 5268) [ClassicSimilarity], result of:
            0.038774874 = score(doc=5268,freq=1.0), product of:
              0.13084991 = queryWeight, product of:
                1.2180802 = boost
                4.7412944 = idf(docFreq=1048, maxDocs=44218)
                0.022656908 = queryNorm
              0.2963309 = fieldWeight in 5268, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.7412944 = idf(docFreq=1048, maxDocs=44218)
                0.0625 = fieldNorm(doc=5268)
          0.039010357 = weight(abstract_txt:learning in 5268) [ClassicSimilarity], result of:
            0.039010357 = score(doc=5268,freq=1.0), product of:
              0.13137916 = queryWeight, product of:
                1.220541 = boost
                4.750873 = idf(docFreq=1038, maxDocs=44218)
                0.022656908 = queryNorm
              0.29692957 = fieldWeight in 5268, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.750873 = idf(docFreq=1038, maxDocs=44218)
                0.0625 = fieldNorm(doc=5268)
          0.061085474 = weight(abstract_txt:automatically in 5268) [ClassicSimilarity], result of:
            0.061085474 = score(doc=5268,freq=1.0), product of:
              0.17715979 = queryWeight, product of:
                1.4173324 = boost
                5.5168705 = idf(docFreq=482, maxDocs=44218)
                0.022656908 = queryNorm
              0.3448044 = fieldWeight in 5268, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.5168705 = idf(docFreq=482, maxDocs=44218)
                0.0625 = fieldNorm(doc=5268)
          0.08047783 = weight(abstract_txt:matching in 5268) [ClassicSimilarity], result of:
            0.08047783 = score(doc=5268,freq=1.0), product of:
              0.21290736 = queryWeight, product of:
                1.553762 = boost
                6.047913 = idf(docFreq=283, maxDocs=44218)
                0.022656908 = queryNorm
              0.37799457 = fieldWeight in 5268, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.047913 = idf(docFreq=283, maxDocs=44218)
                0.0625 = fieldNorm(doc=5268)
          0.03819921 = weight(abstract_txt:documents in 5268) [ClassicSimilarity], result of:
            0.03819921 = score(doc=5268,freq=1.0), product of:
              0.14829956 = queryWeight, product of:
                1.5881982 = boost
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.022656908 = queryNorm
              0.2575814 = fieldWeight in 5268, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.0625 = fieldNorm(doc=5268)
          0.13438265 = weight(abstract_txt:string in 5268) [ClassicSimilarity], result of:
            0.13438265 = score(doc=5268,freq=1.0), product of:
              0.29966453 = queryWeight, product of:
                1.8433458 = boost
                7.1750984 = idf(docFreq=91, maxDocs=44218)
                0.022656908 = queryNorm
              0.44844365 = fieldWeight in 5268, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.1750984 = idf(docFreq=91, maxDocs=44218)
                0.0625 = fieldNorm(doc=5268)
          0.101481244 = weight(abstract_txt:algorithms in 5268) [ClassicSimilarity], result of:
            0.101481244 = score(doc=5268,freq=1.0), product of:
              0.2844641 = queryWeight, product of:
                2.1996243 = boost
                5.707926 = idf(docFreq=398, maxDocs=44218)
                0.022656908 = queryNorm
              0.35674536 = fieldWeight in 5268, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.707926 = idf(docFreq=398, maxDocs=44218)
                0.0625 = fieldNorm(doc=5268)
        0.48 = coord(12/25)
    
  3. Golub, K.: Automated subject classification of textual Web pages, based on a controlled vocabulary : challenges and recommendations (2006) 0.34
    0.33692813 = sum of:
      0.33692813 = product of:
        0.9359114 = sum of:
          0.027121022 = weight(abstract_txt:subject in 5897) [ClassicSimilarity], result of:
            0.027121022 = score(doc=5897,freq=1.0), product of:
              0.08885268 = queryWeight, product of:
                1.0037473 = boost
                3.9070187 = idf(docFreq=2415, maxDocs=44218)
                0.022656908 = queryNorm
              0.30523583 = fieldWeight in 5897, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.9070187 = idf(docFreq=2415, maxDocs=44218)
                0.078125 = fieldNorm(doc=5897)
          0.05011023 = weight(abstract_txt:classification in 5897) [ClassicSimilarity], result of:
            0.05011023 = score(doc=5897,freq=3.0), product of:
              0.092763476 = queryWeight, product of:
                1.0255991 = boost
                3.9920752 = idf(docFreq=2218, maxDocs=44218)
                0.022656908 = queryNorm
              0.5401935 = fieldWeight in 5897, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.9920752 = idf(docFreq=2218, maxDocs=44218)
                0.078125 = fieldNorm(doc=5897)
          0.04846859 = weight(abstract_txt:same in 5897) [ClassicSimilarity], result of:
            0.04846859 = score(doc=5897,freq=1.0), product of:
              0.13084991 = queryWeight, product of:
                1.2180802 = boost
                4.7412944 = idf(docFreq=1048, maxDocs=44218)
                0.022656908 = queryNorm
              0.37041363 = fieldWeight in 5897, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.7412944 = idf(docFreq=1048, maxDocs=44218)
                0.078125 = fieldNorm(doc=5897)
          0.09676119 = weight(abstract_txt:textual in 5897) [ClassicSimilarity], result of:
            0.09676119 = score(doc=5897,freq=1.0), product of:
              0.2074598 = queryWeight, product of:
                1.5337555 = boost
                5.9700394 = idf(docFreq=306, maxDocs=44218)
                0.022656908 = queryNorm
              0.46640933 = fieldWeight in 5897, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.9700394 = idf(docFreq=306, maxDocs=44218)
                0.078125 = fieldNorm(doc=5897)
          0.10059728 = weight(abstract_txt:matching in 5897) [ClassicSimilarity], result of:
            0.10059728 = score(doc=5897,freq=1.0), product of:
              0.21290736 = queryWeight, product of:
                1.553762 = boost
                6.047913 = idf(docFreq=283, maxDocs=44218)
                0.022656908 = queryNorm
              0.4724932 = fieldWeight in 5897, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.047913 = idf(docFreq=283, maxDocs=44218)
                0.078125 = fieldNorm(doc=5897)
          0.23755722 = weight(abstract_txt:string in 5897) [ClassicSimilarity], result of:
            0.23755722 = score(doc=5897,freq=2.0), product of:
              0.29966453 = queryWeight, product of:
                1.8433458 = boost
                7.1750984 = idf(docFreq=91, maxDocs=44218)
                0.022656908 = queryNorm
              0.79274386 = fieldWeight in 5897, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.1750984 = idf(docFreq=91, maxDocs=44218)
                0.078125 = fieldNorm(doc=5897)
          0.10493981 = weight(abstract_txt:vocabulary in 5897) [ClassicSimilarity], result of:
            0.10493981 = score(doc=5897,freq=1.0), product of:
              0.25068235 = queryWeight, product of:
                2.0648887 = boost
                5.358293 = idf(docFreq=565, maxDocs=44218)
                0.022656908 = queryNorm
              0.41861665 = fieldWeight in 5897, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.358293 = idf(docFreq=565, maxDocs=44218)
                0.078125 = fieldNorm(doc=5897)
          0.110350445 = weight(abstract_txt:controlled in 5897) [ClassicSimilarity], result of:
            0.110350445 = score(doc=5897,freq=1.0), product of:
              0.25922665 = queryWeight, product of:
                2.099784 = boost
                5.4488444 = idf(docFreq=516, maxDocs=44218)
                0.022656908 = queryNorm
              0.42569098 = fieldWeight in 5897, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.4488444 = idf(docFreq=516, maxDocs=44218)
                0.078125 = fieldNorm(doc=5897)
          0.16000561 = weight(abstract_txt:automated in 5897) [ClassicSimilarity], result of:
            0.16000561 = score(doc=5897,freq=1.0), product of:
              0.3655106 = queryWeight, product of:
                2.8790827 = boost
                5.6033173 = idf(docFreq=442, maxDocs=44218)
                0.022656908 = queryNorm
              0.43775916 = fieldWeight in 5897, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.6033173 = idf(docFreq=442, maxDocs=44218)
                0.078125 = fieldNorm(doc=5897)
        0.36 = coord(9/25)
    
  4. Golub, K.; Lykke, M.: Automated classification of web pages in hierarchical browsing (2009) 0.27
    0.27416638 = sum of:
      0.27416638 = product of:
        0.7615732 = sum of:
          0.018984716 = weight(abstract_txt:subject in 3614) [ClassicSimilarity], result of:
            0.018984716 = score(doc=3614,freq=1.0), product of:
              0.08885268 = queryWeight, product of:
                1.0037473 = boost
                3.9070187 = idf(docFreq=2415, maxDocs=44218)
                0.022656908 = queryNorm
              0.21366508 = fieldWeight in 3614, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.9070187 = idf(docFreq=2415, maxDocs=44218)
                0.0546875 = fieldNorm(doc=3614)
          0.06716765 = weight(abstract_txt:classification in 3614) [ClassicSimilarity], result of:
            0.06716765 = score(doc=3614,freq=11.0), product of:
              0.092763476 = queryWeight, product of:
                1.0255991 = boost
                3.9920752 = idf(docFreq=2218, maxDocs=44218)
                0.022656908 = queryNorm
              0.7240743 = fieldWeight in 3614, product of:
                3.3166249 = tf(freq=11.0), with freq of:
                  11.0 = termFreq=11.0
                3.9920752 = idf(docFreq=2218, maxDocs=44218)
                0.0546875 = fieldNorm(doc=3614)
          0.021050291 = weight(abstract_txt:terms in 3614) [ClassicSimilarity], result of:
            0.021050291 = score(doc=3614,freq=1.0), product of:
              0.09518603 = queryWeight, product of:
                1.0389048 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.022656908 = queryNorm
              0.22114895 = fieldWeight in 3614, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.0546875 = fieldNorm(doc=3614)
          0.033928014 = weight(abstract_txt:same in 3614) [ClassicSimilarity], result of:
            0.033928014 = score(doc=3614,freq=1.0), product of:
              0.13084991 = queryWeight, product of:
                1.2180802 = boost
                4.7412944 = idf(docFreq=1048, maxDocs=44218)
                0.022656908 = queryNorm
              0.25928953 = fieldWeight in 3614, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.7412944 = idf(docFreq=1048, maxDocs=44218)
                0.0546875 = fieldNorm(doc=3614)
          0.07558942 = weight(abstract_txt:automatically in 3614) [ClassicSimilarity], result of:
            0.07558942 = score(doc=3614,freq=2.0), product of:
              0.17715979 = queryWeight, product of:
                1.4173324 = boost
                5.5168705 = idf(docFreq=482, maxDocs=44218)
                0.022656908 = queryNorm
              0.42667368 = fieldWeight in 3614, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.5168705 = idf(docFreq=482, maxDocs=44218)
                0.0546875 = fieldNorm(doc=3614)
          0.07724531 = weight(abstract_txt:controlled in 3614) [ClassicSimilarity], result of:
            0.07724531 = score(doc=3614,freq=1.0), product of:
              0.25922665 = queryWeight, product of:
                2.099784 = boost
                5.4488444 = idf(docFreq=516, maxDocs=44218)
                0.022656908 = queryNorm
              0.29798368 = fieldWeight in 3614, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.4488444 = idf(docFreq=516, maxDocs=44218)
                0.0546875 = fieldNorm(doc=3614)
          0.21934512 = weight(abstract_txt:browsing in 3614) [ClassicSimilarity], result of:
            0.21934512 = score(doc=3614,freq=7.0), product of:
              0.27173832 = queryWeight, product of:
                2.1498601 = boost
                5.57879 = idf(docFreq=453, maxDocs=44218)
                0.022656908 = queryNorm
              0.80719244 = fieldWeight in 3614, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                5.57879 = idf(docFreq=453, maxDocs=44218)
                0.0546875 = fieldNorm(doc=3614)
          0.089865126 = weight(abstract_txt:hierarchical in 3614) [ClassicSimilarity], result of:
            0.089865126 = score(doc=3614,freq=1.0), product of:
              0.2867427 = queryWeight, product of:
                2.2084165 = boost
                5.7307405 = idf(docFreq=389, maxDocs=44218)
                0.022656908 = queryNorm
              0.31339988 = fieldWeight in 3614, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.7307405 = idf(docFreq=389, maxDocs=44218)
                0.0546875 = fieldNorm(doc=3614)
          0.15839748 = weight(abstract_txt:automated in 3614) [ClassicSimilarity], result of:
            0.15839748 = score(doc=3614,freq=2.0), product of:
              0.3655106 = queryWeight, product of:
                2.8790827 = boost
                5.6033173 = idf(docFreq=442, maxDocs=44218)
                0.022656908 = queryNorm
              0.43335947 = fieldWeight in 3614, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.6033173 = idf(docFreq=442, maxDocs=44218)
                0.0546875 = fieldNorm(doc=3614)
        0.36 = coord(9/25)
    
  5. Chen, H.; Houston, A.L.; Sewell, R.R.; Schatz, B.R.: Internet browsing and searching : user evaluations of category map and concept space techniques (1998) 0.23
    0.23311643 = sum of:
      0.23311643 = product of:
        0.64754564 = sum of:
          0.036386672 = weight(abstract_txt:subject in 869) [ClassicSimilarity], result of:
            0.036386672 = score(doc=869,freq=5.0), product of:
              0.08885268 = queryWeight, product of:
                1.0037473 = boost
                3.9070187 = idf(docFreq=2415, maxDocs=44218)
                0.022656908 = queryNorm
              0.40951687 = fieldWeight in 869, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                3.9070187 = idf(docFreq=2415, maxDocs=44218)
                0.046875 = fieldNorm(doc=869)
          0.047737572 = weight(abstract_txt:terms in 869) [ClassicSimilarity], result of:
            0.047737572 = score(doc=869,freq=7.0), product of:
              0.09518603 = queryWeight, product of:
                1.0389048 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.022656908 = queryNorm
              0.50151867 = fieldWeight in 869, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.046875 = fieldNorm(doc=869)
          0.024098413 = weight(abstract_txt:organization in 869) [ClassicSimilarity], result of:
            0.024098413 = score(doc=869,freq=1.0), product of:
              0.11544044 = queryWeight, product of:
                1.1441109 = boost
                4.4533744 = idf(docFreq=1398, maxDocs=44218)
                0.022656908 = queryNorm
              0.20875192 = fieldWeight in 869, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4533744 = idf(docFreq=1398, maxDocs=44218)
                0.046875 = fieldNorm(doc=869)
          0.06479093 = weight(abstract_txt:automatically in 869) [ClassicSimilarity], result of:
            0.06479093 = score(doc=869,freq=2.0), product of:
              0.17715979 = queryWeight, product of:
                1.4173324 = boost
                5.5168705 = idf(docFreq=482, maxDocs=44218)
                0.022656908 = queryNorm
              0.3657203 = fieldWeight in 869, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.5168705 = idf(docFreq=482, maxDocs=44218)
                0.046875 = fieldNorm(doc=869)
          0.028649408 = weight(abstract_txt:documents in 869) [ClassicSimilarity], result of:
            0.028649408 = score(doc=869,freq=1.0), product of:
              0.14829956 = queryWeight, product of:
                1.5881982 = boost
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.022656908 = queryNorm
              0.19318606 = fieldWeight in 869, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.046875 = fieldNorm(doc=869)
          0.06296388 = weight(abstract_txt:vocabulary in 869) [ClassicSimilarity], result of:
            0.06296388 = score(doc=869,freq=1.0), product of:
              0.25068235 = queryWeight, product of:
                2.0648887 = boost
                5.358293 = idf(docFreq=565, maxDocs=44218)
                0.022656908 = queryNorm
              0.25116998 = fieldWeight in 869, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.358293 = idf(docFreq=565, maxDocs=44218)
                0.046875 = fieldNorm(doc=869)
          0.17406353 = weight(abstract_txt:browsing in 869) [ClassicSimilarity], result of:
            0.17406353 = score(doc=869,freq=6.0), product of:
              0.27173832 = queryWeight, product of:
                2.1498601 = boost
                5.57879 = idf(docFreq=453, maxDocs=44218)
                0.022656908 = queryNorm
              0.64055574 = fieldWeight in 869, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                5.57879 = idf(docFreq=453, maxDocs=44218)
                0.046875 = fieldNorm(doc=869)
          0.13182801 = weight(abstract_txt:algorithms in 869) [ClassicSimilarity], result of:
            0.13182801 = score(doc=869,freq=3.0), product of:
              0.2844641 = queryWeight, product of:
                2.1996243 = boost
                5.707926 = idf(docFreq=398, maxDocs=44218)
                0.022656908 = queryNorm
              0.46342582 = fieldWeight in 869, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.707926 = idf(docFreq=398, maxDocs=44218)
                0.046875 = fieldNorm(doc=869)
          0.077027254 = weight(abstract_txt:hierarchical in 869) [ClassicSimilarity], result of:
            0.077027254 = score(doc=869,freq=1.0), product of:
              0.2867427 = queryWeight, product of:
                2.2084165 = boost
                5.7307405 = idf(docFreq=389, maxDocs=44218)
                0.022656908 = queryNorm
              0.26862848 = fieldWeight in 869, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.7307405 = idf(docFreq=389, maxDocs=44218)
                0.046875 = fieldNorm(doc=869)
        0.36 = coord(9/25)