Document (#34698)

Author
Gauch, S.
Chandramouli, A.
Ranganathan, S.
Title
Training a hierarchical classifier using inter document relationships
Source
Journal of the American Society for Information Science and Technology. 60(2009) no.1, S.47-58
Year
2009
Abstract
Text classifiers automatically classify documents into appropriate concepts for different applications. Most classification approaches use flat classifiers that treat each concept as independent, even when the concept space is hierarchically structured. In contrast, hierarchical text classification exploits the structural relationships between the concepts. In this article, we explore the effectiveness of hierarchical classification for a large concept hierarchy. Since the quality of the classification is dependent on the quality and quantity of the training data, we evaluate the use of documents selected from subconcepts to address the sparseness of training data for the top-level classifiers and the use of document relationships to identify the most representative training documents. By selecting training documents using structural and similarity relationships, we achieve a statistically significant improvement of 39.8% (from 54.5-76.2%) in the accuracy of the hierarchical classifier over that of the flat classifier for a large, three-level concept hierarchy.
Theme
Automatisches Klassifizieren

Similar documents (author)

  1. Gauch, S.: Intelligent information retrieval : an introduction (1992) 2.49
    2.4938753 = sum of:
      2.4938753 = product of:
        4.9877505 = sum of:
          4.9877505 = weight(author_txt:gauch in 503) [ClassicSimilarity], result of:
            4.9877505 = score(doc=503,freq=1.0), product of:
              0.81842065 = queryWeight, product of:
                1.1934332 = boost
                9.7509775 = idf(docFreq=6, maxDocs=44218)
                0.07032833 = queryNorm
              6.094361 = fieldWeight in 503, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.7509775 = idf(docFreq=6, maxDocs=44218)
                0.625 = fieldNorm(doc=503)
        0.5 = coord(1/2)
    
  2. Gauch, S.; Smith, J.B.: ¬An expert system for automatic query reformation (1993) 2.00
    1.9951004 = sum of:
      1.9951004 = product of:
        3.9902008 = sum of:
          3.9902008 = weight(author_txt:gauch in 3693) [ClassicSimilarity], result of:
            3.9902008 = score(doc=3693,freq=1.0), product of:
              0.81842065 = queryWeight, product of:
                1.1934332 = boost
                9.7509775 = idf(docFreq=6, maxDocs=44218)
                0.07032833 = queryNorm
              4.8754888 = fieldWeight in 3693, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.7509775 = idf(docFreq=6, maxDocs=44218)
                0.5 = fieldNorm(doc=3693)
        0.5 = coord(1/2)
    
  3. Gauch, S.; Chong, M.K.: Automatic word similarity detection for TREC 4 query expansion (1996) 2.00
    1.9951004 = sum of:
      1.9951004 = product of:
        3.9902008 = sum of:
          3.9902008 = weight(author_txt:gauch in 2991) [ClassicSimilarity], result of:
            3.9902008 = score(doc=2991,freq=1.0), product of:
              0.81842065 = queryWeight, product of:
                1.1934332 = boost
                9.7509775 = idf(docFreq=6, maxDocs=44218)
                0.07032833 = queryNorm
              4.8754888 = fieldWeight in 2991, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.7509775 = idf(docFreq=6, maxDocs=44218)
                0.5 = fieldNorm(doc=2991)
        0.5 = coord(1/2)
    
  4. Gauch, S.; Wang, J.: Corpus analysis for TREC 5 query expansion (1997) 2.00
    1.9951004 = sum of:
      1.9951004 = product of:
        3.9902008 = sum of:
          3.9902008 = weight(author_txt:gauch in 5800) [ClassicSimilarity], result of:
            3.9902008 = score(doc=5800,freq=1.0), product of:
              0.81842065 = queryWeight, product of:
                1.1934332 = boost
                9.7509775 = idf(docFreq=6, maxDocs=44218)
                0.07032833 = queryNorm
              4.8754888 = fieldWeight in 5800, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.7509775 = idf(docFreq=6, maxDocs=44218)
                0.5 = fieldNorm(doc=5800)
        0.5 = coord(1/2)
    
  5. Haverkamp, D.S.; Gauch, S.: Intelligent information agents : review and challenges for distributed information sources (1998) 2.00
    1.9951004 = sum of:
      1.9951004 = product of:
        3.9902008 = sum of:
          3.9902008 = weight(author_txt:gauch in 2882) [ClassicSimilarity], result of:
            3.9902008 = score(doc=2882,freq=1.0), product of:
              0.81842065 = queryWeight, product of:
                1.1934332 = boost
                9.7509775 = idf(docFreq=6, maxDocs=44218)
                0.07032833 = queryNorm
              4.8754888 = fieldWeight in 2882, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.7509775 = idf(docFreq=6, maxDocs=44218)
                0.5 = fieldNorm(doc=2882)
        0.5 = coord(1/2)
    

Similar documents (content)

  1. Ruiz, M.E.; Srinivasan, P.: Combining machine learning and hierarchical indexing structures for text categorization (2001) 0.45
    0.44538987 = sum of:
      0.44538987 = product of:
        1.3918433 = sum of:
          0.01855474 = weight(abstract_txt:using in 1595) [ClassicSimilarity], result of:
            0.01855474 = score(doc=1595,freq=1.0), product of:
              0.05715 = queryWeight, product of:
                1.0153257 = boost
                3.4631186 = idf(docFreq=3765, maxDocs=44218)
                0.016253373 = queryNorm
              0.32466736 = fieldWeight in 1595, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4631186 = idf(docFreq=3765, maxDocs=44218)
                0.09375 = fieldNorm(doc=1595)
          0.11262477 = weight(abstract_txt:exploits in 1595) [ClassicSimilarity], result of:
            0.11262477 = score(doc=1595,freq=1.0), product of:
              0.1509358 = queryWeight, product of:
                1.1667515 = boost
                7.9592175 = idf(docFreq=41, maxDocs=44218)
                0.016253373 = queryNorm
              0.74617666 = fieldWeight in 1595, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.9592175 = idf(docFreq=41, maxDocs=44218)
                0.09375 = fieldNorm(doc=1595)
          0.04177902 = weight(abstract_txt:text in 1595) [ClassicSimilarity], result of:
            0.04177902 = score(doc=1595,freq=2.0), product of:
              0.0779247 = queryWeight, product of:
                1.1855909 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.016253373 = queryNorm
              0.53614604 = fieldWeight in 1595, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.09375 = fieldNorm(doc=1595)
          0.23859225 = weight(abstract_txt:flat in 1595) [ClassicSimilarity], result of:
            0.23859225 = score(doc=1595,freq=1.0), product of:
              0.31367788 = queryWeight, product of:
                2.3786974 = boost
                8.113368 = idf(docFreq=35, maxDocs=44218)
                0.016253373 = queryNorm
              0.7606282 = fieldWeight in 1595, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.113368 = idf(docFreq=35, maxDocs=44218)
                0.09375 = fieldNorm(doc=1595)
          0.25457555 = weight(abstract_txt:classifier in 1595) [ClassicSimilarity], result of:
            0.25457555 = score(doc=1595,freq=1.0), product of:
              0.37493375 = queryWeight, product of:
                3.1850786 = boost
                7.24254 = idf(docFreq=85, maxDocs=44218)
                0.016253373 = queryNorm
              0.6789881 = fieldWeight in 1595, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.24254 = idf(docFreq=85, maxDocs=44218)
                0.09375 = fieldNorm(doc=1595)
          0.28525332 = weight(abstract_txt:classifiers in 1595) [ClassicSimilarity], result of:
            0.28525332 = score(doc=1595,freq=1.0), product of:
              0.40448013 = queryWeight, product of:
                3.308198 = boost
                7.5225 = idf(docFreq=64, maxDocs=44218)
                0.016253373 = queryNorm
              0.7052344 = fieldWeight in 1595, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.5225 = idf(docFreq=64, maxDocs=44218)
                0.09375 = fieldNorm(doc=1595)
          0.29125652 = weight(abstract_txt:hierarchical in 1595) [ClassicSimilarity], result of:
            0.29125652 = score(doc=1595,freq=3.0), product of:
              0.31299183 = queryWeight, product of:
                3.3603053 = boost
                5.7307405 = idf(docFreq=389, maxDocs=44218)
                0.016253373 = queryNorm
              0.9305563 = fieldWeight in 1595, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.7307405 = idf(docFreq=389, maxDocs=44218)
                0.09375 = fieldNorm(doc=1595)
          0.14920717 = weight(abstract_txt:training in 1595) [ClassicSimilarity], result of:
            0.14920717 = score(doc=1595,freq=1.0), product of:
              0.31132892 = queryWeight, product of:
                3.7469423 = boost
                5.112096 = idf(docFreq=723, maxDocs=44218)
                0.016253373 = queryNorm
              0.47925898 = fieldWeight in 1595, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.112096 = idf(docFreq=723, maxDocs=44218)
                0.09375 = fieldNorm(doc=1595)
        0.32 = coord(8/25)
    
  2. Hung, C.-M.; Chien, L.-F.: Web-based text classification in the absence of manually labeled training documents (2007) 0.33
    0.33479548 = sum of:
      0.33479548 = product of:
        0.92998743 = sum of:
          0.02186697 = weight(abstract_txt:using in 87) [ClassicSimilarity], result of:
            0.02186697 = score(doc=87,freq=2.0), product of:
              0.05715 = queryWeight, product of:
                1.0153257 = boost
                3.4631186 = idf(docFreq=3765, maxDocs=44218)
                0.016253373 = queryNorm
              0.38262415 = fieldWeight in 87, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4631186 = idf(docFreq=3765, maxDocs=44218)
                0.078125 = fieldNorm(doc=87)
          0.022833934 = weight(abstract_txt:most in 87) [ClassicSimilarity], result of:
            0.022833934 = score(doc=87,freq=1.0), product of:
              0.07411185 = queryWeight, product of:
                1.1562216 = boost
                3.943693 = idf(docFreq=2328, maxDocs=44218)
                0.016253373 = queryNorm
              0.308101 = fieldWeight in 87, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.943693 = idf(docFreq=2328, maxDocs=44218)
                0.078125 = fieldNorm(doc=87)
          0.042640533 = weight(abstract_txt:text in 87) [ClassicSimilarity], result of:
            0.042640533 = score(doc=87,freq=3.0), product of:
              0.0779247 = queryWeight, product of:
                1.1855909 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.016253373 = queryNorm
              0.54720175 = fieldWeight in 87, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.078125 = fieldNorm(doc=87)
          0.05303284 = weight(abstract_txt:quality in 87) [ClassicSimilarity], result of:
            0.05303284 = score(doc=87,freq=2.0), product of:
              0.103162155 = queryWeight, product of:
                1.3641354 = boost
                4.6528544 = idf(docFreq=1145, maxDocs=44218)
                0.016253373 = queryNorm
              0.51407266 = fieldWeight in 87, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.6528544 = idf(docFreq=1145, maxDocs=44218)
                0.078125 = fieldNorm(doc=87)
          0.047369376 = weight(abstract_txt:classification in 87) [ClassicSimilarity], result of:
            0.047369376 = score(doc=87,freq=1.0), product of:
              0.1518829 = queryWeight, product of:
                2.340813 = boost
                3.9920752 = idf(docFreq=2218, maxDocs=44218)
                0.016253373 = queryNorm
              0.3118809 = fieldWeight in 87, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.9920752 = idf(docFreq=2218, maxDocs=44218)
                0.078125 = fieldNorm(doc=87)
          0.11654404 = weight(abstract_txt:documents in 87) [ClassicSimilarity], result of:
            0.11654404 = score(doc=87,freq=5.0), product of:
              0.16187526 = queryWeight, product of:
                2.4165874 = boost
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.016253373 = queryNorm
              0.719962 = fieldWeight in 87, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.078125 = fieldNorm(doc=87)
          0.2121463 = weight(abstract_txt:classifier in 87) [ClassicSimilarity], result of:
            0.2121463 = score(doc=87,freq=1.0), product of:
              0.37493375 = queryWeight, product of:
                3.1850786 = boost
                7.24254 = idf(docFreq=85, maxDocs=44218)
                0.016253373 = queryNorm
              0.56582344 = fieldWeight in 87, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.24254 = idf(docFreq=85, maxDocs=44218)
                0.078125 = fieldNorm(doc=87)
          0.23771107 = weight(abstract_txt:classifiers in 87) [ClassicSimilarity], result of:
            0.23771107 = score(doc=87,freq=1.0), product of:
              0.40448013 = queryWeight, product of:
                3.308198 = boost
                7.5225 = idf(docFreq=64, maxDocs=44218)
                0.016253373 = queryNorm
              0.5876953 = fieldWeight in 87, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.5225 = idf(docFreq=64, maxDocs=44218)
                0.078125 = fieldNorm(doc=87)
          0.17584234 = weight(abstract_txt:training in 87) [ClassicSimilarity], result of:
            0.17584234 = score(doc=87,freq=2.0), product of:
              0.31132892 = queryWeight, product of:
                3.7469423 = boost
                5.112096 = idf(docFreq=723, maxDocs=44218)
                0.016253373 = queryNorm
              0.5648121 = fieldWeight in 87, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.112096 = idf(docFreq=723, maxDocs=44218)
                0.078125 = fieldNorm(doc=87)
        0.36 = coord(9/25)
    
  3. Li, T.; Zhu, S.; Ogihara, M.: Hierarchical document classification using automatically generated hierarchy (2007) 0.31
    0.3140919 = sum of:
      0.3140919 = product of:
        0.7852297 = sum of:
          0.02186697 = weight(abstract_txt:using in 4797) [ClassicSimilarity], result of:
            0.02186697 = score(doc=4797,freq=2.0), product of:
              0.05715 = queryWeight, product of:
                1.0153257 = boost
                3.4631186 = idf(docFreq=3765, maxDocs=44218)
                0.016253373 = queryNorm
              0.38262415 = fieldWeight in 4797, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4631186 = idf(docFreq=3765, maxDocs=44218)
                0.078125 = fieldNorm(doc=4797)
          0.022833934 = weight(abstract_txt:most in 4797) [ClassicSimilarity], result of:
            0.022833934 = score(doc=4797,freq=1.0), product of:
              0.07411185 = queryWeight, product of:
                1.1562216 = boost
                3.943693 = idf(docFreq=2328, maxDocs=44218)
                0.016253373 = queryNorm
              0.308101 = fieldWeight in 4797, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.943693 = idf(docFreq=2328, maxDocs=44218)
                0.078125 = fieldNorm(doc=4797)
          0.034815848 = weight(abstract_txt:text in 4797) [ClassicSimilarity], result of:
            0.034815848 = score(doc=4797,freq=2.0), product of:
              0.0779247 = queryWeight, product of:
                1.1855909 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.016253373 = queryNorm
              0.44678837 = fieldWeight in 4797, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.078125 = fieldNorm(doc=4797)
          0.02944661 = weight(abstract_txt:document in 4797) [ClassicSimilarity], result of:
            0.02944661 = score(doc=4797,freq=1.0), product of:
              0.087805964 = queryWeight, product of:
                1.2585175 = boost
                4.2926083 = idf(docFreq=1642, maxDocs=44218)
                0.016253373 = queryNorm
              0.33536002 = fieldWeight in 4797, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.2926083 = idf(docFreq=1642, maxDocs=44218)
                0.078125 = fieldNorm(doc=4797)
          0.032896392 = weight(abstract_txt:large in 4797) [ClassicSimilarity], result of:
            0.032896392 = score(doc=4797,freq=1.0), product of:
              0.09453645 = queryWeight, product of:
                1.3058609 = boost
                4.454089 = idf(docFreq=1397, maxDocs=44218)
                0.016253373 = queryNorm
              0.34797573 = fieldWeight in 4797, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.454089 = idf(docFreq=1397, maxDocs=44218)
                0.078125 = fieldNorm(doc=4797)
          0.066990405 = weight(abstract_txt:classification in 4797) [ClassicSimilarity], result of:
            0.066990405 = score(doc=4797,freq=2.0), product of:
              0.1518829 = queryWeight, product of:
                2.340813 = boost
                3.9920752 = idf(docFreq=2218, maxDocs=44218)
                0.016253373 = queryNorm
              0.44106615 = fieldWeight in 4797, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.9920752 = idf(docFreq=2218, maxDocs=44218)
                0.078125 = fieldNorm(doc=4797)
          0.19882688 = weight(abstract_txt:flat in 4797) [ClassicSimilarity], result of:
            0.19882688 = score(doc=4797,freq=1.0), product of:
              0.31367788 = queryWeight, product of:
                2.3786974 = boost
                8.113368 = idf(docFreq=35, maxDocs=44218)
                0.016253373 = queryNorm
              0.6338569 = fieldWeight in 4797, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.113368 = idf(docFreq=35, maxDocs=44218)
                0.078125 = fieldNorm(doc=4797)
          0.052120075 = weight(abstract_txt:documents in 4797) [ClassicSimilarity], result of:
            0.052120075 = score(doc=4797,freq=1.0), product of:
              0.16187526 = queryWeight, product of:
                2.4165874 = boost
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.016253373 = queryNorm
              0.32197678 = fieldWeight in 4797, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.078125 = fieldNorm(doc=4797)
          0.08271877 = weight(abstract_txt:relationships in 4797) [ClassicSimilarity], result of:
            0.08271877 = score(doc=4797,freq=1.0), product of:
              0.22024861 = queryWeight, product of:
                2.8188298 = boost
                4.807296 = idf(docFreq=981, maxDocs=44218)
                0.016253373 = queryNorm
              0.37557 = fieldWeight in 4797, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.807296 = idf(docFreq=981, maxDocs=44218)
                0.078125 = fieldNorm(doc=4797)
          0.24271376 = weight(abstract_txt:hierarchical in 4797) [ClassicSimilarity], result of:
            0.24271376 = score(doc=4797,freq=3.0), product of:
              0.31299183 = queryWeight, product of:
                3.3603053 = boost
                5.7307405 = idf(docFreq=389, maxDocs=44218)
                0.016253373 = queryNorm
              0.7754636 = fieldWeight in 4797, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.7307405 = idf(docFreq=389, maxDocs=44218)
                0.078125 = fieldNorm(doc=4797)
        0.4 = coord(10/25)
    
  4. Sun, A.; Lim, E.-P.; Ng, W.-K.: Performance measurement framework for hierarchical text classification (2003) 0.30
    0.2980854 = sum of:
      0.2980854 = product of:
        1.0645907 = sum of:
          0.019694818 = weight(abstract_txt:text in 1808) [ClassicSimilarity], result of:
            0.019694818 = score(doc=1808,freq=1.0), product of:
              0.0779247 = queryWeight, product of:
                1.1855909 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.016253373 = queryNorm
              0.25274166 = fieldWeight in 1808, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.0625 = fieldNorm(doc=1808)
          0.023557289 = weight(abstract_txt:document in 1808) [ClassicSimilarity], result of:
            0.023557289 = score(doc=1808,freq=1.0), product of:
              0.087805964 = queryWeight, product of:
                1.2585175 = boost
                4.2926083 = idf(docFreq=1642, maxDocs=44218)
                0.016253373 = queryNorm
              0.26828802 = fieldWeight in 1808, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.2926083 = idf(docFreq=1642, maxDocs=44218)
                0.0625 = fieldNorm(doc=1808)
          0.09282463 = weight(abstract_txt:classification in 1808) [ClassicSimilarity], result of:
            0.09282463 = score(doc=1808,freq=6.0), product of:
              0.1518829 = queryWeight, product of:
                2.340813 = boost
                3.9920752 = idf(docFreq=2218, maxDocs=44218)
                0.016253373 = queryNorm
              0.6111592 = fieldWeight in 1808, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                3.9920752 = idf(docFreq=2218, maxDocs=44218)
                0.0625 = fieldNorm(doc=1808)
          0.058967132 = weight(abstract_txt:documents in 1808) [ClassicSimilarity], result of:
            0.058967132 = score(doc=1808,freq=2.0), product of:
              0.16187526 = queryWeight, product of:
                2.4165874 = boost
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.016253373 = queryNorm
              0.36427513 = fieldWeight in 1808, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.0625 = fieldNorm(doc=1808)
          0.16971704 = weight(abstract_txt:classifier in 1808) [ClassicSimilarity], result of:
            0.16971704 = score(doc=1808,freq=1.0), product of:
              0.37493375 = queryWeight, product of:
                3.1850786 = boost
                7.24254 = idf(docFreq=85, maxDocs=44218)
                0.016253373 = queryNorm
              0.45265874 = fieldWeight in 1808, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.24254 = idf(docFreq=85, maxDocs=44218)
                0.0625 = fieldNorm(doc=1808)
          0.4252305 = weight(abstract_txt:classifiers in 1808) [ClassicSimilarity], result of:
            0.4252305 = score(doc=1808,freq=5.0), product of:
              0.40448013 = queryWeight, product of:
                3.308198 = boost
                7.5225 = idf(docFreq=64, maxDocs=44218)
                0.016253373 = queryNorm
              1.0513014 = fieldWeight in 1808, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                7.5225 = idf(docFreq=64, maxDocs=44218)
                0.0625 = fieldNorm(doc=1808)
          0.27459928 = weight(abstract_txt:hierarchical in 1808) [ClassicSimilarity], result of:
            0.27459928 = score(doc=1808,freq=6.0), product of:
              0.31299183 = queryWeight, product of:
                3.3603053 = boost
                5.7307405 = idf(docFreq=389, maxDocs=44218)
                0.016253373 = queryNorm
              0.8773369 = fieldWeight in 1808, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                5.7307405 = idf(docFreq=389, maxDocs=44218)
                0.0625 = fieldNorm(doc=1808)
        0.28 = coord(7/25)
    
  5. Yoon, Y.; Lee, C.; Lee, G.G.: ¬An effective procedure for constructing a hierarchical text classification system (2006) 0.29
    0.2874947 = sum of:
      0.2874947 = product of:
        1.0267668 = sum of:
          0.024618523 = weight(abstract_txt:text in 5273) [ClassicSimilarity], result of:
            0.024618523 = score(doc=5273,freq=1.0), product of:
              0.0779247 = queryWeight, product of:
                1.1855909 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.016253373 = queryNorm
              0.3159271 = fieldWeight in 5273, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.078125 = fieldNorm(doc=5273)
          0.04652252 = weight(abstract_txt:large in 5273) [ClassicSimilarity], result of:
            0.04652252 = score(doc=5273,freq=2.0), product of:
              0.09453645 = queryWeight, product of:
                1.3058609 = boost
                4.454089 = idf(docFreq=1397, maxDocs=44218)
                0.016253373 = queryNorm
              0.49211198 = fieldWeight in 5273, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.454089 = idf(docFreq=1397, maxDocs=44218)
                0.078125 = fieldNorm(doc=5273)
          0.18114871 = weight(abstract_txt:hierarchy in 5273) [ClassicSimilarity], result of:
            0.18114871 = score(doc=5273,freq=3.0), product of:
              0.20440125 = queryWeight, product of:
                1.9201672 = boost
                6.5493927 = idf(docFreq=171, maxDocs=44218)
                0.016253373 = queryNorm
              0.88624066 = fieldWeight in 5273, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.5493927 = idf(docFreq=171, maxDocs=44218)
                0.078125 = fieldNorm(doc=5273)
          0.10592114 = weight(abstract_txt:classification in 5273) [ClassicSimilarity], result of:
            0.10592114 = score(doc=5273,freq=5.0), product of:
              0.1518829 = queryWeight, product of:
                2.340813 = boost
                3.9920752 = idf(docFreq=2218, maxDocs=44218)
                0.016253373 = queryNorm
              0.69738686 = fieldWeight in 5273, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                3.9920752 = idf(docFreq=2218, maxDocs=44218)
                0.078125 = fieldNorm(doc=5273)
          0.052120075 = weight(abstract_txt:documents in 5273) [ClassicSimilarity], result of:
            0.052120075 = score(doc=5273,freq=1.0), product of:
              0.16187526 = queryWeight, product of:
                2.4165874 = boost
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.016253373 = queryNorm
              0.32197678 = fieldWeight in 5273, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.078125 = fieldNorm(doc=5273)
          0.33617422 = weight(abstract_txt:classifiers in 5273) [ClassicSimilarity], result of:
            0.33617422 = score(doc=5273,freq=2.0), product of:
              0.40448013 = queryWeight, product of:
                3.308198 = boost
                7.5225 = idf(docFreq=64, maxDocs=44218)
                0.016253373 = queryNorm
              0.83112663 = fieldWeight in 5273, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.5225 = idf(docFreq=64, maxDocs=44218)
                0.078125 = fieldNorm(doc=5273)
          0.2802617 = weight(abstract_txt:hierarchical in 5273) [ClassicSimilarity], result of:
            0.2802617 = score(doc=5273,freq=4.0), product of:
              0.31299183 = queryWeight, product of:
                3.3603053 = boost
                5.7307405 = idf(docFreq=389, maxDocs=44218)
                0.016253373 = queryNorm
              0.8954282 = fieldWeight in 5273, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.7307405 = idf(docFreq=389, maxDocs=44218)
                0.078125 = fieldNorm(doc=5273)
        0.28 = coord(7/25)