Document (#34699)

Author
Gauch, S.
Chandramouli, A.
Ranganathan, S.
Title
Training a hierarchical classifier using inter document relationships
Source
Journal of the American Society for Information Science and Technology. 60(2009) no.1, S.47-58
Year
2009
Abstract
Text classifiers automatically classify documents into appropriate concepts for different applications. Most classification approaches use flat classifiers that treat each concept as independent, even when the concept space is hierarchically structured. In contrast, hierarchical text classification exploits the structural relationships between the concepts. In this article, we explore the effectiveness of hierarchical classification for a large concept hierarchy. Since the quality of the classification is dependent on the quality and quantity of the training data, we evaluate the use of documents selected from subconcepts to address the sparseness of training data for the top-level classifiers and the use of document relationships to identify the most representative training documents. By selecting training documents using structural and similarity relationships, we achieve a statistically significant improvement of 39.8% (from 54.5-76.2%) in the accuracy of the hierarchical classifier over that of the flat classifier for a large, three-level concept hierarchy.
Theme
Automatisches Klassifizieren

Similar documents (author)

  1. Gauch, S.: Intelligent information retrieval : an introduction (1992) 2.49
    2.4862905 = sum of:
      2.4862905 = product of:
        4.972581 = sum of:
          4.972581 = weight(author_txt:gauch in 503) [ClassicSimilarity], result of:
            4.972581 = score(doc=503,freq=1.0), product of:
              0.81878626 = queryWeight, product of:
                1.1942413 = boost
                9.71698 = idf(docFreq=6, maxDocs=42740)
                0.070558146 = queryNorm
              6.0731125 = fieldWeight in 503, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.71698 = idf(docFreq=6, maxDocs=42740)
                0.625 = fieldNorm(doc=503)
        0.5 = coord(1/2)
    
  2. Gauch, S.; Smith, J.B.: ¬An expert system for automatic query reformation (1993) 1.99
    1.9890324 = sum of:
      1.9890324 = product of:
        3.9780648 = sum of:
          3.9780648 = weight(author_txt:gauch in 3693) [ClassicSimilarity], result of:
            3.9780648 = score(doc=3693,freq=1.0), product of:
              0.81878626 = queryWeight, product of:
                1.1942413 = boost
                9.71698 = idf(docFreq=6, maxDocs=42740)
                0.070558146 = queryNorm
              4.85849 = fieldWeight in 3693, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.71698 = idf(docFreq=6, maxDocs=42740)
                0.5 = fieldNorm(doc=3693)
        0.5 = coord(1/2)
    
  3. Gauch, S.; Chong, M.K.: Automatic word similarity detection for TREC 4 query expansion (1996) 1.99
    1.9890324 = sum of:
      1.9890324 = product of:
        3.9780648 = sum of:
          3.9780648 = weight(author_txt:gauch in 3060) [ClassicSimilarity], result of:
            3.9780648 = score(doc=3060,freq=1.0), product of:
              0.81878626 = queryWeight, product of:
                1.1942413 = boost
                9.71698 = idf(docFreq=6, maxDocs=42740)
                0.070558146 = queryNorm
              4.85849 = fieldWeight in 3060, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.71698 = idf(docFreq=6, maxDocs=42740)
                0.5 = fieldNorm(doc=3060)
        0.5 = coord(1/2)
    
  4. Gauch, S.; Wang, J.: Corpus analysis for TREC 5 query expansion (1997) 1.99
    1.9890324 = sum of:
      1.9890324 = product of:
        3.9780648 = sum of:
          3.9780648 = weight(author_txt:gauch in 5869) [ClassicSimilarity], result of:
            3.9780648 = score(doc=5869,freq=1.0), product of:
              0.81878626 = queryWeight, product of:
                1.1942413 = boost
                9.71698 = idf(docFreq=6, maxDocs=42740)
                0.070558146 = queryNorm
              4.85849 = fieldWeight in 5869, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.71698 = idf(docFreq=6, maxDocs=42740)
                0.5 = fieldNorm(doc=5869)
        0.5 = coord(1/2)
    
  5. Haverkamp, D.S.; Gauch, S.: Intelligent information agents : review and challenges for distributed information sources (1998) 1.99
    1.9890324 = sum of:
      1.9890324 = product of:
        3.9780648 = sum of:
          3.9780648 = weight(author_txt:gauch in 3883) [ClassicSimilarity], result of:
            3.9780648 = score(doc=3883,freq=1.0), product of:
              0.81878626 = queryWeight, product of:
                1.1942413 = boost
                9.71698 = idf(docFreq=6, maxDocs=42740)
                0.070558146 = queryNorm
              4.85849 = fieldWeight in 3883, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.71698 = idf(docFreq=6, maxDocs=42740)
                0.5 = fieldNorm(doc=3883)
        0.5 = coord(1/2)
    

Similar documents (content)

  1. Ruiz, M.E.; Srinivasan, P.: Combining machine learning and hierarchical indexing structures for text categorization (2001) 0.45
    0.44725272 = sum of:
      0.44725272 = product of:
        1.3976648 = sum of:
          0.018681621 = weight(abstract_txt:using in 2596) [ClassicSimilarity], result of:
            0.018681621 = score(doc=2596,freq=1.0), product of:
              0.057270013 = queryWeight, product of:
                1.0193914 = boost
                3.4794931 = idf(docFreq=3580, maxDocs=42740)
                0.0161462 = queryNorm
              0.32620248 = fieldWeight in 2596, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4794931 = idf(docFreq=3580, maxDocs=42740)
                0.09375 = fieldNorm(doc=2596)
          0.11242608 = weight(abstract_txt:exploits in 2596) [ClassicSimilarity], result of:
            0.11242608 = score(doc=2596,freq=1.0), product of:
              0.15039 = queryWeight, product of:
                1.168078 = boost
                7.974011 = idf(docFreq=39, maxDocs=42740)
                0.0161462 = queryNorm
              0.74756354 = fieldWeight in 2596, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.974011 = idf(docFreq=39, maxDocs=42740)
                0.09375 = fieldNorm(doc=2596)
          0.04166445 = weight(abstract_txt:text in 2596) [ClassicSimilarity], result of:
            0.04166445 = score(doc=2596,freq=2.0), product of:
              0.077592194 = queryWeight, product of:
                1.1865509 = boost
                4.0500593 = idf(docFreq=2023, maxDocs=42740)
                0.0161462 = queryNorm
              0.53696704 = fieldWeight in 2596, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.0500593 = idf(docFreq=2023, maxDocs=42740)
                0.09375 = fieldNorm(doc=2596)
          0.2415215 = weight(abstract_txt:flat in 2596) [ClassicSimilarity], result of:
            0.2415215 = score(doc=2596,freq=1.0), product of:
              0.3154676 = queryWeight, product of:
                2.3925152 = boost
                8.166383 = idf(docFreq=32, maxDocs=42740)
                0.0161462 = queryNorm
              0.7655984 = fieldWeight in 2596, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.166383 = idf(docFreq=32, maxDocs=42740)
                0.09375 = fieldNorm(doc=2596)
          0.25287285 = weight(abstract_txt:classifier in 2596) [ClassicSimilarity], result of:
            0.25287285 = score(doc=2596,freq=1.0), product of:
              0.3723484 = queryWeight, product of:
                3.1834476 = boost
                7.24405 = idf(docFreq=82, maxDocs=42740)
                0.0161462 = queryNorm
              0.6791297 = fieldWeight in 2596, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.24405 = idf(docFreq=82, maxDocs=42740)
                0.09375 = fieldNorm(doc=2596)
          0.28839993 = weight(abstract_txt:classifiers in 2596) [ClassicSimilarity], result of:
            0.28839993 = score(doc=2596,freq=1.0), product of:
              0.40645406 = queryWeight, product of:
                3.3260493 = boost
                7.568546 = idf(docFreq=59, maxDocs=42740)
                0.0161462 = queryNorm
              0.70955116 = fieldWeight in 2596, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.568546 = idf(docFreq=59, maxDocs=42740)
                0.09375 = fieldNorm(doc=2596)
          0.29237154 = weight(abstract_txt:hierarchical in 2596) [ClassicSimilarity], result of:
            0.29237154 = score(doc=2596,freq=3.0), product of:
              0.31302372 = queryWeight, product of:
                3.3703961 = boost
                5.752094 = idf(docFreq=368, maxDocs=42740)
                0.0161462 = queryNorm
              0.9340236 = fieldWeight in 2596, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.752094 = idf(docFreq=368, maxDocs=42740)
                0.09375 = fieldNorm(doc=2596)
          0.14972691 = weight(abstract_txt:training in 2596) [ClassicSimilarity], result of:
            0.14972691 = score(doc=2596,freq=1.0), product of:
              0.31128928 = queryWeight, product of:
                3.7577634 = boost
                5.130556 = idf(docFreq=686, maxDocs=42740)
                0.0161462 = queryNorm
              0.48098963 = fieldWeight in 2596, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.130556 = idf(docFreq=686, maxDocs=42740)
                0.09375 = fieldNorm(doc=2596)
        0.32 = coord(8/25)
    
  2. Hung, C.-M.; Chien, L.-F.: Web-based text classification in the absence of manually labeled training documents (2007) 0.34
    0.33522636 = sum of:
      0.33522636 = product of:
        0.9311843 = sum of:
          0.0220165 = weight(abstract_txt:using in 2088) [ClassicSimilarity], result of:
            0.0220165 = score(doc=2088,freq=2.0), product of:
              0.057270013 = queryWeight, product of:
                1.0193914 = boost
                3.4794931 = idf(docFreq=3580, maxDocs=42740)
                0.0161462 = queryNorm
              0.3844333 = fieldWeight in 2088, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4794931 = idf(docFreq=3580, maxDocs=42740)
                0.078125 = fieldNorm(doc=2088)
          0.022963047 = weight(abstract_txt:most in 2088) [ClassicSimilarity], result of:
            0.022963047 = score(doc=2088,freq=1.0), product of:
              0.07420926 = queryWeight, product of:
                1.1603965 = boost
                3.960786 = idf(docFreq=2212, maxDocs=42740)
                0.0161462 = queryNorm
              0.3094364 = fieldWeight in 2088, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.960786 = idf(docFreq=2212, maxDocs=42740)
                0.078125 = fieldNorm(doc=2088)
          0.042523604 = weight(abstract_txt:text in 2088) [ClassicSimilarity], result of:
            0.042523604 = score(doc=2088,freq=3.0), product of:
              0.077592194 = queryWeight, product of:
                1.1865509 = boost
                4.0500593 = idf(docFreq=2023, maxDocs=42740)
                0.0161462 = queryNorm
              0.54803973 = fieldWeight in 2088, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.0500593 = idf(docFreq=2023, maxDocs=42740)
                0.078125 = fieldNorm(doc=2088)
          0.05366872 = weight(abstract_txt:quality in 2088) [ClassicSimilarity], result of:
            0.05366872 = score(doc=2088,freq=2.0), product of:
              0.10373116 = queryWeight, product of:
                1.37193 = boost
                4.6828146 = idf(docFreq=1074, maxDocs=42740)
                0.0161462 = queryNorm
              0.5173828 = fieldWeight in 2088, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.6828146 = idf(docFreq=1074, maxDocs=42740)
                0.078125 = fieldNorm(doc=2088)
          0.04730205 = weight(abstract_txt:classification in 2088) [ClassicSimilarity], result of:
            0.04730205 = score(doc=2088,freq=1.0), product of:
              0.15136835 = queryWeight, product of:
                2.3437424 = boost
                3.9999528 = idf(docFreq=2127, maxDocs=42740)
                0.0161462 = queryNorm
              0.3124963 = fieldWeight in 2088, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.9999528 = idf(docFreq=2127, maxDocs=42740)
                0.078125 = fieldNorm(doc=2088)
          0.115194835 = weight(abstract_txt:documents in 2088) [ClassicSimilarity], result of:
            0.115194835 = score(doc=2088,freq=5.0), product of:
              0.1602312 = queryWeight, product of:
                2.4113812 = boost
                4.115389 = idf(docFreq=1895, maxDocs=42740)
                0.0161462 = queryNorm
              0.7189289 = fieldWeight in 2088, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.115389 = idf(docFreq=1895, maxDocs=42740)
                0.078125 = fieldNorm(doc=2088)
          0.21072736 = weight(abstract_txt:classifier in 2088) [ClassicSimilarity], result of:
            0.21072736 = score(doc=2088,freq=1.0), product of:
              0.3723484 = queryWeight, product of:
                3.1834476 = boost
                7.24405 = idf(docFreq=82, maxDocs=42740)
                0.0161462 = queryNorm
              0.5659414 = fieldWeight in 2088, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.24405 = idf(docFreq=82, maxDocs=42740)
                0.078125 = fieldNorm(doc=2088)
          0.24033329 = weight(abstract_txt:classifiers in 2088) [ClassicSimilarity], result of:
            0.24033329 = score(doc=2088,freq=1.0), product of:
              0.40645406 = queryWeight, product of:
                3.3260493 = boost
                7.568546 = idf(docFreq=59, maxDocs=42740)
                0.0161462 = queryNorm
              0.5912926 = fieldWeight in 2088, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.568546 = idf(docFreq=59, maxDocs=42740)
                0.078125 = fieldNorm(doc=2088)
          0.17645487 = weight(abstract_txt:training in 2088) [ClassicSimilarity], result of:
            0.17645487 = score(doc=2088,freq=2.0), product of:
              0.31128928 = queryWeight, product of:
                3.7577634 = boost
                5.130556 = idf(docFreq=686, maxDocs=42740)
                0.0161462 = queryNorm
              0.56685174 = fieldWeight in 2088, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.130556 = idf(docFreq=686, maxDocs=42740)
                0.078125 = fieldNorm(doc=2088)
        0.36 = coord(9/25)
    
  3. Li, T.; Zhu, S.; Ogihara, M.: Hierarchical document classification using automatically generated hierarchy (2007) 0.32
    0.31514934 = sum of:
      0.31514934 = product of:
        0.7878733 = sum of:
          0.0220165 = weight(abstract_txt:using in 1798) [ClassicSimilarity], result of:
            0.0220165 = score(doc=1798,freq=2.0), product of:
              0.057270013 = queryWeight, product of:
                1.0193914 = boost
                3.4794931 = idf(docFreq=3580, maxDocs=42740)
                0.0161462 = queryNorm
              0.3844333 = fieldWeight in 1798, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4794931 = idf(docFreq=3580, maxDocs=42740)
                0.078125 = fieldNorm(doc=1798)
          0.022963047 = weight(abstract_txt:most in 1798) [ClassicSimilarity], result of:
            0.022963047 = score(doc=1798,freq=1.0), product of:
              0.07420926 = queryWeight, product of:
                1.1603965 = boost
                3.960786 = idf(docFreq=2212, maxDocs=42740)
                0.0161462 = queryNorm
              0.3094364 = fieldWeight in 1798, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.960786 = idf(docFreq=2212, maxDocs=42740)
                0.078125 = fieldNorm(doc=1798)
          0.03472038 = weight(abstract_txt:text in 1798) [ClassicSimilarity], result of:
            0.03472038 = score(doc=1798,freq=2.0), product of:
              0.077592194 = queryWeight, product of:
                1.1865509 = boost
                4.0500593 = idf(docFreq=2023, maxDocs=42740)
                0.0161462 = queryNorm
              0.44747257 = fieldWeight in 1798, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.0500593 = idf(docFreq=2023, maxDocs=42740)
                0.078125 = fieldNorm(doc=1798)
          0.028990107 = weight(abstract_txt:document in 1798) [ClassicSimilarity], result of:
            0.028990107 = score(doc=1798,freq=1.0), product of:
              0.08668387 = queryWeight, product of:
                1.2541413 = boost
                4.280766 = idf(docFreq=1606, maxDocs=42740)
                0.0161462 = queryNorm
              0.33443484 = fieldWeight in 1798, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.280766 = idf(docFreq=1606, maxDocs=42740)
                0.078125 = fieldNorm(doc=1798)
          0.032972906 = weight(abstract_txt:large in 1798) [ClassicSimilarity], result of:
            0.032972906 = score(doc=1798,freq=1.0), product of:
              0.094451725 = queryWeight, product of:
                1.3091285 = boost
                4.468454 = idf(docFreq=1331, maxDocs=42740)
                0.0161462 = queryNorm
              0.34909797 = fieldWeight in 1798, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.468454 = idf(docFreq=1331, maxDocs=42740)
                0.078125 = fieldNorm(doc=1798)
          0.0668952 = weight(abstract_txt:classification in 1798) [ClassicSimilarity], result of:
            0.0668952 = score(doc=1798,freq=2.0), product of:
              0.15136835 = queryWeight, product of:
                2.3437424 = boost
                3.9999528 = idf(docFreq=2127, maxDocs=42740)
                0.0161462 = queryNorm
              0.44193652 = fieldWeight in 1798, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.9999528 = idf(docFreq=2127, maxDocs=42740)
                0.078125 = fieldNorm(doc=1798)
          0.2012679 = weight(abstract_txt:flat in 1798) [ClassicSimilarity], result of:
            0.2012679 = score(doc=1798,freq=1.0), product of:
              0.3154676 = queryWeight, product of:
                2.3925152 = boost
                8.166383 = idf(docFreq=32, maxDocs=42740)
                0.0161462 = queryNorm
              0.63799864 = fieldWeight in 1798, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.166383 = idf(docFreq=32, maxDocs=42740)
                0.078125 = fieldNorm(doc=1798)
          0.051516697 = weight(abstract_txt:documents in 1798) [ClassicSimilarity], result of:
            0.051516697 = score(doc=1798,freq=1.0), product of:
              0.1602312 = queryWeight, product of:
                2.4113812 = boost
                4.115389 = idf(docFreq=1895, maxDocs=42740)
                0.0161462 = queryNorm
              0.32151476 = fieldWeight in 1798, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.115389 = idf(docFreq=1895, maxDocs=42740)
                0.078125 = fieldNorm(doc=1798)
          0.08288766 = weight(abstract_txt:relationships in 1798) [ClassicSimilarity], result of:
            0.08288766 = score(doc=1798,freq=1.0), product of:
              0.22000962 = queryWeight, product of:
                2.8256161 = boost
                4.822344 = idf(docFreq=934, maxDocs=42740)
                0.0161462 = queryNorm
              0.3767456 = fieldWeight in 1798, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.822344 = idf(docFreq=934, maxDocs=42740)
                0.078125 = fieldNorm(doc=1798)
          0.24364294 = weight(abstract_txt:hierarchical in 1798) [ClassicSimilarity], result of:
            0.24364294 = score(doc=1798,freq=3.0), product of:
              0.31302372 = queryWeight, product of:
                3.3703961 = boost
                5.752094 = idf(docFreq=368, maxDocs=42740)
                0.0161462 = queryNorm
              0.778353 = fieldWeight in 1798, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.752094 = idf(docFreq=368, maxDocs=42740)
                0.078125 = fieldNorm(doc=1798)
        0.4 = coord(10/25)
    
  4. Sun, A.; Lim, E.-P.; Ng, W.-K.: Performance measurement framework for hierarchical text classification (2003) 0.30
    0.29902986 = sum of:
      0.29902986 = product of:
        1.0679637 = sum of:
          0.019640813 = weight(abstract_txt:text in 2809) [ClassicSimilarity], result of:
            0.019640813 = score(doc=2809,freq=1.0), product of:
              0.077592194 = queryWeight, product of:
                1.1865509 = boost
                4.0500593 = idf(docFreq=2023, maxDocs=42740)
                0.0161462 = queryNorm
              0.2531287 = fieldWeight in 2809, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.0500593 = idf(docFreq=2023, maxDocs=42740)
                0.0625 = fieldNorm(doc=2809)
          0.023192085 = weight(abstract_txt:document in 2809) [ClassicSimilarity], result of:
            0.023192085 = score(doc=2809,freq=1.0), product of:
              0.08668387 = queryWeight, product of:
                1.2541413 = boost
                4.280766 = idf(docFreq=1606, maxDocs=42740)
                0.0161462 = queryNorm
              0.26754788 = fieldWeight in 2809, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.280766 = idf(docFreq=1606, maxDocs=42740)
                0.0625 = fieldNorm(doc=2809)
          0.09269272 = weight(abstract_txt:classification in 2809) [ClassicSimilarity], result of:
            0.09269272 = score(doc=2809,freq=6.0), product of:
              0.15136835 = queryWeight, product of:
                2.3437424 = boost
                3.9999528 = idf(docFreq=2127, maxDocs=42740)
                0.0161462 = queryNorm
              0.61236525 = fieldWeight in 2809, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                3.9999528 = idf(docFreq=2127, maxDocs=42740)
                0.0625 = fieldNorm(doc=2809)
          0.058284488 = weight(abstract_txt:documents in 2809) [ClassicSimilarity], result of:
            0.058284488 = score(doc=2809,freq=2.0), product of:
              0.1602312 = queryWeight, product of:
                2.4113812 = boost
                4.115389 = idf(docFreq=1895, maxDocs=42740)
                0.0161462 = queryNorm
              0.36375242 = fieldWeight in 2809, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.115389 = idf(docFreq=1895, maxDocs=42740)
                0.0625 = fieldNorm(doc=2809)
          0.1685819 = weight(abstract_txt:classifier in 2809) [ClassicSimilarity], result of:
            0.1685819 = score(doc=2809,freq=1.0), product of:
              0.3723484 = queryWeight, product of:
                3.1834476 = boost
                7.24405 = idf(docFreq=82, maxDocs=42740)
                0.0161462 = queryNorm
              0.45275313 = fieldWeight in 2809, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.24405 = idf(docFreq=82, maxDocs=42740)
                0.0625 = fieldNorm(doc=2809)
          0.42992124 = weight(abstract_txt:classifiers in 2809) [ClassicSimilarity], result of:
            0.42992124 = score(doc=2809,freq=5.0), product of:
              0.40645406 = queryWeight, product of:
                3.3260493 = boost
                7.568546 = idf(docFreq=59, maxDocs=42740)
                0.0161462 = queryNorm
              1.0577364 = fieldWeight in 2809, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                7.568546 = idf(docFreq=59, maxDocs=42740)
                0.0625 = fieldNorm(doc=2809)
          0.27565053 = weight(abstract_txt:hierarchical in 2809) [ClassicSimilarity], result of:
            0.27565053 = score(doc=2809,freq=6.0), product of:
              0.31302372 = queryWeight, product of:
                3.3703961 = boost
                5.752094 = idf(docFreq=368, maxDocs=42740)
                0.0161462 = queryNorm
              0.88060594 = fieldWeight in 2809, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                5.752094 = idf(docFreq=368, maxDocs=42740)
                0.0625 = fieldNorm(doc=2809)
        0.28 = coord(7/25)
    
  5. Yoon, Y.; Lee, C.; Lee, G.G.: ¬An effective procedure for constructing a hierarchical text classification system (2006) 0.29
    0.2885781 = sum of:
      0.2885781 = product of:
        1.0306361 = sum of:
          0.024551015 = weight(abstract_txt:text in 274) [ClassicSimilarity], result of:
            0.024551015 = score(doc=274,freq=1.0), product of:
              0.077592194 = queryWeight, product of:
                1.1865509 = boost
                4.0500593 = idf(docFreq=2023, maxDocs=42740)
                0.0161462 = queryNorm
              0.3164109 = fieldWeight in 274, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.0500593 = idf(docFreq=2023, maxDocs=42740)
                0.078125 = fieldNorm(doc=274)
          0.046630725 = weight(abstract_txt:large in 274) [ClassicSimilarity], result of:
            0.046630725 = score(doc=274,freq=2.0), product of:
              0.094451725 = queryWeight, product of:
                1.3091285 = boost
                4.468454 = idf(docFreq=1331, maxDocs=42740)
                0.0161462 = queryNorm
              0.49369904 = fieldWeight in 274, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.468454 = idf(docFreq=1331, maxDocs=42740)
                0.078125 = fieldNorm(doc=274)
          0.18094975 = weight(abstract_txt:hierarchy in 274) [ClassicSimilarity], result of:
            0.18094975 = score(doc=274,freq=3.0), product of:
              0.20375268 = queryWeight, product of:
                1.9227773 = boost
                6.563024 = idf(docFreq=163, maxDocs=42740)
                0.0161462 = queryNorm
              0.88808525 = fieldWeight in 274, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.563024 = idf(docFreq=163, maxDocs=42740)
                0.078125 = fieldNorm(doc=274)
          0.1057706 = weight(abstract_txt:classification in 274) [ClassicSimilarity], result of:
            0.1057706 = score(doc=274,freq=5.0), product of:
              0.15136835 = queryWeight, product of:
                2.3437424 = boost
                3.9999528 = idf(docFreq=2127, maxDocs=42740)
                0.0161462 = queryNorm
              0.698763 = fieldWeight in 274, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                3.9999528 = idf(docFreq=2127, maxDocs=42740)
                0.078125 = fieldNorm(doc=274)
          0.051516697 = weight(abstract_txt:documents in 274) [ClassicSimilarity], result of:
            0.051516697 = score(doc=274,freq=1.0), product of:
              0.1602312 = queryWeight, product of:
                2.4113812 = boost
                4.115389 = idf(docFreq=1895, maxDocs=42740)
                0.0161462 = queryNorm
              0.32151476 = fieldWeight in 274, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.115389 = idf(docFreq=1895, maxDocs=42740)
                0.078125 = fieldNorm(doc=274)
          0.3398826 = weight(abstract_txt:classifiers in 274) [ClassicSimilarity], result of:
            0.3398826 = score(doc=274,freq=2.0), product of:
              0.40645406 = queryWeight, product of:
                3.3260493 = boost
                7.568546 = idf(docFreq=59, maxDocs=42740)
                0.0161462 = queryNorm
              0.83621407 = fieldWeight in 274, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.568546 = idf(docFreq=59, maxDocs=42740)
                0.078125 = fieldNorm(doc=274)
          0.28133467 = weight(abstract_txt:hierarchical in 274) [ClassicSimilarity], result of:
            0.28133467 = score(doc=274,freq=4.0), product of:
              0.31302372 = queryWeight, product of:
                3.3703961 = boost
                5.752094 = idf(docFreq=368, maxDocs=42740)
                0.0161462 = queryNorm
              0.89876467 = fieldWeight in 274, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.752094 = idf(docFreq=368, maxDocs=42740)
                0.078125 = fieldNorm(doc=274)
        0.28 = coord(7/25)