Document (#26810)

Author
Sun, A.
Lim, E.-P.
Ng, W.-K.
Title
Performance measurement framework for hierarchical text classification
Source
Journal of the American Society for Information Science and technology. 54(2003) no.11, S.1014-1028
Year
2003
Abstract
Hierarchical text classification or simply hierarchical classification refers to assigning a document to one or more suitable categories from a hierarchical category space. In our literature survey, we have found that the existing hierarchical classification experiments used a variety of measures to evaluate performance. These performance measures often assume independence between categories and do not consider documents misclassified into categories that are similar or not far from the correct categories in the category tree. In this paper, we therefore propose new performance measures for hierarchicai classification. The proposed performance measures consist of category similarity measures and distance-based measures that consider the contributions of misclassified documents. Our experiments an hierarchical classification methods based an SVM classifiers and binary Naive Bayes classifiers showed that SVM classifiers perform better than Naive Bayes classifiers an Reuters-21578 collection according to the extended measures. A new classifier-centric measure called blocking measure is also defined to examine the performance of subtree classifiers in a top-down levelbased hierarchical classificatIon method.
Theme
Automatisches Klassifizieren

Similar documents (content)

  1. Hung, C.-M.; Chien, L.-F.: Web-based text classification in the absence of manually labeled training documents (2007) 0.46
    0.46079326 = sum of:
      0.46079326 = product of:
        0.959986 = sum of:
          0.052034575 = weight(abstract_txt:classifier in 2088) [ClassicSimilarity], result of:
            0.052034575 = score(doc=2088,freq=1.0), product of:
              0.0919191 = queryWeight, product of:
                1.0296127 = boost
                7.245965 = idf(docFreq=81, maxDocs=42306)
                0.012320708 = queryNorm
              0.566091 = fieldWeight in 2088, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.245965 = idf(docFreq=81, maxDocs=42306)
                0.078125 = fieldNorm(doc=2088)
          0.05457962 = weight(abstract_txt:assume in 2088) [ClassicSimilarity], result of:
            0.05457962 = score(doc=2088,freq=1.0), product of:
              0.0948924 = queryWeight, product of:
                1.0461326 = boost
                7.3622246 = idf(docFreq=72, maxDocs=42306)
                0.012320708 = queryNorm
              0.5751738 = fieldWeight in 2088, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.3622246 = idf(docFreq=72, maxDocs=42306)
                0.078125 = fieldNorm(doc=2088)
          0.058671907 = weight(abstract_txt:reuters in 2088) [ClassicSimilarity], result of:
            0.058671907 = score(doc=2088,freq=1.0), product of:
              0.09957826 = queryWeight, product of:
                1.0716507 = boost
                7.5418105 = idf(docFreq=60, maxDocs=42306)
                0.012320708 = queryNorm
              0.58920395 = fieldWeight in 2088, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.5418105 = idf(docFreq=60, maxDocs=42306)
                0.078125 = fieldNorm(doc=2088)
          0.03151607 = weight(abstract_txt:text in 2088) [ClassicSimilarity], result of:
            0.03151607 = score(doc=2088,freq=3.0), product of:
              0.057482462 = queryWeight, product of:
                1.1514728 = boost
                4.0517817 = idf(docFreq=1999, maxDocs=42306)
                0.012320708 = queryNorm
              0.5482728 = fieldWeight in 2088, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.0517817 = idf(docFreq=1999, maxDocs=42306)
                0.078125 = fieldNorm(doc=2088)
          0.04264587 = weight(abstract_txt:documents in 2088) [ClassicSimilarity], result of:
            0.04264587 = score(doc=2088,freq=5.0), product of:
              0.05931289 = queryWeight, product of:
                1.1696625 = boost
                4.115787 = idf(docFreq=1875, maxDocs=42306)
                0.012320708 = queryNorm
              0.7189984 = fieldWeight in 2088, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.115787 = idf(docFreq=1875, maxDocs=42306)
                0.078125 = fieldNorm(doc=2088)
          0.010760818 = weight(abstract_txt:that in 2088) [ClassicSimilarity], result of:
            0.010760818 = score(doc=2088,freq=2.0), product of:
              0.04049965 = queryWeight, product of:
                1.3668681 = boost
                2.4048555 = idf(docFreq=10381, maxDocs=42306)
                0.012320708 = queryNorm
              0.2657015 = fieldWeight in 2088, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.4048555 = idf(docFreq=10381, maxDocs=42306)
                0.078125 = fieldNorm(doc=2088)
          0.13114637 = weight(abstract_txt:21578 in 2088) [ClassicSimilarity], result of:
            0.13114637 = score(doc=2088,freq=1.0), product of:
              0.1702349 = queryWeight, product of:
                1.4011844 = boost
                9.860925 = idf(docFreq=5, maxDocs=42306)
                0.012320708 = queryNorm
              0.7703847 = fieldWeight in 2088, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.860925 = idf(docFreq=5, maxDocs=42306)
                0.078125 = fieldNorm(doc=2088)
          0.059982736 = weight(abstract_txt:experiments in 2088) [ClassicSimilarity], result of:
            0.059982736 = score(doc=2088,freq=2.0), product of:
              0.10105596 = queryWeight, product of:
                1.5267466 = boost
                5.372288 = idf(docFreq=533, maxDocs=42306)
                0.012320708 = queryNorm
              0.5935596 = fieldWeight in 2088, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.372288 = idf(docFreq=533, maxDocs=42306)
                0.078125 = fieldNorm(doc=2088)
          0.07730624 = weight(abstract_txt:categories in 2088) [ClassicSimilarity], result of:
            0.07730624 = score(doc=2088,freq=1.0), product of:
              0.1899798 = queryWeight, product of:
                2.9604294 = boost
                5.208553 = idf(docFreq=628, maxDocs=42306)
                0.012320708 = queryNorm
              0.4069182 = fieldWeight in 2088, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.208553 = idf(docFreq=628, maxDocs=42306)
                0.078125 = fieldNorm(doc=2088)
          0.08244266 = weight(abstract_txt:performance in 2088) [ClassicSimilarity], result of:
            0.08244266 = score(doc=2088,freq=1.0), product of:
              0.2270019 = queryWeight, product of:
                3.9633405 = boost
                4.6487103 = idf(docFreq=1100, maxDocs=42306)
                0.012320708 = queryNorm
              0.3631805 = fieldWeight in 2088, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.6487103 = idf(docFreq=1100, maxDocs=42306)
                0.078125 = fieldNorm(doc=2088)
          0.061632253 = weight(abstract_txt:classification in 2088) [ClassicSimilarity], result of:
            0.061632253 = score(doc=2088,freq=1.0), product of:
              0.1968411 = queryWeight, product of:
                3.986373 = boost
                4.007765 = idf(docFreq=2089, maxDocs=42306)
                0.012320708 = queryNorm
              0.31310663 = fieldWeight in 2088, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.007765 = idf(docFreq=2089, maxDocs=42306)
                0.078125 = fieldNorm(doc=2088)
          0.29726684 = weight(abstract_txt:classifiers in 2088) [ClassicSimilarity], result of:
            0.29726684 = score(doc=2088,freq=1.0), product of:
              0.5023026 = queryWeight, product of:
                5.381938 = boost
                7.5751467 = idf(docFreq=58, maxDocs=42306)
                0.012320708 = queryNorm
              0.5918083 = fieldWeight in 2088, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.5751467 = idf(docFreq=58, maxDocs=42306)
                0.078125 = fieldNorm(doc=2088)
        0.48 = coord(12/25)
    
  2. Ko, Y.; Park, J.; Seo, J.: Improving text categorization using the importance of sentences (2004) 0.34
    0.3357505 = sum of:
      0.3357505 = product of:
        0.8393762 = sum of:
          0.038138214 = weight(abstract_txt:assigning in 3558) [ClassicSimilarity], result of:
            0.038138214 = score(doc=3558,freq=1.0), product of:
              0.08670776 = queryWeight, product of:
                7.037564 = idf(docFreq=100, maxDocs=42306)
                0.012320708 = queryNorm
              0.43984774 = fieldWeight in 3558, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.037564 = idf(docFreq=100, maxDocs=42306)
                0.0625 = fieldNorm(doc=3558)
          0.032549657 = weight(abstract_txt:text in 3558) [ClassicSimilarity], result of:
            0.032549657 = score(doc=3558,freq=5.0), product of:
              0.057482462 = queryWeight, product of:
                1.1514728 = boost
                4.0517817 = idf(docFreq=1999, maxDocs=42306)
                0.012320708 = queryNorm
              0.5662537 = fieldWeight in 3558, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.0517817 = idf(docFreq=1999, maxDocs=42306)
                0.0625 = fieldNorm(doc=3558)
          0.021577295 = weight(abstract_txt:documents in 3558) [ClassicSimilarity], result of:
            0.021577295 = score(doc=3558,freq=2.0), product of:
              0.05931289 = queryWeight, product of:
                1.1696625 = boost
                4.115787 = idf(docFreq=1875, maxDocs=42306)
                0.012320708 = queryNorm
              0.36378762 = fieldWeight in 3558, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.115787 = idf(docFreq=1875, maxDocs=42306)
                0.0625 = fieldNorm(doc=3558)
          0.006087238 = weight(abstract_txt:that in 3558) [ClassicSimilarity], result of:
            0.006087238 = score(doc=3558,freq=1.0), product of:
              0.04049965 = queryWeight, product of:
                1.3668681 = boost
                2.4048555 = idf(docFreq=10381, maxDocs=42306)
                0.012320708 = queryNorm
              0.15030347 = fieldWeight in 3558, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4048555 = idf(docFreq=10381, maxDocs=42306)
                0.0625 = fieldNorm(doc=3558)
          0.047986187 = weight(abstract_txt:experiments in 3558) [ClassicSimilarity], result of:
            0.047986187 = score(doc=3558,freq=2.0), product of:
              0.10105596 = queryWeight, product of:
                1.5267466 = boost
                5.372288 = idf(docFreq=533, maxDocs=42306)
                0.012320708 = queryNorm
              0.47484767 = fieldWeight in 3558, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.372288 = idf(docFreq=533, maxDocs=42306)
                0.0625 = fieldNorm(doc=3558)
          0.035193235 = weight(abstract_txt:measure in 3558) [ClassicSimilarity], result of:
            0.035193235 = score(doc=3558,freq=1.0), product of:
              0.103546135 = queryWeight, product of:
                1.5454428 = boost
                5.438076 = idf(docFreq=499, maxDocs=42306)
                0.012320708 = queryNorm
              0.33987975 = fieldWeight in 3558, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.438076 = idf(docFreq=499, maxDocs=42306)
                0.0625 = fieldNorm(doc=3558)
          0.1244692 = weight(abstract_txt:naive in 3558) [ClassicSimilarity], result of:
            0.1244692 = score(doc=3558,freq=1.0), product of:
              0.24036378 = queryWeight, product of:
                2.3546183 = boost
                8.285388 = idf(docFreq=28, maxDocs=42306)
                0.012320708 = queryNorm
              0.51783675 = fieldWeight in 3558, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.285388 = idf(docFreq=28, maxDocs=42306)
                0.0625 = fieldNorm(doc=3558)
          0.1352111 = weight(abstract_txt:bayes in 3558) [ClassicSimilarity], result of:
            0.1352111 = score(doc=3558,freq=1.0), product of:
              0.25400132 = queryWeight, product of:
                2.4204938 = boost
                8.51719 = idf(docFreq=22, maxDocs=42306)
                0.012320708 = queryNorm
              0.5323244 = fieldWeight in 3558, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.51719 = idf(docFreq=22, maxDocs=42306)
                0.0625 = fieldNorm(doc=3558)
          0.06184499 = weight(abstract_txt:categories in 3558) [ClassicSimilarity], result of:
            0.06184499 = score(doc=3558,freq=1.0), product of:
              0.1899798 = queryWeight, product of:
                2.9604294 = boost
                5.208553 = idf(docFreq=628, maxDocs=42306)
                0.012320708 = queryNorm
              0.32553455 = fieldWeight in 3558, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.208553 = idf(docFreq=628, maxDocs=42306)
                0.0625 = fieldNorm(doc=3558)
          0.33631906 = weight(abstract_txt:classifiers in 3558) [ClassicSimilarity], result of:
            0.33631906 = score(doc=3558,freq=2.0), product of:
              0.5023026 = queryWeight, product of:
                5.381938 = boost
                7.5751467 = idf(docFreq=58, maxDocs=42306)
                0.012320708 = queryNorm
              0.6695547 = fieldWeight in 3558, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.5751467 = idf(docFreq=58, maxDocs=42306)
                0.0625 = fieldNorm(doc=3558)
        0.4 = coord(10/25)
    
  3. Liu, R.-L.: Dynamic category profiling for text filtering and classification (2007) 0.33
    0.33461943 = sum of:
      0.33461943 = product of:
        1.1950694 = sum of:
          0.01819581 = weight(abstract_txt:text in 2901) [ClassicSimilarity], result of:
            0.01819581 = score(doc=2901,freq=1.0), product of:
              0.057482462 = queryWeight, product of:
                1.1514728 = boost
                4.0517817 = idf(docFreq=1999, maxDocs=42306)
                0.012320708 = queryNorm
              0.31654543 = fieldWeight in 2901, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.0517817 = idf(docFreq=1999, maxDocs=42306)
                0.078125 = fieldNorm(doc=2901)
          0.033033352 = weight(abstract_txt:documents in 2901) [ClassicSimilarity], result of:
            0.033033352 = score(doc=2901,freq=3.0), product of:
              0.05931289 = queryWeight, product of:
                1.1696625 = boost
                4.115787 = idf(docFreq=1875, maxDocs=42306)
                0.012320708 = queryNorm
              0.55693376 = fieldWeight in 2901, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.115787 = idf(docFreq=1875, maxDocs=42306)
                0.078125 = fieldNorm(doc=2901)
          0.015218095 = weight(abstract_txt:that in 2901) [ClassicSimilarity], result of:
            0.015218095 = score(doc=2901,freq=4.0), product of:
              0.04049965 = queryWeight, product of:
                1.3668681 = boost
                2.4048555 = idf(docFreq=10381, maxDocs=42306)
                0.012320708 = queryNorm
              0.37575868 = fieldWeight in 2901, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                2.4048555 = idf(docFreq=10381, maxDocs=42306)
                0.078125 = fieldNorm(doc=2901)
          0.22951025 = weight(abstract_txt:category in 2901) [ClassicSimilarity], result of:
            0.22951025 = score(doc=2901,freq=5.0), product of:
              0.20851126 = queryWeight, product of:
                2.6859405 = boost
                6.300826 = idf(docFreq=210, maxDocs=42306)
                0.012320708 = queryNorm
              1.1007091 = fieldWeight in 2901, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                6.300826 = idf(docFreq=210, maxDocs=42306)
                0.078125 = fieldNorm(doc=2901)
          0.109327525 = weight(abstract_txt:categories in 2901) [ClassicSimilarity], result of:
            0.109327525 = score(doc=2901,freq=2.0), product of:
              0.1899798 = queryWeight, product of:
                2.9604294 = boost
                5.208553 = idf(docFreq=628, maxDocs=42306)
                0.012320708 = queryNorm
              0.5754692 = fieldWeight in 2901, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.208553 = idf(docFreq=628, maxDocs=42306)
                0.078125 = fieldNorm(doc=2901)
          0.061632253 = weight(abstract_txt:classification in 2901) [ClassicSimilarity], result of:
            0.061632253 = score(doc=2901,freq=1.0), product of:
              0.1968411 = queryWeight, product of:
                3.986373 = boost
                4.007765 = idf(docFreq=2089, maxDocs=42306)
                0.012320708 = queryNorm
              0.31310663 = fieldWeight in 2901, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.007765 = idf(docFreq=2089, maxDocs=42306)
                0.078125 = fieldNorm(doc=2901)
          0.72815216 = weight(abstract_txt:classifiers in 2901) [ClassicSimilarity], result of:
            0.72815216 = score(doc=2901,freq=6.0), product of:
              0.5023026 = queryWeight, product of:
                5.381938 = boost
                7.5751467 = idf(docFreq=58, maxDocs=42306)
                0.012320708 = queryNorm
              1.4496285 = fieldWeight in 2901, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                7.5751467 = idf(docFreq=58, maxDocs=42306)
                0.078125 = fieldNorm(doc=2901)
        0.28 = coord(7/25)
    
  4. Fagni, T.; Sebastiani, F.: Selecting negative examples for hierarchical text classification: An experimental comparison (2010) 0.31
    0.31112584 = sum of:
      0.31112584 = product of:
        0.7778146 = sum of:
          0.046937525 = weight(abstract_txt:reuters in 1102) [ClassicSimilarity], result of:
            0.046937525 = score(doc=1102,freq=1.0), product of:
              0.09957826 = queryWeight, product of:
                1.0716507 = boost
                7.5418105 = idf(docFreq=60, maxDocs=42306)
                0.012320708 = queryNorm
              0.47136316 = fieldWeight in 1102, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.5418105 = idf(docFreq=60, maxDocs=42306)
                0.0625 = fieldNorm(doc=1102)
          0.014556649 = weight(abstract_txt:text in 1102) [ClassicSimilarity], result of:
            0.014556649 = score(doc=1102,freq=1.0), product of:
              0.057482462 = queryWeight, product of:
                1.1514728 = boost
                4.0517817 = idf(docFreq=1999, maxDocs=42306)
                0.012320708 = queryNorm
              0.25323635 = fieldWeight in 1102, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.0517817 = idf(docFreq=1999, maxDocs=42306)
                0.0625 = fieldNorm(doc=1102)
          0.006087238 = weight(abstract_txt:that in 1102) [ClassicSimilarity], result of:
            0.006087238 = score(doc=1102,freq=1.0), product of:
              0.04049965 = queryWeight, product of:
                1.3668681 = boost
                2.4048555 = idf(docFreq=10381, maxDocs=42306)
                0.012320708 = queryNorm
              0.15030347 = fieldWeight in 1102, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4048555 = idf(docFreq=10381, maxDocs=42306)
                0.0625 = fieldNorm(doc=1102)
          0.1049171 = weight(abstract_txt:21578 in 1102) [ClassicSimilarity], result of:
            0.1049171 = score(doc=1102,freq=1.0), product of:
              0.1702349 = queryWeight, product of:
                1.4011844 = boost
                9.860925 = idf(docFreq=5, maxDocs=42306)
                0.012320708 = queryNorm
              0.6163078 = fieldWeight in 1102, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.860925 = idf(docFreq=5, maxDocs=42306)
                0.0625 = fieldNorm(doc=1102)
          0.03393136 = weight(abstract_txt:experiments in 1102) [ClassicSimilarity], result of:
            0.03393136 = score(doc=1102,freq=1.0), product of:
              0.10105596 = queryWeight, product of:
                1.5267466 = boost
                5.372288 = idf(docFreq=533, maxDocs=42306)
                0.012320708 = queryNorm
              0.335768 = fieldWeight in 1102, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.372288 = idf(docFreq=533, maxDocs=42306)
                0.0625 = fieldNorm(doc=1102)
          0.1352111 = weight(abstract_txt:bayes in 1102) [ClassicSimilarity], result of:
            0.1352111 = score(doc=1102,freq=1.0), product of:
              0.25400132 = queryWeight, product of:
                2.4204938 = boost
                8.51719 = idf(docFreq=22, maxDocs=42306)
                0.012320708 = queryNorm
              0.5323244 = fieldWeight in 1102, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.51719 = idf(docFreq=22, maxDocs=42306)
                0.0625 = fieldNorm(doc=1102)
          0.082112074 = weight(abstract_txt:category in 1102) [ClassicSimilarity], result of:
            0.082112074 = score(doc=1102,freq=1.0), product of:
              0.20851126 = queryWeight, product of:
                2.6859405 = boost
                6.300826 = idf(docFreq=210, maxDocs=42306)
                0.012320708 = queryNorm
              0.39380163 = fieldWeight in 1102, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.300826 = idf(docFreq=210, maxDocs=42306)
                0.0625 = fieldNorm(doc=1102)
          0.06184499 = weight(abstract_txt:categories in 1102) [ClassicSimilarity], result of:
            0.06184499 = score(doc=1102,freq=1.0), product of:
              0.1899798 = queryWeight, product of:
                2.9604294 = boost
                5.208553 = idf(docFreq=628, maxDocs=42306)
                0.012320708 = queryNorm
              0.32553455 = fieldWeight in 1102, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.208553 = idf(docFreq=628, maxDocs=42306)
                0.0625 = fieldNorm(doc=1102)
          0.08540016 = weight(abstract_txt:classification in 1102) [ClassicSimilarity], result of:
            0.08540016 = score(doc=1102,freq=3.0), product of:
              0.1968411 = queryWeight, product of:
                3.986373 = boost
                4.007765 = idf(docFreq=2089, maxDocs=42306)
                0.012320708 = queryNorm
              0.43385327 = fieldWeight in 1102, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.007765 = idf(docFreq=2089, maxDocs=42306)
                0.0625 = fieldNorm(doc=1102)
          0.2068164 = weight(abstract_txt:hierarchical in 1102) [ClassicSimilarity], result of:
            0.2068164 = score(doc=1102,freq=2.0), product of:
              0.40634704 = queryWeight, product of:
                5.7275457 = boost
                5.758281 = idf(docFreq=362, maxDocs=42306)
                0.012320708 = queryNorm
              0.50896496 = fieldWeight in 1102, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.758281 = idf(docFreq=362, maxDocs=42306)
                0.0625 = fieldNorm(doc=1102)
        0.4 = coord(10/25)
    
  5. Gauch, S.; Chandramouli, A.; Ranganathan, S.: Training a hierarchical classifier using inter document relationships (2009) 0.31
    0.30883795 = sum of:
      0.30883795 = product of:
        1.1029927 = sum of:
          0.073588006 = weight(abstract_txt:classifier in 517) [ClassicSimilarity], result of:
            0.073588006 = score(doc=517,freq=2.0), product of:
              0.0919191 = queryWeight, product of:
                1.0296127 = boost
                7.245965 = idf(docFreq=81, maxDocs=42306)
                0.012320708 = queryNorm
              0.8005736 = fieldWeight in 517, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.245965 = idf(docFreq=81, maxDocs=42306)
                0.078125 = fieldNorm(doc=517)
          0.025732761 = weight(abstract_txt:text in 517) [ClassicSimilarity], result of:
            0.025732761 = score(doc=517,freq=2.0), product of:
              0.057482462 = queryWeight, product of:
                1.1514728 = boost
                4.0517817 = idf(docFreq=1999, maxDocs=42306)
                0.012320708 = queryNorm
              0.44766283 = fieldWeight in 517, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.0517817 = idf(docFreq=1999, maxDocs=42306)
                0.078125 = fieldNorm(doc=517)
          0.03814363 = weight(abstract_txt:documents in 517) [ClassicSimilarity], result of:
            0.03814363 = score(doc=517,freq=4.0), product of:
              0.05931289 = queryWeight, product of:
                1.1696625 = boost
                4.115787 = idf(docFreq=1875, maxDocs=42306)
                0.012320708 = queryNorm
              0.64309174 = fieldWeight in 517, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.115787 = idf(docFreq=1875, maxDocs=42306)
                0.078125 = fieldNorm(doc=517)
          0.010760818 = weight(abstract_txt:that in 517) [ClassicSimilarity], result of:
            0.010760818 = score(doc=517,freq=2.0), product of:
              0.04049965 = queryWeight, product of:
                1.3668681 = boost
                2.4048555 = idf(docFreq=10381, maxDocs=42306)
                0.012320708 = queryNorm
              0.2657015 = fieldWeight in 517, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.4048555 = idf(docFreq=10381, maxDocs=42306)
                0.078125 = fieldNorm(doc=517)
          0.12326451 = weight(abstract_txt:classification in 517) [ClassicSimilarity], result of:
            0.12326451 = score(doc=517,freq=4.0), product of:
              0.1968411 = queryWeight, product of:
                3.986373 = boost
                4.007765 = idf(docFreq=2089, maxDocs=42306)
                0.012320708 = queryNorm
              0.62621325 = fieldWeight in 517, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.007765 = idf(docFreq=2089, maxDocs=42306)
                0.078125 = fieldNorm(doc=517)
          0.51488125 = weight(abstract_txt:classifiers in 517) [ClassicSimilarity], result of:
            0.51488125 = score(doc=517,freq=3.0), product of:
              0.5023026 = queryWeight, product of:
                5.381938 = boost
                7.5751467 = idf(docFreq=58, maxDocs=42306)
                0.012320708 = queryNorm
              1.025042 = fieldWeight in 517, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.5751467 = idf(docFreq=58, maxDocs=42306)
                0.078125 = fieldNorm(doc=517)
          0.31662166 = weight(abstract_txt:hierarchical in 517) [ClassicSimilarity], result of:
            0.31662166 = score(doc=517,freq=3.0), product of:
              0.40634704 = queryWeight, product of:
                5.7275457 = boost
                5.758281 = idf(docFreq=362, maxDocs=42306)
                0.012320708 = queryNorm
              0.7791903 = fieldWeight in 517, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.758281 = idf(docFreq=362, maxDocs=42306)
                0.078125 = fieldNorm(doc=517)
        0.28 = coord(7/25)