Document (#30005)

Author
Sebastiani, F.
Title
Classification of text, automatic
Source
Encyclopedia of language and linguistics. 2nd ed. Ed.: K. Brown. Vol. 14
Imprint
Amsterdam : Elsevier Science Publishers
Year
2006
Pages
S.457-462
Abstract
Automatic text classification (ATC) is a discipline at the crossroads of information retrieval (IR), machine learning (ML), and computational linguistics (CL), and consists in the realization of text classifiers, i.e. software systems capable of assigning texts to one or more categories, or classes, from a predefined set. Applications range from the automated indexing of scientific articles, to e-mail routing, spam filtering, authorship attribution, and automated survey coding. This article will focus on the ML approach to ATC, whereby a software system (called the learner) automatically builds a classifier for the categories of interest by generalizing from a "training" set of pre-classified texts.
Content
Vgl. auch unter: http://www.math.unipd.it/~fabseb60/Publications/ELL06.pdf.
Theme
Automatisches Klassifizieren

Similar documents (author)

  1. Sebastiani, F.: On the role of logic in information retrieval (1998) 5.99
    5.9875464 = sum of:
      5.9875464 = weight(author_txt:sebastiani in 2141) [ClassicSimilarity], result of:
        5.9875464 = fieldWeight in 2141, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.580074 = idf(docFreq=7, maxDocs=42596)
          0.625 = fieldNorm(doc=2141)
    
  2. Sebastiani, F.: Machine learning in automated text categorization (2002) 5.99
    5.9875464 = sum of:
      5.9875464 = weight(author_txt:sebastiani in 4390) [ClassicSimilarity], result of:
        5.9875464 = fieldWeight in 4390, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.580074 = idf(docFreq=7, maxDocs=42596)
          0.625 = fieldNorm(doc=4390)
    
  3. Sebastiani, F.: ¬A tutorial an automated text categorisation (1999) 5.99
    5.9875464 = sum of:
      5.9875464 = weight(author_txt:sebastiani in 4391) [ClassicSimilarity], result of:
        5.9875464 = fieldWeight in 4391, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.580074 = idf(docFreq=7, maxDocs=42596)
          0.625 = fieldNorm(doc=4391)
    
  4. Debole, F.; Sebastiani, F.: ¬An analysis of the relative hardness of Reuters-21578 subsets (2005) 4.79
    4.790037 = sum of:
      4.790037 = weight(author_txt:sebastiani in 4457) [ClassicSimilarity], result of:
        4.790037 = fieldWeight in 4457, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.580074 = idf(docFreq=7, maxDocs=42596)
          0.5 = fieldNorm(doc=4457)
    
  5. Giorgetti, D.; Sebastiani, F.: Automating survey coding by multiclass text categorization techniques (2003) 4.79
    4.790037 = sum of:
      4.790037 = weight(author_txt:sebastiani in 173) [ClassicSimilarity], result of:
        4.790037 = fieldWeight in 173, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.580074 = idf(docFreq=7, maxDocs=42596)
          0.5 = fieldNorm(doc=173)
    

Similar documents (content)

  1. Sebastiani, F.: Machine learning in automated text categorization (2002) 0.31
    0.30655545 = sum of:
      0.30655545 = product of:
        0.8515429 = sum of:
          0.09844129 = weight(abstract_txt:builds in 4390) [ClassicSimilarity], result of:
            0.09844129 = score(doc=4390,freq=1.0), product of:
              0.17599165 = queryWeight, product of:
                1.0724691 = boost
                7.159706 = idf(docFreq=89, maxDocs=42596)
                0.022919867 = queryNorm
              0.55935204 = fieldWeight in 4390, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.159706 = idf(docFreq=89, maxDocs=42596)
                0.078125 = fieldNorm(doc=4390)
          0.20363803 = weight(abstract_txt:classifier in 4390) [ClassicSimilarity], result of:
            0.20363803 = score(doc=4390,freq=4.0), product of:
              0.17999473 = queryWeight, product of:
                1.0845976 = boost
                7.240675 = idf(docFreq=82, maxDocs=42596)
                0.022919867 = queryNorm
              1.1313555 = fieldWeight in 4390, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                7.240675 = idf(docFreq=82, maxDocs=42596)
                0.078125 = fieldNorm(doc=4390)
          0.034316793 = weight(abstract_txt:classification in 4390) [ClassicSimilarity], result of:
            0.034316793 = score(doc=4390,freq=1.0), product of:
              0.10983017 = queryWeight, product of:
                1.1981593 = boost
                3.9994013 = idf(docFreq=2121, maxDocs=42596)
                0.022919867 = queryNorm
              0.31245324 = fieldWeight in 4390, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.9994013 = idf(docFreq=2121, maxDocs=42596)
                0.078125 = fieldNorm(doc=4390)
          0.14430085 = weight(abstract_txt:predefined in 4390) [ClassicSimilarity], result of:
            0.14430085 = score(doc=4390,freq=1.0), product of:
              0.22710139 = queryWeight, product of:
                1.2182842 = boost
                8.133155 = idf(docFreq=33, maxDocs=42596)
                0.022919867 = queryNorm
              0.63540274 = fieldWeight in 4390, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.133155 = idf(docFreq=33, maxDocs=42596)
                0.078125 = fieldNorm(doc=4390)
          0.017459061 = weight(abstract_txt:from in 4390) [ClassicSimilarity], result of:
            0.017459061 = score(doc=4390,freq=1.0), product of:
              0.080123805 = queryWeight, product of:
                1.2533726 = boost
                2.7891335 = idf(docFreq=7117, maxDocs=42596)
                0.022919867 = queryNorm
              0.21790105 = fieldWeight in 4390, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.7891335 = idf(docFreq=7117, maxDocs=42596)
                0.078125 = fieldNorm(doc=4390)
          0.10684017 = weight(abstract_txt:categories in 4390) [ClassicSimilarity], result of:
            0.10684017 = score(doc=4390,freq=2.0), product of:
              0.1858647 = queryWeight, product of:
                1.5586629 = boost
                5.202746 = idf(docFreq=636, maxDocs=42596)
                0.022919867 = queryNorm
              0.5748277 = fieldWeight in 4390, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.202746 = idf(docFreq=636, maxDocs=42596)
                0.078125 = fieldNorm(doc=4390)
          0.09518298 = weight(abstract_txt:automated in 4390) [ClassicSimilarity], result of:
            0.09518298 = score(doc=4390,freq=1.0), product of:
              0.21681537 = queryWeight, product of:
                1.6834444 = boost
                5.619261 = idf(docFreq=419, maxDocs=42596)
                0.022919867 = queryNorm
              0.43900475 = fieldWeight in 4390, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.619261 = idf(docFreq=419, maxDocs=42596)
                0.078125 = fieldNorm(doc=4390)
          0.09794329 = weight(abstract_txt:texts in 4390) [ClassicSimilarity], result of:
            0.09794329 = score(doc=4390,freq=1.0), product of:
              0.22098714 = queryWeight, product of:
                1.6995629 = boost
                5.6730638 = idf(docFreq=397, maxDocs=42596)
                0.022919867 = queryNorm
              0.4432081 = fieldWeight in 4390, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.6730638 = idf(docFreq=397, maxDocs=42596)
                0.078125 = fieldNorm(doc=4390)
          0.053420402 = weight(abstract_txt:text in 4390) [ClassicSimilarity], result of:
            0.053420402 = score(doc=4390,freq=1.0), product of:
              0.16886996 = queryWeight, product of:
                1.8195986 = boost
                4.049158 = idf(docFreq=2018, maxDocs=42596)
                0.022919867 = queryNorm
              0.31634048 = fieldWeight in 4390, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.049158 = idf(docFreq=2018, maxDocs=42596)
                0.078125 = fieldNorm(doc=4390)
        0.36 = coord(9/25)
    
  2. Liu, R.-L.: ¬A passage extractor for classification of disease aspect information (2013) 0.25
    0.25014895 = sum of:
      0.25014895 = product of:
        0.7817155 = sum of:
          0.11519506 = weight(abstract_txt:classifier in 2108) [ClassicSimilarity], result of:
            0.11519506 = score(doc=2108,freq=2.0), product of:
              0.17999473 = queryWeight, product of:
                1.0845976 = boost
                7.240675 = idf(docFreq=82, maxDocs=42596)
                0.022919867 = queryNorm
              0.6399913 = fieldWeight in 2108, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.240675 = idf(docFreq=82, maxDocs=42596)
                0.0625 = fieldNorm(doc=2108)
          0.22756927 = weight(abstract_txt:classifiers in 2108) [ClassicSimilarity], result of:
            0.22756927 = score(doc=2108,freq=6.0), product of:
              0.19648944 = queryWeight, product of:
                1.1332047 = boost
                7.5651712 = idf(docFreq=59, maxDocs=42596)
                0.022919867 = queryNorm
              1.1581756 = fieldWeight in 2108, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                7.5651712 = idf(docFreq=59, maxDocs=42596)
                0.0625 = fieldNorm(doc=2108)
          0.047550738 = weight(abstract_txt:classification in 2108) [ClassicSimilarity], result of:
            0.047550738 = score(doc=2108,freq=3.0), product of:
              0.10983017 = queryWeight, product of:
                1.1981593 = boost
                3.9994013 = idf(docFreq=2121, maxDocs=42596)
                0.022919867 = queryNorm
              0.43294787 = fieldWeight in 2108, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.9994013 = idf(docFreq=2121, maxDocs=42596)
                0.0625 = fieldNorm(doc=2108)
          0.01396725 = weight(abstract_txt:from in 2108) [ClassicSimilarity], result of:
            0.01396725 = score(doc=2108,freq=1.0), product of:
              0.080123805 = queryWeight, product of:
                1.2533726 = boost
                2.7891335 = idf(docFreq=7117, maxDocs=42596)
                0.022919867 = queryNorm
              0.17432085 = fieldWeight in 2108, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.7891335 = idf(docFreq=7117, maxDocs=42596)
                0.0625 = fieldNorm(doc=2108)
          0.060274336 = weight(abstract_txt:automatic in 2108) [ClassicSimilarity], result of:
            0.060274336 = score(doc=2108,freq=1.0), product of:
              0.18552916 = queryWeight, product of:
                1.5572554 = boost
                5.1980476 = idf(docFreq=639, maxDocs=42596)
                0.022919867 = queryNorm
              0.32487798 = fieldWeight in 2108, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.1980476 = idf(docFreq=639, maxDocs=42596)
                0.0625 = fieldNorm(doc=2108)
          0.08547214 = weight(abstract_txt:categories in 2108) [ClassicSimilarity], result of:
            0.08547214 = score(doc=2108,freq=2.0), product of:
              0.1858647 = queryWeight, product of:
                1.5586629 = boost
                5.202746 = idf(docFreq=636, maxDocs=42596)
                0.022919867 = queryNorm
              0.4598621 = fieldWeight in 2108, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.202746 = idf(docFreq=636, maxDocs=42596)
                0.0625 = fieldNorm(doc=2108)
          0.11081018 = weight(abstract_txt:texts in 2108) [ClassicSimilarity], result of:
            0.11081018 = score(doc=2108,freq=2.0), product of:
              0.22098714 = queryWeight, product of:
                1.6995629 = boost
                5.6730638 = idf(docFreq=397, maxDocs=42596)
                0.022919867 = queryNorm
              0.5014327 = fieldWeight in 2108, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.6730638 = idf(docFreq=397, maxDocs=42596)
                0.0625 = fieldNorm(doc=2108)
          0.12087657 = weight(abstract_txt:text in 2108) [ClassicSimilarity], result of:
            0.12087657 = score(doc=2108,freq=8.0), product of:
              0.16886996 = queryWeight, product of:
                1.8195986 = boost
                4.049158 = idf(docFreq=2018, maxDocs=42596)
                0.022919867 = queryNorm
              0.71579677 = fieldWeight in 2108, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                4.049158 = idf(docFreq=2018, maxDocs=42596)
                0.0625 = fieldNorm(doc=2108)
        0.32 = coord(8/25)
    
  3. Sebastiani, F.: ¬A tutorial an automated text categorisation (1999) 0.24
    0.24424252 = sum of:
      0.24424252 = product of:
        0.6784514 = sum of:
          0.07875303 = weight(abstract_txt:builds in 4391) [ClassicSimilarity], result of:
            0.07875303 = score(doc=4391,freq=1.0), product of:
              0.17599165 = queryWeight, product of:
                1.0724691 = boost
                7.159706 = idf(docFreq=89, maxDocs=42596)
                0.022919867 = queryNorm
              0.44748163 = fieldWeight in 4391, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.159706 = idf(docFreq=89, maxDocs=42596)
                0.0625 = fieldNorm(doc=4391)
          0.14108457 = weight(abstract_txt:classifier in 4391) [ClassicSimilarity], result of:
            0.14108457 = score(doc=4391,freq=3.0), product of:
              0.17999473 = queryWeight, product of:
                1.0845976 = boost
                7.240675 = idf(docFreq=82, maxDocs=42596)
                0.022919867 = queryNorm
              0.78382605 = fieldWeight in 4391, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.240675 = idf(docFreq=82, maxDocs=42596)
                0.0625 = fieldNorm(doc=4391)
          0.027453434 = weight(abstract_txt:classification in 4391) [ClassicSimilarity], result of:
            0.027453434 = score(doc=4391,freq=1.0), product of:
              0.10983017 = queryWeight, product of:
                1.1981593 = boost
                3.9994013 = idf(docFreq=2121, maxDocs=42596)
                0.022919867 = queryNorm
              0.24996258 = fieldWeight in 4391, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.9994013 = idf(docFreq=2121, maxDocs=42596)
                0.0625 = fieldNorm(doc=4391)
          0.01396725 = weight(abstract_txt:from in 4391) [ClassicSimilarity], result of:
            0.01396725 = score(doc=4391,freq=1.0), product of:
              0.080123805 = queryWeight, product of:
                1.2533726 = boost
                2.7891335 = idf(docFreq=7117, maxDocs=42596)
                0.022919867 = queryNorm
              0.17432085 = fieldWeight in 4391, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.7891335 = idf(docFreq=7117, maxDocs=42596)
                0.0625 = fieldNorm(doc=4391)
          0.08524079 = weight(abstract_txt:automatic in 4391) [ClassicSimilarity], result of:
            0.08524079 = score(doc=4391,freq=2.0), product of:
              0.18552916 = queryWeight, product of:
                1.5572554 = boost
                5.1980476 = idf(docFreq=639, maxDocs=42596)
                0.022919867 = queryNorm
              0.45944685 = fieldWeight in 4391, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.1980476 = idf(docFreq=639, maxDocs=42596)
                0.0625 = fieldNorm(doc=4391)
          0.08547214 = weight(abstract_txt:categories in 4391) [ClassicSimilarity], result of:
            0.08547214 = score(doc=4391,freq=2.0), product of:
              0.1858647 = queryWeight, product of:
                1.5586629 = boost
                5.202746 = idf(docFreq=636, maxDocs=42596)
                0.022919867 = queryNorm
              0.4598621 = fieldWeight in 4391, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.202746 = idf(docFreq=636, maxDocs=42596)
                0.0625 = fieldNorm(doc=4391)
          0.10768724 = weight(abstract_txt:automated in 4391) [ClassicSimilarity], result of:
            0.10768724 = score(doc=4391,freq=2.0), product of:
              0.21681537 = queryWeight, product of:
                1.6834444 = boost
                5.619261 = idf(docFreq=419, maxDocs=42596)
                0.022919867 = queryNorm
              0.49667716 = fieldWeight in 4391, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.619261 = idf(docFreq=419, maxDocs=42596)
                0.0625 = fieldNorm(doc=4391)
          0.078354634 = weight(abstract_txt:texts in 4391) [ClassicSimilarity], result of:
            0.078354634 = score(doc=4391,freq=1.0), product of:
              0.22098714 = queryWeight, product of:
                1.6995629 = boost
                5.6730638 = idf(docFreq=397, maxDocs=42596)
                0.022919867 = queryNorm
              0.35456648 = fieldWeight in 4391, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.6730638 = idf(docFreq=397, maxDocs=42596)
                0.0625 = fieldNorm(doc=4391)
          0.060438287 = weight(abstract_txt:text in 4391) [ClassicSimilarity], result of:
            0.060438287 = score(doc=4391,freq=2.0), product of:
              0.16886996 = queryWeight, product of:
                1.8195986 = boost
                4.049158 = idf(docFreq=2018, maxDocs=42596)
                0.022919867 = queryNorm
              0.35789838 = fieldWeight in 4391, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.049158 = idf(docFreq=2018, maxDocs=42596)
                0.0625 = fieldNorm(doc=4391)
        0.36 = coord(9/25)
    
  4. Teich, E.; Degaetano-Ortlieb, S.; Fankhauser, P.; Kermes, H.; Lapshinova-Koltunski, E.: ¬The linguistic construal of disciplinarity : a data-mining approach using register features (2016) 0.20
    0.19958918 = sum of:
      0.19958918 = product of:
        0.6237162 = sum of:
          0.07980369 = weight(abstract_txt:linguistics in 4016) [ClassicSimilarity], result of:
            0.07980369 = score(doc=4016,freq=1.0), product of:
              0.15301095 = queryWeight, product of:
                6.675909 = idf(docFreq=145, maxDocs=42596)
                0.022919867 = queryNorm
              0.5215554 = fieldWeight in 4016, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.675909 = idf(docFreq=145, maxDocs=42596)
                0.078125 = fieldNorm(doc=4016)
          0.09298034 = weight(abstract_txt:authorship in 4016) [ClassicSimilarity], result of:
            0.09298034 = score(doc=4016,freq=1.0), product of:
              0.16942129 = queryWeight, product of:
                1.0522592 = boost
                7.0247865 = idf(docFreq=102, maxDocs=42596)
                0.022919867 = queryNorm
              0.54881144 = fieldWeight in 4016, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.0247865 = idf(docFreq=102, maxDocs=42596)
                0.078125 = fieldNorm(doc=4016)
          0.1333432 = weight(abstract_txt:attribution in 4016) [ClassicSimilarity], result of:
            0.1333432 = score(doc=4016,freq=1.0), product of:
              0.21545395 = queryWeight, product of:
                1.1866318 = boost
                7.921846 = idf(docFreq=41, maxDocs=42596)
                0.022919867 = queryNorm
              0.6188942 = fieldWeight in 4016, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.921846 = idf(docFreq=41, maxDocs=42596)
                0.078125 = fieldNorm(doc=4016)
          0.034316793 = weight(abstract_txt:classification in 4016) [ClassicSimilarity], result of:
            0.034316793 = score(doc=4016,freq=1.0), product of:
              0.10983017 = queryWeight, product of:
                1.1981593 = boost
                3.9994013 = idf(docFreq=2121, maxDocs=42596)
                0.022919867 = queryNorm
              0.31245324 = fieldWeight in 4016, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.9994013 = idf(docFreq=2121, maxDocs=42596)
                0.078125 = fieldNorm(doc=4016)
          0.017459061 = weight(abstract_txt:from in 4016) [ClassicSimilarity], result of:
            0.017459061 = score(doc=4016,freq=1.0), product of:
              0.080123805 = queryWeight, product of:
                1.2533726 = boost
                2.7891335 = idf(docFreq=7117, maxDocs=42596)
                0.022919867 = queryNorm
              0.21790105 = fieldWeight in 4016, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.7891335 = idf(docFreq=7117, maxDocs=42596)
                0.078125 = fieldNorm(doc=4016)
          0.07534292 = weight(abstract_txt:automatic in 4016) [ClassicSimilarity], result of:
            0.07534292 = score(doc=4016,freq=1.0), product of:
              0.18552916 = queryWeight, product of:
                1.5572554 = boost
                5.1980476 = idf(docFreq=639, maxDocs=42596)
                0.022919867 = queryNorm
              0.40609747 = fieldWeight in 4016, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.1980476 = idf(docFreq=639, maxDocs=42596)
                0.078125 = fieldNorm(doc=4016)
          0.09794329 = weight(abstract_txt:texts in 4016) [ClassicSimilarity], result of:
            0.09794329 = score(doc=4016,freq=1.0), product of:
              0.22098714 = queryWeight, product of:
                1.6995629 = boost
                5.6730638 = idf(docFreq=397, maxDocs=42596)
                0.022919867 = queryNorm
              0.4432081 = fieldWeight in 4016, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.6730638 = idf(docFreq=397, maxDocs=42596)
                0.078125 = fieldNorm(doc=4016)
          0.09252685 = weight(abstract_txt:text in 4016) [ClassicSimilarity], result of:
            0.09252685 = score(doc=4016,freq=3.0), product of:
              0.16886996 = queryWeight, product of:
                1.8195986 = boost
                4.049158 = idf(docFreq=2018, maxDocs=42596)
                0.022919867 = queryNorm
              0.5479178 = fieldWeight in 4016, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.049158 = idf(docFreq=2018, maxDocs=42596)
                0.078125 = fieldNorm(doc=4016)
        0.32 = coord(8/25)
    
  5. Sun, A.; Lim, E.-P.; Ng, W.-K.: Performance measurement framework for hierarchical text classification (2003) 0.17
    0.17206082 = sum of:
      0.17206082 = product of:
        0.6145029 = sum of:
          0.074694626 = weight(abstract_txt:assigning in 2809) [ClassicSimilarity], result of:
            0.074694626 = score(doc=2809,freq=1.0), product of:
              0.1698922 = queryWeight, product of:
                1.0537206 = boost
                7.034543 = idf(docFreq=101, maxDocs=42596)
                0.022919867 = queryNorm
              0.43965894 = fieldWeight in 2809, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.034543 = idf(docFreq=101, maxDocs=42596)
                0.0625 = fieldNorm(doc=2809)
          0.08145521 = weight(abstract_txt:classifier in 2809) [ClassicSimilarity], result of:
            0.08145521 = score(doc=2809,freq=1.0), product of:
              0.17999473 = queryWeight, product of:
                1.0845976 = boost
                7.240675 = idf(docFreq=82, maxDocs=42596)
                0.022919867 = queryNorm
              0.4525422 = fieldWeight in 2809, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.240675 = idf(docFreq=82, maxDocs=42596)
                0.0625 = fieldNorm(doc=2809)
          0.20774136 = weight(abstract_txt:classifiers in 2809) [ClassicSimilarity], result of:
            0.20774136 = score(doc=2809,freq=5.0), product of:
              0.19648944 = queryWeight, product of:
                1.1332047 = boost
                7.5651712 = idf(docFreq=59, maxDocs=42596)
                0.022919867 = queryNorm
              1.0572648 = fieldWeight in 2809, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                7.5651712 = idf(docFreq=59, maxDocs=42596)
                0.0625 = fieldNorm(doc=2809)
          0.06724691 = weight(abstract_txt:classification in 2809) [ClassicSimilarity], result of:
            0.06724691 = score(doc=2809,freq=6.0), product of:
              0.10983017 = queryWeight, product of:
                1.1981593 = boost
                3.9994013 = idf(docFreq=2121, maxDocs=42596)
                0.022919867 = queryNorm
              0.6122808 = fieldWeight in 2809, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                3.9994013 = idf(docFreq=2121, maxDocs=42596)
                0.0625 = fieldNorm(doc=2809)
          0.019752674 = weight(abstract_txt:from in 2809) [ClassicSimilarity], result of:
            0.019752674 = score(doc=2809,freq=2.0), product of:
              0.080123805 = queryWeight, product of:
                1.2533726 = boost
                2.7891335 = idf(docFreq=7117, maxDocs=42596)
                0.022919867 = queryNorm
              0.2465269 = fieldWeight in 2809, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.7891335 = idf(docFreq=7117, maxDocs=42596)
                0.0625 = fieldNorm(doc=2809)
          0.12087585 = weight(abstract_txt:categories in 2809) [ClassicSimilarity], result of:
            0.12087585 = score(doc=2809,freq=4.0), product of:
              0.1858647 = queryWeight, product of:
                1.5586629 = boost
                5.202746 = idf(docFreq=636, maxDocs=42596)
                0.022919867 = queryNorm
              0.65034324 = fieldWeight in 2809, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.202746 = idf(docFreq=636, maxDocs=42596)
                0.0625 = fieldNorm(doc=2809)
          0.04273632 = weight(abstract_txt:text in 2809) [ClassicSimilarity], result of:
            0.04273632 = score(doc=2809,freq=1.0), product of:
              0.16886996 = queryWeight, product of:
                1.8195986 = boost
                4.049158 = idf(docFreq=2018, maxDocs=42596)
                0.022919867 = queryNorm
              0.25307238 = fieldWeight in 2809, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.049158 = idf(docFreq=2018, maxDocs=42596)
                0.0625 = fieldNorm(doc=2809)
        0.28 = coord(7/25)