Document (#34453)

Author
Ko, Y.
Seo, J.
Title
Text classification from unlabeled documents with bootstrapping and feature projection techniques
Source
Information processing and management. 45(2009) no.1, S.70-83
Year
2009
Abstract
Many machine learning algorithms have been applied to text classification tasks. In the machine learning paradigm, a general inductive process automatically builds a text classifier by learning, generally known as supervised learning. However, the supervised learning approaches have some problems. The most notable problem is that they require a large number of labeled training documents for accurate learning. While unlabeled documents are easily collected and plentiful, labeled documents are difficultly generated because a labeling task must be done by human developers. In this paper, we propose a new text classification method based on unsupervised or semi-supervised learning. The proposed method launches text classification tasks with only unlabeled documents and the title word of each category for learning, and then it automatically learns text classifier by using bootstrapping and feature projection techniques. The results of experiments showed that the proposed method achieved reasonably useful performance compared to a supervised method. If the proposed method is used in a text classification task, building text classification systems will become significantly faster and less expensive.
Theme
Automatisches Klassifizieren

Similar documents (content)

  1. Li, M.; Li, H.; Zhou, Z.-H.: Semi-supervised document retrieval (2009) 0.60
    0.59720963 = sum of:
      0.59720963 = product of:
        1.493024 = sum of:
          0.047757905 = weight(abstract_txt:unsupervised in 4218) [ClassicSimilarity], result of:
            0.047757905 = score(doc=4218,freq=1.0), product of:
              0.10028762 = queryWeight, product of:
                7.61935 = idf(docFreq=58, maxDocs=44218)
                0.013162228 = queryNorm
              0.47620937 = fieldWeight in 4218, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.61935 = idf(docFreq=58, maxDocs=44218)
                0.0625 = fieldNorm(doc=4218)
          0.07317686 = weight(abstract_txt:labeling in 4218) [ClassicSimilarity], result of:
            0.07317686 = score(doc=4218,freq=2.0), product of:
              0.10579286 = queryWeight, product of:
                1.0270805 = boost
                7.825686 = idf(docFreq=47, maxDocs=44218)
                0.013162228 = queryNorm
              0.69169945 = fieldWeight in 4218, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.825686 = idf(docFreq=47, maxDocs=44218)
                0.0625 = fieldNorm(doc=4218)
          0.055007163 = weight(abstract_txt:machine in 4218) [ClassicSimilarity], result of:
            0.055007163 = score(doc=4218,freq=3.0), product of:
              0.09626452 = queryWeight, product of:
                1.3855572 = boost
                5.2785225 = idf(docFreq=612, maxDocs=44218)
                0.013162228 = queryNorm
              0.5714168 = fieldWeight in 4218, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.2785225 = idf(docFreq=612, maxDocs=44218)
                0.0625 = fieldNorm(doc=4218)
          0.031719144 = weight(abstract_txt:proposed in 4218) [ClassicSimilarity], result of:
            0.031719144 = score(doc=4218,freq=1.0), product of:
              0.11010453 = queryWeight, product of:
                1.8148451 = boost
                4.6093135 = idf(docFreq=1196, maxDocs=44218)
                0.013162228 = queryNorm
              0.2880821 = fieldWeight in 4218, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.6093135 = idf(docFreq=1196, maxDocs=44218)
                0.0625 = fieldNorm(doc=4218)
          0.18614002 = weight(abstract_txt:labeled in 4218) [ClassicSimilarity], result of:
            0.18614002 = score(doc=4218,freq=4.0), product of:
              0.19713648 = queryWeight, product of:
                1.9827813 = boost
                7.5537524 = idf(docFreq=62, maxDocs=44218)
                0.013162228 = queryNorm
              0.94421905 = fieldWeight in 4218, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                7.5537524 = idf(docFreq=62, maxDocs=44218)
                0.0625 = fieldNorm(doc=4218)
          0.053441674 = weight(abstract_txt:documents in 4218) [ClassicSimilarity], result of:
            0.053441674 = score(doc=4218,freq=2.0), product of:
              0.1467069 = queryWeight, product of:
                2.7044976 = boost
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.013162228 = queryNorm
              0.36427513 = fieldWeight in 4218, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.0625 = fieldNorm(doc=4218)
          0.110067494 = weight(abstract_txt:method in 4218) [ClassicSimilarity], result of:
            0.110067494 = score(doc=4218,freq=5.0), product of:
              0.17498058 = queryWeight, product of:
                2.9536312 = boost
                4.50095 = idf(docFreq=1333, maxDocs=44218)
                0.013162228 = queryNorm
              0.6290269 = fieldWeight in 4218, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.50095 = idf(docFreq=1333, maxDocs=44218)
                0.0625 = fieldNorm(doc=4218)
          0.3797706 = weight(abstract_txt:unlabeled in 4218) [ClassicSimilarity], result of:
            0.3797706 = score(doc=4218,freq=2.0), product of:
              0.45736384 = queryWeight, product of:
                3.6988597 = boost
                9.394302 = idf(docFreq=9, maxDocs=44218)
                0.013162228 = queryNorm
              0.8303468 = fieldWeight in 4218, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.394302 = idf(docFreq=9, maxDocs=44218)
                0.0625 = fieldNorm(doc=4218)
          0.31089535 = weight(abstract_txt:supervised in 4218) [ClassicSimilarity], result of:
            0.31089535 = score(doc=4218,freq=3.0), product of:
              0.3848335 = queryWeight, product of:
                3.9178045 = boost
                7.462781 = idf(docFreq=68, maxDocs=44218)
                0.013162228 = queryNorm
              0.80786973 = fieldWeight in 4218, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.462781 = idf(docFreq=68, maxDocs=44218)
                0.0625 = fieldNorm(doc=4218)
          0.24504781 = weight(abstract_txt:learning in 4218) [ClassicSimilarity], result of:
            0.24504781 = score(doc=4218,freq=7.0), product of:
              0.31192368 = queryWeight, product of:
                4.988219 = boost
                4.750873 = idf(docFreq=1038, maxDocs=44218)
                0.013162228 = queryNorm
              0.7856018 = fieldWeight in 4218, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                4.750873 = idf(docFreq=1038, maxDocs=44218)
                0.0625 = fieldNorm(doc=4218)
        0.4 = coord(10/25)
    
  2. Suakkaphong, N.; Zhang, Z.; Chen, H.: Disease named entity recognition using semisupervised learning and conditional random fields (2011) 0.52
    0.5189199 = sum of:
      0.5189199 = product of:
        1.2972997 = sum of:
          0.051743854 = weight(abstract_txt:labeling in 4367) [ClassicSimilarity], result of:
            0.051743854 = score(doc=4367,freq=1.0), product of:
              0.10579286 = queryWeight, product of:
                1.0270805 = boost
                7.825686 = idf(docFreq=47, maxDocs=44218)
                0.013162228 = queryNorm
              0.48910537 = fieldWeight in 4367, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.825686 = idf(docFreq=47, maxDocs=44218)
                0.0625 = fieldNorm(doc=4367)
          0.028384931 = weight(abstract_txt:techniques in 4367) [ClassicSimilarity], result of:
            0.028384931 = score(doc=4367,freq=2.0), product of:
              0.07089393 = queryWeight, product of:
                1.1890383 = boost
                4.5298495 = idf(docFreq=1295, maxDocs=44218)
                0.013162228 = queryNorm
              0.40038592 = fieldWeight in 4367, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.5298495 = idf(docFreq=1295, maxDocs=44218)
                0.0625 = fieldNorm(doc=4367)
          0.025580605 = weight(abstract_txt:task in 4367) [ClassicSimilarity], result of:
            0.025580605 = score(doc=4367,freq=1.0), product of:
              0.08333633 = queryWeight, product of:
                1.289165 = boost
                4.9112997 = idf(docFreq=884, maxDocs=44218)
                0.013162228 = queryNorm
              0.30695623 = fieldWeight in 4367, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.9112997 = idf(docFreq=884, maxDocs=44218)
                0.0625 = fieldNorm(doc=4367)
          0.04436706 = weight(abstract_txt:feature in 4367) [ClassicSimilarity], result of:
            0.04436706 = score(doc=4367,freq=1.0), product of:
              0.12030055 = queryWeight, product of:
                1.5489062 = boost
                5.9008293 = idf(docFreq=328, maxDocs=44218)
                0.013162228 = queryNorm
              0.36880183 = fieldWeight in 4367, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.9008293 = idf(docFreq=328, maxDocs=44218)
                0.0625 = fieldNorm(doc=4367)
          0.09307001 = weight(abstract_txt:labeled in 4367) [ClassicSimilarity], result of:
            0.09307001 = score(doc=4367,freq=1.0), product of:
              0.19713648 = queryWeight, product of:
                1.9827813 = boost
                7.5537524 = idf(docFreq=62, maxDocs=44218)
                0.013162228 = queryNorm
              0.47210953 = fieldWeight in 4367, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.5537524 = idf(docFreq=62, maxDocs=44218)
                0.0625 = fieldNorm(doc=4367)
          0.2831268 = weight(abstract_txt:bootstrapping in 4367) [ClassicSimilarity], result of:
            0.2831268 = score(doc=4367,freq=2.0), product of:
              0.32850185 = queryWeight, product of:
                2.55953 = boost
                9.7509775 = idf(docFreq=6, maxDocs=44218)
                0.013162228 = queryNorm
              0.8618728 = fieldWeight in 4367, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.7509775 = idf(docFreq=6, maxDocs=44218)
                0.0625 = fieldNorm(doc=4367)
          0.3797706 = weight(abstract_txt:unlabeled in 4367) [ClassicSimilarity], result of:
            0.3797706 = score(doc=4367,freq=2.0), product of:
              0.45736384 = queryWeight, product of:
                3.6988597 = boost
                9.394302 = idf(docFreq=9, maxDocs=44218)
                0.013162228 = queryNorm
              0.8303468 = fieldWeight in 4367, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.394302 = idf(docFreq=9, maxDocs=44218)
                0.0625 = fieldNorm(doc=4367)
          0.17949551 = weight(abstract_txt:supervised in 4367) [ClassicSimilarity], result of:
            0.17949551 = score(doc=4367,freq=1.0), product of:
              0.3848335 = queryWeight, product of:
                3.9178045 = boost
                7.462781 = idf(docFreq=68, maxDocs=44218)
                0.013162228 = queryNorm
              0.4664238 = fieldWeight in 4367, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.462781 = idf(docFreq=68, maxDocs=44218)
                0.0625 = fieldNorm(doc=4367)
          0.08077686 = weight(abstract_txt:text in 4367) [ClassicSimilarity], result of:
            0.08077686 = score(doc=4367,freq=2.0), product of:
              0.22599308 = queryWeight, product of:
                4.2458916 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.013162228 = queryNorm
              0.3574307 = fieldWeight in 4367, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.0625 = fieldNorm(doc=4367)
          0.13098356 = weight(abstract_txt:learning in 4367) [ClassicSimilarity], result of:
            0.13098356 = score(doc=4367,freq=2.0), product of:
              0.31192368 = queryWeight, product of:
                4.988219 = boost
                4.750873 = idf(docFreq=1038, maxDocs=44218)
                0.013162228 = queryNorm
              0.41992182 = fieldWeight in 4367, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.750873 = idf(docFreq=1038, maxDocs=44218)
                0.0625 = fieldNorm(doc=4367)
        0.4 = coord(10/25)
    
  3. Billal, B.; Fonseca, A.; Sadat, F.; Lounis, H.: Semi-supervised learning and social media text analysis towards multi-labeling categorization (2017) 0.35
    0.3458966 = sum of:
      0.3458966 = product of:
        0.9608239 = sum of:
          0.02238303 = weight(abstract_txt:task in 4095) [ClassicSimilarity], result of:
            0.02238303 = score(doc=4095,freq=1.0), product of:
              0.08333633 = queryWeight, product of:
                1.289165 = boost
                4.9112997 = idf(docFreq=884, maxDocs=44218)
                0.013162228 = queryNorm
              0.2685867 = fieldWeight in 4095, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.9112997 = idf(docFreq=884, maxDocs=44218)
                0.0546875 = fieldNorm(doc=4095)
          0.027788602 = weight(abstract_txt:machine in 4095) [ClassicSimilarity], result of:
            0.027788602 = score(doc=4095,freq=1.0), product of:
              0.09626452 = queryWeight, product of:
                1.3855572 = boost
                5.2785225 = idf(docFreq=612, maxDocs=44218)
                0.013162228 = queryNorm
              0.2886692 = fieldWeight in 4095, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.2785225 = idf(docFreq=612, maxDocs=44218)
                0.0546875 = fieldNorm(doc=4095)
          0.02775425 = weight(abstract_txt:proposed in 4095) [ClassicSimilarity], result of:
            0.02775425 = score(doc=4095,freq=1.0), product of:
              0.11010453 = queryWeight, product of:
                1.8148451 = boost
                4.6093135 = idf(docFreq=1196, maxDocs=44218)
                0.013162228 = queryNorm
              0.25207183 = fieldWeight in 4095, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.6093135 = idf(docFreq=1196, maxDocs=44218)
                0.0546875 = fieldNorm(doc=4095)
          0.081436254 = weight(abstract_txt:labeled in 4095) [ClassicSimilarity], result of:
            0.081436254 = score(doc=4095,freq=1.0), product of:
              0.19713648 = queryWeight, product of:
                1.9827813 = boost
                7.5537524 = idf(docFreq=62, maxDocs=44218)
                0.013162228 = queryNorm
              0.41309583 = fieldWeight in 4095, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.5537524 = idf(docFreq=62, maxDocs=44218)
                0.0546875 = fieldNorm(doc=4095)
          0.06091119 = weight(abstract_txt:method in 4095) [ClassicSimilarity], result of:
            0.06091119 = score(doc=4095,freq=2.0), product of:
              0.17498058 = queryWeight, product of:
                2.9536312 = boost
                4.50095 = idf(docFreq=1333, maxDocs=44218)
                0.013162228 = queryNorm
              0.34810257 = fieldWeight in 4095, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.50095 = idf(docFreq=1333, maxDocs=44218)
                0.0546875 = fieldNorm(doc=4095)
          0.1081853 = weight(abstract_txt:classification in 4095) [ClassicSimilarity], result of:
            0.1081853 = score(doc=4095,freq=9.0), product of:
              0.16518104 = queryWeight, product of:
                3.1436346 = boost
                3.9920752 = idf(docFreq=2218, maxDocs=44218)
                0.013162228 = queryNorm
              0.65494984 = fieldWeight in 4095, product of:
                3.0 = tf(freq=9.0), with freq of:
                  9.0 = termFreq=9.0
                3.9920752 = idf(docFreq=2218, maxDocs=44218)
                0.0546875 = fieldNorm(doc=4095)
          0.35119367 = weight(abstract_txt:supervised in 4095) [ClassicSimilarity], result of:
            0.35119367 = score(doc=4095,freq=5.0), product of:
              0.3848335 = queryWeight, product of:
                3.9178045 = boost
                7.462781 = idf(docFreq=68, maxDocs=44218)
                0.013162228 = queryNorm
              0.912586 = fieldWeight in 4095, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                7.462781 = idf(docFreq=68, maxDocs=44218)
                0.0546875 = fieldNorm(doc=4095)
          0.09995627 = weight(abstract_txt:text in 4095) [ClassicSimilarity], result of:
            0.09995627 = score(doc=4095,freq=4.0), product of:
              0.22599308 = queryWeight, product of:
                4.2458916 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.013162228 = queryNorm
              0.4422979 = fieldWeight in 4095, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.0546875 = fieldNorm(doc=4095)
          0.1812153 = weight(abstract_txt:learning in 4095) [ClassicSimilarity], result of:
            0.1812153 = score(doc=4095,freq=5.0), product of:
              0.31192368 = queryWeight, product of:
                4.988219 = boost
                4.750873 = idf(docFreq=1038, maxDocs=44218)
                0.013162228 = queryNorm
              0.5809604 = fieldWeight in 4095, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.750873 = idf(docFreq=1038, maxDocs=44218)
                0.0546875 = fieldNorm(doc=4095)
        0.36 = coord(9/25)
    
  4. Hung, C.-M.; Chien, L.-F.: Web-based text classification in the absence of manually labeled training documents (2007) 0.30
    0.29860428 = sum of:
      0.29860428 = product of:
        0.82945627 = sum of:
          0.025088971 = weight(abstract_txt:techniques in 87) [ClassicSimilarity], result of:
            0.025088971 = score(doc=87,freq=1.0), product of:
              0.07089393 = queryWeight, product of:
                1.1890383 = boost
                4.5298495 = idf(docFreq=1295, maxDocs=44218)
                0.013162228 = queryNorm
              0.3538945 = fieldWeight in 87, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5298495 = idf(docFreq=1295, maxDocs=44218)
                0.078125 = fieldNorm(doc=87)
          0.06409511 = weight(abstract_txt:automatically in 87) [ClassicSimilarity], result of:
            0.06409511 = score(doc=87,freq=2.0), product of:
              0.10515431 = queryWeight, product of:
                1.4481211 = boost
                5.5168705 = idf(docFreq=482, maxDocs=44218)
                0.013162228 = queryNorm
              0.60953385 = fieldWeight in 87, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.5168705 = idf(docFreq=482, maxDocs=44218)
                0.078125 = fieldNorm(doc=87)
          0.03964893 = weight(abstract_txt:proposed in 87) [ClassicSimilarity], result of:
            0.03964893 = score(doc=87,freq=1.0), product of:
              0.11010453 = queryWeight, product of:
                1.8148451 = boost
                4.6093135 = idf(docFreq=1196, maxDocs=44218)
                0.013162228 = queryNorm
              0.36010262 = fieldWeight in 87, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.6093135 = idf(docFreq=1196, maxDocs=44218)
                0.078125 = fieldNorm(doc=87)
          0.10254258 = weight(abstract_txt:classifier in 87) [ClassicSimilarity], result of:
            0.10254258 = score(doc=87,freq=1.0), product of:
              0.18122716 = queryWeight, product of:
                1.9010913 = boost
                7.24254 = idf(docFreq=85, maxDocs=44218)
                0.013162228 = queryNorm
              0.56582344 = fieldWeight in 87, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.24254 = idf(docFreq=85, maxDocs=44218)
                0.078125 = fieldNorm(doc=87)
          0.20150247 = weight(abstract_txt:labeled in 87) [ClassicSimilarity], result of:
            0.20150247 = score(doc=87,freq=3.0), product of:
              0.19713648 = queryWeight, product of:
                1.9827813 = boost
                7.5537524 = idf(docFreq=62, maxDocs=44218)
                0.013162228 = queryNorm
              1.022147 = fieldWeight in 87, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.5537524 = idf(docFreq=62, maxDocs=44218)
                0.078125 = fieldNorm(doc=87)
          0.10562339 = weight(abstract_txt:documents in 87) [ClassicSimilarity], result of:
            0.10562339 = score(doc=87,freq=5.0), product of:
              0.1467069 = queryWeight, product of:
                2.7044976 = boost
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.013162228 = queryNorm
              0.719962 = fieldWeight in 87, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.078125 = fieldNorm(doc=87)
          0.05151681 = weight(abstract_txt:classification in 87) [ClassicSimilarity], result of:
            0.05151681 = score(doc=87,freq=1.0), product of:
              0.16518104 = queryWeight, product of:
                3.1436346 = boost
                3.9920752 = idf(docFreq=2218, maxDocs=44218)
                0.013162228 = queryNorm
              0.3118809 = fieldWeight in 87, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.9920752 = idf(docFreq=2218, maxDocs=44218)
                0.078125 = fieldNorm(doc=87)
          0.12366381 = weight(abstract_txt:text in 87) [ClassicSimilarity], result of:
            0.12366381 = score(doc=87,freq=3.0), product of:
              0.22599308 = queryWeight, product of:
                4.2458916 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.013162228 = queryNorm
              0.54720175 = fieldWeight in 87, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.078125 = fieldNorm(doc=87)
          0.11577421 = weight(abstract_txt:learning in 87) [ClassicSimilarity], result of:
            0.11577421 = score(doc=87,freq=1.0), product of:
              0.31192368 = queryWeight, product of:
                4.988219 = boost
                4.750873 = idf(docFreq=1038, maxDocs=44218)
                0.013162228 = queryNorm
              0.37116197 = fieldWeight in 87, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.750873 = idf(docFreq=1038, maxDocs=44218)
                0.078125 = fieldNorm(doc=87)
        0.36 = coord(9/25)
    
  5. Sebastiani, F.: Machine learning in automated text categorization (2002) 0.28
    0.28228846 = sum of:
      0.28228846 = product of:
        0.78413457 = sum of:
          0.06225382 = weight(abstract_txt:inductive in 3389) [ClassicSimilarity], result of:
            0.06225382 = score(doc=3389,freq=1.0), product of:
              0.10313067 = queryWeight, product of:
                1.0140754 = boost
                7.7265954 = idf(docFreq=52, maxDocs=44218)
                0.013162228 = queryNorm
              0.60364026 = fieldWeight in 3389, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.7265954 = idf(docFreq=52, maxDocs=44218)
                0.078125 = fieldNorm(doc=3389)
          0.025088971 = weight(abstract_txt:techniques in 3389) [ClassicSimilarity], result of:
            0.025088971 = score(doc=3389,freq=1.0), product of:
              0.07089393 = queryWeight, product of:
                1.1890383 = boost
                4.5298495 = idf(docFreq=1295, maxDocs=44218)
                0.013162228 = queryNorm
              0.3538945 = fieldWeight in 3389, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5298495 = idf(docFreq=1295, maxDocs=44218)
                0.078125 = fieldNorm(doc=3389)
          0.056141455 = weight(abstract_txt:machine in 3389) [ClassicSimilarity], result of:
            0.056141455 = score(doc=3389,freq=2.0), product of:
              0.09626452 = queryWeight, product of:
                1.3855572 = boost
                5.2785225 = idf(docFreq=612, maxDocs=44218)
                0.013162228 = queryNorm
              0.58319986 = fieldWeight in 3389, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.2785225 = idf(docFreq=612, maxDocs=44218)
                0.078125 = fieldNorm(doc=3389)
          0.045322087 = weight(abstract_txt:automatically in 3389) [ClassicSimilarity], result of:
            0.045322087 = score(doc=3389,freq=1.0), product of:
              0.10515431 = queryWeight, product of:
                1.4481211 = boost
                5.5168705 = idf(docFreq=482, maxDocs=44218)
                0.013162228 = queryNorm
              0.4310055 = fieldWeight in 3389, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.5168705 = idf(docFreq=482, maxDocs=44218)
                0.078125 = fieldNorm(doc=3389)
          0.20508516 = weight(abstract_txt:classifier in 3389) [ClassicSimilarity], result of:
            0.20508516 = score(doc=3389,freq=4.0), product of:
              0.18122716 = queryWeight, product of:
                1.9010913 = boost
                7.24254 = idf(docFreq=85, maxDocs=44218)
                0.013162228 = queryNorm
              1.1316469 = fieldWeight in 3389, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                7.24254 = idf(docFreq=85, maxDocs=44218)
                0.078125 = fieldNorm(doc=3389)
          0.06680209 = weight(abstract_txt:documents in 3389) [ClassicSimilarity], result of:
            0.06680209 = score(doc=3389,freq=2.0), product of:
              0.1467069 = queryWeight, product of:
                2.7044976 = boost
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.013162228 = queryNorm
              0.4553439 = fieldWeight in 3389, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.078125 = fieldNorm(doc=3389)
          0.05151681 = weight(abstract_txt:classification in 3389) [ClassicSimilarity], result of:
            0.05151681 = score(doc=3389,freq=1.0), product of:
              0.16518104 = queryWeight, product of:
                3.1436346 = boost
                3.9920752 = idf(docFreq=2218, maxDocs=44218)
                0.013162228 = queryNorm
              0.3118809 = fieldWeight in 3389, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.9920752 = idf(docFreq=2218, maxDocs=44218)
                0.078125 = fieldNorm(doc=3389)
          0.071397334 = weight(abstract_txt:text in 3389) [ClassicSimilarity], result of:
            0.071397334 = score(doc=3389,freq=1.0), product of:
              0.22599308 = queryWeight, product of:
                4.2458916 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.013162228 = queryNorm
              0.3159271 = fieldWeight in 3389, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.078125 = fieldNorm(doc=3389)
          0.20052679 = weight(abstract_txt:learning in 3389) [ClassicSimilarity], result of:
            0.20052679 = score(doc=3389,freq=3.0), product of:
              0.31192368 = queryWeight, product of:
                4.988219 = boost
                4.750873 = idf(docFreq=1038, maxDocs=44218)
                0.013162228 = queryNorm
              0.6428713 = fieldWeight in 3389, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.750873 = idf(docFreq=1038, maxDocs=44218)
                0.078125 = fieldNorm(doc=3389)
        0.36 = coord(9/25)