Document (#32088)

Author
Hung, C.-M.
Chien, L.-F.
Title
Web-based text classification in the absence of manually labeled training documents
Source
Journal of the American Society for Information Science and Technology. 58(2007) no.1, S.88-96
Year
2007
Abstract
Most text classification techniques assume that manually labeled documents (corpora) can be easily obtained while learning text classifiers. However, labeled training documents are sometimes unavailable or inadequate even if they are available. The goal of this article is to present a self-learned approach to extract high-quality training documents from the Web when the required manually labeled documents are unavailable or of poor quality. To learn a text classifier automatically, we need only a set of user-defined categories and some highly related keywords. Extensive experiments are conducted to evaluate the performance of the proposed approach using the test set from the Reuters-21578 news data set. The experiments show that very promising results can be achieved only by using automatically extracted documents from the Web.
Theme
Automatisches Klassifizieren

Similar documents (content)

  1. Ko, Y.; Seo, J.: Text classification from unlabeled documents with bootstrapping and feature projection techniques (2009) 0.39
    0.38569912 = sum of:
      0.38569912 = product of:
        0.9642478 = sum of:
          0.012891208 = weight(abstract_txt:using in 2452) [ClassicSimilarity], result of:
            0.012891208 = score(doc=2452,freq=1.0), product of:
              0.059558842 = queryWeight, product of:
                1.0508721 = boost
                3.4631186 = idf(docFreq=3765, maxDocs=44218)
                0.01636549 = queryNorm
              0.21644491 = fieldWeight in 2452, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4631186 = idf(docFreq=3765, maxDocs=44218)
                0.0625 = fieldNorm(doc=2452)
          0.08337758 = weight(abstract_txt:classifier in 2452) [ClassicSimilarity], result of:
            0.08337758 = score(doc=2452,freq=2.0), product of:
              0.13024569 = queryWeight, product of:
                1.0988626 = boost
                7.24254 = idf(docFreq=85, maxDocs=44218)
                0.01636549 = queryNorm
              0.64015615 = fieldWeight in 2452, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.24254 = idf(docFreq=85, maxDocs=44218)
                0.0625 = fieldNorm(doc=2452)
          0.044154268 = weight(abstract_txt:classification in 2452) [ClassicSimilarity], result of:
            0.044154268 = score(doc=2452,freq=5.0), product of:
              0.07914235 = queryWeight, product of:
                1.2113823 = boost
                3.9920752 = idf(docFreq=2218, maxDocs=44218)
                0.01636549 = queryNorm
              0.5579095 = fieldWeight in 2452, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                3.9920752 = idf(docFreq=2218, maxDocs=44218)
                0.0625 = fieldNorm(doc=2452)
          0.023569532 = weight(abstract_txt:only in 2452) [ClassicSimilarity], result of:
            0.023569532 = score(doc=2452,freq=1.0), product of:
              0.089053534 = queryWeight, product of:
                1.2849976 = boost
                4.234672 = idf(docFreq=1740, maxDocs=44218)
                0.01636549 = queryNorm
              0.264667 = fieldWeight in 2452, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.234672 = idf(docFreq=1740, maxDocs=44218)
                0.0625 = fieldNorm(doc=2452)
          0.047008354 = weight(abstract_txt:experiments in 2452) [ClassicSimilarity], result of:
            0.047008354 = score(doc=2452,freq=1.0), product of:
              0.14110222 = queryWeight, product of:
                1.6174977 = boost
                5.3304167 = idf(docFreq=581, maxDocs=44218)
                0.01636549 = queryNorm
              0.33315104 = fieldWeight in 2452, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.3304167 = idf(docFreq=581, maxDocs=44218)
                0.0625 = fieldNorm(doc=2452)
          0.073702954 = weight(abstract_txt:automatically in 2452) [ClassicSimilarity], result of:
            0.073702954 = score(doc=2452,freq=2.0), product of:
              0.15114616 = queryWeight, product of:
                1.6740766 = boost
                5.5168705 = idf(docFreq=482, maxDocs=44218)
                0.01636549 = queryNorm
              0.48762706 = fieldWeight in 2452, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.5168705 = idf(docFreq=482, maxDocs=44218)
                0.0625 = fieldNorm(doc=2452)
          0.062198482 = weight(abstract_txt:training in 2452) [ClassicSimilarity], result of:
            0.062198482 = score(doc=2452,freq=1.0), product of:
              0.19467078 = queryWeight, product of:
                2.3268733 = boost
                5.112096 = idf(docFreq=723, maxDocs=44218)
                0.01636549 = queryNorm
              0.319506 = fieldWeight in 2452, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.112096 = idf(docFreq=723, maxDocs=44218)
                0.0625 = fieldNorm(doc=2452)
          0.108607806 = weight(abstract_txt:text in 2452) [ClassicSimilarity], result of:
            0.108607806 = score(doc=2452,freq=7.0), product of:
              0.16241838 = queryWeight, product of:
                2.4541965 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.01636549 = queryNorm
              0.6686916 = fieldWeight in 2452, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.0625 = fieldNorm(doc=2452)
          0.1303606 = weight(abstract_txt:documents in 2452) [ClassicSimilarity], result of:
            0.1303606 = score(doc=2452,freq=4.0), product of:
              0.25304738 = queryWeight, product of:
                3.751788 = boost
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.01636549 = queryNorm
              0.5151628 = fieldWeight in 2452, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.0625 = fieldNorm(doc=2452)
          0.37837705 = weight(abstract_txt:labeled in 2452) [ClassicSimilarity], result of:
            0.37837705 = score(doc=2452,freq=2.0), product of:
              0.56671804 = queryWeight, product of:
                4.5843234 = boost
                7.5537524 = idf(docFreq=62, maxDocs=44218)
                0.01636549 = queryNorm
              0.6676637 = fieldWeight in 2452, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.5537524 = idf(docFreq=62, maxDocs=44218)
                0.0625 = fieldNorm(doc=2452)
        0.4 = coord(10/25)
    
  2. Gauch, S.; Chandramouli, A.; Ranganathan, S.: Training a hierarchical classifier using inter document relationships (2009) 0.34
    0.336613 = sum of:
      0.336613 = product of:
        0.84153247 = sum of:
          0.016114011 = weight(abstract_txt:using in 2697) [ClassicSimilarity], result of:
            0.016114011 = score(doc=2697,freq=1.0), product of:
              0.059558842 = queryWeight, product of:
                1.0508721 = boost
                3.4631186 = idf(docFreq=3765, maxDocs=44218)
                0.01636549 = queryNorm
              0.27055615 = fieldWeight in 2697, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4631186 = idf(docFreq=3765, maxDocs=44218)
                0.078125 = fieldNorm(doc=2697)
          0.10422198 = weight(abstract_txt:classifier in 2697) [ClassicSimilarity], result of:
            0.10422198 = score(doc=2697,freq=2.0), product of:
              0.13024569 = queryWeight, product of:
                1.0988626 = boost
                7.24254 = idf(docFreq=85, maxDocs=44218)
                0.01636549 = queryNorm
              0.8001952 = fieldWeight in 2697, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.24254 = idf(docFreq=85, maxDocs=44218)
                0.078125 = fieldNorm(doc=2697)
          0.14302726 = weight(abstract_txt:classifiers in 2697) [ClassicSimilarity], result of:
            0.14302726 = score(doc=2697,freq=3.0), product of:
              0.14050959 = queryWeight, product of:
                1.1413392 = boost
                7.5225 = idf(docFreq=64, maxDocs=44218)
                0.01636549 = queryNorm
              1.0179181 = fieldWeight in 2697, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.5225 = idf(docFreq=64, maxDocs=44218)
                0.078125 = fieldNorm(doc=2697)
          0.04936597 = weight(abstract_txt:classification in 2697) [ClassicSimilarity], result of:
            0.04936597 = score(doc=2697,freq=4.0), product of:
              0.07914235 = queryWeight, product of:
                1.2113823 = boost
                3.9920752 = idf(docFreq=2218, maxDocs=44218)
                0.01636549 = queryNorm
              0.6237618 = fieldWeight in 2697, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.9920752 = idf(docFreq=2218, maxDocs=44218)
                0.078125 = fieldNorm(doc=2697)
          0.017376672 = weight(abstract_txt:from in 2697) [ClassicSimilarity], result of:
            0.017376672 = score(doc=2697,freq=2.0), product of:
              0.056903895 = queryWeight, product of:
                1.2580369 = boost
                2.7638826 = idf(docFreq=7577, maxDocs=44218)
                0.01636549 = queryNorm
              0.30536878 = fieldWeight in 2697, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.7638826 = idf(docFreq=7577, maxDocs=44218)
                0.078125 = fieldNorm(doc=2697)
          0.055268157 = weight(abstract_txt:quality in 2697) [ClassicSimilarity], result of:
            0.055268157 = score(doc=2697,freq=2.0), product of:
              0.1075104 = queryWeight, product of:
                1.4118936 = boost
                4.6528544 = idf(docFreq=1145, maxDocs=44218)
                0.01636549 = queryNorm
              0.51407266 = fieldWeight in 2697, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.6528544 = idf(docFreq=1145, maxDocs=44218)
                0.078125 = fieldNorm(doc=2697)
          0.06514483 = weight(abstract_txt:automatically in 2697) [ClassicSimilarity], result of:
            0.06514483 = score(doc=2697,freq=1.0), product of:
              0.15114616 = queryWeight, product of:
                1.6740766 = boost
                5.5168705 = idf(docFreq=482, maxDocs=44218)
                0.01636549 = queryNorm
              0.4310055 = fieldWeight in 2697, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.5168705 = idf(docFreq=482, maxDocs=44218)
                0.078125 = fieldNorm(doc=2697)
          0.1554962 = weight(abstract_txt:training in 2697) [ClassicSimilarity], result of:
            0.1554962 = score(doc=2697,freq=4.0), product of:
              0.19467078 = queryWeight, product of:
                2.3268733 = boost
                5.112096 = idf(docFreq=723, maxDocs=44218)
                0.01636549 = queryNorm
              0.79876494 = fieldWeight in 2697, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.112096 = idf(docFreq=723, maxDocs=44218)
                0.078125 = fieldNorm(doc=2697)
          0.07256664 = weight(abstract_txt:text in 2697) [ClassicSimilarity], result of:
            0.07256664 = score(doc=2697,freq=2.0), product of:
              0.16241838 = queryWeight, product of:
                2.4541965 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.01636549 = queryNorm
              0.44678837 = fieldWeight in 2697, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.078125 = fieldNorm(doc=2697)
          0.16295075 = weight(abstract_txt:documents in 2697) [ClassicSimilarity], result of:
            0.16295075 = score(doc=2697,freq=4.0), product of:
              0.25304738 = queryWeight, product of:
                3.751788 = boost
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.01636549 = queryNorm
              0.64395356 = fieldWeight in 2697, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.078125 = fieldNorm(doc=2697)
        0.4 = coord(10/25)
    
  3. Sun, A.; Lim, E.-P.; Ng, W.-K.: Performance measurement framework for hierarchical text classification (2003) 0.30
    0.2996577 = sum of:
      0.2996577 = product of:
        0.74914426 = sum of:
          0.058956847 = weight(abstract_txt:classifier in 1808) [ClassicSimilarity], result of:
            0.058956847 = score(doc=1808,freq=1.0), product of:
              0.13024569 = queryWeight, product of:
                1.0988626 = boost
                7.24254 = idf(docFreq=85, maxDocs=44218)
                0.01636549 = queryNorm
              0.45265874 = fieldWeight in 1808, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.24254 = idf(docFreq=85, maxDocs=44218)
                0.0625 = fieldNorm(doc=1808)
          0.062362663 = weight(abstract_txt:assume in 1808) [ClassicSimilarity], result of:
            0.062362663 = score(doc=1808,freq=1.0), product of:
              0.13521461 = queryWeight, product of:
                1.1196275 = boost
                7.3793993 = idf(docFreq=74, maxDocs=44218)
                0.01636549 = queryNorm
              0.46121246 = fieldWeight in 1808, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.3793993 = idf(docFreq=74, maxDocs=44218)
                0.0625 = fieldNorm(doc=1808)
          0.14771792 = weight(abstract_txt:classifiers in 1808) [ClassicSimilarity], result of:
            0.14771792 = score(doc=1808,freq=5.0), product of:
              0.14050959 = queryWeight, product of:
                1.1413392 = boost
                7.5225 = idf(docFreq=64, maxDocs=44218)
                0.01636549 = queryNorm
              1.0513014 = fieldWeight in 1808, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                7.5225 = idf(docFreq=64, maxDocs=44218)
                0.0625 = fieldNorm(doc=1808)
          0.06731419 = weight(abstract_txt:reuters in 1808) [ClassicSimilarity], result of:
            0.06731419 = score(doc=1808,freq=1.0), product of:
              0.14228036 = queryWeight, product of:
                1.1485084 = boost
                7.5697527 = idf(docFreq=61, maxDocs=44218)
                0.01636549 = queryNorm
              0.47310954 = fieldWeight in 1808, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.5697527 = idf(docFreq=61, maxDocs=44218)
                0.0625 = fieldNorm(doc=1808)
          0.048368573 = weight(abstract_txt:classification in 1808) [ClassicSimilarity], result of:
            0.048368573 = score(doc=1808,freq=6.0), product of:
              0.07914235 = queryWeight, product of:
                1.2113823 = boost
                3.9920752 = idf(docFreq=2218, maxDocs=44218)
                0.01636549 = queryNorm
              0.6111592 = fieldWeight in 1808, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                3.9920752 = idf(docFreq=2218, maxDocs=44218)
                0.0625 = fieldNorm(doc=1808)
          0.013901338 = weight(abstract_txt:from in 1808) [ClassicSimilarity], result of:
            0.013901338 = score(doc=1808,freq=2.0), product of:
              0.056903895 = queryWeight, product of:
                1.2580369 = boost
                2.7638826 = idf(docFreq=7577, maxDocs=44218)
                0.01636549 = queryNorm
              0.24429502 = fieldWeight in 1808, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.7638826 = idf(docFreq=7577, maxDocs=44218)
                0.0625 = fieldNorm(doc=1808)
          0.15081404 = weight(abstract_txt:21578 in 1808) [ClassicSimilarity], result of:
            0.15081404 = score(doc=1808,freq=1.0), product of:
              0.24361369 = queryWeight, product of:
                1.5028394 = boost
                9.905128 = idf(docFreq=5, maxDocs=44218)
                0.01636549 = queryNorm
              0.6190705 = fieldWeight in 1808, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.905128 = idf(docFreq=5, maxDocs=44218)
                0.0625 = fieldNorm(doc=1808)
          0.06647985 = weight(abstract_txt:experiments in 1808) [ClassicSimilarity], result of:
            0.06647985 = score(doc=1808,freq=2.0), product of:
              0.14110222 = queryWeight, product of:
                1.6174977 = boost
                5.3304167 = idf(docFreq=581, maxDocs=44218)
                0.01636549 = queryNorm
              0.4711467 = fieldWeight in 1808, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.3304167 = idf(docFreq=581, maxDocs=44218)
                0.0625 = fieldNorm(doc=1808)
          0.04104989 = weight(abstract_txt:text in 1808) [ClassicSimilarity], result of:
            0.04104989 = score(doc=1808,freq=1.0), product of:
              0.16241838 = queryWeight, product of:
                2.4541965 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.01636549 = queryNorm
              0.25274166 = fieldWeight in 1808, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.0625 = fieldNorm(doc=1808)
          0.092178866 = weight(abstract_txt:documents in 1808) [ClassicSimilarity], result of:
            0.092178866 = score(doc=1808,freq=2.0), product of:
              0.25304738 = queryWeight, product of:
                3.751788 = boost
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.01636549 = queryNorm
              0.36427513 = fieldWeight in 1808, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.0625 = fieldNorm(doc=1808)
        0.4 = coord(10/25)
    
  4. Giorgetti, D.; Sebastiani, F.: Automating survey coding by multiclass text categorization techniques (2003) 0.24
    0.23667994 = sum of:
      0.23667994 = product of:
        0.59169984 = sum of:
          0.012891208 = weight(abstract_txt:using in 5172) [ClassicSimilarity], result of:
            0.012891208 = score(doc=5172,freq=1.0), product of:
              0.059558842 = queryWeight, product of:
                1.0508721 = boost
                3.4631186 = idf(docFreq=3765, maxDocs=44218)
                0.01636549 = queryNorm
              0.21644491 = fieldWeight in 5172, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4631186 = idf(docFreq=3765, maxDocs=44218)
                0.0625 = fieldNorm(doc=5172)
          0.02824371 = weight(abstract_txt:approach in 5172) [ClassicSimilarity], result of:
            0.02824371 = score(doc=5172,freq=3.0), product of:
              0.069661245 = queryWeight, product of:
                1.1365076 = boost
                3.745328 = idf(docFreq=2839, maxDocs=44218)
                0.01636549 = queryNorm
              0.40544364 = fieldWeight in 5172, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.745328 = idf(docFreq=2839, maxDocs=44218)
                0.0625 = fieldNorm(doc=5172)
          0.09342501 = weight(abstract_txt:classifiers in 5172) [ClassicSimilarity], result of:
            0.09342501 = score(doc=5172,freq=2.0), product of:
              0.14050959 = queryWeight, product of:
                1.1413392 = boost
                7.5225 = idf(docFreq=64, maxDocs=44218)
                0.01636549 = queryNorm
              0.6649013 = fieldWeight in 5172, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.5225 = idf(docFreq=64, maxDocs=44218)
                0.0625 = fieldNorm(doc=5172)
          0.019659461 = weight(abstract_txt:from in 5172) [ClassicSimilarity], result of:
            0.019659461 = score(doc=5172,freq=4.0), product of:
              0.056903895 = queryWeight, product of:
                1.2580369 = boost
                2.7638826 = idf(docFreq=7577, maxDocs=44218)
                0.01636549 = queryNorm
              0.34548533 = fieldWeight in 5172, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                2.7638826 = idf(docFreq=7577, maxDocs=44218)
                0.0625 = fieldNorm(doc=5172)
          0.023569532 = weight(abstract_txt:only in 5172) [ClassicSimilarity], result of:
            0.023569532 = score(doc=5172,freq=1.0), product of:
              0.089053534 = queryWeight, product of:
                1.2849976 = boost
                4.234672 = idf(docFreq=1740, maxDocs=44218)
                0.01636549 = queryNorm
              0.264667 = fieldWeight in 5172, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.234672 = idf(docFreq=1740, maxDocs=44218)
                0.0625 = fieldNorm(doc=5172)
          0.05211586 = weight(abstract_txt:automatically in 5172) [ClassicSimilarity], result of:
            0.05211586 = score(doc=5172,freq=1.0), product of:
              0.15114616 = queryWeight, product of:
                1.6740766 = boost
                5.5168705 = idf(docFreq=482, maxDocs=44218)
                0.01636549 = queryNorm
              0.3448044 = fieldWeight in 5172, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.5168705 = idf(docFreq=482, maxDocs=44218)
                0.0625 = fieldNorm(doc=5172)
          0.062198482 = weight(abstract_txt:training in 5172) [ClassicSimilarity], result of:
            0.062198482 = score(doc=5172,freq=1.0), product of:
              0.19467078 = queryWeight, product of:
                2.3268733 = boost
                5.112096 = idf(docFreq=723, maxDocs=44218)
                0.01636549 = queryNorm
              0.319506 = fieldWeight in 5172, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.112096 = idf(docFreq=723, maxDocs=44218)
                0.0625 = fieldNorm(doc=5172)
          0.04104989 = weight(abstract_txt:text in 5172) [ClassicSimilarity], result of:
            0.04104989 = score(doc=5172,freq=1.0), product of:
              0.16241838 = queryWeight, product of:
                2.4541965 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.01636549 = queryNorm
              0.25274166 = fieldWeight in 5172, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.0625 = fieldNorm(doc=5172)
          0.1933664 = weight(abstract_txt:manually in 5172) [ClassicSimilarity], result of:
            0.1933664 = score(doc=5172,freq=2.0), product of:
              0.32912302 = queryWeight, product of:
                3.02553 = boost
                6.6470313 = idf(docFreq=155, maxDocs=44218)
                0.01636549 = queryNorm
              0.5875201 = fieldWeight in 5172, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.6470313 = idf(docFreq=155, maxDocs=44218)
                0.0625 = fieldNorm(doc=5172)
          0.0651803 = weight(abstract_txt:documents in 5172) [ClassicSimilarity], result of:
            0.0651803 = score(doc=5172,freq=1.0), product of:
              0.25304738 = queryWeight, product of:
                3.751788 = boost
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.01636549 = queryNorm
              0.2575814 = fieldWeight in 5172, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.0625 = fieldNorm(doc=5172)
        0.4 = coord(10/25)
    
  5. Mengle, S.S.R.; Goharian, N.: Ambiguity measure feature-selection algorithm (2009) 0.23
    0.2252697 = sum of:
      0.2252697 = product of:
        0.56317425 = sum of:
          0.022328228 = weight(abstract_txt:using in 2804) [ClassicSimilarity], result of:
            0.022328228 = score(doc=2804,freq=3.0), product of:
              0.059558842 = queryWeight, product of:
                1.0508721 = boost
                3.4631186 = idf(docFreq=3765, maxDocs=44218)
                0.01636549 = queryNorm
              0.37489358 = fieldWeight in 2804, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.4631186 = idf(docFreq=3765, maxDocs=44218)
                0.0625 = fieldNorm(doc=2804)
          0.13183151 = weight(abstract_txt:classifier in 2804) [ClassicSimilarity], result of:
            0.13183151 = score(doc=2804,freq=5.0), product of:
              0.13024569 = queryWeight, product of:
                1.0988626 = boost
                7.24254 = idf(docFreq=85, maxDocs=44218)
                0.01636549 = queryNorm
              1.0121757 = fieldWeight in 2804, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                7.24254 = idf(docFreq=85, maxDocs=44218)
                0.0625 = fieldNorm(doc=2804)
          0.02306089 = weight(abstract_txt:approach in 2804) [ClassicSimilarity], result of:
            0.02306089 = score(doc=2804,freq=2.0), product of:
              0.069661245 = queryWeight, product of:
                1.1365076 = boost
                3.745328 = idf(docFreq=2839, maxDocs=44218)
                0.01636549 = queryNorm
              0.33104333 = fieldWeight in 2804, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.745328 = idf(docFreq=2839, maxDocs=44218)
                0.0625 = fieldNorm(doc=2804)
          0.019746387 = weight(abstract_txt:classification in 2804) [ClassicSimilarity], result of:
            0.019746387 = score(doc=2804,freq=1.0), product of:
              0.07914235 = queryWeight, product of:
                1.2113823 = boost
                3.9920752 = idf(docFreq=2218, maxDocs=44218)
                0.01636549 = queryNorm
              0.2495047 = fieldWeight in 2804, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.9920752 = idf(docFreq=2218, maxDocs=44218)
                0.0625 = fieldNorm(doc=2804)
          0.009829731 = weight(abstract_txt:from in 2804) [ClassicSimilarity], result of:
            0.009829731 = score(doc=2804,freq=1.0), product of:
              0.056903895 = queryWeight, product of:
                1.2580369 = boost
                2.7638826 = idf(docFreq=7577, maxDocs=44218)
                0.01636549 = queryNorm
              0.17274266 = fieldWeight in 2804, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.7638826 = idf(docFreq=7577, maxDocs=44218)
                0.0625 = fieldNorm(doc=2804)
          0.023569532 = weight(abstract_txt:only in 2804) [ClassicSimilarity], result of:
            0.023569532 = score(doc=2804,freq=1.0), product of:
              0.089053534 = queryWeight, product of:
                1.2849976 = boost
                4.234672 = idf(docFreq=1740, maxDocs=44218)
                0.01636549 = queryNorm
              0.264667 = fieldWeight in 2804, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.234672 = idf(docFreq=1740, maxDocs=44218)
                0.0625 = fieldNorm(doc=2804)
          0.05211586 = weight(abstract_txt:automatically in 2804) [ClassicSimilarity], result of:
            0.05211586 = score(doc=2804,freq=1.0), product of:
              0.15114616 = queryWeight, product of:
                1.6740766 = boost
                5.5168705 = idf(docFreq=482, maxDocs=44218)
                0.01636549 = queryNorm
              0.3448044 = fieldWeight in 2804, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.5168705 = idf(docFreq=482, maxDocs=44218)
                0.0625 = fieldNorm(doc=2804)
          0.087961935 = weight(abstract_txt:training in 2804) [ClassicSimilarity], result of:
            0.087961935 = score(doc=2804,freq=2.0), product of:
              0.19467078 = queryWeight, product of:
                2.3268733 = boost
                5.112096 = idf(docFreq=723, maxDocs=44218)
                0.01636549 = queryNorm
              0.4518497 = fieldWeight in 2804, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.112096 = idf(docFreq=723, maxDocs=44218)
                0.0625 = fieldNorm(doc=2804)
          0.10055129 = weight(abstract_txt:text in 2804) [ClassicSimilarity], result of:
            0.10055129 = score(doc=2804,freq=6.0), product of:
              0.16241838 = queryWeight, product of:
                2.4541965 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.01636549 = queryNorm
              0.6190881 = fieldWeight in 2804, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.0625 = fieldNorm(doc=2804)
          0.092178866 = weight(abstract_txt:documents in 2804) [ClassicSimilarity], result of:
            0.092178866 = score(doc=2804,freq=2.0), product of:
              0.25304738 = queryWeight, product of:
                3.751788 = boost
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.01636549 = queryNorm
              0.36427513 = fieldWeight in 2804, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.0625 = fieldNorm(doc=2804)
        0.4 = coord(10/25)