Document (#34806)

Author
Mengle, S.S.R.
Goharian, N.
Title
Ambiguity measure feature-selection algorithm
Source
Journal of the American Society for Information Science and Technology. 60(2009) no.5, S.1037-1050
Year
2009
Abstract
With the increasing number of digital documents, the ability to automatically classify those documents both efficiently and accurately is becoming more critical and difficult. One of the major problems in text classification is the high dimensionality of feature space. We present the ambiguity measure (AM) feature-selection algorithm, which selects the most unambiguous features from the feature set. Unambiguous features are those features whose presence in a document indicate a strong degree of confidence that a document belongs to only one specific category. We apply AM feature selection on a naïve Bayes text classifier. We favorably show the effectiveness of our approach in outperforming eight existing feature-selection methods, using five benchmark datasets with a statistical significance of at least 95% confidence. The support vector machine (SVM) text classifier is shown to perform consistently better than the naïve Bayes text classifier. The drawback, however, is the time complexity in training a model. We further explore the effect of using the AM feature-selection method on an SVM text classifier. Our results indicate that the training time for the SVM algorithm can be reduced by more than 50%, while still improving the accuracy of the text classifier. We favorably show the effectiveness of our approach by demonstrating that it statistically significantly (99% confidence) outperforms eight existing feature-selection methods using four standard benchmark datasets.
Theme
Automatisches Klassifizieren

Similar documents (author)

  1. Mengle, S.; Goharian, N.: Passage detection using text classification (2009) 4.80
    4.797702 = sum of:
      4.797702 = weight(author_txt:goharian in 4766) [ClassicSimilarity], result of:
        4.797702 = fieldWeight in 4766, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.595404 = idf(docFreq=7, maxDocs=43254)
          0.5 = fieldNorm(doc=4766)
    
  2. Mengle, S.S.R.; Goharian, N.: Detecting relationships among categories using text classification (2010) 4.80
    4.797702 = sum of:
      4.797702 = weight(author_txt:goharian in 463) [ClassicSimilarity], result of:
        4.797702 = fieldWeight in 463, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.595404 = idf(docFreq=7, maxDocs=43254)
          0.5 = fieldNorm(doc=463)
    
  3. Urbain, J.; Goharian, N.; Frieder, O.: Probabilistic passage models for semantic search of genomics literature (2008) 3.60
    3.5982764 = sum of:
      3.5982764 = weight(author_txt:goharian in 4381) [ClassicSimilarity], result of:
        3.5982764 = fieldWeight in 4381, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.595404 = idf(docFreq=7, maxDocs=43254)
          0.375 = fieldNorm(doc=4381)
    
  4. Soldaini, L.; Yates, A.; Goharian, N.: Learning to reformulate long queries for clinical decision support (2017) 3.60
    3.5982764 = sum of:
      3.5982764 = weight(author_txt:goharian in 5422) [ClassicSimilarity], result of:
        3.5982764 = fieldWeight in 5422, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.595404 = idf(docFreq=7, maxDocs=43254)
          0.375 = fieldNorm(doc=5422)
    
  5. Cohan, A.; Young, S.; Yates, A.; Goharian, N.: Triaging content severity in online mental health forums (2017) 3.00
    2.9985638 = sum of:
      2.9985638 = weight(author_txt:goharian in 5395) [ClassicSimilarity], result of:
        2.9985638 = fieldWeight in 5395, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.595404 = idf(docFreq=7, maxDocs=43254)
          0.3125 = fieldNorm(doc=5395)
    

Similar documents (content)

  1. Maghsoodi, N.; Homayounpour, M.M.: Improving Farsi multiclass text classification using a thesaurus and two-stage feature selection (2011) 0.29
    0.2907725 = sum of:
      0.2907725 = product of:
        0.9086641 = sum of:
          0.047056403 = weight(abstract_txt:training in 1240) [ClassicSimilarity], result of:
            0.047056403 = score(doc=1240,freq=3.0), product of:
              0.08471917 = queryWeight, product of:
                1.1882863 = boost
                5.1309333 = idf(docFreq=694, maxDocs=43254)
                0.013895183 = queryNorm
              0.55543983 = fieldWeight in 1240, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.1309333 = idf(docFreq=694, maxDocs=43254)
                0.0625 = fieldNorm(doc=1240)
          0.012639908 = weight(abstract_txt:using in 1240) [ClassicSimilarity], result of:
            0.012639908 = score(doc=1240,freq=1.0), product of:
              0.058228556 = queryWeight, product of:
                1.206546 = boost
                3.4731848 = idf(docFreq=3646, maxDocs=43254)
                0.013895183 = queryNorm
              0.21707405 = fieldWeight in 1240, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4731848 = idf(docFreq=3646, maxDocs=43254)
                0.0625 = fieldNorm(doc=1240)
          0.031539757 = weight(abstract_txt:indicate in 1240) [ClassicSimilarity], result of:
            0.031539757 = score(doc=1240,freq=1.0), product of:
              0.093579754 = queryWeight, product of:
                1.2488813 = boost
                5.392578 = idf(docFreq=534, maxDocs=43254)
                0.013895183 = queryNorm
              0.33703613 = fieldWeight in 1240, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.392578 = idf(docFreq=534, maxDocs=43254)
                0.0625 = fieldNorm(doc=1240)
          0.04923343 = weight(abstract_txt:features in 1240) [ClassicSimilarity], result of:
            0.04923343 = score(doc=1240,freq=3.0), product of:
              0.09994776 = queryWeight, product of:
                1.580747 = boost
                4.550367 = idf(docFreq=1241, maxDocs=43254)
                0.013895183 = queryNorm
              0.49259165 = fieldWeight in 1240, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.550367 = idf(docFreq=1241, maxDocs=43254)
                0.0625 = fieldNorm(doc=1240)
          0.04007477 = weight(abstract_txt:text in 1240) [ClassicSimilarity], result of:
            0.04007477 = score(doc=1240,freq=1.0), product of:
              0.15833032 = queryWeight, product of:
                2.8136683 = boost
                4.049738 = idf(docFreq=2048, maxDocs=43254)
                0.013895183 = queryNorm
              0.25310862 = fieldWeight in 1240, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.049738 = idf(docFreq=2048, maxDocs=43254)
                0.0625 = fieldNorm(doc=1240)
          0.16236912 = weight(abstract_txt:selection in 1240) [ClassicSimilarity], result of:
            0.16236912 = score(doc=1240,freq=3.0), product of:
              0.27900496 = queryWeight, product of:
                3.7350533 = boost
                5.375896 = idf(docFreq=543, maxDocs=43254)
                0.013895183 = queryNorm
              0.5819578 = fieldWeight in 1240, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.375896 = idf(docFreq=543, maxDocs=43254)
                0.0625 = fieldNorm(doc=1240)
          0.19208862 = weight(abstract_txt:classifier in 1240) [ClassicSimilarity], result of:
            0.19208862 = score(doc=1240,freq=1.0), product of:
              0.4235689 = queryWeight, product of:
                4.2010927 = boost
                7.2560043 = idf(docFreq=82, maxDocs=43254)
                0.013895183 = queryNorm
              0.45350027 = fieldWeight in 1240, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.2560043 = idf(docFreq=82, maxDocs=43254)
                0.0625 = fieldNorm(doc=1240)
          0.3736621 = weight(abstract_txt:feature in 1240) [ClassicSimilarity], result of:
            0.3736621 = score(doc=1240,freq=5.0), product of:
              0.45146665 = queryWeight, product of:
                5.486218 = boost
                5.922272 = idf(docFreq=314, maxDocs=43254)
                0.013895183 = queryNorm
              0.8276627 = fieldWeight in 1240, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                5.922272 = idf(docFreq=314, maxDocs=43254)
                0.0625 = fieldNorm(doc=1240)
        0.32 = coord(8/25)
    
  2. Duwairi, R.M.: Machine learning for Arabic text categorization (2006) 0.24
    0.2372757 = sum of:
      0.2372757 = product of:
        0.9886488 = sum of:
          0.022036748 = weight(abstract_txt:show in 116) [ClassicSimilarity], result of:
            0.022036748 = score(doc=116,freq=1.0), product of:
              0.06349916 = queryWeight, product of:
                1.0287603 = boost
                4.442112 = idf(docFreq=1383, maxDocs=43254)
                0.013895183 = queryNorm
              0.34704 = fieldWeight in 116, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.442112 = idf(docFreq=1383, maxDocs=43254)
                0.078125 = fieldNorm(doc=116)
          0.033960033 = weight(abstract_txt:training in 116) [ClassicSimilarity], result of:
            0.033960033 = score(doc=116,freq=1.0), product of:
              0.08471917 = queryWeight, product of:
                1.1882863 = boost
                5.1309333 = idf(docFreq=694, maxDocs=43254)
                0.013895183 = queryNorm
              0.40085417 = fieldWeight in 116, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.1309333 = idf(docFreq=694, maxDocs=43254)
                0.078125 = fieldNorm(doc=116)
          0.05024866 = weight(abstract_txt:features in 116) [ClassicSimilarity], result of:
            0.05024866 = score(doc=116,freq=2.0), product of:
              0.09994776 = queryWeight, product of:
                1.580747 = boost
                4.550367 = idf(docFreq=1241, maxDocs=43254)
                0.013895183 = queryNorm
              0.50274926 = fieldWeight in 116, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.550367 = idf(docFreq=1241, maxDocs=43254)
                0.078125 = fieldNorm(doc=116)
          0.05009346 = weight(abstract_txt:text in 116) [ClassicSimilarity], result of:
            0.05009346 = score(doc=116,freq=1.0), product of:
              0.15833032 = queryWeight, product of:
                2.8136683 = boost
                4.049738 = idf(docFreq=2048, maxDocs=43254)
                0.013895183 = queryNorm
              0.31638578 = fieldWeight in 116, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.049738 = idf(docFreq=2048, maxDocs=43254)
                0.078125 = fieldNorm(doc=116)
          0.536904 = weight(abstract_txt:classifier in 116) [ClassicSimilarity], result of:
            0.536904 = score(doc=116,freq=5.0), product of:
              0.4235689 = queryWeight, product of:
                4.2010927 = boost
                7.2560043 = idf(docFreq=82, maxDocs=43254)
                0.013895183 = queryNorm
              1.2675717 = fieldWeight in 116, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                7.2560043 = idf(docFreq=82, maxDocs=43254)
                0.078125 = fieldNorm(doc=116)
          0.29540583 = weight(abstract_txt:feature in 116) [ClassicSimilarity], result of:
            0.29540583 = score(doc=116,freq=2.0), product of:
              0.45146665 = queryWeight, product of:
                5.486218 = boost
                5.922272 = idf(docFreq=314, maxDocs=43254)
                0.013895183 = queryNorm
              0.6543248 = fieldWeight in 116, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.922272 = idf(docFreq=314, maxDocs=43254)
                0.078125 = fieldNorm(doc=116)
        0.24 = coord(6/25)
    
  3. Malenica, M.; Smuc, T.; Snajder, J.; Basic, B.D.: Language morphology offset : text classification on a Croatian-English parallel corpus (2008) 0.22
    0.22432473 = sum of:
      0.22432473 = product of:
        0.9346864 = sum of:
          0.015799886 = weight(abstract_txt:using in 4036) [ClassicSimilarity], result of:
            0.015799886 = score(doc=4036,freq=1.0), product of:
              0.058228556 = queryWeight, product of:
                1.206546 = boost
                3.4731848 = idf(docFreq=3646, maxDocs=43254)
                0.013895183 = queryNorm
              0.27134258 = fieldWeight in 4036, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4731848 = idf(docFreq=3646, maxDocs=43254)
                0.078125 = fieldNorm(doc=4036)
          0.05024866 = weight(abstract_txt:features in 4036) [ClassicSimilarity], result of:
            0.05024866 = score(doc=4036,freq=2.0), product of:
              0.09994776 = queryWeight, product of:
                1.580747 = boost
                4.550367 = idf(docFreq=1241, maxDocs=43254)
                0.013895183 = queryNorm
              0.50274926 = fieldWeight in 4036, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.550367 = idf(docFreq=1241, maxDocs=43254)
                0.078125 = fieldNorm(doc=4036)
          0.05009346 = weight(abstract_txt:text in 4036) [ClassicSimilarity], result of:
            0.05009346 = score(doc=4036,freq=1.0), product of:
              0.15833032 = queryWeight, product of:
                2.8136683 = boost
                4.049738 = idf(docFreq=2048, maxDocs=43254)
                0.013895183 = queryNorm
              0.31638578 = fieldWeight in 4036, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.049738 = idf(docFreq=2048, maxDocs=43254)
                0.078125 = fieldNorm(doc=4036)
          0.11717982 = weight(abstract_txt:selection in 4036) [ClassicSimilarity], result of:
            0.11717982 = score(doc=4036,freq=1.0), product of:
              0.27900496 = queryWeight, product of:
                3.7350533 = boost
                5.375896 = idf(docFreq=543, maxDocs=43254)
                0.013895183 = queryNorm
              0.41999188 = fieldWeight in 4036, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.375896 = idf(docFreq=543, maxDocs=43254)
                0.078125 = fieldNorm(doc=4036)
          0.3395679 = weight(abstract_txt:classifier in 4036) [ClassicSimilarity], result of:
            0.3395679 = score(doc=4036,freq=2.0), product of:
              0.4235689 = queryWeight, product of:
                4.2010927 = boost
                7.2560043 = idf(docFreq=82, maxDocs=43254)
                0.013895183 = queryNorm
              0.8016828 = fieldWeight in 4036, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.2560043 = idf(docFreq=82, maxDocs=43254)
                0.078125 = fieldNorm(doc=4036)
          0.36179677 = weight(abstract_txt:feature in 4036) [ClassicSimilarity], result of:
            0.36179677 = score(doc=4036,freq=3.0), product of:
              0.45146665 = queryWeight, product of:
                5.486218 = boost
                5.922272 = idf(docFreq=314, maxDocs=43254)
                0.013895183 = queryNorm
              0.80138093 = fieldWeight in 4036, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.922272 = idf(docFreq=314, maxDocs=43254)
                0.078125 = fieldNorm(doc=4036)
        0.24 = coord(6/25)
    
  4. Yoon, Y.; Lee, G.G.: Efficient implementation of associative classifiers for document classification (2007) 0.17
    0.16994771 = sum of:
      0.16994771 = product of:
        0.6069561 = sum of:
          0.017629398 = weight(abstract_txt:show in 2910) [ClassicSimilarity], result of:
            0.017629398 = score(doc=2910,freq=1.0), product of:
              0.06349916 = queryWeight, product of:
                1.0287603 = boost
                4.442112 = idf(docFreq=1383, maxDocs=43254)
                0.013895183 = queryNorm
              0.277632 = fieldWeight in 2910, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.442112 = idf(docFreq=1383, maxDocs=43254)
                0.0625 = fieldNorm(doc=2910)
          0.054336052 = weight(abstract_txt:training in 2910) [ClassicSimilarity], result of:
            0.054336052 = score(doc=2910,freq=4.0), product of:
              0.08471917 = queryWeight, product of:
                1.1882863 = boost
                5.1309333 = idf(docFreq=694, maxDocs=43254)
                0.013895183 = queryNorm
              0.64136666 = fieldWeight in 2910, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.1309333 = idf(docFreq=694, maxDocs=43254)
                0.0625 = fieldNorm(doc=2910)
          0.012639908 = weight(abstract_txt:using in 2910) [ClassicSimilarity], result of:
            0.012639908 = score(doc=2910,freq=1.0), product of:
              0.058228556 = queryWeight, product of:
                1.206546 = boost
                3.4731848 = idf(docFreq=3646, maxDocs=43254)
                0.013895183 = queryNorm
              0.21707405 = fieldWeight in 2910, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4731848 = idf(docFreq=3646, maxDocs=43254)
                0.0625 = fieldNorm(doc=2910)
          0.06941154 = weight(abstract_txt:text in 2910) [ClassicSimilarity], result of:
            0.06941154 = score(doc=2910,freq=3.0), product of:
              0.15833032 = queryWeight, product of:
                2.8136683 = boost
                4.049738 = idf(docFreq=2048, maxDocs=43254)
                0.013895183 = queryNorm
              0.438397 = fieldWeight in 2910, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.049738 = idf(docFreq=2048, maxDocs=43254)
                0.0625 = fieldNorm(doc=2910)
          0.09374385 = weight(abstract_txt:selection in 2910) [ClassicSimilarity], result of:
            0.09374385 = score(doc=2910,freq=1.0), product of:
              0.27900496 = queryWeight, product of:
                3.7350533 = boost
                5.375896 = idf(docFreq=543, maxDocs=43254)
                0.013895183 = queryNorm
              0.3359935 = fieldWeight in 2910, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.375896 = idf(docFreq=543, maxDocs=43254)
                0.0625 = fieldNorm(doc=2910)
          0.19208862 = weight(abstract_txt:classifier in 2910) [ClassicSimilarity], result of:
            0.19208862 = score(doc=2910,freq=1.0), product of:
              0.4235689 = queryWeight, product of:
                4.2010927 = boost
                7.2560043 = idf(docFreq=82, maxDocs=43254)
                0.013895183 = queryNorm
              0.45350027 = fieldWeight in 2910, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.2560043 = idf(docFreq=82, maxDocs=43254)
                0.0625 = fieldNorm(doc=2910)
          0.16710678 = weight(abstract_txt:feature in 2910) [ClassicSimilarity], result of:
            0.16710678 = score(doc=2910,freq=1.0), product of:
              0.45146665 = queryWeight, product of:
                5.486218 = boost
                5.922272 = idf(docFreq=314, maxDocs=43254)
                0.013895183 = queryNorm
              0.370142 = fieldWeight in 2910, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.922272 = idf(docFreq=314, maxDocs=43254)
                0.0625 = fieldNorm(doc=2910)
        0.28 = coord(7/25)
    
  5. Hu, X.; Choi, K.; Downie, J.S.: ¬A framework for evaluating multimodal music mood classification (2017) 0.17
    0.16696142 = sum of:
      0.16696142 = product of:
        0.59629077 = sum of:
          0.022036748 = weight(abstract_txt:show in 4819) [ClassicSimilarity], result of:
            0.022036748 = score(doc=4819,freq=1.0), product of:
              0.06349916 = queryWeight, product of:
                1.0287603 = boost
                4.442112 = idf(docFreq=1383, maxDocs=43254)
                0.013895183 = queryNorm
              0.34704 = fieldWeight in 4819, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.442112 = idf(docFreq=1383, maxDocs=43254)
                0.078125 = fieldNorm(doc=4819)
          0.033960033 = weight(abstract_txt:training in 4819) [ClassicSimilarity], result of:
            0.033960033 = score(doc=4819,freq=1.0), product of:
              0.08471917 = queryWeight, product of:
                1.1882863 = boost
                5.1309333 = idf(docFreq=694, maxDocs=43254)
                0.013895183 = queryNorm
              0.40085417 = fieldWeight in 4819, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.1309333 = idf(docFreq=694, maxDocs=43254)
                0.078125 = fieldNorm(doc=4819)
          0.027366202 = weight(abstract_txt:using in 4819) [ClassicSimilarity], result of:
            0.027366202 = score(doc=4819,freq=3.0), product of:
              0.058228556 = queryWeight, product of:
                1.206546 = boost
                3.4731848 = idf(docFreq=3646, maxDocs=43254)
                0.013895183 = queryNorm
              0.46997908 = fieldWeight in 4819, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.4731848 = idf(docFreq=3646, maxDocs=43254)
                0.078125 = fieldNorm(doc=4819)
          0.05024866 = weight(abstract_txt:features in 4819) [ClassicSimilarity], result of:
            0.05024866 = score(doc=4819,freq=2.0), product of:
              0.09994776 = queryWeight, product of:
                1.580747 = boost
                4.550367 = idf(docFreq=1241, maxDocs=43254)
                0.013895183 = queryNorm
              0.50274926 = fieldWeight in 4819, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.550367 = idf(docFreq=1241, maxDocs=43254)
                0.078125 = fieldNorm(doc=4819)
          0.05009346 = weight(abstract_txt:text in 4819) [ClassicSimilarity], result of:
            0.05009346 = score(doc=4819,freq=1.0), product of:
              0.15833032 = queryWeight, product of:
                2.8136683 = boost
                4.049738 = idf(docFreq=2048, maxDocs=43254)
                0.013895183 = queryNorm
              0.31638578 = fieldWeight in 4819, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.049738 = idf(docFreq=2048, maxDocs=43254)
                0.078125 = fieldNorm(doc=4819)
          0.11717982 = weight(abstract_txt:selection in 4819) [ClassicSimilarity], result of:
            0.11717982 = score(doc=4819,freq=1.0), product of:
              0.27900496 = queryWeight, product of:
                3.7350533 = boost
                5.375896 = idf(docFreq=543, maxDocs=43254)
                0.013895183 = queryNorm
              0.41999188 = fieldWeight in 4819, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.375896 = idf(docFreq=543, maxDocs=43254)
                0.078125 = fieldNorm(doc=4819)
          0.29540583 = weight(abstract_txt:feature in 4819) [ClassicSimilarity], result of:
            0.29540583 = score(doc=4819,freq=2.0), product of:
              0.45146665 = queryWeight, product of:
                5.486218 = boost
                5.922272 = idf(docFreq=314, maxDocs=43254)
                0.013895183 = queryNorm
              0.6543248 = fieldWeight in 4819, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.922272 = idf(docFreq=314, maxDocs=43254)
                0.078125 = fieldNorm(doc=4819)
        0.28 = coord(7/25)