Document (#36776)

Author
Maghsoodi, N.
Homayounpour, M.M.
Title
Improving Farsi multiclass text classification using a thesaurus and two-stage feature selection
Source
Journal of the American Society for Information Science and Technology. 62(2011) no.10, S.2055-2066
Year
2011
Abstract
The progressive increase of information content has recently made it necessary to create a system for automatic classification of documents. In this article, a system is presented for the categorization of multiclass Farsi documents that requires fewer training examples and can help to compensate the shortcoming of the standard training dataset. The new idea proposed in the present article is based on extending the feature vector by adding some words extracted from a thesaurus and then filtering the new feature vector by applying secondary feature selection to discard inappropriate features. In fact, a phase of secondary feature selection is applied to choose more appropriate features among the features added from a thesaurus to enhance the effect of using a thesaurus on the efficiency of the classifier. To evaluate the proposed system, a corpus is gathered from the Farsi Wikipedia website and some articles in the Hamshahri newspaper, the Roshd periodical, and the Soroush magazine. In addition to studying the role of a thesaurus and applying secondary feature selection, the effect of a various number of categories, size of the training dataset, and average number of words in the test data also are examined. As the results indicate, classification efficiency improves by applying this approach, especially when available data is not sufficient for some text categories.
Theme
Automatisches Klassifizieren

Similar documents (content)

  1. Mengle, S.S.R.; Goharian, N.: Ambiguity measure feature-selection algorithm (2009) 0.49
    0.4888367 = sum of:
      0.4888367 = product of:
        1.1109926 = sum of:
          0.048004683 = weight(abstract_txt:text in 2804) [ClassicSimilarity], result of:
            0.048004683 = score(doc=2804,freq=6.0), product of:
              0.07754096 = queryWeight, product of:
                1.0161468 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.01887026 = queryNorm
              0.6190881 = fieldWeight in 2804, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.0625 = fieldNorm(doc=2804)
          0.029338375 = weight(abstract_txt:documents in 2804) [ClassicSimilarity], result of:
            0.029338375 = score(doc=2804,freq=2.0), product of:
              0.080539055 = queryWeight, product of:
                1.035605 = boost
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.01887026 = queryNorm
              0.36427513 = fieldWeight in 2804, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.0625 = fieldNorm(doc=2804)
          0.020917179 = weight(abstract_txt:number in 2804) [ClassicSimilarity], result of:
            0.020917179 = score(doc=2804,freq=1.0), product of:
              0.08098313 = queryWeight, product of:
                1.0384561 = boost
                4.132649 = idf(docFreq=1927, maxDocs=44218)
                0.01887026 = queryNorm
              0.25829056 = fieldWeight in 2804, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.132649 = idf(docFreq=1927, maxDocs=44218)
                0.0625 = fieldNorm(doc=2804)
          0.00938572 = weight(abstract_txt:from in 2804) [ClassicSimilarity], result of:
            0.00938572 = score(doc=2804,freq=1.0), product of:
              0.054333538 = queryWeight, product of:
                1.0417668 = boost
                2.7638826 = idf(docFreq=7577, maxDocs=44218)
                0.01887026 = queryNorm
              0.17274266 = fieldWeight in 2804, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.7638826 = idf(docFreq=7577, maxDocs=44218)
                0.0625 = fieldNorm(doc=2804)
          0.04743969 = weight(abstract_txt:effect in 2804) [ClassicSimilarity], result of:
            0.04743969 = score(doc=2804,freq=1.0), product of:
              0.13979353 = queryWeight, product of:
                1.364377 = boost
                5.4296865 = idf(docFreq=526, maxDocs=44218)
                0.01887026 = queryNorm
              0.3393554 = fieldWeight in 2804, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.4296865 = idf(docFreq=526, maxDocs=44218)
                0.0625 = fieldNorm(doc=2804)
          0.028281664 = weight(abstract_txt:classification in 2804) [ClassicSimilarity], result of:
            0.028281664 = score(doc=2804,freq=1.0), product of:
              0.113351226 = queryWeight, product of:
                1.5046989 = boost
                3.9920752 = idf(docFreq=2218, maxDocs=44218)
                0.01887026 = queryNorm
              0.2495047 = fieldWeight in 2804, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.9920752 = idf(docFreq=2218, maxDocs=44218)
                0.0625 = fieldNorm(doc=2804)
          0.082168944 = weight(abstract_txt:vector in 2804) [ClassicSimilarity], result of:
            0.082168944 = score(doc=2804,freq=1.0), product of:
              0.20161878 = queryWeight, product of:
                1.6385374 = boost
                6.5207376 = idf(docFreq=176, maxDocs=44218)
                0.01887026 = queryNorm
              0.4075461 = fieldWeight in 2804, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.5207376 = idf(docFreq=176, maxDocs=44218)
                0.0625 = fieldNorm(doc=2804)
          0.07201011 = weight(abstract_txt:features in 2804) [ClassicSimilarity], result of:
            0.07201011 = score(doc=2804,freq=3.0), product of:
              0.1465474 = queryWeight, product of:
                1.7109036 = boost
                4.5391517 = idf(docFreq=1283, maxDocs=44218)
                0.01887026 = queryNorm
              0.49137756 = fieldWeight in 2804, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.5391517 = idf(docFreq=1283, maxDocs=44218)
                0.0625 = fieldNorm(doc=2804)
          0.08398868 = weight(abstract_txt:training in 2804) [ClassicSimilarity], result of:
            0.08398868 = score(doc=2804,freq=2.0), product of:
              0.18587747 = queryWeight, product of:
                1.9268587 = boost
                5.112096 = idf(docFreq=723, maxDocs=44218)
                0.01887026 = queryNorm
              0.4518497 = fieldWeight in 2804, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.112096 = idf(docFreq=723, maxDocs=44218)
                0.0625 = fieldNorm(doc=2804)
          0.20614621 = weight(abstract_txt:selection in 2804) [ClassicSimilarity], result of:
            0.20614621 = score(doc=2804,freq=5.0), product of:
              0.2742812 = queryWeight, product of:
                2.7027376 = boost
                5.377919 = idf(docFreq=554, maxDocs=44218)
                0.01887026 = queryNorm
              0.7515871 = fieldWeight in 2804, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                5.377919 = idf(docFreq=554, maxDocs=44218)
                0.0625 = fieldNorm(doc=2804)
          0.4833113 = weight(abstract_txt:feature in 2804) [ClassicSimilarity], result of:
            0.4833113 = score(doc=2804,freq=7.0), product of:
              0.49531886 = queryWeight, product of:
                4.4482985 = boost
                5.9008293 = idf(docFreq=328, maxDocs=44218)
                0.01887026 = queryNorm
              0.9757579 = fieldWeight in 2804, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                5.9008293 = idf(docFreq=328, maxDocs=44218)
                0.0625 = fieldNorm(doc=2804)
        0.44 = coord(11/25)
    
  2. Duwairi, R.M.: Machine learning for Arabic text categorization (2006) 0.29
    0.2935982 = sum of:
      0.2935982 = product of:
        0.81555057 = sum of:
          0.02449729 = weight(abstract_txt:text in 5115) [ClassicSimilarity], result of:
            0.02449729 = score(doc=5115,freq=1.0), product of:
              0.07754096 = queryWeight, product of:
                1.0161468 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.01887026 = queryNorm
              0.3159271 = fieldWeight in 5115, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.078125 = fieldNorm(doc=5115)
          0.05186341 = weight(abstract_txt:documents in 5115) [ClassicSimilarity], result of:
            0.05186341 = score(doc=5115,freq=4.0), product of:
              0.080539055 = queryWeight, product of:
                1.035605 = boost
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.01887026 = queryNorm
              0.64395356 = fieldWeight in 5115, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.078125 = fieldNorm(doc=5115)
          0.036277413 = weight(abstract_txt:proposed in 5115) [ClassicSimilarity], result of:
            0.036277413 = score(doc=5115,freq=1.0), product of:
              0.10074187 = queryWeight, product of:
                1.1582328 = boost
                4.6093135 = idf(docFreq=1196, maxDocs=44218)
                0.01887026 = queryNorm
              0.36010262 = fieldWeight in 5115, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.6093135 = idf(docFreq=1196, maxDocs=44218)
                0.078125 = fieldNorm(doc=5115)
          0.07272158 = weight(abstract_txt:categories in 5115) [ClassicSimilarity], result of:
            0.07272158 = score(doc=5115,freq=2.0), product of:
              0.12712121 = queryWeight, product of:
                1.3010676 = boost
                5.17774 = idf(docFreq=677, maxDocs=44218)
                0.01887026 = queryNorm
              0.5720649 = fieldWeight in 5115, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.17774 = idf(docFreq=677, maxDocs=44218)
                0.078125 = fieldNorm(doc=5115)
          0.05682258 = weight(abstract_txt:words in 5115) [ClassicSimilarity], result of:
            0.05682258 = score(doc=5115,freq=1.0), product of:
              0.13587299 = queryWeight, product of:
                1.3451089 = boost
                5.353007 = idf(docFreq=568, maxDocs=44218)
                0.01887026 = queryNorm
              0.41820365 = fieldWeight in 5115, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.353007 = idf(docFreq=568, maxDocs=44218)
                0.078125 = fieldNorm(doc=5115)
          0.10271118 = weight(abstract_txt:vector in 5115) [ClassicSimilarity], result of:
            0.10271118 = score(doc=5115,freq=1.0), product of:
              0.20161878 = queryWeight, product of:
                1.6385374 = boost
                6.5207376 = idf(docFreq=176, maxDocs=44218)
                0.01887026 = queryNorm
              0.5094326 = fieldWeight in 5115, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.5207376 = idf(docFreq=176, maxDocs=44218)
                0.078125 = fieldNorm(doc=5115)
          0.07349501 = weight(abstract_txt:features in 5115) [ClassicSimilarity], result of:
            0.07349501 = score(doc=5115,freq=2.0), product of:
              0.1465474 = queryWeight, product of:
                1.7109036 = boost
                4.5391517 = idf(docFreq=1283, maxDocs=44218)
                0.01887026 = queryNorm
              0.50151014 = fieldWeight in 5115, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.5391517 = idf(docFreq=1283, maxDocs=44218)
                0.078125 = fieldNorm(doc=5115)
          0.07423621 = weight(abstract_txt:training in 5115) [ClassicSimilarity], result of:
            0.07423621 = score(doc=5115,freq=1.0), product of:
              0.18587747 = queryWeight, product of:
                1.9268587 = boost
                5.112096 = idf(docFreq=723, maxDocs=44218)
                0.01887026 = queryNorm
              0.39938247 = fieldWeight in 5115, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.112096 = idf(docFreq=723, maxDocs=44218)
                0.078125 = fieldNorm(doc=5115)
          0.32292593 = weight(abstract_txt:feature in 5115) [ClassicSimilarity], result of:
            0.32292593 = score(doc=5115,freq=2.0), product of:
              0.49531886 = queryWeight, product of:
                4.4482985 = boost
                5.9008293 = idf(docFreq=328, maxDocs=44218)
                0.01887026 = queryNorm
              0.65195566 = fieldWeight in 5115, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.9008293 = idf(docFreq=328, maxDocs=44218)
                0.078125 = fieldNorm(doc=5115)
        0.36 = coord(9/25)
    
  3. Malenica, M.; Smuc, T.; Snajder, J.; Basic, B.D.: Language morphology offset : text classification on a Croatian-English parallel corpus (2008) 0.29
    0.287286 = sum of:
      0.287286 = product of:
        0.89776886 = sum of:
          0.02449729 = weight(abstract_txt:text in 2035) [ClassicSimilarity], result of:
            0.02449729 = score(doc=2035,freq=1.0), product of:
              0.07754096 = queryWeight, product of:
                1.0161468 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.01887026 = queryNorm
              0.3159271 = fieldWeight in 2035, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.078125 = fieldNorm(doc=2035)
          0.036976695 = weight(abstract_txt:number in 2035) [ClassicSimilarity], result of:
            0.036976695 = score(doc=2035,freq=2.0), product of:
              0.08098313 = queryWeight, product of:
                1.0384561 = boost
                4.132649 = idf(docFreq=1927, maxDocs=44218)
                0.01887026 = queryNorm
              0.4565975 = fieldWeight in 2035, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.132649 = idf(docFreq=1927, maxDocs=44218)
                0.078125 = fieldNorm(doc=2035)
          0.03535208 = weight(abstract_txt:classification in 2035) [ClassicSimilarity], result of:
            0.03535208 = score(doc=2035,freq=1.0), product of:
              0.113351226 = queryWeight, product of:
                1.5046989 = boost
                3.9920752 = idf(docFreq=2218, maxDocs=44218)
                0.01887026 = queryNorm
              0.3118809 = fieldWeight in 2035, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.9920752 = idf(docFreq=2218, maxDocs=44218)
                0.078125 = fieldNorm(doc=2035)
          0.10271118 = weight(abstract_txt:vector in 2035) [ClassicSimilarity], result of:
            0.10271118 = score(doc=2035,freq=1.0), product of:
              0.20161878 = queryWeight, product of:
                1.6385374 = boost
                6.5207376 = idf(docFreq=176, maxDocs=44218)
                0.01887026 = queryNorm
              0.5094326 = fieldWeight in 2035, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.5207376 = idf(docFreq=176, maxDocs=44218)
                0.078125 = fieldNorm(doc=2035)
          0.07349501 = weight(abstract_txt:features in 2035) [ClassicSimilarity], result of:
            0.07349501 = score(doc=2035,freq=2.0), product of:
              0.1465474 = queryWeight, product of:
                1.7109036 = boost
                4.5391517 = idf(docFreq=1283, maxDocs=44218)
                0.01887026 = queryNorm
              0.50151014 = fieldWeight in 2035, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.5391517 = idf(docFreq=1283, maxDocs=44218)
                0.078125 = fieldNorm(doc=2035)
          0.113995515 = weight(abstract_txt:applying in 2035) [ClassicSimilarity], result of:
            0.113995515 = score(doc=2035,freq=1.0), product of:
              0.24740478 = queryWeight, product of:
                2.2230055 = boost
                5.8977947 = idf(docFreq=329, maxDocs=44218)
                0.01887026 = queryNorm
              0.4607652 = fieldWeight in 2035, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8977947 = idf(docFreq=329, maxDocs=44218)
                0.078125 = fieldNorm(doc=2035)
          0.115239225 = weight(abstract_txt:selection in 2035) [ClassicSimilarity], result of:
            0.115239225 = score(doc=2035,freq=1.0), product of:
              0.2742812 = queryWeight, product of:
                2.7027376 = boost
                5.377919 = idf(docFreq=554, maxDocs=44218)
                0.01887026 = queryNorm
              0.42014992 = fieldWeight in 2035, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.377919 = idf(docFreq=554, maxDocs=44218)
                0.078125 = fieldNorm(doc=2035)
          0.39550188 = weight(abstract_txt:feature in 2035) [ClassicSimilarity], result of:
            0.39550188 = score(doc=2035,freq=3.0), product of:
              0.49531886 = queryWeight, product of:
                4.4482985 = boost
                5.9008293 = idf(docFreq=328, maxDocs=44218)
                0.01887026 = queryNorm
              0.7984794 = fieldWeight in 2035, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.9008293 = idf(docFreq=328, maxDocs=44218)
                0.078125 = fieldNorm(doc=2035)
        0.32 = coord(8/25)
    
  4. Ikae, C.; Savoy, J.: Gender identification on Twitter (2022) 0.28
    0.2779894 = sum of:
      0.2779894 = product of:
        0.86871684 = sum of:
          0.020917179 = weight(abstract_txt:number in 445) [ClassicSimilarity], result of:
            0.020917179 = score(doc=445,freq=1.0), product of:
              0.08098313 = queryWeight, product of:
                1.0384561 = boost
                4.132649 = idf(docFreq=1927, maxDocs=44218)
                0.01887026 = queryNorm
              0.25829056 = fieldWeight in 445, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.132649 = idf(docFreq=1927, maxDocs=44218)
                0.0625 = fieldNorm(doc=445)
          0.029021928 = weight(abstract_txt:proposed in 445) [ClassicSimilarity], result of:
            0.029021928 = score(doc=445,freq=1.0), product of:
              0.10074187 = queryWeight, product of:
                1.1582328 = boost
                4.6093135 = idf(docFreq=1196, maxDocs=44218)
                0.01887026 = queryNorm
              0.2880821 = fieldWeight in 445, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.6093135 = idf(docFreq=1196, maxDocs=44218)
                0.0625 = fieldNorm(doc=445)
          0.045458063 = weight(abstract_txt:words in 445) [ClassicSimilarity], result of:
            0.045458063 = score(doc=445,freq=1.0), product of:
              0.13587299 = queryWeight, product of:
                1.3451089 = boost
                5.353007 = idf(docFreq=568, maxDocs=44218)
                0.01887026 = queryNorm
              0.33456293 = fieldWeight in 445, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.353007 = idf(docFreq=568, maxDocs=44218)
                0.0625 = fieldNorm(doc=445)
          0.022116687 = weight(abstract_txt:some in 445) [ClassicSimilarity], result of:
            0.022116687 = score(doc=445,freq=1.0), product of:
              0.096213564 = queryWeight, product of:
                1.3862917 = boost
                3.6779325 = idf(docFreq=3037, maxDocs=44218)
                0.01887026 = queryNorm
              0.22987078 = fieldWeight in 445, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6779325 = idf(docFreq=3037, maxDocs=44218)
                0.0625 = fieldNorm(doc=445)
          0.082168944 = weight(abstract_txt:vector in 445) [ClassicSimilarity], result of:
            0.082168944 = score(doc=445,freq=1.0), product of:
              0.20161878 = queryWeight, product of:
                1.6385374 = boost
                6.5207376 = idf(docFreq=176, maxDocs=44218)
                0.01887026 = queryNorm
              0.4075461 = fieldWeight in 445, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.5207376 = idf(docFreq=176, maxDocs=44218)
                0.0625 = fieldNorm(doc=445)
          0.09119642 = weight(abstract_txt:applying in 445) [ClassicSimilarity], result of:
            0.09119642 = score(doc=445,freq=1.0), product of:
              0.24740478 = queryWeight, product of:
                2.2230055 = boost
                5.8977947 = idf(docFreq=329, maxDocs=44218)
                0.01887026 = queryNorm
              0.36861217 = fieldWeight in 445, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8977947 = idf(docFreq=329, maxDocs=44218)
                0.0625 = fieldNorm(doc=445)
          0.1303783 = weight(abstract_txt:selection in 445) [ClassicSimilarity], result of:
            0.1303783 = score(doc=445,freq=2.0), product of:
              0.2742812 = queryWeight, product of:
                2.7027376 = boost
                5.377919 = idf(docFreq=554, maxDocs=44218)
                0.01887026 = queryNorm
              0.47534537 = fieldWeight in 445, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.377919 = idf(docFreq=554, maxDocs=44218)
                0.0625 = fieldNorm(doc=445)
          0.44745934 = weight(abstract_txt:feature in 445) [ClassicSimilarity], result of:
            0.44745934 = score(doc=445,freq=6.0), product of:
              0.49531886 = queryWeight, product of:
                4.4482985 = boost
                5.9008293 = idf(docFreq=328, maxDocs=44218)
                0.01887026 = queryNorm
              0.90337634 = fieldWeight in 445, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                5.9008293 = idf(docFreq=328, maxDocs=44218)
                0.0625 = fieldNorm(doc=445)
        0.32 = coord(8/25)
    
  5. Tseng, Y.-H.; Lin, C.-J.; Lin, Y.-I.: Text mining techniques for patent analysis (2007) 0.24
    0.23940735 = sum of:
      0.23940735 = product of:
        0.5985184 = sum of:
          0.027715517 = weight(abstract_txt:text in 935) [ClassicSimilarity], result of:
            0.027715517 = score(doc=935,freq=2.0), product of:
              0.07754096 = queryWeight, product of:
                1.0161468 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.01887026 = queryNorm
              0.3574307 = fieldWeight in 935, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.0625 = fieldNorm(doc=935)
          0.020745363 = weight(abstract_txt:documents in 935) [ClassicSimilarity], result of:
            0.020745363 = score(doc=935,freq=1.0), product of:
              0.080539055 = queryWeight, product of:
                1.035605 = boost
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.01887026 = queryNorm
              0.2575814 = fieldWeight in 935, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.0625 = fieldNorm(doc=935)
          0.041043207 = weight(abstract_txt:proposed in 935) [ClassicSimilarity], result of:
            0.041043207 = score(doc=935,freq=2.0), product of:
              0.10074187 = queryWeight, product of:
                1.1582328 = boost
                4.6093135 = idf(docFreq=1196, maxDocs=44218)
                0.01887026 = queryNorm
              0.4074096 = fieldWeight in 935, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.6093135 = idf(docFreq=1196, maxDocs=44218)
                0.0625 = fieldNorm(doc=935)
          0.045458063 = weight(abstract_txt:words in 935) [ClassicSimilarity], result of:
            0.045458063 = score(doc=935,freq=1.0), product of:
              0.13587299 = queryWeight, product of:
                1.3451089 = boost
                5.353007 = idf(docFreq=568, maxDocs=44218)
                0.01887026 = queryNorm
              0.33456293 = fieldWeight in 935, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.353007 = idf(docFreq=568, maxDocs=44218)
                0.0625 = fieldNorm(doc=935)
          0.03127772 = weight(abstract_txt:some in 935) [ClassicSimilarity], result of:
            0.03127772 = score(doc=935,freq=2.0), product of:
              0.096213564 = queryWeight, product of:
                1.3862917 = boost
                3.6779325 = idf(docFreq=3037, maxDocs=44218)
                0.01887026 = queryNorm
              0.32508639 = fieldWeight in 935, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.6779325 = idf(docFreq=3037, maxDocs=44218)
                0.0625 = fieldNorm(doc=935)
          0.048985276 = weight(abstract_txt:classification in 935) [ClassicSimilarity], result of:
            0.048985276 = score(doc=935,freq=3.0), product of:
              0.113351226 = queryWeight, product of:
                1.5046989 = boost
                3.9920752 = idf(docFreq=2218, maxDocs=44218)
                0.01887026 = queryNorm
              0.4321548 = fieldWeight in 935, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.9920752 = idf(docFreq=2218, maxDocs=44218)
                0.0625 = fieldNorm(doc=935)
          0.066852294 = weight(abstract_txt:efficiency in 935) [ClassicSimilarity], result of:
            0.066852294 = score(doc=935,freq=1.0), product of:
              0.17571278 = queryWeight, product of:
                1.5296516 = boost
                6.087415 = idf(docFreq=272, maxDocs=44218)
                0.01887026 = queryNorm
              0.38046345 = fieldWeight in 935, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.087415 = idf(docFreq=272, maxDocs=44218)
                0.0625 = fieldNorm(doc=935)
          0.041575056 = weight(abstract_txt:features in 935) [ClassicSimilarity], result of:
            0.041575056 = score(doc=935,freq=1.0), product of:
              0.1465474 = queryWeight, product of:
                1.7109036 = boost
                4.5391517 = idf(docFreq=1283, maxDocs=44218)
                0.01887026 = queryNorm
              0.28369698 = fieldWeight in 935, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5391517 = idf(docFreq=1283, maxDocs=44218)
                0.0625 = fieldNorm(doc=935)
          0.09219138 = weight(abstract_txt:selection in 935) [ClassicSimilarity], result of:
            0.09219138 = score(doc=935,freq=1.0), product of:
              0.2742812 = queryWeight, product of:
                2.7027376 = boost
                5.377919 = idf(docFreq=554, maxDocs=44218)
                0.01887026 = queryNorm
              0.33611995 = fieldWeight in 935, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.377919 = idf(docFreq=554, maxDocs=44218)
                0.0625 = fieldNorm(doc=935)
          0.1826745 = weight(abstract_txt:feature in 935) [ClassicSimilarity], result of:
            0.1826745 = score(doc=935,freq=1.0), product of:
              0.49531886 = queryWeight, product of:
                4.4482985 = boost
                5.9008293 = idf(docFreq=328, maxDocs=44218)
                0.01887026 = queryNorm
              0.36880183 = fieldWeight in 935, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.9008293 = idf(docFreq=328, maxDocs=44218)
                0.0625 = fieldNorm(doc=935)
        0.4 = coord(10/25)