Document (#35170)

Author
Duwairi, R.
Al-Refai, M.N.
Khasawneh, N.
Title
Feature reduction techniques for Arabic text categorization
Source
Journal of the American Society for Information Science and Technology. 60(2009) no.11, S.2347-2352
Year
2009
Abstract
This paper presents and compares three feature reduction techniques that were applied to Arabic text. The techniques include stemming, light stemming, and word clusters. The effects of the aforementioned techniques were studied and analyzed on the K-nearest-neighbor classifier. Stemming reduces words to their stems. Light stemming, by comparison, removes common affixes from words without reducing them to their stems. Word clusters group synonymous words into clusters and each cluster is represented by a single word. The purpose of employing the previous methods is to reduce the size of document vectors without affecting the accuracy of the classifiers. The comparison metric includes size of document vectors, classification time, and accuracy (in terms of precision and recall). Several experiments were carried out using four different representations of the same corpus: the first version uses stem-vectors, the second uses light stem-vectors, the third uses word clusters, and the fourth uses the original words (without any transformation) as representatives of documents. The corpus consists of 15,000 documents that fall into three categories: sports, economics, and politics. In terms of vector sizes and classification time, the stemmed vectors consumed the smallest size and the least time necessary to classify a testing dataset that consists of 6,000 documents. The light stemmed vectors superseded the other three representations in terms of classification accuracy.

Similar documents (content)

  1. Duwairi, R.M.: Machine learning for Arabic text categorization (2006) 0.44
    0.43586683 = sum of:
      0.43586683 = product of:
        1.2107412 = sum of:
          0.069455884 = weight(abstract_txt:feature in 5115) [ClassicSimilarity], result of:
            0.069455884 = score(doc=5115,freq=2.0), product of:
              0.106534675 = queryWeight, product of:
                5.9008293 = idf(docFreq=328, maxDocs=44218)
                0.018054187 = queryNorm
              0.65195566 = fieldWeight in 5115, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.9008293 = idf(docFreq=328, maxDocs=44218)
                0.078125 = fieldNorm(doc=5115)
          0.054214645 = weight(abstract_txt:corpus in 5115) [ClassicSimilarity], result of:
            0.054214645 = score(doc=5115,freq=1.0), product of:
              0.113790505 = queryWeight, product of:
                1.0334929 = boost
                6.0984654 = idf(docFreq=269, maxDocs=44218)
                0.018054187 = queryNorm
              0.4764426 = fieldWeight in 5115, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.0984654 = idf(docFreq=269, maxDocs=44218)
                0.078125 = fieldNorm(doc=5115)
          0.050197225 = weight(abstract_txt:documents in 5115) [ClassicSimilarity], result of:
            0.050197225 = score(doc=5115,freq=4.0), product of:
              0.077951625 = queryWeight, product of:
                1.0476415 = boost
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.018054187 = queryNorm
              0.64395356 = fieldWeight in 5115, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.078125 = fieldNorm(doc=5115)
          0.14662735 = weight(abstract_txt:arabic in 5115) [ClassicSimilarity], result of:
            0.14662735 = score(doc=5115,freq=2.0), product of:
              0.17531872 = queryWeight, product of:
                1.2828286 = boost
                7.5697527 = idf(docFreq=61, maxDocs=44218)
                0.018054187 = queryNorm
              0.8363474 = fieldWeight in 5115, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.5697527 = idf(docFreq=61, maxDocs=44218)
                0.078125 = fieldNorm(doc=5115)
          0.076291785 = weight(abstract_txt:accuracy in 5115) [ClassicSimilarity], result of:
            0.076291785 = score(doc=5115,freq=1.0), product of:
              0.1635726 = queryWeight, product of:
                1.5175933 = boost
                5.9700394 = idf(docFreq=306, maxDocs=44218)
                0.018054187 = queryNorm
              0.46640933 = fieldWeight in 5115, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.9700394 = idf(docFreq=306, maxDocs=44218)
                0.078125 = fieldNorm(doc=5115)
          0.060883626 = weight(abstract_txt:uses in 5115) [ClassicSimilarity], result of:
            0.060883626 = score(doc=5115,freq=1.0), product of:
              0.15489542 = queryWeight, product of:
                1.7052529 = boost
                5.0312033 = idf(docFreq=784, maxDocs=44218)
                0.018054187 = queryNorm
              0.39306277 = fieldWeight in 5115, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.0312033 = idf(docFreq=784, maxDocs=44218)
                0.078125 = fieldNorm(doc=5115)
          0.07332943 = weight(abstract_txt:words in 5115) [ClassicSimilarity], result of:
            0.07332943 = score(doc=5115,freq=1.0), product of:
              0.17534383 = queryWeight, product of:
                1.8143235 = boost
                5.353007 = idf(docFreq=568, maxDocs=44218)
                0.018054187 = queryNorm
              0.41820365 = fieldWeight in 5115, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.353007 = idf(docFreq=568, maxDocs=44218)
                0.078125 = fieldNorm(doc=5115)
          0.19754793 = weight(abstract_txt:stemming in 5115) [ClassicSimilarity], result of:
            0.19754793 = score(doc=5115,freq=1.0), product of:
              0.33948448 = queryWeight, product of:
                2.5245237 = boost
                7.448392 = idf(docFreq=69, maxDocs=44218)
                0.018054187 = queryNorm
              0.5819056 = fieldWeight in 5115, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.448392 = idf(docFreq=69, maxDocs=44218)
                0.078125 = fieldNorm(doc=5115)
          0.4821932 = weight(abstract_txt:vectors in 5115) [ClassicSimilarity], result of:
            0.4821932 = score(doc=5115,freq=2.0), product of:
              0.5591643 = queryWeight, product of:
                3.9681203 = boost
                7.805067 = idf(docFreq=48, maxDocs=44218)
                0.018054187 = queryNorm
              0.86234623 = fieldWeight in 5115, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.805067 = idf(docFreq=48, maxDocs=44218)
                0.078125 = fieldNorm(doc=5115)
        0.36 = coord(9/25)
    
  2. Hafer, M.A.; Weiss, S.F.: Word segmentation by letter successor varieties (1974) 0.38
    0.37782782 = sum of:
      0.37782782 = product of:
        1.0495217 = sum of:
          0.027373075 = weight(abstract_txt:classification in 4997) [ClassicSimilarity], result of:
            0.027373075 = score(doc=4997,freq=1.0), product of:
              0.07313977 = queryWeight, product of:
                1.0147917 = boost
                3.9920752 = idf(docFreq=2218, maxDocs=44218)
                0.018054187 = queryNorm
              0.37425706 = fieldWeight in 4997, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.9920752 = idf(docFreq=2218, maxDocs=44218)
                0.09375 = fieldNorm(doc=4997)
          0.065057576 = weight(abstract_txt:corpus in 4997) [ClassicSimilarity], result of:
            0.065057576 = score(doc=4997,freq=1.0), product of:
              0.113790505 = queryWeight, product of:
                1.0334929 = boost
                6.0984654 = idf(docFreq=269, maxDocs=44218)
                0.018054187 = queryNorm
              0.57173115 = fieldWeight in 4997, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.0984654 = idf(docFreq=269, maxDocs=44218)
                0.09375 = fieldNorm(doc=4997)
          0.030118335 = weight(abstract_txt:documents in 4997) [ClassicSimilarity], result of:
            0.030118335 = score(doc=4997,freq=1.0), product of:
              0.077951625 = queryWeight, product of:
                1.0476415 = boost
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.018054187 = queryNorm
              0.38637212 = fieldWeight in 4997, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.09375 = fieldNorm(doc=4997)
          0.14730236 = weight(abstract_txt:stem in 4997) [ClassicSimilarity], result of:
            0.14730236 = score(doc=4997,freq=1.0), product of:
              0.19620673 = queryWeight, product of:
                1.3570987 = boost
                8.008008 = idf(docFreq=39, maxDocs=44218)
                0.018054187 = queryNorm
              0.7507508 = fieldWeight in 4997, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.008008 = idf(docFreq=39, maxDocs=44218)
                0.09375 = fieldNorm(doc=4997)
          0.15479459 = weight(abstract_txt:stems in 4997) [ClassicSimilarity], result of:
            0.15479459 = score(doc=4997,freq=1.0), product of:
              0.20280468 = queryWeight, product of:
                1.3797281 = boost
                8.14154 = idf(docFreq=34, maxDocs=44218)
                0.018054187 = queryNorm
              0.7632693 = fieldWeight in 4997, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.14154 = idf(docFreq=34, maxDocs=44218)
                0.09375 = fieldNorm(doc=4997)
          0.07306035 = weight(abstract_txt:uses in 4997) [ClassicSimilarity], result of:
            0.07306035 = score(doc=4997,freq=1.0), product of:
              0.15489542 = queryWeight, product of:
                1.7052529 = boost
                5.0312033 = idf(docFreq=784, maxDocs=44218)
                0.018054187 = queryNorm
              0.4716753 = fieldWeight in 4997, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.0312033 = idf(docFreq=784, maxDocs=44218)
                0.09375 = fieldNorm(doc=4997)
          0.124444164 = weight(abstract_txt:words in 4997) [ClassicSimilarity], result of:
            0.124444164 = score(doc=4997,freq=2.0), product of:
              0.17534383 = queryWeight, product of:
                1.8143235 = boost
                5.353007 = idf(docFreq=568, maxDocs=44218)
                0.018054187 = queryNorm
              0.7097151 = fieldWeight in 4997, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.353007 = idf(docFreq=568, maxDocs=44218)
                0.09375 = fieldNorm(doc=4997)
          0.09212122 = weight(abstract_txt:word in 4997) [ClassicSimilarity], result of:
            0.09212122 = score(doc=4997,freq=1.0), product of:
              0.18078285 = queryWeight, product of:
                1.8422481 = boost
                5.4353957 = idf(docFreq=523, maxDocs=44218)
                0.018054187 = queryNorm
              0.50956833 = fieldWeight in 4997, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.4353957 = idf(docFreq=523, maxDocs=44218)
                0.09375 = fieldNorm(doc=4997)
          0.33524996 = weight(abstract_txt:stemming in 4997) [ClassicSimilarity], result of:
            0.33524996 = score(doc=4997,freq=2.0), product of:
              0.33948448 = queryWeight, product of:
                2.5245237 = boost
                7.448392 = idf(docFreq=69, maxDocs=44218)
                0.018054187 = queryNorm
              0.9875266 = fieldWeight in 4997, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.448392 = idf(docFreq=69, maxDocs=44218)
                0.09375 = fieldNorm(doc=4997)
        0.36 = coord(9/25)
    
  3. Maheswari, J.U.; Karpagam, G.R.: ¬A conceptual framework for ontology based information retrieval (2010) 0.24
    0.23957443 = sum of:
      0.23957443 = product of:
        0.7486701 = sum of:
          0.03437891 = weight(abstract_txt:feature in 702) [ClassicSimilarity], result of:
            0.03437891 = score(doc=702,freq=1.0), product of:
              0.106534675 = queryWeight, product of:
                5.9008293 = idf(docFreq=328, maxDocs=44218)
                0.018054187 = queryNorm
              0.3227016 = fieldWeight in 702, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.9008293 = idf(docFreq=328, maxDocs=44218)
                0.0546875 = fieldNorm(doc=702)
          0.03319439 = weight(abstract_txt:terms in 702) [ClassicSimilarity], result of:
            0.03319439 = score(doc=702,freq=4.0), product of:
              0.07504985 = queryWeight, product of:
                1.0279572 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.018054187 = queryNorm
              0.4422979 = fieldWeight in 702, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.0546875 = fieldNorm(doc=702)
          0.017916966 = weight(abstract_txt:time in 702) [ClassicSimilarity], result of:
            0.017916966 = score(doc=702,freq=1.0), product of:
              0.07897743 = queryWeight, product of:
                1.0545123 = boost
                4.148331 = idf(docFreq=1897, maxDocs=44218)
                0.018054187 = queryNorm
              0.22686186 = fieldWeight in 702, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.148331 = idf(docFreq=1897, maxDocs=44218)
                0.0546875 = fieldNorm(doc=702)
          0.19027232 = weight(abstract_txt:stemmed in 702) [ClassicSimilarity], result of:
            0.19027232 = score(doc=702,freq=2.0), product of:
              0.26456758 = queryWeight, product of:
                1.5758789 = boost
                9.298992 = idf(docFreq=10, maxDocs=44218)
                0.018054187 = queryNorm
              0.71918225 = fieldWeight in 702, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.298992 = idf(docFreq=10, maxDocs=44218)
                0.0546875 = fieldNorm(doc=702)
          0.042618535 = weight(abstract_txt:uses in 702) [ClassicSimilarity], result of:
            0.042618535 = score(doc=702,freq=1.0), product of:
              0.15489542 = queryWeight, product of:
                1.7052529 = boost
                5.0312033 = idf(docFreq=784, maxDocs=44218)
                0.018054187 = queryNorm
              0.27514392 = fieldWeight in 702, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.0312033 = idf(docFreq=784, maxDocs=44218)
                0.0546875 = fieldNorm(doc=702)
          0.11477871 = weight(abstract_txt:words in 702) [ClassicSimilarity], result of:
            0.11477871 = score(doc=702,freq=5.0), product of:
              0.17534383 = queryWeight, product of:
                1.8143235 = boost
                5.353007 = idf(docFreq=568, maxDocs=44218)
                0.018054187 = queryNorm
              0.6545923 = fieldWeight in 702, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                5.353007 = idf(docFreq=568, maxDocs=44218)
                0.0546875 = fieldNorm(doc=702)
          0.07599613 = weight(abstract_txt:word in 702) [ClassicSimilarity], result of:
            0.07599613 = score(doc=702,freq=2.0), product of:
              0.18078285 = queryWeight, product of:
                1.8422481 = boost
                5.4353957 = idf(docFreq=523, maxDocs=44218)
                0.018054187 = queryNorm
              0.42037243 = fieldWeight in 702, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.4353957 = idf(docFreq=523, maxDocs=44218)
                0.0546875 = fieldNorm(doc=702)
          0.23951414 = weight(abstract_txt:stemming in 702) [ClassicSimilarity], result of:
            0.23951414 = score(doc=702,freq=3.0), product of:
              0.33948448 = queryWeight, product of:
                2.5245237 = boost
                7.448392 = idf(docFreq=69, maxDocs=44218)
                0.018054187 = queryNorm
              0.7055231 = fieldWeight in 702, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.448392 = idf(docFreq=69, maxDocs=44218)
                0.0546875 = fieldNorm(doc=702)
        0.32 = coord(8/25)
    
  4. Brychcín, T.; Konopík, M.: HPS: High precision stemmer (2015) 0.22
    0.22348312 = sum of:
      0.22348312 = product of:
        0.798154 = sum of:
          0.04538833 = weight(abstract_txt:consists in 2686) [ClassicSimilarity], result of:
            0.04538833 = score(doc=2686,freq=1.0), product of:
              0.11729093 = queryWeight, product of:
                1.0492687 = boost
                6.1915555 = idf(docFreq=245, maxDocs=44218)
                0.018054187 = queryNorm
              0.38697222 = fieldWeight in 2686, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.1915555 = idf(docFreq=245, maxDocs=44218)
                0.0625 = fieldNorm(doc=2686)
          0.024704693 = weight(abstract_txt:three in 2686) [ClassicSimilarity], result of:
            0.024704693 = score(doc=2686,freq=1.0), product of:
              0.0895059 = queryWeight, product of:
                1.1226025 = boost
                4.41619 = idf(docFreq=1451, maxDocs=44218)
                0.018054187 = queryNorm
              0.27601188 = fieldWeight in 2686, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.41619 = idf(docFreq=1451, maxDocs=44218)
                0.0625 = fieldNorm(doc=2686)
          0.098201565 = weight(abstract_txt:stem in 2686) [ClassicSimilarity], result of:
            0.098201565 = score(doc=2686,freq=1.0), product of:
              0.19620673 = queryWeight, product of:
                1.3570987 = boost
                8.008008 = idf(docFreq=39, maxDocs=44218)
                0.018054187 = queryNorm
              0.5005005 = fieldWeight in 2686, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.008008 = idf(docFreq=39, maxDocs=44218)
                0.0625 = fieldNorm(doc=2686)
          0.048706897 = weight(abstract_txt:uses in 2686) [ClassicSimilarity], result of:
            0.048706897 = score(doc=2686,freq=1.0), product of:
              0.15489542 = queryWeight, product of:
                1.7052529 = boost
                5.0312033 = idf(docFreq=784, maxDocs=44218)
                0.018054187 = queryNorm
              0.3144502 = fieldWeight in 2686, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.0312033 = idf(docFreq=784, maxDocs=44218)
                0.0625 = fieldNorm(doc=2686)
          0.10160824 = weight(abstract_txt:words in 2686) [ClassicSimilarity], result of:
            0.10160824 = score(doc=2686,freq=3.0), product of:
              0.17534383 = queryWeight, product of:
                1.8143235 = boost
                5.353007 = idf(docFreq=568, maxDocs=44218)
                0.018054187 = queryNorm
              0.57948 = fieldWeight in 2686, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.353007 = idf(docFreq=568, maxDocs=44218)
                0.0625 = fieldNorm(doc=2686)
          0.06141415 = weight(abstract_txt:word in 2686) [ClassicSimilarity], result of:
            0.06141415 = score(doc=2686,freq=1.0), product of:
              0.18078285 = queryWeight, product of:
                1.8422481 = boost
                5.4353957 = idf(docFreq=523, maxDocs=44218)
                0.018054187 = queryNorm
              0.33971223 = fieldWeight in 2686, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.4353957 = idf(docFreq=523, maxDocs=44218)
                0.0625 = fieldNorm(doc=2686)
          0.41813016 = weight(abstract_txt:stemming in 2686) [ClassicSimilarity], result of:
            0.41813016 = score(doc=2686,freq=7.0), product of:
              0.33948448 = queryWeight, product of:
                2.5245237 = boost
                7.448392 = idf(docFreq=69, maxDocs=44218)
                0.018054187 = queryNorm
              1.231662 = fieldWeight in 2686, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                7.448392 = idf(docFreq=69, maxDocs=44218)
                0.0625 = fieldNorm(doc=2686)
        0.28 = coord(7/25)
    
  5. Greengrass, M.: Conflation methods for searching databases of Latin text (1996) 0.22
    0.22340518 = sum of:
      0.22340518 = product of:
        0.930855 = sum of:
          0.14730236 = weight(abstract_txt:stem in 6987) [ClassicSimilarity], result of:
            0.14730236 = score(doc=6987,freq=1.0), product of:
              0.19620673 = queryWeight, product of:
                1.3570987 = boost
                8.008008 = idf(docFreq=39, maxDocs=44218)
                0.018054187 = queryNorm
              0.7507508 = fieldWeight in 6987, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.008008 = idf(docFreq=39, maxDocs=44218)
                0.09375 = fieldNorm(doc=6987)
          0.15479459 = weight(abstract_txt:stems in 6987) [ClassicSimilarity], result of:
            0.15479459 = score(doc=6987,freq=1.0), product of:
              0.20280468 = queryWeight, product of:
                1.3797281 = boost
                8.14154 = idf(docFreq=34, maxDocs=44218)
                0.018054187 = queryNorm
              0.7632693 = fieldWeight in 6987, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.14154 = idf(docFreq=34, maxDocs=44218)
                0.09375 = fieldNorm(doc=6987)
          0.23064487 = weight(abstract_txt:stemmed in 6987) [ClassicSimilarity], result of:
            0.23064487 = score(doc=6987,freq=1.0), product of:
              0.26456758 = queryWeight, product of:
                1.5758789 = boost
                9.298992 = idf(docFreq=10, maxDocs=44218)
                0.018054187 = queryNorm
              0.8717805 = fieldWeight in 6987, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.298992 = idf(docFreq=10, maxDocs=44218)
                0.09375 = fieldNorm(doc=6987)
          0.07306035 = weight(abstract_txt:uses in 6987) [ClassicSimilarity], result of:
            0.07306035 = score(doc=6987,freq=1.0), product of:
              0.15489542 = queryWeight, product of:
                1.7052529 = boost
                5.0312033 = idf(docFreq=784, maxDocs=44218)
                0.018054187 = queryNorm
              0.4716753 = fieldWeight in 6987, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.0312033 = idf(docFreq=784, maxDocs=44218)
                0.09375 = fieldNorm(doc=6987)
          0.08799532 = weight(abstract_txt:words in 6987) [ClassicSimilarity], result of:
            0.08799532 = score(doc=6987,freq=1.0), product of:
              0.17534383 = queryWeight, product of:
                1.8143235 = boost
                5.353007 = idf(docFreq=568, maxDocs=44218)
                0.018054187 = queryNorm
              0.5018444 = fieldWeight in 6987, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.353007 = idf(docFreq=568, maxDocs=44218)
                0.09375 = fieldNorm(doc=6987)
          0.23705752 = weight(abstract_txt:stemming in 6987) [ClassicSimilarity], result of:
            0.23705752 = score(doc=6987,freq=1.0), product of:
              0.33948448 = queryWeight, product of:
                2.5245237 = boost
                7.448392 = idf(docFreq=69, maxDocs=44218)
                0.018054187 = queryNorm
              0.6982868 = fieldWeight in 6987, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.448392 = idf(docFreq=69, maxDocs=44218)
                0.09375 = fieldNorm(doc=6987)
        0.24 = coord(6/25)