Document (#27558)

Author
Ko, Y.
Park, J.
Seo, J.
Title
Improving text categorization using the importance of sentences
Source
Information processing and management. 40(2004) no.1, S.65-79
Year
2004
Abstract
Automatic text categorization is a problem of assigning text documents to pre-defined categories. In order to classify text documents, we must extract useful features. In previous researches, a text document is commonly represented by the term frequency and the inverted document frequency of each feature. Since there is a difference between important sentences and unimportant sentences in a document, the features from more important sentences should be considered more than other features. In this paper, we measure the importance of sentences using text summarization techniques. Then we represent a document as a vector of features with different weights according to the importance of each sentence. To verify our new method, we conduct experiments using two language newsgroup data sets: one written by English and the other written by Korean. Four kinds of classifiers are used in our experiments: Naive Bayes, Rocchio, k-NN, and SVM. We observe that our new method makes a significant improvement in all these classifiers and both data sets.

Similar documents (author)

  1. Park, A.L.: ¬A comparison of a new OCLC/PRISM searches with earlier OCLC derived searches (1992) 4.65
    4.6463795 = sum of:
      4.6463795 = weight(author_txt:park in 4239) [ClassicSimilarity], result of:
        4.6463795 = fieldWeight in 4239, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          7.4342074 = idf(docFreq=70, maxDocs=44218)
          0.625 = fieldNorm(doc=4239)
    
  2. Park, T.K.: ¬The nature of relevance in information retrieval : an empirical study (1993) 4.65
    4.6463795 = sum of:
      4.6463795 = weight(author_txt:park in 5336) [ClassicSimilarity], result of:
        4.6463795 = fieldWeight in 5336, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          7.4342074 = idf(docFreq=70, maxDocs=44218)
          0.625 = fieldNorm(doc=5336)
    
  3. Park, T.K.: ¬The nature of relevance in information retrieval : an empirical study (1992) 4.65
    4.6463795 = sum of:
      4.6463795 = weight(author_txt:park in 5370) [ClassicSimilarity], result of:
        4.6463795 = fieldWeight in 5370, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          7.4342074 = idf(docFreq=70, maxDocs=44218)
          0.625 = fieldNorm(doc=5370)
    
  4. Park, A.L.: Automated authority control : making the transition (1992) 4.65
    4.6463795 = sum of:
      4.6463795 = weight(author_txt:park in 5394) [ClassicSimilarity], result of:
        4.6463795 = fieldWeight in 5394, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          7.4342074 = idf(docFreq=70, maxDocs=44218)
          0.625 = fieldNorm(doc=5394)
    
  5. Park, T.K.: Toward a theory of user-based relevance : a call for a new paradigm of inquiry (1994) 4.65
    4.6463795 = sum of:
      4.6463795 = weight(author_txt:park in 6926) [ClassicSimilarity], result of:
        4.6463795 = fieldWeight in 6926, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          7.4342074 = idf(docFreq=70, maxDocs=44218)
          0.625 = fieldNorm(doc=6926)
    

Similar documents (content)

  1. Wei, F.; Li, W.; Lu, Q.; He, Y.: Applying two-level reinforcement ranking in query-oriented multidocument summarization (2009) 0.26
    0.25692457 = sum of:
      0.25692457 = product of:
        0.91758776 = sum of:
          0.029023523 = weight(abstract_txt:documents in 3120) [ClassicSimilarity], result of:
            0.029023523 = score(doc=3120,freq=2.0), product of:
              0.07967473 = queryWeight, product of:
                1.1044952 = boost
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.017503394 = queryNorm
              0.36427513 = fieldWeight in 3120, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.0625 = fieldNorm(doc=3120)
          0.038020108 = weight(abstract_txt:important in 3120) [ClassicSimilarity], result of:
            0.038020108 = score(doc=3120,freq=3.0), product of:
              0.08332954 = queryWeight, product of:
                1.1295437 = boost
                4.2147684 = idf(docFreq=1775, maxDocs=44218)
                0.017503394 = queryNorm
              0.45626205 = fieldWeight in 3120, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.2147684 = idf(docFreq=1775, maxDocs=44218)
                0.0625 = fieldNorm(doc=3120)
          0.01826522 = weight(abstract_txt:using in 3120) [ClassicSimilarity], result of:
            0.01826522 = score(doc=3120,freq=1.0), product of:
              0.084387384 = queryWeight, product of:
                1.392156 = boost
                3.4631186 = idf(docFreq=3765, maxDocs=44218)
                0.017503394 = queryNorm
              0.21644491 = fieldWeight in 3120, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4631186 = idf(docFreq=3765, maxDocs=44218)
                0.0625 = fieldNorm(doc=3120)
          0.08033152 = weight(abstract_txt:document in 3120) [ClassicSimilarity], result of:
            0.08033152 = score(doc=3120,freq=3.0), product of:
              0.17287177 = queryWeight, product of:
                2.300809 = boost
                4.2926083 = idf(docFreq=1642, maxDocs=44218)
                0.017503394 = queryNorm
              0.46468848 = fieldWeight in 3120, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.2926083 = idf(docFreq=1642, maxDocs=44218)
                0.0625 = fieldNorm(doc=3120)
          0.05483851 = weight(abstract_txt:features in 3120) [ClassicSimilarity], result of:
            0.05483851 = score(doc=3120,freq=1.0), product of:
              0.19329959 = queryWeight, product of:
                2.4329545 = boost
                4.5391517 = idf(docFreq=1283, maxDocs=44218)
                0.017503394 = queryNorm
              0.28369698 = fieldWeight in 3120, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5391517 = idf(docFreq=1283, maxDocs=44218)
                0.0625 = fieldNorm(doc=3120)
          0.082254246 = weight(abstract_txt:text in 3120) [ClassicSimilarity], result of:
            0.082254246 = score(doc=3120,freq=2.0), product of:
              0.23012641 = queryWeight, product of:
                3.2512276 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.017503394 = queryNorm
              0.3574307 = fieldWeight in 3120, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.0625 = fieldNorm(doc=3120)
          0.61485463 = weight(abstract_txt:sentences in 3120) [ClassicSimilarity], result of:
            0.61485463 = score(doc=3120,freq=6.0), product of:
              0.5740394 = queryWeight, product of:
                4.6875334 = boost
                6.996407 = idf(docFreq=109, maxDocs=44218)
                0.017503394 = queryNorm
              1.0711018 = fieldWeight in 3120, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                6.996407 = idf(docFreq=109, maxDocs=44218)
                0.0625 = fieldNorm(doc=3120)
        0.28 = coord(7/25)
    
  2. Sun, A.; Lim, E.-P.; Ng, W.-K.: Performance measurement framework for hierarchical text classification (2003) 0.24
    0.24004075 = sum of:
      0.24004075 = product of:
        0.7501274 = sum of:
          0.029023523 = weight(abstract_txt:documents in 1808) [ClassicSimilarity], result of:
            0.029023523 = score(doc=1808,freq=2.0), product of:
              0.07967473 = queryWeight, product of:
                1.1044952 = boost
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.017503394 = queryNorm
              0.36427513 = fieldWeight in 1808, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.0625 = fieldNorm(doc=1808)
          0.119808525 = weight(abstract_txt:naive in 1808) [ClassicSimilarity], result of:
            0.119808525 = score(doc=1808,freq=2.0), product of:
              0.16273052 = queryWeight, product of:
                1.1161512 = boost
                8.329592 = idf(docFreq=28, maxDocs=44218)
                0.017503394 = queryNorm
              0.73623884 = fieldWeight in 1808, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.329592 = idf(docFreq=28, maxDocs=44218)
                0.0625 = fieldNorm(doc=1808)
          0.1281613 = weight(abstract_txt:bayes in 1808) [ClassicSimilarity], result of:
            0.1281613 = score(doc=1808,freq=2.0), product of:
              0.17020871 = queryWeight, product of:
                1.1415092 = boost
                8.518833 = idf(docFreq=23, maxDocs=44218)
                0.017503394 = queryNorm
              0.75296557 = fieldWeight in 1808, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.518833 = idf(docFreq=23, maxDocs=44218)
                0.0625 = fieldNorm(doc=1808)
          0.02673278 = weight(abstract_txt:method in 1808) [ClassicSimilarity], result of:
            0.02673278 = score(doc=1808,freq=1.0), product of:
              0.09502982 = queryWeight, product of:
                1.2062393 = boost
                4.50095 = idf(docFreq=1333, maxDocs=44218)
                0.017503394 = queryNorm
              0.28130937 = fieldWeight in 1808, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.50095 = idf(docFreq=1333, maxDocs=44218)
                0.0625 = fieldNorm(doc=1808)
          0.06279571 = weight(abstract_txt:experiments in 1808) [ClassicSimilarity], result of:
            0.06279571 = score(doc=1808,freq=2.0), product of:
              0.13328272 = queryWeight, product of:
                1.4285336 = boost
                5.3304167 = idf(docFreq=581, maxDocs=44218)
                0.017503394 = queryNorm
              0.4711467 = fieldWeight in 1808, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.3304167 = idf(docFreq=581, maxDocs=44218)
                0.0625 = fieldNorm(doc=1808)
          0.2790636 = weight(abstract_txt:classifiers in 1808) [ClassicSimilarity], result of:
            0.2790636 = score(doc=1808,freq=5.0), product of:
              0.2654459 = queryWeight, product of:
                2.0160046 = boost
                7.5225 = idf(docFreq=64, maxDocs=44218)
                0.017503394 = queryNorm
              1.0513014 = fieldWeight in 1808, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                7.5225 = idf(docFreq=64, maxDocs=44218)
                0.0625 = fieldNorm(doc=1808)
          0.046379425 = weight(abstract_txt:document in 1808) [ClassicSimilarity], result of:
            0.046379425 = score(doc=1808,freq=1.0), product of:
              0.17287177 = queryWeight, product of:
                2.300809 = boost
                4.2926083 = idf(docFreq=1642, maxDocs=44218)
                0.017503394 = queryNorm
              0.26828802 = fieldWeight in 1808, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.2926083 = idf(docFreq=1642, maxDocs=44218)
                0.0625 = fieldNorm(doc=1808)
          0.058162533 = weight(abstract_txt:text in 1808) [ClassicSimilarity], result of:
            0.058162533 = score(doc=1808,freq=1.0), product of:
              0.23012641 = queryWeight, product of:
                3.2512276 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.017503394 = queryNorm
              0.25274166 = fieldWeight in 1808, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.0625 = fieldNorm(doc=1808)
        0.32 = coord(8/25)
    
  3. Giannella, C.: ¬An improved algorithm for unsupervised decomposition of a multi-author document (2016) 0.23
    0.22787961 = sum of:
      0.22787961 = product of:
        0.81385577 = sum of:
          0.025605623 = weight(abstract_txt:each in 2642) [ClassicSimilarity], result of:
            0.025605623 = score(doc=2642,freq=1.0), product of:
              0.07957575 = queryWeight, product of:
                1.1038089 = boost
                4.118742 = idf(docFreq=1954, maxDocs=44218)
                0.017503394 = queryNorm
              0.32177672 = fieldWeight in 2642, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.118742 = idf(docFreq=1954, maxDocs=44218)
                0.078125 = fieldNorm(doc=2642)
          0.033415973 = weight(abstract_txt:method in 2642) [ClassicSimilarity], result of:
            0.033415973 = score(doc=2642,freq=1.0), product of:
              0.09502982 = queryWeight, product of:
                1.2062393 = boost
                4.50095 = idf(docFreq=1333, maxDocs=44218)
                0.017503394 = queryNorm
              0.3516367 = fieldWeight in 2642, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.50095 = idf(docFreq=1333, maxDocs=44218)
                0.078125 = fieldNorm(doc=2642)
          0.0555041 = weight(abstract_txt:experiments in 2642) [ClassicSimilarity], result of:
            0.0555041 = score(doc=2642,freq=1.0), product of:
              0.13328272 = queryWeight, product of:
                1.4285336 = boost
                5.3304167 = idf(docFreq=581, maxDocs=44218)
                0.017503394 = queryNorm
              0.41643882 = fieldWeight in 2642, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.3304167 = idf(docFreq=581, maxDocs=44218)
                0.078125 = fieldNorm(doc=2642)
          0.07079124 = weight(abstract_txt:written in 2642) [ClassicSimilarity], result of:
            0.07079124 = score(doc=2642,freq=1.0), product of:
              0.15675095 = queryWeight, product of:
                1.549204 = boost
                5.780685 = idf(docFreq=370, maxDocs=44218)
                0.017503394 = queryNorm
              0.45161602 = fieldWeight in 2642, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.780685 = idf(docFreq=370, maxDocs=44218)
                0.078125 = fieldNorm(doc=2642)
          0.081988014 = weight(abstract_txt:document in 2642) [ClassicSimilarity], result of:
            0.081988014 = score(doc=2642,freq=2.0), product of:
              0.17287177 = queryWeight, product of:
                2.300809 = boost
                4.2926083 = idf(docFreq=1642, maxDocs=44218)
                0.017503394 = queryNorm
              0.4742707 = fieldWeight in 2642, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.2926083 = idf(docFreq=1642, maxDocs=44218)
                0.078125 = fieldNorm(doc=2642)
          0.1028178 = weight(abstract_txt:text in 2642) [ClassicSimilarity], result of:
            0.1028178 = score(doc=2642,freq=2.0), product of:
              0.23012641 = queryWeight, product of:
                3.2512276 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.017503394 = queryNorm
              0.44678837 = fieldWeight in 2642, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.078125 = fieldNorm(doc=2642)
          0.44373307 = weight(abstract_txt:sentences in 2642) [ClassicSimilarity], result of:
            0.44373307 = score(doc=2642,freq=2.0), product of:
              0.5740394 = queryWeight, product of:
                4.6875334 = boost
                6.996407 = idf(docFreq=109, maxDocs=44218)
                0.017503394 = queryNorm
              0.7730011 = fieldWeight in 2642, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.996407 = idf(docFreq=109, maxDocs=44218)
                0.078125 = fieldNorm(doc=2642)
        0.28 = coord(7/25)
    
  4. Yang, C.C.; Wang, F.L.: Hierarchical summarization of large documents (2008) 0.21
    0.20712076 = sum of:
      0.20712076 = product of:
        0.739717 = sum of:
          0.029023523 = weight(abstract_txt:documents in 1719) [ClassicSimilarity], result of:
            0.029023523 = score(doc=1719,freq=2.0), product of:
              0.07967473 = queryWeight, product of:
                1.1044952 = boost
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.017503394 = queryNorm
              0.36427513 = fieldWeight in 1719, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.0625 = fieldNorm(doc=1719)
          0.021950921 = weight(abstract_txt:important in 1719) [ClassicSimilarity], result of:
            0.021950921 = score(doc=1719,freq=1.0), product of:
              0.08332954 = queryWeight, product of:
                1.1295437 = boost
                4.2147684 = idf(docFreq=1775, maxDocs=44218)
                0.017503394 = queryNorm
              0.26342303 = fieldWeight in 1719, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.2147684 = idf(docFreq=1775, maxDocs=44218)
                0.0625 = fieldNorm(doc=1719)
          0.01826522 = weight(abstract_txt:using in 1719) [ClassicSimilarity], result of:
            0.01826522 = score(doc=1719,freq=1.0), product of:
              0.084387384 = queryWeight, product of:
                1.392156 = boost
                3.4631186 = idf(docFreq=3765, maxDocs=44218)
                0.017503394 = queryNorm
              0.21644491 = fieldWeight in 1719, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4631186 = idf(docFreq=3765, maxDocs=44218)
                0.0625 = fieldNorm(doc=1719)
          0.122708425 = weight(abstract_txt:document in 1719) [ClassicSimilarity], result of:
            0.122708425 = score(doc=1719,freq=7.0), product of:
              0.17287177 = queryWeight, product of:
                2.300809 = boost
                4.2926083 = idf(docFreq=1642, maxDocs=44218)
                0.017503394 = queryNorm
              0.70982337 = fieldWeight in 1719, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                4.2926083 = idf(docFreq=1642, maxDocs=44218)
                0.0625 = fieldNorm(doc=1719)
          0.05483851 = weight(abstract_txt:features in 1719) [ClassicSimilarity], result of:
            0.05483851 = score(doc=1719,freq=1.0), product of:
              0.19329959 = queryWeight, product of:
                2.4329545 = boost
                4.5391517 = idf(docFreq=1283, maxDocs=44218)
                0.017503394 = queryNorm
              0.28369698 = fieldWeight in 1719, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5391517 = idf(docFreq=1283, maxDocs=44218)
                0.0625 = fieldNorm(doc=1719)
          0.058162533 = weight(abstract_txt:text in 1719) [ClassicSimilarity], result of:
            0.058162533 = score(doc=1719,freq=1.0), product of:
              0.23012641 = queryWeight, product of:
                3.2512276 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.017503394 = queryNorm
              0.25274166 = fieldWeight in 1719, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.0625 = fieldNorm(doc=1719)
          0.43476784 = weight(abstract_txt:sentences in 1719) [ClassicSimilarity], result of:
            0.43476784 = score(doc=1719,freq=3.0), product of:
              0.5740394 = queryWeight, product of:
                4.6875334 = boost
                6.996407 = idf(docFreq=109, maxDocs=44218)
                0.017503394 = queryNorm
              0.7573833 = fieldWeight in 1719, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.996407 = idf(docFreq=109, maxDocs=44218)
                0.0625 = fieldNorm(doc=1719)
        0.28 = coord(7/25)
    
  5. Craven, T.C.: Customized extracts based on Boolean queries and sentence dependency structures (1989) 0.19
    0.18829454 = sum of:
      0.18829454 = product of:
        0.9414727 = sum of:
          0.030726748 = weight(abstract_txt:each in 789) [ClassicSimilarity], result of:
            0.030726748 = score(doc=789,freq=1.0), product of:
              0.07957575 = queryWeight, product of:
                1.1038089 = boost
                4.118742 = idf(docFreq=1954, maxDocs=44218)
                0.017503394 = queryNorm
              0.38613206 = fieldWeight in 789, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.118742 = idf(docFreq=1954, maxDocs=44218)
                0.09375 = fieldNorm(doc=789)
          0.05670879 = weight(abstract_txt:method in 789) [ClassicSimilarity], result of:
            0.05670879 = score(doc=789,freq=2.0), product of:
              0.09502982 = queryWeight, product of:
                1.2062393 = boost
                4.50095 = idf(docFreq=1333, maxDocs=44218)
                0.017503394 = queryNorm
              0.5967473 = fieldWeight in 789, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.50095 = idf(docFreq=1333, maxDocs=44218)
                0.09375 = fieldNorm(doc=789)
          0.02739783 = weight(abstract_txt:using in 789) [ClassicSimilarity], result of:
            0.02739783 = score(doc=789,freq=1.0), product of:
              0.084387384 = queryWeight, product of:
                1.392156 = boost
                3.4631186 = idf(docFreq=3765, maxDocs=44218)
                0.017503394 = queryNorm
              0.32466736 = fieldWeight in 789, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4631186 = idf(docFreq=3765, maxDocs=44218)
                0.09375 = fieldNorm(doc=789)
          0.17448759 = weight(abstract_txt:text in 789) [ClassicSimilarity], result of:
            0.17448759 = score(doc=789,freq=4.0), product of:
              0.23012641 = queryWeight, product of:
                3.2512276 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.017503394 = queryNorm
              0.75822496 = fieldWeight in 789, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.09375 = fieldNorm(doc=789)
          0.65215176 = weight(abstract_txt:sentences in 789) [ClassicSimilarity], result of:
            0.65215176 = score(doc=789,freq=3.0), product of:
              0.5740394 = queryWeight, product of:
                4.6875334 = boost
                6.996407 = idf(docFreq=109, maxDocs=44218)
                0.017503394 = queryNorm
              1.1360749 = fieldWeight in 789, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.996407 = idf(docFreq=109, maxDocs=44218)
                0.09375 = fieldNorm(doc=789)
        0.2 = coord(5/25)