Document (#21057)

Author
Cheng, K.-S.
Young, G.H.
Wong, K.-F.
Title
¬A study on word-based and integral-bit Chinese text compression algorithms
Source
Journal of the American Society for Information Science. 50(1999) no.3, S.218-228
Year
1999
Abstract
Experimental results show that a word-based arithmetic coding scheme can achieve a higher compression performance for Chinese text. However, an arithmetic coding scheme is a fractional-bit compression algorithm which is known to be time comsuming. In this article, we change the direction to study how to cascade the word segmentation model with a faster alternative, the integral-bit compression algorithm. It is shown that the cascaded algorithm is mor suitable for practical usage.

Similar documents (author)

  1. Wong, M.L.; Leung, K.S.; Cheng, J.C.Y.: Discovering knowledge from noisy databases using genetic programming (2000) 2.37
    2.3735623 = sum of:
      2.3735623 = product of:
        3.5603435 = sum of:
          1.7384325 = weight(author_txt:wong in 4863) [ClassicSimilarity], result of:
            1.7384325 = score(doc=4863,freq=1.0), product of:
              0.56531775 = queryWeight, product of:
                8.200379 = idf(docFreq=32, maxDocs=44218)
                0.068938 = queryNorm
              3.0751424 = fieldWeight in 4863, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.200379 = idf(docFreq=32, maxDocs=44218)
                0.375 = fieldNorm(doc=4863)
          1.821911 = weight(author_txt:cheng in 4863) [ClassicSimilarity], result of:
            1.821911 = score(doc=4863,freq=1.0), product of:
              0.5832734 = queryWeight, product of:
                1.0157568 = boost
                8.329592 = idf(docFreq=28, maxDocs=44218)
                0.068938 = queryNorm
              3.123597 = fieldWeight in 4863, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.329592 = idf(docFreq=28, maxDocs=44218)
                0.375 = fieldNorm(doc=4863)
        0.6666667 = coord(2/3)
    
  2. Young, J.B.: Crisis in cataloging revisited : the year's work in subject analysis, 1990 (1991) 1.01
    1.0121728 = sum of:
      1.0121728 = product of:
        3.0365183 = sum of:
          3.0365183 = weight(author_txt:young in 316) [ClassicSimilarity], result of:
            3.0365183 = score(doc=316,freq=1.0), product of:
              0.5832734 = queryWeight, product of:
                1.0157568 = boost
                8.329592 = idf(docFreq=28, maxDocs=44218)
                0.068938 = queryNorm
              5.2059946 = fieldWeight in 316, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.329592 = idf(docFreq=28, maxDocs=44218)
                0.625 = fieldNorm(doc=316)
        0.33333334 = coord(1/3)
    
  3. Young, W.F.: Methods for evaluating reference desk performance (1985) 1.01
    1.0121728 = sum of:
      1.0121728 = product of:
        3.0365183 = sum of:
          3.0365183 = weight(author_txt:young in 4620) [ClassicSimilarity], result of:
            3.0365183 = score(doc=4620,freq=1.0), product of:
              0.5832734 = queryWeight, product of:
                1.0157568 = boost
                8.329592 = idf(docFreq=28, maxDocs=44218)
                0.068938 = queryNorm
              5.2059946 = fieldWeight in 4620, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.329592 = idf(docFreq=28, maxDocs=44218)
                0.625 = fieldNorm(doc=4620)
        0.33333334 = coord(1/3)
    
  4. Young, A.: Cutting gold : a guide to in-house CD-ROM production (1994) 1.01
    1.0121728 = sum of:
      1.0121728 = product of:
        3.0365183 = sum of:
          3.0365183 = weight(author_txt:young in 7054) [ClassicSimilarity], result of:
            3.0365183 = score(doc=7054,freq=1.0), product of:
              0.5832734 = queryWeight, product of:
                1.0157568 = boost
                8.329592 = idf(docFreq=28, maxDocs=44218)
                0.068938 = queryNorm
              5.2059946 = fieldWeight in 7054, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.329592 = idf(docFreq=28, maxDocs=44218)
                0.625 = fieldNorm(doc=7054)
        0.33333334 = coord(1/3)
    
  5. Young, E.: Cataloguing interactive multimedia (1995) 1.01
    1.0121728 = sum of:
      1.0121728 = product of:
        3.0365183 = sum of:
          3.0365183 = weight(author_txt:young in 4681) [ClassicSimilarity], result of:
            3.0365183 = score(doc=4681,freq=1.0), product of:
              0.5832734 = queryWeight, product of:
                1.0157568 = boost
                8.329592 = idf(docFreq=28, maxDocs=44218)
                0.068938 = queryNorm
              5.2059946 = fieldWeight in 4681, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.329592 = idf(docFreq=28, maxDocs=44218)
                0.625 = fieldNorm(doc=4681)
        0.33333334 = coord(1/3)
    

Similar documents (content)

  1. Moffat, A.; Isal, R.Y.K.: Word-based text compression using the Burrows-Wheeler transform (2005) 0.32
    0.32157823 = sum of:
      0.32157823 = product of:
        1.3399093 = sum of:
          0.017885305 = weight(abstract_txt:based in 1044) [ClassicSimilarity], result of:
            0.017885305 = score(doc=1044,freq=2.0), product of:
              0.050778847 = queryWeight, product of:
                1.203757 = boost
                3.1879277 = idf(docFreq=4958, maxDocs=44218)
                0.013232306 = queryNorm
              0.35221958 = fieldWeight in 1044, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.1879277 = idf(docFreq=4958, maxDocs=44218)
                0.078125 = fieldNorm(doc=1044)
          0.025813472 = weight(abstract_txt:text in 1044) [ClassicSimilarity], result of:
            0.025813472 = score(doc=1044,freq=1.0), product of:
              0.08170705 = queryWeight, product of:
                1.5269583 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.013232306 = queryNorm
              0.3159271 = fieldWeight in 1044, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.078125 = fieldNorm(doc=1044)
          0.17017439 = weight(abstract_txt:coding in 1044) [ClassicSimilarity], result of:
            0.17017439 = score(doc=1044,freq=2.0), product of:
              0.22800696 = queryWeight, product of:
                2.550771 = boost
                6.7552447 = idf(docFreq=139, maxDocs=44218)
                0.013232306 = queryNorm
              0.7463561 = fieldWeight in 1044, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.7552447 = idf(docFreq=139, maxDocs=44218)
                0.078125 = fieldNorm(doc=1044)
          0.13297062 = weight(abstract_txt:word in 1044) [ClassicSimilarity], result of:
            0.13297062 = score(doc=1044,freq=2.0), product of:
              0.22142135 = queryWeight, product of:
                3.0785966 = boost
                5.4353957 = idf(docFreq=523, maxDocs=44218)
                0.013232306 = queryNorm
              0.60053205 = fieldWeight in 1044, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.4353957 = idf(docFreq=523, maxDocs=44218)
                0.078125 = fieldNorm(doc=1044)
          0.4102451 = weight(abstract_txt:arithmetic in 1044) [ClassicSimilarity], result of:
            0.4102451 = score(doc=1044,freq=2.0), product of:
              0.40993425 = queryWeight, product of:
                3.4202237 = boost
                9.05783 = idf(docFreq=13, maxDocs=44218)
                0.013232306 = queryNorm
              1.0007583 = fieldWeight in 1044, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.05783 = idf(docFreq=13, maxDocs=44218)
                0.078125 = fieldNorm(doc=1044)
          0.5828205 = weight(abstract_txt:compression in 1044) [ClassicSimilarity], result of:
            0.5828205 = score(doc=1044,freq=3.0), product of:
              0.5701924 = queryWeight, product of:
                5.7045727 = boost
                7.5537524 = idf(docFreq=62, maxDocs=44218)
                0.013232306 = queryNorm
              1.022147 = fieldWeight in 1044, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.5537524 = idf(docFreq=62, maxDocs=44218)
                0.078125 = fieldNorm(doc=1044)
        0.24 = coord(6/25)
    
  2. Wang, F.L.; Yang, C.C.: Mining Web data for Chinese segmentation (2007) 0.24
    0.23917767 = sum of:
      0.23917767 = product of:
        0.85420597 = sum of:
          0.050293457 = weight(abstract_txt:algorithms in 604) [ClassicSimilarity], result of:
            0.050293457 = score(doc=604,freq=3.0), product of:
              0.081394024 = queryWeight, product of:
                1.0776523 = boost
                5.707926 = idf(docFreq=398, maxDocs=44218)
                0.013232306 = queryNorm
              0.6179011 = fieldWeight in 604, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.707926 = idf(docFreq=398, maxDocs=44218)
                0.0625 = fieldNorm(doc=604)
          0.010117456 = weight(abstract_txt:based in 604) [ClassicSimilarity], result of:
            0.010117456 = score(doc=604,freq=1.0), product of:
              0.050778847 = queryWeight, product of:
                1.203757 = boost
                3.1879277 = idf(docFreq=4958, maxDocs=44218)
                0.013232306 = queryNorm
              0.19924548 = fieldWeight in 604, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.1879277 = idf(docFreq=4958, maxDocs=44218)
                0.0625 = fieldNorm(doc=604)
          0.24675702 = weight(abstract_txt:segmentation in 604) [ClassicSimilarity], result of:
            0.24675702 = score(doc=604,freq=10.0), product of:
              0.1573276 = queryWeight, product of:
                1.4982522 = boost
                7.935687 = idf(docFreq=42, maxDocs=44218)
                0.013232306 = queryNorm
              1.5684279 = fieldWeight in 604, product of:
                3.1622777 = tf(freq=10.0), with freq of:
                  10.0 = termFreq=10.0
                7.935687 = idf(docFreq=42, maxDocs=44218)
                0.0625 = fieldNorm(doc=604)
          0.020650776 = weight(abstract_txt:text in 604) [ClassicSimilarity], result of:
            0.020650776 = score(doc=604,freq=1.0), product of:
              0.08170705 = queryWeight, product of:
                1.5269583 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.013232306 = queryNorm
              0.25274166 = fieldWeight in 604, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.0625 = fieldNorm(doc=604)
          0.20691432 = weight(abstract_txt:chinese in 604) [ClassicSimilarity], result of:
            0.20691432 = score(doc=604,freq=7.0), product of:
              0.19851637 = queryWeight, product of:
                2.3801022 = boost
                6.30326 = idf(docFreq=219, maxDocs=44218)
                0.013232306 = queryNorm
              1.0423036 = fieldWeight in 604, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                6.30326 = idf(docFreq=219, maxDocs=44218)
                0.0625 = fieldNorm(doc=604)
          0.10637649 = weight(abstract_txt:word in 604) [ClassicSimilarity], result of:
            0.10637649 = score(doc=604,freq=2.0), product of:
              0.22142135 = queryWeight, product of:
                3.0785966 = boost
                5.4353957 = idf(docFreq=523, maxDocs=44218)
                0.013232306 = queryNorm
              0.48042563 = fieldWeight in 604, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.4353957 = idf(docFreq=523, maxDocs=44218)
                0.0625 = fieldNorm(doc=604)
          0.2130965 = weight(abstract_txt:algorithm in 604) [ClassicSimilarity], result of:
            0.2130965 = score(doc=604,freq=6.0), product of:
              0.24396798 = queryWeight, product of:
                3.2315395 = boost
                5.705423 = idf(docFreq=399, maxDocs=44218)
                0.013232306 = queryNorm
              0.87346095 = fieldWeight in 604, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                5.705423 = idf(docFreq=399, maxDocs=44218)
                0.0625 = fieldNorm(doc=604)
        0.28 = coord(7/25)
    
  3. Cannane, A.; Williams, H.E.: General-purpose compression for efficient retrieval (2001) 0.24
    0.23867767 = sum of:
      0.23867767 = product of:
        1.1933883 = sum of:
          0.03404953 = weight(abstract_txt:shown in 5705) [ClassicSimilarity], result of:
            0.03404953 = score(doc=5705,freq=1.0), product of:
              0.07799966 = queryWeight, product of:
                1.0549425 = boost
                5.58764 = idf(docFreq=449, maxDocs=44218)
                0.013232306 = queryNorm
              0.43653435 = fieldWeight in 5705, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.58764 = idf(docFreq=449, maxDocs=44218)
                0.078125 = fieldNorm(doc=5705)
          0.012646819 = weight(abstract_txt:based in 5705) [ClassicSimilarity], result of:
            0.012646819 = score(doc=5705,freq=1.0), product of:
              0.050778847 = queryWeight, product of:
                1.203757 = boost
                3.1879277 = idf(docFreq=4958, maxDocs=44218)
                0.013232306 = queryNorm
              0.24905685 = fieldWeight in 5705, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.1879277 = idf(docFreq=4958, maxDocs=44218)
                0.078125 = fieldNorm(doc=5705)
          0.025813472 = weight(abstract_txt:text in 5705) [ClassicSimilarity], result of:
            0.025813472 = score(doc=5705,freq=1.0), product of:
              0.08170705 = queryWeight, product of:
                1.5269583 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.013232306 = queryNorm
              0.3159271 = fieldWeight in 5705, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.078125 = fieldNorm(doc=5705)
          0.11140381 = weight(abstract_txt:scheme in 5705) [ClassicSimilarity], result of:
            0.11140381 = score(doc=5705,freq=3.0), product of:
              0.15017174 = queryWeight, product of:
                2.070101 = boost
                5.4822793 = idf(docFreq=499, maxDocs=44218)
                0.013232306 = queryNorm
              0.7418427 = fieldWeight in 5705, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.4822793 = idf(docFreq=499, maxDocs=44218)
                0.078125 = fieldNorm(doc=5705)
          1.0094748 = weight(abstract_txt:compression in 5705) [ClassicSimilarity], result of:
            1.0094748 = score(doc=5705,freq=9.0), product of:
              0.5701924 = queryWeight, product of:
                5.7045727 = boost
                7.5537524 = idf(docFreq=62, maxDocs=44218)
                0.013232306 = queryNorm
              1.7704107 = fieldWeight in 5705, product of:
                3.0 = tf(freq=9.0), with freq of:
                  9.0 = termFreq=9.0
                7.5537524 = idf(docFreq=62, maxDocs=44218)
                0.078125 = fieldNorm(doc=5705)
        0.2 = coord(5/25)
    
  4. Lee, K.H.; Ng, M.K.M.; Lu, Q.: Text segmentation for Chinese spell checking (1999) 0.17
    0.17105646 = sum of:
      0.17105646 = product of:
        0.61091596 = sum of:
          0.028049078 = weight(abstract_txt:usage in 3913) [ClassicSimilarity], result of:
            0.028049078 = score(doc=3913,freq=1.0), product of:
              0.07953733 = queryWeight, product of:
                1.0652902 = boost
                5.642448 = idf(docFreq=425, maxDocs=44218)
                0.013232306 = queryNorm
              0.352653 = fieldWeight in 3913, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.642448 = idf(docFreq=425, maxDocs=44218)
                0.0625 = fieldNorm(doc=3913)
          0.03472308 = weight(abstract_txt:suitable in 3913) [ClassicSimilarity], result of:
            0.03472308 = score(doc=3913,freq=1.0), product of:
              0.09170031 = queryWeight, product of:
                1.1438468 = boost
                6.0585327 = idf(docFreq=280, maxDocs=44218)
                0.013232306 = queryNorm
              0.3786583 = fieldWeight in 3913, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.0585327 = idf(docFreq=280, maxDocs=44218)
                0.0625 = fieldNorm(doc=3913)
          0.014308243 = weight(abstract_txt:based in 3913) [ClassicSimilarity], result of:
            0.014308243 = score(doc=3913,freq=2.0), product of:
              0.050778847 = queryWeight, product of:
                1.203757 = boost
                3.1879277 = idf(docFreq=4958, maxDocs=44218)
                0.013232306 = queryNorm
              0.28177565 = fieldWeight in 3913, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.1879277 = idf(docFreq=4958, maxDocs=44218)
                0.0625 = fieldNorm(doc=3913)
          0.15606283 = weight(abstract_txt:segmentation in 3913) [ClassicSimilarity], result of:
            0.15606283 = score(doc=3913,freq=4.0), product of:
              0.1573276 = queryWeight, product of:
                1.4982522 = boost
                7.935687 = idf(docFreq=42, maxDocs=44218)
                0.013232306 = queryNorm
              0.9919609 = fieldWeight in 3913, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                7.935687 = idf(docFreq=42, maxDocs=44218)
                0.0625 = fieldNorm(doc=3913)
          0.035768192 = weight(abstract_txt:text in 3913) [ClassicSimilarity], result of:
            0.035768192 = score(doc=3913,freq=3.0), product of:
              0.08170705 = queryWeight, product of:
                1.5269583 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.013232306 = queryNorm
              0.4377614 = fieldWeight in 3913, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.0625 = fieldNorm(doc=3913)
          0.19156545 = weight(abstract_txt:chinese in 3913) [ClassicSimilarity], result of:
            0.19156545 = score(doc=3913,freq=6.0), product of:
              0.19851637 = queryWeight, product of:
                2.3801022 = boost
                6.30326 = idf(docFreq=219, maxDocs=44218)
                0.013232306 = queryNorm
              0.96498567 = fieldWeight in 3913, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                6.30326 = idf(docFreq=219, maxDocs=44218)
                0.0625 = fieldNorm(doc=3913)
          0.15043908 = weight(abstract_txt:word in 3913) [ClassicSimilarity], result of:
            0.15043908 = score(doc=3913,freq=4.0), product of:
              0.22142135 = queryWeight, product of:
                3.0785966 = boost
                5.4353957 = idf(docFreq=523, maxDocs=44218)
                0.013232306 = queryNorm
              0.67942446 = fieldWeight in 3913, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.4353957 = idf(docFreq=523, maxDocs=44218)
                0.0625 = fieldNorm(doc=3913)
        0.28 = coord(7/25)
    
  5. Yang, C.C.; Li, K.W.: ¬A heuristic method based on a statistical approach for chinese text segmentation (2005) 0.16
    0.16046229 = sum of:
      0.16046229 = product of:
        0.6685929 = sum of:
          0.024558371 = weight(abstract_txt:experimental in 4580) [ClassicSimilarity], result of:
            0.024558371 = score(doc=4580,freq=1.0), product of:
              0.07279334 = queryWeight, product of:
                1.0191269 = boost
                5.397938 = idf(docFreq=543, maxDocs=44218)
                0.013232306 = queryNorm
              0.3373711 = fieldWeight in 4580, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.397938 = idf(docFreq=543, maxDocs=44218)
                0.0625 = fieldNorm(doc=4580)
          0.014308243 = weight(abstract_txt:based in 4580) [ClassicSimilarity], result of:
            0.014308243 = score(doc=4580,freq=2.0), product of:
              0.050778847 = queryWeight, product of:
                1.203757 = boost
                3.1879277 = idf(docFreq=4958, maxDocs=44218)
                0.013232306 = queryNorm
              0.28177565 = fieldWeight in 4580, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.1879277 = idf(docFreq=4958, maxDocs=44218)
                0.0625 = fieldNorm(doc=4580)
          0.23409423 = weight(abstract_txt:segmentation in 4580) [ClassicSimilarity], result of:
            0.23409423 = score(doc=4580,freq=9.0), product of:
              0.1573276 = queryWeight, product of:
                1.4982522 = boost
                7.935687 = idf(docFreq=42, maxDocs=44218)
                0.013232306 = queryNorm
              1.4879413 = fieldWeight in 4580, product of:
                3.0 = tf(freq=9.0), with freq of:
                  9.0 = termFreq=9.0
                7.935687 = idf(docFreq=42, maxDocs=44218)
                0.0625 = fieldNorm(doc=4580)
          0.054636817 = weight(abstract_txt:text in 4580) [ClassicSimilarity], result of:
            0.054636817 = score(doc=4580,freq=7.0), product of:
              0.08170705 = queryWeight, product of:
                1.5269583 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.013232306 = queryNorm
              0.6686916 = fieldWeight in 4580, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.0625 = fieldNorm(doc=4580)
          0.23461878 = weight(abstract_txt:chinese in 4580) [ClassicSimilarity], result of:
            0.23461878 = score(doc=4580,freq=9.0), product of:
              0.19851637 = queryWeight, product of:
                2.3801022 = boost
                6.30326 = idf(docFreq=219, maxDocs=44218)
                0.013232306 = queryNorm
              1.1818612 = fieldWeight in 4580, product of:
                3.0 = tf(freq=9.0), with freq of:
                  9.0 = termFreq=9.0
                6.30326 = idf(docFreq=219, maxDocs=44218)
                0.0625 = fieldNorm(doc=4580)
          0.10637649 = weight(abstract_txt:word in 4580) [ClassicSimilarity], result of:
            0.10637649 = score(doc=4580,freq=2.0), product of:
              0.22142135 = queryWeight, product of:
                3.0785966 = boost
                5.4353957 = idf(docFreq=523, maxDocs=44218)
                0.013232306 = queryNorm
              0.48042563 = fieldWeight in 4580, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.4353957 = idf(docFreq=523, maxDocs=44218)
                0.0625 = fieldNorm(doc=4580)
        0.24 = coord(6/25)