Document (#5422)

Author
Tseng, Y.-H.
Title
Automatic cataloguing and searching for retrospective data by use of OCR text
Source
Journal of the American Society for Information Science and technology. 52(2001) no.5, S.378-390
Year
2001
Abstract
This article describes our efforts in supporting information retrieval from OCR degraded text. In particular, we report our approach to an automatic cataloging and searching contest for books in multiple languages. In this contest, 500 books in English, German, French, and Italian published during the 1770s to 1970s are scanned into images and OCRed to digital text. The goal is to use only automatic ways to extract information for sophisticated searching. We adopted the vector space retrieval model, an n-gram indexing method, and a special weighting scheme to tackle this problem. Although the performance by this approach is slightly inferior to the best approach, which is mainly based on regular expression match, one advantage of our approach is that it is less language dependent and less layout sensitive, thus is readily applicable to other languages and document collections. Problems of OCR text retrieval for some Asian languages are also discussed in this article, and solutions are suggested
Theme
Kataloganreicherung
Object
OCR

Similar documents (author)

  1. Tseng, Y.-H.: Keyword extraction techniques and relevance feedback (1997) 4.55
    4.5489707 = sum of:
      4.5489707 = weight(author_txt:tseng in 2831) [ClassicSimilarity], result of:
        4.5489707 = fieldWeight in 2831, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.097941 = idf(docFreq=12, maxDocs=42740)
          0.5 = fieldNorm(doc=2831)
    
  2. Tseng, Y.-H.: Solving vocabulary problems with interactive query expansion (1998) 4.55
    4.5489707 = sum of:
      4.5489707 = weight(author_txt:tseng in 75) [ClassicSimilarity], result of:
        4.5489707 = fieldWeight in 75, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.097941 = idf(docFreq=12, maxDocs=42740)
          0.5 = fieldNorm(doc=75)
    
  3. Tseng, Y.H.; Lin, Y.I.: Evaluation of fuzzy search, term suggestion, and term relevance feedback in an OPAC system (1998) 4.55
    4.5489707 = sum of:
      4.5489707 = weight(author_txt:tseng in 431) [ClassicSimilarity], result of:
        4.5489707 = fieldWeight in 431, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.097941 = idf(docFreq=12, maxDocs=42740)
          0.5 = fieldNorm(doc=431)
    
  4. Tseng, Y.-H.: Automatic thesaurus generation for Chinese documents (2002) 4.55
    4.5489707 = sum of:
      4.5489707 = weight(author_txt:tseng in 227) [ClassicSimilarity], result of:
        4.5489707 = fieldWeight in 227, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.097941 = idf(docFreq=12, maxDocs=42740)
          0.5 = fieldNorm(doc=227)
    
  5. Drenth, H.; Morris, A.; Tseng, G.: Expert systems as information intermediaries (1991) 3.41
    3.411728 = sum of:
      3.411728 = weight(author_txt:tseng in 3695) [ClassicSimilarity], result of:
        3.411728 = fieldWeight in 3695, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.097941 = idf(docFreq=12, maxDocs=42740)
          0.375 = fieldNorm(doc=3695)
    

Similar documents (content)

  1. Lee, Y.-S.; Wu, Y.-C.; Yang, J.-C.: BVideoQA : Online English/Chinese bilingual video question answering (2009) 0.15
    0.14766842 = sum of:
      0.14766842 = product of:
        0.4614638 = sum of:
          0.061179236 = weight(abstract_txt:weighting in 4740) [ClassicSimilarity], result of:
            0.061179236 = score(doc=4740,freq=1.0), product of:
              0.14022368 = queryWeight, product of:
                6.980759 = idf(docFreq=107, maxDocs=42740)
                0.020087168 = queryNorm
              0.43629745 = fieldWeight in 4740, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.980759 = idf(docFreq=107, maxDocs=42740)
                0.0625 = fieldNorm(doc=4740)
          0.02032921 = weight(abstract_txt:article in 4740) [ClassicSimilarity], result of:
            0.02032921 = score(doc=4740,freq=1.0), product of:
              0.08475702 = queryWeight, product of:
                1.0994922 = boost
                3.8376453 = idf(docFreq=2502, maxDocs=42740)
                0.020087168 = queryNorm
              0.23985283 = fieldWeight in 4740, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.8376453 = idf(docFreq=2502, maxDocs=42740)
                0.0625 = fieldNorm(doc=4740)
          0.085072845 = weight(abstract_txt:asian in 4740) [ClassicSimilarity], result of:
            0.085072845 = score(doc=4740,freq=1.0), product of:
              0.17469452 = queryWeight, product of:
                1.1161665 = boost
                7.7916894 = idf(docFreq=47, maxDocs=42740)
                0.020087168 = queryNorm
              0.4869806 = fieldWeight in 4740, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.7916894 = idf(docFreq=47, maxDocs=42740)
                0.0625 = fieldNorm(doc=4740)
          0.092057 = weight(abstract_txt:gram in 4740) [ClassicSimilarity], result of:
            0.092057 = score(doc=4740,freq=1.0), product of:
              0.18412943 = queryWeight, product of:
                1.1459111 = boost
                7.999329 = idf(docFreq=38, maxDocs=42740)
                0.020087168 = queryNorm
              0.49995807 = fieldWeight in 4740, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.999329 = idf(docFreq=38, maxDocs=42740)
                0.0625 = fieldNorm(doc=4740)
          0.03886985 = weight(abstract_txt:retrieval in 4740) [ClassicSimilarity], result of:
            0.03886985 = score(doc=4740,freq=3.0), product of:
              0.10363201 = queryWeight, product of:
                1.4890076 = boost
                3.4648013 = idf(docFreq=3633, maxDocs=42740)
                0.020087168 = queryNorm
              0.37507573 = fieldWeight in 4740, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.4648013 = idf(docFreq=3633, maxDocs=42740)
                0.0625 = fieldNorm(doc=4740)
          0.018541692 = weight(abstract_txt:this in 4740) [ClassicSimilarity], result of:
            0.018541692 = score(doc=4740,freq=2.0), product of:
              0.085868046 = queryWeight, product of:
                1.7498069 = boost
                2.442996 = idf(docFreq=10095, maxDocs=42740)
                0.020087168 = queryNorm
              0.21593238 = fieldWeight in 4740, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.442996 = idf(docFreq=10095, maxDocs=42740)
                0.0625 = fieldNorm(doc=4740)
          0.03861691 = weight(abstract_txt:approach in 4740) [ClassicSimilarity], result of:
            0.03861691 = score(doc=4740,freq=1.0), product of:
              0.16379112 = queryWeight, product of:
                2.161546 = boost
                3.772308 = idf(docFreq=2671, maxDocs=42740)
                0.020087168 = queryNorm
              0.23576926 = fieldWeight in 4740, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.772308 = idf(docFreq=2671, maxDocs=42740)
                0.0625 = fieldNorm(doc=4740)
          0.10679703 = weight(abstract_txt:languages in 4740) [ClassicSimilarity], result of:
            0.10679703 = score(doc=4740,freq=2.0), product of:
              0.23271367 = queryWeight, product of:
                2.2313151 = boost
                5.192091 = idf(docFreq=645, maxDocs=42740)
                0.020087168 = queryNorm
              0.45892033 = fieldWeight in 4740, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.192091 = idf(docFreq=645, maxDocs=42740)
                0.0625 = fieldNorm(doc=4740)
        0.32 = coord(8/25)
    
  2. Seki, K.; Mostafa, J.: Gene ontology annotation as text categorization : an empirical study (2008) 0.13
    0.12795621 = sum of:
      0.12795621 = product of:
        0.39986318 = sum of:
          0.061179236 = weight(abstract_txt:weighting in 4124) [ClassicSimilarity], result of:
            0.061179236 = score(doc=4124,freq=1.0), product of:
              0.14022368 = queryWeight, product of:
                6.980759 = idf(docFreq=107, maxDocs=42740)
                0.020087168 = queryNorm
              0.43629745 = fieldWeight in 4124, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.980759 = idf(docFreq=107, maxDocs=42740)
                0.0625 = fieldNorm(doc=4124)
          0.08012285 = weight(abstract_txt:tackle in 4124) [ClassicSimilarity], result of:
            0.08012285 = score(doc=4124,freq=1.0), product of:
              0.16785061 = queryWeight, product of:
                1.0940843 = boost
                7.637539 = idf(docFreq=55, maxDocs=42740)
                0.020087168 = queryNorm
              0.47734618 = fieldWeight in 4124, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.637539 = idf(docFreq=55, maxDocs=42740)
                0.0625 = fieldNorm(doc=4124)
          0.02032921 = weight(abstract_txt:article in 4124) [ClassicSimilarity], result of:
            0.02032921 = score(doc=4124,freq=1.0), product of:
              0.08475702 = queryWeight, product of:
                1.0994922 = boost
                3.8376453 = idf(docFreq=2502, maxDocs=42740)
                0.020087168 = queryNorm
              0.23985283 = fieldWeight in 4124, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.8376453 = idf(docFreq=2502, maxDocs=42740)
                0.0625 = fieldNorm(doc=4124)
          0.022441521 = weight(abstract_txt:retrieval in 4124) [ClassicSimilarity], result of:
            0.022441521 = score(doc=4124,freq=1.0), product of:
              0.10363201 = queryWeight, product of:
                1.4890076 = boost
                3.4648013 = idf(docFreq=3633, maxDocs=42740)
                0.020087168 = queryNorm
              0.21655008 = fieldWeight in 4124, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4648013 = idf(docFreq=3633, maxDocs=42740)
                0.0625 = fieldNorm(doc=4124)
          0.018541692 = weight(abstract_txt:this in 4124) [ClassicSimilarity], result of:
            0.018541692 = score(doc=4124,freq=2.0), product of:
              0.085868046 = queryWeight, product of:
                1.7498069 = boost
                2.442996 = idf(docFreq=10095, maxDocs=42740)
                0.020087168 = queryNorm
              0.21593238 = fieldWeight in 4124, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.442996 = idf(docFreq=10095, maxDocs=42740)
                0.0625 = fieldNorm(doc=4124)
          0.03861691 = weight(abstract_txt:approach in 4124) [ClassicSimilarity], result of:
            0.03861691 = score(doc=4124,freq=1.0), product of:
              0.16379112 = queryWeight, product of:
                2.161546 = boost
                3.772308 = idf(docFreq=2671, maxDocs=42740)
                0.020087168 = queryNorm
              0.23576926 = fieldWeight in 4124, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.772308 = idf(docFreq=2671, maxDocs=42740)
                0.0625 = fieldNorm(doc=4124)
          0.07585645 = weight(abstract_txt:automatic in 4124) [ClassicSimilarity], result of:
            0.07585645 = score(doc=4124,freq=1.0), product of:
              0.2334107 = queryWeight, product of:
                2.2346542 = boost
                5.199861 = idf(docFreq=640, maxDocs=42740)
                0.020087168 = queryNorm
              0.32499132 = fieldWeight in 4124, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.199861 = idf(docFreq=640, maxDocs=42740)
                0.0625 = fieldNorm(doc=4124)
          0.08277532 = weight(abstract_txt:text in 4124) [ClassicSimilarity], result of:
            0.08277532 = score(doc=4124,freq=3.0), product of:
              0.18879862 = queryWeight, product of:
                2.3206985 = boost
                4.0500593 = idf(docFreq=2023, maxDocs=42740)
                0.020087168 = queryNorm
              0.43843177 = fieldWeight in 4124, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.0500593 = idf(docFreq=2023, maxDocs=42740)
                0.0625 = fieldNorm(doc=4124)
        0.32 = coord(8/25)
    
  3. Bookstein, A.; Klein, S.T.; Raita, T.: Clumping properties of content-bearing words (1998) 0.12
    0.12018351 = sum of:
      0.12018351 = product of:
        0.42922682 = sum of:
          0.08861379 = weight(abstract_txt:sensitive in 1443) [ClassicSimilarity], result of:
            0.08861379 = score(doc=1443,freq=1.0), product of:
              0.15469617 = queryWeight, product of:
                1.050338 = boost
                7.332157 = idf(docFreq=75, maxDocs=42740)
                0.020087168 = queryNorm
              0.5728248 = fieldWeight in 1443, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.332157 = idf(docFreq=75, maxDocs=42740)
                0.078125 = fieldNorm(doc=1443)
          0.025411515 = weight(abstract_txt:article in 1443) [ClassicSimilarity], result of:
            0.025411515 = score(doc=1443,freq=1.0), product of:
              0.08475702 = queryWeight, product of:
                1.0994922 = boost
                3.8376453 = idf(docFreq=2502, maxDocs=42740)
                0.020087168 = queryNorm
              0.29981604 = fieldWeight in 1443, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.8376453 = idf(docFreq=2502, maxDocs=42740)
                0.078125 = fieldNorm(doc=1443)
          0.03967138 = weight(abstract_txt:retrieval in 1443) [ClassicSimilarity], result of:
            0.03967138 = score(doc=1443,freq=2.0), product of:
              0.10363201 = queryWeight, product of:
                1.4890076 = boost
                3.4648013 = idf(docFreq=3633, maxDocs=42740)
                0.020087168 = queryNorm
              0.3828101 = fieldWeight in 1443, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4648013 = idf(docFreq=3633, maxDocs=42740)
                0.078125 = fieldNorm(doc=1443)
          0.028386053 = weight(abstract_txt:this in 1443) [ClassicSimilarity], result of:
            0.028386053 = score(doc=1443,freq=3.0), product of:
              0.085868046 = queryWeight, product of:
                1.7498069 = boost
                2.442996 = idf(docFreq=10095, maxDocs=42740)
                0.020087168 = queryNorm
              0.3305776 = fieldWeight in 1443, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.442996 = idf(docFreq=10095, maxDocs=42740)
                0.078125 = fieldNorm(doc=1443)
          0.0682657 = weight(abstract_txt:approach in 1443) [ClassicSimilarity], result of:
            0.0682657 = score(doc=1443,freq=2.0), product of:
              0.16379112 = queryWeight, product of:
                2.161546 = boost
                3.772308 = idf(docFreq=2671, maxDocs=42740)
                0.020087168 = queryNorm
              0.41678512 = fieldWeight in 1443, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.772308 = idf(docFreq=2671, maxDocs=42740)
                0.078125 = fieldNorm(doc=1443)
          0.09439614 = weight(abstract_txt:languages in 1443) [ClassicSimilarity], result of:
            0.09439614 = score(doc=1443,freq=1.0), product of:
              0.23271367 = queryWeight, product of:
                2.2313151 = boost
                5.192091 = idf(docFreq=645, maxDocs=42740)
                0.020087168 = queryNorm
              0.4056321 = fieldWeight in 1443, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.192091 = idf(docFreq=645, maxDocs=42740)
                0.078125 = fieldNorm(doc=1443)
          0.08448221 = weight(abstract_txt:text in 1443) [ClassicSimilarity], result of:
            0.08448221 = score(doc=1443,freq=2.0), product of:
              0.18879862 = queryWeight, product of:
                2.3206985 = boost
                4.0500593 = idf(docFreq=2023, maxDocs=42740)
                0.020087168 = queryNorm
              0.44747257 = fieldWeight in 1443, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.0500593 = idf(docFreq=2023, maxDocs=42740)
                0.078125 = fieldNorm(doc=1443)
        0.28 = coord(7/25)
    
  4. Alexander, M.: Retrieving digital data with fuzzy matching (1996) 0.11
    0.11446174 = sum of:
      0.11446174 = product of:
        0.5723087 = sum of:
          0.2323571 = weight(abstract_txt:scanned in 31) [ClassicSimilarity], result of:
            0.2323571 = score(doc=31,freq=2.0), product of:
              0.1865609 = queryWeight, product of:
                1.1534523 = boost
                8.051972 = idf(docFreq=36, maxDocs=42740)
                0.020087168 = queryNorm
              1.2454759 = fieldWeight in 31, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.051972 = idf(docFreq=36, maxDocs=42740)
                0.109375 = fieldNorm(doc=31)
          0.03927266 = weight(abstract_txt:retrieval in 31) [ClassicSimilarity], result of:
            0.03927266 = score(doc=31,freq=1.0), product of:
              0.10363201 = queryWeight, product of:
                1.4890076 = boost
                3.4648013 = idf(docFreq=3633, maxDocs=42740)
                0.020087168 = queryNorm
              0.37896264 = fieldWeight in 31, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4648013 = idf(docFreq=3633, maxDocs=42740)
                0.109375 = fieldNorm(doc=31)
          0.0945359 = weight(abstract_txt:less in 31) [ClassicSimilarity], result of:
            0.0945359 = score(doc=31,freq=1.0), product of:
              0.16260523 = queryWeight, product of:
                1.5229006 = boost
                5.315501 = idf(docFreq=570, maxDocs=42740)
                0.020087168 = queryNorm
              0.58138293 = fieldWeight in 31, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.315501 = idf(docFreq=570, maxDocs=42740)
                0.109375 = fieldNorm(doc=31)
          0.07339431 = weight(abstract_txt:searching in 31) [ClassicSimilarity], result of:
            0.07339431 = score(doc=31,freq=1.0), product of:
              0.15723239 = queryWeight, product of:
                1.8340913 = boost
                4.267783 = idf(docFreq=1627, maxDocs=42740)
                0.020087168 = queryNorm
              0.46678877 = fieldWeight in 31, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.267783 = idf(docFreq=1627, maxDocs=42740)
                0.109375 = fieldNorm(doc=31)
          0.1327488 = weight(abstract_txt:automatic in 31) [ClassicSimilarity], result of:
            0.1327488 = score(doc=31,freq=1.0), product of:
              0.2334107 = queryWeight, product of:
                2.2346542 = boost
                5.199861 = idf(docFreq=640, maxDocs=42740)
                0.020087168 = queryNorm
              0.5687348 = fieldWeight in 31, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.199861 = idf(docFreq=640, maxDocs=42740)
                0.109375 = fieldNorm(doc=31)
        0.2 = coord(5/25)
    
  5. Pearce, C.; Nicholas, C.: TELLTALE: Experiments in a dynamic hypertext environment for degraded and multilingual data (1996) 0.11
    0.11151246 = sum of:
      0.11151246 = product of:
        0.46463525 = sum of:
          0.025411515 = weight(abstract_txt:article in 4140) [ClassicSimilarity], result of:
            0.025411515 = score(doc=4140,freq=1.0), product of:
              0.08475702 = queryWeight, product of:
                1.0994922 = boost
                3.8376453 = idf(docFreq=2502, maxDocs=42740)
                0.020087168 = queryNorm
              0.29981604 = fieldWeight in 4140, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.8376453 = idf(docFreq=2502, maxDocs=42740)
                0.078125 = fieldNorm(doc=4140)
          0.16929163 = weight(abstract_txt:degraded in 4140) [ClassicSimilarity], result of:
            0.16929163 = score(doc=4140,freq=1.0), product of:
              0.23817836 = queryWeight, product of:
                1.3032882 = boost
                9.097941 = idf(docFreq=12, maxDocs=42740)
                0.020087168 = queryNorm
              0.7107767 = fieldWeight in 4140, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.097941 = idf(docFreq=12, maxDocs=42740)
                0.078125 = fieldNorm(doc=4140)
          0.03967138 = weight(abstract_txt:retrieval in 4140) [ClassicSimilarity], result of:
            0.03967138 = score(doc=4140,freq=2.0), product of:
              0.10363201 = queryWeight, product of:
                1.4890076 = boost
                3.4648013 = idf(docFreq=3633, maxDocs=42740)
                0.020087168 = queryNorm
              0.3828101 = fieldWeight in 4140, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4648013 = idf(docFreq=3633, maxDocs=42740)
                0.078125 = fieldNorm(doc=4140)
          0.016388696 = weight(abstract_txt:this in 4140) [ClassicSimilarity], result of:
            0.016388696 = score(doc=4140,freq=1.0), product of:
              0.085868046 = queryWeight, product of:
                1.7498069 = boost
                2.442996 = idf(docFreq=10095, maxDocs=42740)
                0.020087168 = queryNorm
              0.19085906 = fieldWeight in 4140, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.442996 = idf(docFreq=10095, maxDocs=42740)
                0.078125 = fieldNorm(doc=4140)
          0.09439614 = weight(abstract_txt:languages in 4140) [ClassicSimilarity], result of:
            0.09439614 = score(doc=4140,freq=1.0), product of:
              0.23271367 = queryWeight, product of:
                2.2313151 = boost
                5.192091 = idf(docFreq=645, maxDocs=42740)
                0.020087168 = queryNorm
              0.4056321 = fieldWeight in 4140, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.192091 = idf(docFreq=645, maxDocs=42740)
                0.078125 = fieldNorm(doc=4140)
          0.119475886 = weight(abstract_txt:text in 4140) [ClassicSimilarity], result of:
            0.119475886 = score(doc=4140,freq=4.0), product of:
              0.18879862 = queryWeight, product of:
                2.3206985 = boost
                4.0500593 = idf(docFreq=2023, maxDocs=42740)
                0.020087168 = queryNorm
              0.6328218 = fieldWeight in 4140, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.0500593 = idf(docFreq=2023, maxDocs=42740)
                0.078125 = fieldNorm(doc=4140)
        0.24 = coord(6/25)