Document (#5422)

Author
Tseng, Y.-H.
Title
Automatic cataloguing and searching for retrospective data by use of OCR text
Source
Journal of the American Society for Information Science and technology. 52(2001) no.5, S.378-390
Year
2001
Abstract
This article describes our efforts in supporting information retrieval from OCR degraded text. In particular, we report our approach to an automatic cataloging and searching contest for books in multiple languages. In this contest, 500 books in English, German, French, and Italian published during the 1770s to 1970s are scanned into images and OCRed to digital text. The goal is to use only automatic ways to extract information for sophisticated searching. We adopted the vector space retrieval model, an n-gram indexing method, and a special weighting scheme to tackle this problem. Although the performance by this approach is slightly inferior to the best approach, which is mainly based on regular expression match, one advantage of our approach is that it is less language dependent and less layout sensitive, thus is readily applicable to other languages and document collections. Problems of OCR text retrieval for some Asian languages are also discussed in this article, and solutions are suggested
Theme
Kataloganreicherung
Object
OCR

Similar documents (author)

  1. Tseng, Y.-H.: Keyword extraction techniques and relevance feedback (1997) 4.57
    4.565969 = sum of:
      4.565969 = weight(author_txt:tseng in 1830) [ClassicSimilarity], result of:
        4.565969 = fieldWeight in 1830, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.131938 = idf(docFreq=12, maxDocs=44218)
          0.5 = fieldNorm(doc=1830)
    
  2. Tseng, Y.-H.: Solving vocabulary problems with interactive query expansion (1998) 4.57
    4.565969 = sum of:
      4.565969 = weight(author_txt:tseng in 5159) [ClassicSimilarity], result of:
        4.565969 = fieldWeight in 5159, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.131938 = idf(docFreq=12, maxDocs=44218)
          0.5 = fieldNorm(doc=5159)
    
  3. Tseng, Y.H.; Lin, Y.I.: Evaluation of fuzzy search, term suggestion, and term relevance feedback in an OPAC system (1998) 4.57
    4.565969 = sum of:
      4.565969 = weight(author_txt:tseng in 6430) [ClassicSimilarity], result of:
        4.565969 = fieldWeight in 6430, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.131938 = idf(docFreq=12, maxDocs=44218)
          0.5 = fieldNorm(doc=6430)
    
  4. Tseng, Y.-H.: Automatic thesaurus generation for Chinese documents (2002) 4.57
    4.565969 = sum of:
      4.565969 = weight(author_txt:tseng in 5226) [ClassicSimilarity], result of:
        4.565969 = fieldWeight in 5226, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.131938 = idf(docFreq=12, maxDocs=44218)
          0.5 = fieldNorm(doc=5226)
    
  5. Drenth, H.; Morris, A.; Tseng, G.: Expert systems as information intermediaries (1991) 3.42
    3.4244766 = sum of:
      3.4244766 = weight(author_txt:tseng in 3695) [ClassicSimilarity], result of:
        3.4244766 = fieldWeight in 3695, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.131938 = idf(docFreq=12, maxDocs=44218)
          0.375 = fieldNorm(doc=3695)
    

Similar documents (content)

  1. Lee, Y.-S.; Wu, Y.-C.; Yang, J.-C.: BVideoQA : Online English/Chinese bilingual video question answering (2009) 0.15
    0.14641148 = sum of:
      0.14641148 = product of:
        0.4575359 = sum of:
          0.061178364 = weight(abstract_txt:weighting in 2739) [ClassicSimilarity], result of:
            0.061178364 = score(doc=2739,freq=1.0), product of:
              0.14026932 = queryWeight, product of:
                6.9783883 = idf(docFreq=111, maxDocs=44218)
                0.020100534 = queryNorm
              0.43614927 = fieldWeight in 2739, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.9783883 = idf(docFreq=111, maxDocs=44218)
                0.0625 = fieldNorm(doc=2739)
          0.019820064 = weight(abstract_txt:article in 2739) [ClassicSimilarity], result of:
            0.019820064 = score(doc=2739,freq=1.0), product of:
              0.0833638 = queryWeight, product of:
                1.0902407 = boost
                3.8040617 = idf(docFreq=2677, maxDocs=44218)
                0.020100534 = queryNorm
              0.23775385 = fieldWeight in 2739, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.8040617 = idf(docFreq=2677, maxDocs=44218)
                0.0625 = fieldNorm(doc=2739)
          0.0849348 = weight(abstract_txt:asian in 2739) [ClassicSimilarity], result of:
            0.0849348 = score(doc=2739,freq=1.0), product of:
              0.17456396 = queryWeight, product of:
                1.1155677 = boost
                7.7848644 = idf(docFreq=49, maxDocs=44218)
                0.020100534 = queryNorm
              0.48655403 = fieldWeight in 2739, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.7848644 = idf(docFreq=49, maxDocs=44218)
                0.0625 = fieldNorm(doc=2739)
          0.089967586 = weight(abstract_txt:gram in 2739) [ClassicSimilarity], result of:
            0.089967586 = score(doc=2739,freq=1.0), product of:
              0.18139341 = queryWeight, product of:
                1.1371804 = boost
                7.935687 = idf(docFreq=42, maxDocs=44218)
                0.020100534 = queryNorm
              0.49598044 = fieldWeight in 2739, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.935687 = idf(docFreq=42, maxDocs=44218)
                0.0625 = fieldNorm(doc=2739)
          0.03925826 = weight(abstract_txt:retrieval in 2739) [ClassicSimilarity], result of:
            0.03925826 = score(doc=2739,freq=3.0), product of:
              0.104356185 = queryWeight, product of:
                1.4939579 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.020100534 = queryNorm
              0.37619486 = fieldWeight in 2739, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.0625 = fieldNorm(doc=2739)
          0.01788541 = weight(abstract_txt:this in 2739) [ClassicSimilarity], result of:
            0.01788541 = score(doc=2739,freq=2.0), product of:
              0.08385779 = queryWeight, product of:
                1.7289218 = boost
                2.4130175 = idf(docFreq=10762, maxDocs=44218)
                0.020100534 = queryNorm
              0.21328263 = fieldWeight in 2739, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.4130175 = idf(docFreq=10762, maxDocs=44218)
                0.0625 = fieldNorm(doc=2739)
          0.03783224 = weight(abstract_txt:approach in 2739) [ClassicSimilarity], result of:
            0.03783224 = score(doc=2739,freq=1.0), product of:
              0.16161892 = queryWeight, product of:
                2.1468155 = boost
                3.745328 = idf(docFreq=2839, maxDocs=44218)
                0.020100534 = queryNorm
              0.234083 = fieldWeight in 2739, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.745328 = idf(docFreq=2839, maxDocs=44218)
                0.0625 = fieldNorm(doc=2739)
          0.10665918 = weight(abstract_txt:languages in 2739) [ClassicSimilarity], result of:
            0.10665918 = score(doc=2739,freq=2.0), product of:
              0.23259126 = queryWeight, product of:
                2.230365 = boost
                5.188118 = idf(docFreq=670, maxDocs=44218)
                0.020100534 = queryNorm
              0.45856917 = fieldWeight in 2739, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.188118 = idf(docFreq=670, maxDocs=44218)
                0.0625 = fieldNorm(doc=2739)
        0.32 = coord(8/25)
    
  2. Seki, K.; Mostafa, J.: Gene ontology annotation as text categorization : an empirical study (2008) 0.13
    0.126622 = sum of:
      0.126622 = product of:
        0.39569378 = sum of:
          0.061178364 = weight(abstract_txt:weighting in 2123) [ClassicSimilarity], result of:
            0.061178364 = score(doc=2123,freq=1.0), product of:
              0.14026932 = queryWeight, product of:
                6.9783883 = idf(docFreq=111, maxDocs=44218)
                0.020100534 = queryNorm
              0.43614927 = fieldWeight in 2123, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.9783883 = idf(docFreq=111, maxDocs=44218)
                0.0625 = fieldNorm(doc=2123)
          0.07808679 = weight(abstract_txt:tackle in 2123) [ClassicSimilarity], result of:
            0.07808679 = score(doc=2123,freq=1.0), product of:
              0.16505013 = queryWeight, product of:
                1.0847423 = boost
                7.5697527 = idf(docFreq=61, maxDocs=44218)
                0.020100534 = queryNorm
              0.47310954 = fieldWeight in 2123, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.5697527 = idf(docFreq=61, maxDocs=44218)
                0.0625 = fieldNorm(doc=2123)
          0.019820064 = weight(abstract_txt:article in 2123) [ClassicSimilarity], result of:
            0.019820064 = score(doc=2123,freq=1.0), product of:
              0.0833638 = queryWeight, product of:
                1.0902407 = boost
                3.8040617 = idf(docFreq=2677, maxDocs=44218)
                0.020100534 = queryNorm
              0.23775385 = fieldWeight in 2123, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.8040617 = idf(docFreq=2677, maxDocs=44218)
                0.0625 = fieldNorm(doc=2123)
          0.022665767 = weight(abstract_txt:retrieval in 2123) [ClassicSimilarity], result of:
            0.022665767 = score(doc=2123,freq=1.0), product of:
              0.104356185 = queryWeight, product of:
                1.4939579 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.020100534 = queryNorm
              0.21719621 = fieldWeight in 2123, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.0625 = fieldNorm(doc=2123)
          0.01788541 = weight(abstract_txt:this in 2123) [ClassicSimilarity], result of:
            0.01788541 = score(doc=2123,freq=2.0), product of:
              0.08385779 = queryWeight, product of:
                1.7289218 = boost
                2.4130175 = idf(docFreq=10762, maxDocs=44218)
                0.020100534 = queryNorm
              0.21328263 = fieldWeight in 2123, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.4130175 = idf(docFreq=10762, maxDocs=44218)
                0.0625 = fieldNorm(doc=2123)
          0.03783224 = weight(abstract_txt:approach in 2123) [ClassicSimilarity], result of:
            0.03783224 = score(doc=2123,freq=1.0), product of:
              0.16161892 = queryWeight, product of:
                2.1468155 = boost
                3.745328 = idf(docFreq=2839, maxDocs=44218)
                0.020100534 = queryNorm
              0.234083 = fieldWeight in 2123, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.745328 = idf(docFreq=2839, maxDocs=44218)
                0.0625 = fieldNorm(doc=2123)
          0.0757461 = weight(abstract_txt:automatic in 2123) [ClassicSimilarity], result of:
            0.0757461 = score(doc=2123,freq=1.0), product of:
              0.23326239 = queryWeight, product of:
                2.2335806 = boost
                5.1955976 = idf(docFreq=665, maxDocs=44218)
                0.020100534 = queryNorm
              0.32472485 = fieldWeight in 2123, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.1955976 = idf(docFreq=665, maxDocs=44218)
                0.0625 = fieldNorm(doc=2123)
          0.08247904 = weight(abstract_txt:text in 2123) [ClassicSimilarity], result of:
            0.08247904 = score(doc=2123,freq=3.0), product of:
              0.18841094 = queryWeight, product of:
                2.3179374 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.020100534 = queryNorm
              0.4377614 = fieldWeight in 2123, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.0625 = fieldNorm(doc=2123)
        0.32 = coord(8/25)
    
  3. Bookstein, A.; Klein, S.T.; Raita, T.: Clumping properties of content-bearing words (1998) 0.12
    0.119177386 = sum of:
      0.119177386 = product of:
        0.42563352 = sum of:
          0.08807662 = weight(abstract_txt:sensitive in 442) [ClassicSimilarity], result of:
            0.08807662 = score(doc=442,freq=1.0), product of:
              0.15412198 = queryWeight, product of:
                1.0482163 = boost
                7.314861 = idf(docFreq=79, maxDocs=44218)
                0.020100534 = queryNorm
              0.5714735 = fieldWeight in 442, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.314861 = idf(docFreq=79, maxDocs=44218)
                0.078125 = fieldNorm(doc=442)
          0.02477508 = weight(abstract_txt:article in 442) [ClassicSimilarity], result of:
            0.02477508 = score(doc=442,freq=1.0), product of:
              0.0833638 = queryWeight, product of:
                1.0902407 = boost
                3.8040617 = idf(docFreq=2677, maxDocs=44218)
                0.020100534 = queryNorm
              0.2971923 = fieldWeight in 442, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.8040617 = idf(docFreq=2677, maxDocs=44218)
                0.078125 = fieldNorm(doc=442)
          0.040067792 = weight(abstract_txt:retrieval in 442) [ClassicSimilarity], result of:
            0.040067792 = score(doc=442,freq=2.0), product of:
              0.104356185 = queryWeight, product of:
                1.4939579 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.020100534 = queryNorm
              0.38395226 = fieldWeight in 442, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.078125 = fieldNorm(doc=442)
          0.027381327 = weight(abstract_txt:this in 442) [ClassicSimilarity], result of:
            0.027381327 = score(doc=442,freq=3.0), product of:
              0.08385779 = queryWeight, product of:
                1.7289218 = boost
                2.4130175 = idf(docFreq=10762, maxDocs=44218)
                0.020100534 = queryNorm
              0.32652098 = fieldWeight in 442, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.4130175 = idf(docFreq=10762, maxDocs=44218)
                0.078125 = fieldNorm(doc=442)
          0.06687858 = weight(abstract_txt:approach in 442) [ClassicSimilarity], result of:
            0.06687858 = score(doc=442,freq=2.0), product of:
              0.16161892 = queryWeight, product of:
                2.1468155 = boost
                3.745328 = idf(docFreq=2839, maxDocs=44218)
                0.020100534 = queryNorm
              0.41380417 = fieldWeight in 442, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.745328 = idf(docFreq=2839, maxDocs=44218)
                0.078125 = fieldNorm(doc=442)
          0.09427429 = weight(abstract_txt:languages in 442) [ClassicSimilarity], result of:
            0.09427429 = score(doc=442,freq=1.0), product of:
              0.23259126 = queryWeight, product of:
                2.230365 = boost
                5.188118 = idf(docFreq=670, maxDocs=44218)
                0.020100534 = queryNorm
              0.40532172 = fieldWeight in 442, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.188118 = idf(docFreq=670, maxDocs=44218)
                0.078125 = fieldNorm(doc=442)
          0.08417982 = weight(abstract_txt:text in 442) [ClassicSimilarity], result of:
            0.08417982 = score(doc=442,freq=2.0), product of:
              0.18841094 = queryWeight, product of:
                2.3179374 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.020100534 = queryNorm
              0.44678837 = fieldWeight in 442, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.078125 = fieldNorm(doc=442)
        0.28 = coord(7/25)
    
  4. Alexander, M.: Retrieving digital data with fuzzy matching (1996) 0.11
    0.114544414 = sum of:
      0.114544414 = product of:
        0.5727221 = sum of:
          0.23322642 = weight(abstract_txt:scanned in 6961) [ClassicSimilarity], result of:
            0.23322642 = score(doc=6961,freq=2.0), product of:
              0.18708858 = queryWeight, product of:
                1.1548944 = boost
                8.059301 = idf(docFreq=37, maxDocs=44218)
                0.020100534 = queryNorm
              1.2466096 = fieldWeight in 6961, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.059301 = idf(docFreq=37, maxDocs=44218)
                0.109375 = fieldNorm(doc=6961)
          0.03966509 = weight(abstract_txt:retrieval in 6961) [ClassicSimilarity], result of:
            0.03966509 = score(doc=6961,freq=1.0), product of:
              0.104356185 = queryWeight, product of:
                1.4939579 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.020100534 = queryNorm
              0.38009337 = fieldWeight in 6961, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.109375 = fieldNorm(doc=6961)
          0.09292829 = weight(abstract_txt:less in 6961) [ClassicSimilarity], result of:
            0.09292829 = score(doc=6961,freq=1.0), product of:
              0.16081038 = queryWeight, product of:
                1.514226 = boost
                5.283428 = idf(docFreq=609, maxDocs=44218)
                0.020100534 = queryNorm
              0.57787496 = fieldWeight in 6961, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.283428 = idf(docFreq=609, maxDocs=44218)
                0.109375 = fieldNorm(doc=6961)
          0.074346624 = weight(abstract_txt:searching in 6961) [ClassicSimilarity], result of:
            0.074346624 = score(doc=6961,freq=1.0), product of:
              0.15864268 = queryWeight, product of:
                1.8419986 = boost
                4.284727 = idf(docFreq=1655, maxDocs=44218)
                0.020100534 = queryNorm
              0.46864203 = fieldWeight in 6961, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.284727 = idf(docFreq=1655, maxDocs=44218)
                0.109375 = fieldNorm(doc=6961)
          0.13255566 = weight(abstract_txt:automatic in 6961) [ClassicSimilarity], result of:
            0.13255566 = score(doc=6961,freq=1.0), product of:
              0.23326239 = queryWeight, product of:
                2.2335806 = boost
                5.1955976 = idf(docFreq=665, maxDocs=44218)
                0.020100534 = queryNorm
              0.5682685 = fieldWeight in 6961, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.1955976 = idf(docFreq=665, maxDocs=44218)
                0.109375 = fieldNorm(doc=6961)
        0.2 = coord(5/25)
    
  5. Pearce, C.; Nicholas, C.: TELLTALE: Experiments in a dynamic hypertext environment for degraded and multilingual data (1996) 0.11
    0.11168223 = sum of:
      0.11168223 = product of:
        0.46534264 = sum of:
          0.02477508 = weight(abstract_txt:article in 4071) [ClassicSimilarity], result of:
            0.02477508 = score(doc=4071,freq=1.0), product of:
              0.0833638 = queryWeight, product of:
                1.0902407 = boost
                3.8040617 = idf(docFreq=2677, maxDocs=44218)
                0.020100534 = queryNorm
              0.2971923 = fieldWeight in 4071, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.8040617 = idf(docFreq=2677, maxDocs=44218)
                0.078125 = fieldNorm(doc=4071)
          0.17136864 = weight(abstract_txt:degraded in 4071) [ClassicSimilarity], result of:
            0.17136864 = score(doc=4071,freq=1.0), product of:
              0.24020296 = queryWeight, product of:
                1.3086027 = boost
                9.131938 = idf(docFreq=12, maxDocs=44218)
                0.020100534 = queryNorm
              0.71343267 = fieldWeight in 4071, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.131938 = idf(docFreq=12, maxDocs=44218)
                0.078125 = fieldNorm(doc=4071)
          0.040067792 = weight(abstract_txt:retrieval in 4071) [ClassicSimilarity], result of:
            0.040067792 = score(doc=4071,freq=2.0), product of:
              0.104356185 = queryWeight, product of:
                1.4939579 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.020100534 = queryNorm
              0.38395226 = fieldWeight in 4071, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.078125 = fieldNorm(doc=4071)
          0.015808618 = weight(abstract_txt:this in 4071) [ClassicSimilarity], result of:
            0.015808618 = score(doc=4071,freq=1.0), product of:
              0.08385779 = queryWeight, product of:
                1.7289218 = boost
                2.4130175 = idf(docFreq=10762, maxDocs=44218)
                0.020100534 = queryNorm
              0.18851699 = fieldWeight in 4071, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4130175 = idf(docFreq=10762, maxDocs=44218)
                0.078125 = fieldNorm(doc=4071)
          0.09427429 = weight(abstract_txt:languages in 4071) [ClassicSimilarity], result of:
            0.09427429 = score(doc=4071,freq=1.0), product of:
              0.23259126 = queryWeight, product of:
                2.230365 = boost
                5.188118 = idf(docFreq=670, maxDocs=44218)
                0.020100534 = queryNorm
              0.40532172 = fieldWeight in 4071, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.188118 = idf(docFreq=670, maxDocs=44218)
                0.078125 = fieldNorm(doc=4071)
          0.11904824 = weight(abstract_txt:text in 4071) [ClassicSimilarity], result of:
            0.11904824 = score(doc=4071,freq=4.0), product of:
              0.18841094 = queryWeight, product of:
                2.3179374 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.020100534 = queryNorm
              0.6318542 = fieldWeight in 4071, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.078125 = fieldNorm(doc=4071)
        0.24 = coord(6/25)