Document (#40623)

Author
Collovini de Abreu, S.
Vieira, R.
Title
RelP: Portuguese open relation extraction
Source
Knowledge organization. 44(2017) no.3, S.163-177
Year
2017
Abstract
Natural language texts are valuable data sources in many human activities. NLP techniques are being widely used in order to help find the right information to specific needs. In this paper, we present one such technique: relation extraction from texts. This task aims at identifying and classifying semantic relations that occur between entities in a text. For example, the sentence "Roberto Marinho is the founder of Rede Globo" expresses a relation occurring between "Roberto Marinho" and "Rede Globo." This work presents a system for Portuguese Open Relation Extraction, named RelP, which extracts any relation descriptor that describes an explicit relation between named entities in the organisation domain by applying the Conditional Random Fields. For implementing RelP, we define the representation scheme, features based on previous work, and a reference corpus. RelP achieved state of the art results for open relation extraction; the F-measure rate was around 60% between the named entities person, organisation and place. For better understanding of the output, we present a way for organizing the output from the mining of the extracted relation descriptors. This organization can be useful to classify relation types, to cluster the entities involved in a common relation and to populate datasets.
Content
Beitrag in einem Special Issue "New Trends for Knowledge Organization, Guest Editor: Renato Rocha Souza".
Theme
Computerlinguistik

Similar documents (author)

  1. Vieira, L.: Modèle d'analyse pur une classification du document iconographique (1999) 5.91
    5.9139314 = sum of:
      5.9139314 = weight(author_txt:vieira in 321) [ClassicSimilarity], result of:
        5.9139314 = score(doc=321,freq=1.0), product of:
          0.99999994 = queryWeight, product of:
            9.462291 = idf(docFreq=8, maxDocs=42596)
            0.10568265 = queryNorm
          5.913932 = fieldWeight in 321, product of:
            1.0 = tf(freq=1.0), with freq of:
              1.0 = termFreq=1.0
            9.462291 = idf(docFreq=8, maxDocs=42596)
            0.625 = fieldNorm(doc=321)
    
  2. Vieira, S. Bastos => Bastos Vieira, S.: 5.02
    5.018137 = sum of:
      5.018137 = weight(author_txt:vieira in 326) [ClassicSimilarity], result of:
        5.018137 = score(doc=326,freq=2.0), product of:
          0.99999994 = queryWeight, product of:
            9.462291 = idf(docFreq=8, maxDocs=42596)
            0.10568265 = queryNorm
          5.0181375 = fieldWeight in 326, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            9.462291 = idf(docFreq=8, maxDocs=42596)
            0.375 = fieldNorm(doc=326)
    
  3. Vieira, E.S.; Cabral, J.A.S.; Gomes, J.A.N.F.: Definition of a model based on bibliometric indicators for assessing applicants to academic positions (2014) 3.55
    3.5483587 = sum of:
      3.5483587 = weight(author_txt:vieira in 2222) [ClassicSimilarity], result of:
        3.5483587 = score(doc=2222,freq=1.0), product of:
          0.99999994 = queryWeight, product of:
            9.462291 = idf(docFreq=8, maxDocs=42596)
            0.10568265 = queryNorm
          3.548359 = fieldWeight in 2222, product of:
            1.0 = tf(freq=1.0), with freq of:
              1.0 = termFreq=1.0
            9.462291 = idf(docFreq=8, maxDocs=42596)
            0.375 = fieldNorm(doc=2222)
    
  4. Carvalho, J.R. de; Cordeiro, M.I.; Lopes, A.; Vieira, M.: Meta-information about MARC : an XML framework for validation, explanation and help systems (2004) 2.96
    2.9569657 = sum of:
      2.9569657 = weight(author_txt:vieira in 3849) [ClassicSimilarity], result of:
        2.9569657 = score(doc=3849,freq=1.0), product of:
          0.99999994 = queryWeight, product of:
            9.462291 = idf(docFreq=8, maxDocs=42596)
            0.10568265 = queryNorm
          2.956966 = fieldWeight in 3849, product of:
            1.0 = tf(freq=1.0), with freq of:
              1.0 = termFreq=1.0
            9.462291 = idf(docFreq=8, maxDocs=42596)
            0.3125 = fieldNorm(doc=3849)
    
  5. Bastos Vieira, S.; DeBrito, M.; Mustafa El Hadi, W.; Zumer, M.: Developing imaged KOS with the FRSAD Model : a conceptual methodology (2016) 2.37
    2.3655725 = sum of:
      2.3655725 = weight(author_txt:vieira in 4110) [ClassicSimilarity], result of:
        2.3655725 = score(doc=4110,freq=1.0), product of:
          0.99999994 = queryWeight, product of:
            9.462291 = idf(docFreq=8, maxDocs=42596)
            0.10568265 = queryNorm
          2.3655727 = fieldWeight in 4110, product of:
            1.0 = tf(freq=1.0), with freq of:
              1.0 = termFreq=1.0
            9.462291 = idf(docFreq=8, maxDocs=42596)
            0.25 = fieldNorm(doc=4110)
    

Similar documents (content)

  1. Vo, D.-T.; Bagheri, E.: Feature-enriched matrix factorization for relation extraction (2019) 0.33
    0.32913992 = sum of:
      0.32913992 = product of:
        0.91427755 = sum of:
          0.03166461 = weight(abstract_txt:datasets in 703) [ClassicSimilarity], result of:
            0.03166461 = score(doc=703,freq=1.0), product of:
              0.08563114 = queryWeight, product of:
                1.0086501 = boost
                6.761676 = idf(docFreq=133, maxDocs=42596)
                0.012555582 = queryNorm
              0.36977914 = fieldWeight in 703, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.761676 = idf(docFreq=133, maxDocs=42596)
                0.0546875 = fieldNorm(doc=703)
          0.023001589 = weight(abstract_txt:work in 703) [ClassicSimilarity], result of:
            0.023001589 = score(doc=703,freq=4.0), product of:
              0.054921728 = queryWeight, product of:
                1.1423831 = boost
                3.82909 = idf(docFreq=2515, maxDocs=42596)
                0.012555582 = queryNorm
              0.41880673 = fieldWeight in 703, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.82909 = idf(docFreq=2515, maxDocs=42596)
                0.0546875 = fieldNorm(doc=703)
          0.008462444 = weight(abstract_txt:this in 703) [ClassicSimilarity], result of:
            0.008462444 = score(doc=703,freq=2.0), product of:
              0.044763375 = queryWeight, product of:
                1.4585323 = boost
                2.4443867 = idf(docFreq=10047, maxDocs=42596)
                0.012555582 = queryNorm
              0.1890484 = fieldWeight in 703, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.4443867 = idf(docFreq=10047, maxDocs=42596)
                0.0546875 = fieldNorm(doc=703)
          0.024585791 = weight(abstract_txt:between in 703) [ClassicSimilarity], result of:
            0.024585791 = score(doc=703,freq=2.0), product of:
              0.091141276 = queryWeight, product of:
                2.0811923 = boost
                3.4879162 = idf(docFreq=3538, maxDocs=42596)
                0.012555582 = queryNorm
              0.26975474 = fieldWeight in 703, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4879162 = idf(docFreq=3538, maxDocs=42596)
                0.0546875 = fieldNorm(doc=703)
          0.03633758 = weight(abstract_txt:open in 703) [ClassicSimilarity], result of:
            0.03633758 = score(doc=703,freq=1.0), product of:
              0.13537133 = queryWeight, product of:
                2.196588 = boost
                4.9084144 = idf(docFreq=854, maxDocs=42596)
                0.012555582 = queryNorm
              0.26842892 = fieldWeight in 703, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.9084144 = idf(docFreq=854, maxDocs=42596)
                0.0546875 = fieldNorm(doc=703)
          0.0993601 = weight(abstract_txt:named in 703) [ClassicSimilarity], result of:
            0.0993601 = score(doc=703,freq=1.0), product of:
              0.26470616 = queryWeight, product of:
                3.0716186 = boost
                6.863725 = idf(docFreq=120, maxDocs=42596)
                0.012555582 = queryNorm
              0.37535998 = fieldWeight in 703, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.863725 = idf(docFreq=120, maxDocs=42596)
                0.0546875 = fieldNorm(doc=703)
          0.0832398 = weight(abstract_txt:entities in 703) [ClassicSimilarity], result of:
            0.0832398 = score(doc=703,freq=1.0), product of:
              0.25891447 = queryWeight, product of:
                3.5077837 = boost
                5.8787723 = idf(docFreq=323, maxDocs=42596)
                0.012555582 = queryNorm
              0.32149535 = fieldWeight in 703, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8787723 = idf(docFreq=323, maxDocs=42596)
                0.0546875 = fieldNorm(doc=703)
          0.17017244 = weight(abstract_txt:extraction in 703) [ClassicSimilarity], result of:
            0.17017244 = score(doc=703,freq=3.0), product of:
              0.28917098 = queryWeight, product of:
                3.7070804 = boost
                6.212778 = idf(docFreq=231, maxDocs=42596)
                0.012555582 = queryNorm
              0.5884838 = fieldWeight in 703, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.212778 = idf(docFreq=231, maxDocs=42596)
                0.0546875 = fieldNorm(doc=703)
          0.43745324 = weight(abstract_txt:relation in 703) [ClassicSimilarity], result of:
            0.43745324 = score(doc=703,freq=7.0), product of:
              0.5552698 = queryWeight, product of:
                8.122256 = boost
                5.4449077 = idf(docFreq=499, maxDocs=42596)
                0.012555582 = queryNorm
              0.7878211 = fieldWeight in 703, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                5.4449077 = idf(docFreq=499, maxDocs=42596)
                0.0546875 = fieldNorm(doc=703)
        0.36 = coord(9/25)
    
  2. Li, J.; Zhang, Z.; Li, X.; Chen, H.: Kernel-based learning for biomedical relation extraction (2008) 0.27
    0.2678208 = sum of:
      0.2678208 = product of:
        1.1159201 = sum of:
          0.008548359 = weight(abstract_txt:this in 2791) [ClassicSimilarity], result of:
            0.008548359 = score(doc=2791,freq=1.0), product of:
              0.044763375 = queryWeight, product of:
                1.4585323 = boost
                2.4443867 = idf(docFreq=10047, maxDocs=42596)
                0.012555582 = queryNorm
              0.19096771 = fieldWeight in 2791, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4443867 = idf(docFreq=10047, maxDocs=42596)
                0.078125 = fieldNorm(doc=2791)
          0.0248354 = weight(abstract_txt:between in 2791) [ClassicSimilarity], result of:
            0.0248354 = score(doc=2791,freq=1.0), product of:
              0.091141276 = queryWeight, product of:
                2.0811923 = boost
                3.4879162 = idf(docFreq=3538, maxDocs=42596)
                0.012555582 = queryNorm
              0.27249345 = fieldWeight in 2791, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4879162 = idf(docFreq=3538, maxDocs=42596)
                0.078125 = fieldNorm(doc=2791)
          0.141943 = weight(abstract_txt:named in 2791) [ClassicSimilarity], result of:
            0.141943 = score(doc=2791,freq=1.0), product of:
              0.26470616 = queryWeight, product of:
                3.0716186 = boost
                6.863725 = idf(docFreq=120, maxDocs=42596)
                0.012555582 = queryNorm
              0.53622854 = fieldWeight in 2791, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.863725 = idf(docFreq=120, maxDocs=42596)
                0.078125 = fieldNorm(doc=2791)
          0.118914 = weight(abstract_txt:entities in 2791) [ClassicSimilarity], result of:
            0.118914 = score(doc=2791,freq=1.0), product of:
              0.25891447 = queryWeight, product of:
                3.5077837 = boost
                5.8787723 = idf(docFreq=323, maxDocs=42596)
                0.012555582 = queryNorm
              0.4592791 = fieldWeight in 2791, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8787723 = idf(docFreq=323, maxDocs=42596)
                0.078125 = fieldNorm(doc=2791)
          0.2431035 = weight(abstract_txt:extraction in 2791) [ClassicSimilarity], result of:
            0.2431035 = score(doc=2791,freq=3.0), product of:
              0.28917098 = queryWeight, product of:
                3.7070804 = boost
                6.212778 = idf(docFreq=231, maxDocs=42596)
                0.012555582 = queryNorm
              0.8406912 = fieldWeight in 2791, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.212778 = idf(docFreq=231, maxDocs=42596)
                0.078125 = fieldNorm(doc=2791)
          0.5785758 = weight(abstract_txt:relation in 2791) [ClassicSimilarity], result of:
            0.5785758 = score(doc=2791,freq=6.0), product of:
              0.5552698 = queryWeight, product of:
                8.122256 = boost
                5.4449077 = idf(docFreq=499, maxDocs=42596)
                0.012555582 = queryNorm
              1.0419724 = fieldWeight in 2791, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                5.4449077 = idf(docFreq=499, maxDocs=42596)
                0.078125 = fieldNorm(doc=2791)
        0.24 = coord(6/25)
    
  3. Ru, C.; Tang, J.; Li, S.; Xie, S.; Wang, T.: Using semantic similarity to reduce wrong labels in distant supervision for relation extraction (2018) 0.24
    0.24146609 = sum of:
      0.24146609 = product of:
        1.0061088 = sum of:
          0.053920954 = weight(abstract_txt:sentence in 653) [ClassicSimilarity], result of:
            0.053920954 = score(doc=653,freq=2.0), product of:
              0.08866442 = queryWeight, product of:
                1.0263591 = boost
                6.880392 = idf(docFreq=118, maxDocs=42596)
                0.012555582 = queryNorm
              0.6081465 = fieldWeight in 653, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.880392 = idf(docFreq=118, maxDocs=42596)
                0.0625 = fieldNorm(doc=653)
          0.0068386877 = weight(abstract_txt:this in 653) [ClassicSimilarity], result of:
            0.0068386877 = score(doc=653,freq=1.0), product of:
              0.044763375 = queryWeight, product of:
                1.4585323 = boost
                2.4443867 = idf(docFreq=10047, maxDocs=42596)
                0.012555582 = queryNorm
              0.15277417 = fieldWeight in 653, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4443867 = idf(docFreq=10047, maxDocs=42596)
                0.0625 = fieldNorm(doc=653)
          0.028098047 = weight(abstract_txt:between in 653) [ClassicSimilarity], result of:
            0.028098047 = score(doc=653,freq=2.0), product of:
              0.091141276 = queryWeight, product of:
                2.0811923 = boost
                3.4879162 = idf(docFreq=3538, maxDocs=42596)
                0.012555582 = queryNorm
              0.30829114 = fieldWeight in 653, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4879162 = idf(docFreq=3538, maxDocs=42596)
                0.0625 = fieldNorm(doc=653)
          0.0951312 = weight(abstract_txt:entities in 653) [ClassicSimilarity], result of:
            0.0951312 = score(doc=653,freq=1.0), product of:
              0.25891447 = queryWeight, product of:
                3.5077837 = boost
                5.8787723 = idf(docFreq=323, maxDocs=42596)
                0.012555582 = queryNorm
              0.36742327 = fieldWeight in 653, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8787723 = idf(docFreq=323, maxDocs=42596)
                0.0625 = fieldNorm(doc=653)
          0.2245694 = weight(abstract_txt:extraction in 653) [ClassicSimilarity], result of:
            0.2245694 = score(doc=653,freq=4.0), product of:
              0.28917098 = queryWeight, product of:
                3.7070804 = boost
                6.212778 = idf(docFreq=231, maxDocs=42596)
                0.012555582 = queryNorm
              0.77659726 = fieldWeight in 653, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.212778 = idf(docFreq=231, maxDocs=42596)
                0.0625 = fieldNorm(doc=653)
          0.59755045 = weight(abstract_txt:relation in 653) [ClassicSimilarity], result of:
            0.59755045 = score(doc=653,freq=10.0), product of:
              0.5552698 = queryWeight, product of:
                8.122256 = boost
                5.4449077 = idf(docFreq=499, maxDocs=42596)
                0.012555582 = queryNorm
              1.0761443 = fieldWeight in 653, product of:
                3.1622777 = tf(freq=10.0), with freq of:
                  10.0 = termFreq=10.0
                5.4449077 = idf(docFreq=499, maxDocs=42596)
                0.0625 = fieldNorm(doc=653)
        0.24 = coord(6/25)
    
  4. Zhou, G.D.; Zhang, M.: Extracting relation information from text documents by exploring various types of knowledge (2007) 0.18
    0.17772298 = sum of:
      0.17772298 = product of:
        0.8886149 = sum of:
          0.013677375 = weight(abstract_txt:this in 2107) [ClassicSimilarity], result of:
            0.013677375 = score(doc=2107,freq=4.0), product of:
              0.044763375 = queryWeight, product of:
                1.4585323 = boost
                2.4443867 = idf(docFreq=10047, maxDocs=42596)
                0.012555582 = queryNorm
              0.30554834 = fieldWeight in 2107, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                2.4443867 = idf(docFreq=10047, maxDocs=42596)
                0.0625 = fieldNorm(doc=2107)
          0.019868322 = weight(abstract_txt:between in 2107) [ClassicSimilarity], result of:
            0.019868322 = score(doc=2107,freq=1.0), product of:
              0.091141276 = queryWeight, product of:
                2.0811923 = boost
                3.4879162 = idf(docFreq=3538, maxDocs=42596)
                0.012555582 = queryNorm
              0.21799476 = fieldWeight in 2107, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4879162 = idf(docFreq=3538, maxDocs=42596)
                0.0625 = fieldNorm(doc=2107)
          0.0951312 = weight(abstract_txt:entities in 2107) [ClassicSimilarity], result of:
            0.0951312 = score(doc=2107,freq=1.0), product of:
              0.25891447 = queryWeight, product of:
                3.5077837 = boost
                5.8787723 = idf(docFreq=323, maxDocs=42596)
                0.012555582 = queryNorm
              0.36742327 = fieldWeight in 2107, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8787723 = idf(docFreq=323, maxDocs=42596)
                0.0625 = fieldNorm(doc=2107)
          0.2970774 = weight(abstract_txt:extraction in 2107) [ClassicSimilarity], result of:
            0.2970774 = score(doc=2107,freq=7.0), product of:
              0.28917098 = queryWeight, product of:
                3.7070804 = boost
                6.212778 = idf(docFreq=231, maxDocs=42596)
                0.012555582 = queryNorm
              1.0273416 = fieldWeight in 2107, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                6.212778 = idf(docFreq=231, maxDocs=42596)
                0.0625 = fieldNorm(doc=2107)
          0.4628606 = weight(abstract_txt:relation in 2107) [ClassicSimilarity], result of:
            0.4628606 = score(doc=2107,freq=6.0), product of:
              0.5552698 = queryWeight, product of:
                8.122256 = boost
                5.4449077 = idf(docFreq=499, maxDocs=42596)
                0.012555582 = queryNorm
              0.8335779 = fieldWeight in 2107, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                5.4449077 = idf(docFreq=499, maxDocs=42596)
                0.0625 = fieldNorm(doc=2107)
        0.2 = coord(5/25)
    
  5. Zhang, M.; Zhou, G.D.; Aw, A.: Exploring syntactic structured features over parse trees for relation extraction using kernel methods (2008) 0.16
    0.16444862 = sum of:
      0.16444862 = product of:
        0.8222431 = sum of:
          0.009671365 = weight(abstract_txt:this in 3235) [ClassicSimilarity], result of:
            0.009671365 = score(doc=3235,freq=2.0), product of:
              0.044763375 = queryWeight, product of:
                1.4585323 = boost
                2.4443867 = idf(docFreq=10047, maxDocs=42596)
                0.012555582 = queryNorm
              0.2160553 = fieldWeight in 3235, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.4443867 = idf(docFreq=10047, maxDocs=42596)
                0.0625 = fieldNorm(doc=3235)
          0.019868322 = weight(abstract_txt:between in 3235) [ClassicSimilarity], result of:
            0.019868322 = score(doc=3235,freq=1.0), product of:
              0.091141276 = queryWeight, product of:
                2.0811923 = boost
                3.4879162 = idf(docFreq=3538, maxDocs=42596)
                0.012555582 = queryNorm
              0.21799476 = fieldWeight in 3235, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4879162 = idf(docFreq=3538, maxDocs=42596)
                0.0625 = fieldNorm(doc=3235)
          0.0951312 = weight(abstract_txt:entities in 3235) [ClassicSimilarity], result of:
            0.0951312 = score(doc=3235,freq=1.0), product of:
              0.25891447 = queryWeight, product of:
                3.5077837 = boost
                5.8787723 = idf(docFreq=323, maxDocs=42596)
                0.012555582 = queryNorm
              0.36742327 = fieldWeight in 3235, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8787723 = idf(docFreq=323, maxDocs=42596)
                0.0625 = fieldNorm(doc=3235)
          0.2750402 = weight(abstract_txt:extraction in 3235) [ClassicSimilarity], result of:
            0.2750402 = score(doc=3235,freq=6.0), product of:
              0.28917098 = queryWeight, product of:
                3.7070804 = boost
                6.212778 = idf(docFreq=231, maxDocs=42596)
                0.012555582 = queryNorm
              0.95113355 = fieldWeight in 3235, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                6.212778 = idf(docFreq=231, maxDocs=42596)
                0.0625 = fieldNorm(doc=3235)
          0.422532 = weight(abstract_txt:relation in 3235) [ClassicSimilarity], result of:
            0.422532 = score(doc=3235,freq=5.0), product of:
              0.5552698 = queryWeight, product of:
                8.122256 = boost
                5.4449077 = idf(docFreq=499, maxDocs=42596)
                0.012555582 = queryNorm
              0.760949 = fieldWeight in 3235, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                5.4449077 = idf(docFreq=499, maxDocs=42596)
                0.0625 = fieldNorm(doc=3235)
        0.2 = coord(5/25)