Document (#34850)

Author
Cortez, E.
Silva, A.S. da
Gonçalves, M.A.
Mesquita, F.
Moura, E.S. de
Title
¬A flexible approach for extracting metadata from bibliographic citations
Source
Journal of the American Society for Information Science and Technology. 60(2009) no.6, S.1144-1158
Year
2009
Abstract
In this article we present FLUX-CiM, a novel method for extracting components (e.g., author names, article titles, venues, page numbers) from bibliographic citations. Our method does not rely on patterns encoding specific delimiters used in a particular citation style. This feature yields a high degree of automation and flexibility, and allows FLUX-CiM to extract from citations in any given format. Differently from previous methods that are based on models learned from user-driven training, our method relies on a knowledge base automatically constructed from an existing set of sample metadata records from a given field (e.g., computer science, health sciences, social sciences, etc.). These records are usually available on the Web or other public data repositories. To demonstrate the effectiveness and applicability of our proposed method, we present a series of experiments in which we apply it to extract bibliographic data from citations in articles of different fields. Results of these experiments exhibit precision and recall levels above 94% for all fields, and perfect extraction for the large majority of citations tested. In addition, in a comparison against a state-of-the-art information-extraction method, ours produced superior results without the training phase required by that method. Finally, we present a strategy for using bibliographic data resulting from the extraction process with FLUX-CiM to automatically update and expand the knowledge base of a given domain. We show that this strategy can be used to achieve good extraction results even if only a very small initial sample of bibliographic records is available for building the knowledge base.
Theme
Formalerschließung
Object
FLUX-CiM

Similar documents (author)

  1. Cortez, E.; Herrera, M.R.; Silva, A.S. da; Moura, E.S. de; Neubert, M.: Lightweight methods for large-scale product categorization (2011) 2.44
    2.4440625 = sum of:
      2.4440625 = product of:
        3.25875 = sum of:
          0.71179897 = weight(author_txt:silva in 1759) [ClassicSimilarity], result of:
            0.71179897 = score(doc=1759,freq=1.0), product of:
              0.37782484 = queryWeight, product of:
                7.535756 = idf(docFreq=61, maxDocs=42740)
                0.050137617 = queryNorm
              1.883939 = fieldWeight in 1759, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.535756 = idf(docFreq=61, maxDocs=42740)
                0.25 = fieldNorm(doc=1759)
          1.0829465 = weight(author_txt:moura in 1759) [ClassicSimilarity], result of:
            1.0829465 = score(doc=1759,freq=1.0), product of:
              0.4997931 = queryWeight, product of:
                1.1501378 = boost
                8.667158 = idf(docFreq=19, maxDocs=42740)
                0.050137617 = queryNorm
              2.1667895 = fieldWeight in 1759, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.667158 = idf(docFreq=19, maxDocs=42740)
                0.25 = fieldNorm(doc=1759)
          1.4640045 = weight(author_txt:cortez in 1759) [ClassicSimilarity], result of:
            1.4640045 = score(doc=1759,freq=1.0), product of:
              0.6110554 = queryWeight, product of:
                1.2717303 = boost
                9.583449 = idf(docFreq=7, maxDocs=42740)
                0.050137617 = queryNorm
              2.3958623 = fieldWeight in 1759, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.583449 = idf(docFreq=7, maxDocs=42740)
                0.25 = fieldNorm(doc=1759)
        0.75 = coord(3/4)
    
  2. Moura, E.S. de; Fernandes, D.; Ribeiro-Neto, B.; Silva, A.S. da; Gonçalves, M.A.: Using structural information to improve search in Web collections (2010) 2.12
    2.1196074 = sum of:
      2.1196074 = product of:
        2.8261433 = sum of:
          0.71179897 = weight(author_txt:silva in 1120) [ClassicSimilarity], result of:
            0.71179897 = score(doc=1120,freq=1.0), product of:
              0.37782484 = queryWeight, product of:
                7.535756 = idf(docFreq=61, maxDocs=42740)
                0.050137617 = queryNorm
              1.883939 = fieldWeight in 1120, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.535756 = idf(docFreq=61, maxDocs=42740)
                0.25 = fieldNorm(doc=1120)
          1.0313978 = weight(author_txt:gonçalves in 1120) [ClassicSimilarity], result of:
            1.0313978 = score(doc=1120,freq=1.0), product of:
              0.48380432 = queryWeight, product of:
                1.1315913 = boost
                8.527396 = idf(docFreq=22, maxDocs=42740)
                0.050137617 = queryNorm
              2.131849 = fieldWeight in 1120, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.527396 = idf(docFreq=22, maxDocs=42740)
                0.25 = fieldNorm(doc=1120)
          1.0829465 = weight(author_txt:moura in 1120) [ClassicSimilarity], result of:
            1.0829465 = score(doc=1120,freq=1.0), product of:
              0.4997931 = queryWeight, product of:
                1.1501378 = boost
                8.667158 = idf(docFreq=19, maxDocs=42740)
                0.050137617 = queryNorm
              2.1667895 = fieldWeight in 1120, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.667158 = idf(docFreq=19, maxDocs=42740)
                0.25 = fieldNorm(doc=1120)
        0.75 = coord(3/4)
    
  3. Silva, R.M.; Gonçalves, M.A.; Veloso, A.: ¬A Two-stage active learning method for learning to rank (2014) 1.31
    1.3073976 = sum of:
      1.3073976 = product of:
        2.6147952 = sum of:
          1.0676985 = weight(author_txt:silva in 3185) [ClassicSimilarity], result of:
            1.0676985 = score(doc=3185,freq=1.0), product of:
              0.37782484 = queryWeight, product of:
                7.535756 = idf(docFreq=61, maxDocs=42740)
                0.050137617 = queryNorm
              2.8259087 = fieldWeight in 3185, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.535756 = idf(docFreq=61, maxDocs=42740)
                0.375 = fieldNorm(doc=3185)
          1.5470966 = weight(author_txt:gonçalves in 3185) [ClassicSimilarity], result of:
            1.5470966 = score(doc=3185,freq=1.0), product of:
              0.48380432 = queryWeight, product of:
                1.1315913 = boost
                8.527396 = idf(docFreq=22, maxDocs=42740)
                0.050137617 = queryNorm
              3.1977735 = fieldWeight in 3185, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.527396 = idf(docFreq=22, maxDocs=42740)
                0.375 = fieldNorm(doc=3185)
        0.5 = coord(2/4)
    
  4. Calado, P.; Cristo, M.; Gonçalves, M.A.; Moura, E.S. de; Ribeiro-Neto, B.; Ziviani, N.: Link-based similarity measures for the classification of Web documents (2006) 1.06
    1.0571722 = sum of:
      1.0571722 = product of:
        2.1143444 = sum of:
          1.0313978 = weight(author_txt:gonçalves in 922) [ClassicSimilarity], result of:
            1.0313978 = score(doc=922,freq=1.0), product of:
              0.48380432 = queryWeight, product of:
                1.1315913 = boost
                8.527396 = idf(docFreq=22, maxDocs=42740)
                0.050137617 = queryNorm
              2.131849 = fieldWeight in 922, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.527396 = idf(docFreq=22, maxDocs=42740)
                0.25 = fieldNorm(doc=922)
          1.0829465 = weight(author_txt:moura in 922) [ClassicSimilarity], result of:
            1.0829465 = score(doc=922,freq=1.0), product of:
              0.4997931 = queryWeight, product of:
                1.1501378 = boost
                8.667158 = idf(docFreq=19, maxDocs=42740)
                0.050137617 = queryNorm
              2.1667895 = fieldWeight in 922, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.667158 = idf(docFreq=19, maxDocs=42740)
                0.25 = fieldNorm(doc=922)
        0.5 = coord(2/4)
    
  5. Couto, T.; Cristo, M.; Gonçalves, M.A.; Calado, P.; Ziviani, N.; Moura, E.; Ribeiro-Neto, B.: ¬A comparative study of citations and links in document classification (2006) 1.06
    1.0571722 = sum of:
      1.0571722 = product of:
        2.1143444 = sum of:
          1.0313978 = weight(author_txt:gonçalves in 4532) [ClassicSimilarity], result of:
            1.0313978 = score(doc=4532,freq=1.0), product of:
              0.48380432 = queryWeight, product of:
                1.1315913 = boost
                8.527396 = idf(docFreq=22, maxDocs=42740)
                0.050137617 = queryNorm
              2.131849 = fieldWeight in 4532, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.527396 = idf(docFreq=22, maxDocs=42740)
                0.25 = fieldNorm(doc=4532)
          1.0829465 = weight(author_txt:moura in 4532) [ClassicSimilarity], result of:
            1.0829465 = score(doc=4532,freq=1.0), product of:
              0.4997931 = queryWeight, product of:
                1.1501378 = boost
                8.667158 = idf(docFreq=19, maxDocs=42740)
                0.050137617 = queryNorm
              2.1667895 = fieldWeight in 4532, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.667158 = idf(docFreq=19, maxDocs=42740)
                0.25 = fieldNorm(doc=4532)
        0.5 = coord(2/4)
    

Similar documents (content)

  1. Lawson, M.: Automatic extraction of citations from the text of English-language patents : an example of template mining (1996) 0.31
    0.31144756 = sum of:
      0.31144756 = product of:
        0.7786189 = sum of:
          0.0330391 = weight(abstract_txt:data in 3655) [ClassicSimilarity], result of:
            0.0330391 = score(doc=3655,freq=5.0), product of:
              0.07011251 = queryWeight, product of:
                1.0806844 = boost
                3.3718455 = idf(docFreq=3987, maxDocs=42740)
                0.01924106 = queryNorm
              0.47122973 = fieldWeight in 3655, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                3.3718455 = idf(docFreq=3987, maxDocs=42740)
                0.0625 = fieldNorm(doc=3655)
          0.016627891 = weight(abstract_txt:results in 3655) [ClassicSimilarity], result of:
            0.016627891 = score(doc=3655,freq=1.0), product of:
              0.075856276 = queryWeight, product of:
                1.1240791 = boost
                3.5072412 = idf(docFreq=3482, maxDocs=42740)
                0.01924106 = queryNorm
              0.21920258 = fieldWeight in 3655, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.5072412 = idf(docFreq=3482, maxDocs=42740)
                0.0625 = fieldNorm(doc=3655)
          0.043037254 = weight(abstract_txt:automatically in 3655) [ClassicSimilarity], result of:
            0.043037254 = score(doc=3655,freq=1.0), product of:
              0.124920204 = queryWeight, product of:
                1.1778008 = boost
                5.5122876 = idf(docFreq=468, maxDocs=42740)
                0.01924106 = queryNorm
              0.34451798 = fieldWeight in 3655, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.5122876 = idf(docFreq=468, maxDocs=42740)
                0.0625 = fieldNorm(doc=3655)
          0.044386987 = weight(abstract_txt:sample in 3655) [ClassicSimilarity], result of:
            0.044386987 = score(doc=3655,freq=1.0), product of:
              0.12751856 = queryWeight, product of:
                1.189987 = boost
                5.5693207 = idf(docFreq=442, maxDocs=42740)
                0.01924106 = queryNorm
              0.34808254 = fieldWeight in 3655, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.5693207 = idf(docFreq=442, maxDocs=42740)
                0.0625 = fieldNorm(doc=3655)
          0.07474653 = weight(abstract_txt:extract in 3655) [ClassicSimilarity], result of:
            0.07474653 = score(doc=3655,freq=1.0), product of:
              0.18049437 = queryWeight, product of:
                1.4157524 = boost
                6.625938 = idf(docFreq=153, maxDocs=42740)
                0.01924106 = queryNorm
              0.41412112 = fieldWeight in 3655, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.625938 = idf(docFreq=153, maxDocs=42740)
                0.0625 = fieldNorm(doc=3655)
          0.08672169 = weight(abstract_txt:extracting in 3655) [ClassicSimilarity], result of:
            0.08672169 = score(doc=3655,freq=1.0), product of:
              0.1992912 = queryWeight, product of:
                1.4876459 = boost
                6.96241 = idf(docFreq=109, maxDocs=42740)
                0.01924106 = queryNorm
              0.43515062 = fieldWeight in 3655, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.96241 = idf(docFreq=109, maxDocs=42740)
                0.0625 = fieldNorm(doc=3655)
          0.04834951 = weight(abstract_txt:bibliographic in 3655) [ClassicSimilarity], result of:
            0.04834951 = score(doc=3655,freq=1.0), product of:
              0.18322203 = queryWeight, product of:
                2.255352 = boost
                4.222157 = idf(docFreq=1703, maxDocs=42740)
                0.01924106 = queryNorm
              0.2638848 = fieldWeight in 3655, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.222157 = idf(docFreq=1703, maxDocs=42740)
                0.0625 = fieldNorm(doc=3655)
          0.17456578 = weight(abstract_txt:extraction in 3655) [ClassicSimilarity], result of:
            0.17456578 = score(doc=3655,freq=2.0), product of:
              0.31771842 = queryWeight, product of:
                2.656389 = boost
                6.216153 = idf(docFreq=231, maxDocs=42740)
                0.01924106 = queryNorm
              0.5494355 = fieldWeight in 3655, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.216153 = idf(docFreq=231, maxDocs=42740)
                0.0625 = fieldNorm(doc=3655)
          0.03546438 = weight(abstract_txt:from in 3655) [ClassicSimilarity], result of:
            0.03546438 = score(doc=3655,freq=2.0), product of:
              0.14387722 = queryWeight, product of:
                2.6813765 = boost
                2.7887225 = idf(docFreq=7144, maxDocs=42740)
                0.01924106 = queryNorm
              0.24649057 = fieldWeight in 3655, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.7887225 = idf(docFreq=7144, maxDocs=42740)
                0.0625 = fieldNorm(doc=3655)
          0.22167973 = weight(abstract_txt:citations in 3655) [ClassicSimilarity], result of:
            0.22167973 = score(doc=3655,freq=5.0), product of:
              0.29571745 = queryWeight, product of:
                2.8652596 = boost
                5.363941 = idf(docFreq=543, maxDocs=42740)
                0.01924106 = queryNorm
              0.7496336 = fieldWeight in 3655, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                5.363941 = idf(docFreq=543, maxDocs=42740)
                0.0625 = fieldNorm(doc=3655)
        0.4 = coord(10/25)
    
  2. Cota, R.G.; Ferreira, A.A.; Nascimento, C.; Gonçalves, M.A.; Laender, A.H.F.: ¬An unsupervised heuristic-based hierarchical method for name disambiguation in bibliographic citations (2010) 0.24
    0.24091001 = sum of:
      0.24091001 = product of:
        0.66919446 = sum of:
          0.10536323 = weight(abstract_txt:ours in 987) [ClassicSimilarity], result of:
            0.10536323 = score(doc=987,freq=1.0), product of:
              0.1801022 = queryWeight, product of:
                9.360306 = idf(docFreq=9, maxDocs=42740)
                0.01924106 = queryNorm
              0.5850191 = fieldWeight in 987, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.360306 = idf(docFreq=9, maxDocs=42740)
                0.0625 = fieldNorm(doc=987)
          0.034701023 = weight(abstract_txt:training in 987) [ClassicSimilarity], result of:
            0.034701023 = score(doc=987,freq=1.0), product of:
              0.10821758 = queryWeight, product of:
                1.0962368 = boost
                5.130556 = idf(docFreq=686, maxDocs=42740)
                0.01924106 = queryNorm
              0.32065976 = fieldWeight in 987, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.130556 = idf(docFreq=686, maxDocs=42740)
                0.0625 = fieldNorm(doc=987)
          0.028800352 = weight(abstract_txt:results in 987) [ClassicSimilarity], result of:
            0.028800352 = score(doc=987,freq=3.0), product of:
              0.075856276 = queryWeight, product of:
                1.1240791 = boost
                3.5072412 = idf(docFreq=3482, maxDocs=42740)
                0.01924106 = queryNorm
              0.37967 = fieldWeight in 987, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.5072412 = idf(docFreq=3482, maxDocs=42740)
                0.0625 = fieldNorm(doc=987)
          0.039737027 = weight(abstract_txt:experiments in 987) [ClassicSimilarity], result of:
            0.039737027 = score(doc=987,freq=1.0), product of:
              0.118449494 = queryWeight, product of:
                1.1468909 = boost
                5.3676248 = idf(docFreq=541, maxDocs=42740)
                0.01924106 = queryNorm
              0.33547655 = fieldWeight in 987, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.3676248 = idf(docFreq=541, maxDocs=42740)
                0.0625 = fieldNorm(doc=987)
          0.045719262 = weight(abstract_txt:present in 987) [ClassicSimilarity], result of:
            0.045719262 = score(doc=987,freq=2.0), product of:
              0.11816519 = queryWeight, product of:
                1.402962 = boost
                4.377384 = idf(docFreq=1458, maxDocs=42740)
                0.01924106 = queryNorm
              0.38690975 = fieldWeight in 987, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.377384 = idf(docFreq=1458, maxDocs=42740)
                0.0625 = fieldNorm(doc=987)
          0.04834951 = weight(abstract_txt:bibliographic in 987) [ClassicSimilarity], result of:
            0.04834951 = score(doc=987,freq=1.0), product of:
              0.18322203 = queryWeight, product of:
                2.255352 = boost
                4.222157 = idf(docFreq=1703, maxDocs=42740)
                0.01924106 = queryNorm
              0.2638848 = fieldWeight in 987, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.222157 = idf(docFreq=1703, maxDocs=42740)
                0.0625 = fieldNorm(doc=987)
          0.03546438 = weight(abstract_txt:from in 987) [ClassicSimilarity], result of:
            0.03546438 = score(doc=987,freq=2.0), product of:
              0.14387722 = queryWeight, product of:
                2.6813765 = boost
                2.7887225 = idf(docFreq=7144, maxDocs=42740)
                0.01924106 = queryNorm
              0.24649057 = fieldWeight in 987, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.7887225 = idf(docFreq=7144, maxDocs=42740)
                0.0625 = fieldNorm(doc=987)
          0.17171238 = weight(abstract_txt:citations in 987) [ClassicSimilarity], result of:
            0.17171238 = score(doc=987,freq=3.0), product of:
              0.29571745 = queryWeight, product of:
                2.8652596 = boost
                5.363941 = idf(docFreq=543, maxDocs=42740)
                0.01924106 = queryNorm
              0.5806637 = fieldWeight in 987, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.363941 = idf(docFreq=543, maxDocs=42740)
                0.0625 = fieldNorm(doc=987)
          0.15934731 = weight(abstract_txt:method in 987) [ClassicSimilarity], result of:
            0.15934731 = score(doc=987,freq=5.0), product of:
              0.25216407 = queryWeight, product of:
                2.898396 = boost
                4.5216455 = idf(docFreq=1262, maxDocs=42740)
                0.01924106 = queryNorm
              0.6319192 = fieldWeight in 987, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.5216455 = idf(docFreq=1262, maxDocs=42740)
                0.0625 = fieldNorm(doc=987)
        0.36 = coord(9/25)
    
  3. Riloff, E.: ¬An empirical study of automated dictionary construction for information extraction in three domains (1996) 0.23
    0.22854273 = sum of:
      0.22854273 = product of:
        0.714196 = sum of:
          0.060726795 = weight(abstract_txt:training in 6821) [ClassicSimilarity], result of:
            0.060726795 = score(doc=6821,freq=1.0), product of:
              0.10821758 = queryWeight, product of:
                1.0962368 = boost
                5.130556 = idf(docFreq=686, maxDocs=42740)
                0.01924106 = queryNorm
              0.5611546 = fieldWeight in 6821, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.130556 = idf(docFreq=686, maxDocs=42740)
                0.109375 = fieldNorm(doc=6821)
          0.02909881 = weight(abstract_txt:results in 6821) [ClassicSimilarity], result of:
            0.02909881 = score(doc=6821,freq=1.0), product of:
              0.075856276 = queryWeight, product of:
                1.1240791 = boost
                3.5072412 = idf(docFreq=3482, maxDocs=42740)
                0.01924106 = queryNorm
              0.38360453 = fieldWeight in 6821, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.5072412 = idf(docFreq=3482, maxDocs=42740)
                0.109375 = fieldNorm(doc=6821)
          0.098344125 = weight(abstract_txt:experiments in 6821) [ClassicSimilarity], result of:
            0.098344125 = score(doc=6821,freq=2.0), product of:
              0.118449494 = queryWeight, product of:
                1.1468909 = boost
                5.3676248 = idf(docFreq=541, maxDocs=42740)
                0.01924106 = queryNorm
              0.83026206 = fieldWeight in 6821, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.3676248 = idf(docFreq=541, maxDocs=42740)
                0.109375 = fieldNorm(doc=6821)
          0.03119433 = weight(abstract_txt:knowledge in 6821) [ClassicSimilarity], result of:
            0.03119433 = score(doc=6821,freq=1.0), product of:
              0.07945571 = queryWeight, product of:
                1.1504393 = boost
                3.5894876 = idf(docFreq=3207, maxDocs=42740)
                0.01924106 = queryNorm
              0.3926002 = fieldWeight in 6821, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.5894876 = idf(docFreq=3207, maxDocs=42740)
                0.109375 = fieldNorm(doc=6821)
          0.0753152 = weight(abstract_txt:automatically in 6821) [ClassicSimilarity], result of:
            0.0753152 = score(doc=6821,freq=1.0), product of:
              0.124920204 = queryWeight, product of:
                1.1778008 = boost
                5.5122876 = idf(docFreq=468, maxDocs=42740)
                0.01924106 = queryNorm
              0.60290647 = fieldWeight in 6821, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.5122876 = idf(docFreq=468, maxDocs=42740)
                0.109375 = fieldNorm(doc=6821)
          0.07014174 = weight(abstract_txt:given in 6821) [ClassicSimilarity], result of:
            0.07014174 = score(doc=6821,freq=1.0), product of:
              0.13637215 = queryWeight, product of:
                1.5071759 = boost
                4.702543 = idf(docFreq=1053, maxDocs=42740)
                0.01924106 = queryNorm
              0.51434064 = fieldWeight in 6821, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.702543 = idf(docFreq=1053, maxDocs=42740)
                0.109375 = fieldNorm(doc=6821)
          0.3054901 = weight(abstract_txt:extraction in 6821) [ClassicSimilarity], result of:
            0.3054901 = score(doc=6821,freq=2.0), product of:
              0.31771842 = queryWeight, product of:
                2.656389 = boost
                6.216153 = idf(docFreq=231, maxDocs=42740)
                0.01924106 = queryNorm
              0.9615121 = fieldWeight in 6821, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.216153 = idf(docFreq=231, maxDocs=42740)
                0.109375 = fieldNorm(doc=6821)
          0.04388493 = weight(abstract_txt:from in 6821) [ClassicSimilarity], result of:
            0.04388493 = score(doc=6821,freq=1.0), product of:
              0.14387722 = queryWeight, product of:
                2.6813765 = boost
                2.7887225 = idf(docFreq=7144, maxDocs=42740)
                0.01924106 = queryNorm
              0.30501652 = fieldWeight in 6821, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.7887225 = idf(docFreq=7144, maxDocs=42740)
                0.109375 = fieldNorm(doc=6821)
        0.32 = coord(8/25)
    
  4. Ru, C.; Tang, J.; Li, S.; Xie, S.; Wang, T.: Using semantic similarity to reduce wrong labels in distant supervision for relation extraction (2018) 0.23
    0.22824806 = sum of:
      0.22824806 = product of:
        0.63402236 = sum of:
          0.02955107 = weight(abstract_txt:data in 1056) [ClassicSimilarity], result of:
            0.02955107 = score(doc=1056,freq=4.0), product of:
              0.07011251 = queryWeight, product of:
                1.0806844 = boost
                3.3718455 = idf(docFreq=3987, maxDocs=42740)
                0.01924106 = queryNorm
              0.4214807 = fieldWeight in 1056, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.3718455 = idf(docFreq=3987, maxDocs=42740)
                0.0625 = fieldNorm(doc=1056)
          0.034701023 = weight(abstract_txt:training in 1056) [ClassicSimilarity], result of:
            0.034701023 = score(doc=1056,freq=1.0), product of:
              0.10821758 = queryWeight, product of:
                1.0962368 = boost
                5.130556 = idf(docFreq=686, maxDocs=42740)
                0.01924106 = queryNorm
              0.32065976 = fieldWeight in 1056, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.130556 = idf(docFreq=686, maxDocs=42740)
                0.0625 = fieldNorm(doc=1056)
          0.016627891 = weight(abstract_txt:results in 1056) [ClassicSimilarity], result of:
            0.016627891 = score(doc=1056,freq=1.0), product of:
              0.075856276 = queryWeight, product of:
                1.1240791 = boost
                3.5072412 = idf(docFreq=3482, maxDocs=42740)
                0.01924106 = queryNorm
              0.21920258 = fieldWeight in 1056, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.5072412 = idf(docFreq=3482, maxDocs=42740)
                0.0625 = fieldNorm(doc=1056)
          0.01782533 = weight(abstract_txt:knowledge in 1056) [ClassicSimilarity], result of:
            0.01782533 = score(doc=1056,freq=1.0), product of:
              0.07945571 = queryWeight, product of:
                1.1504393 = boost
                3.5894876 = idf(docFreq=3207, maxDocs=42740)
                0.01924106 = queryNorm
              0.22434297 = fieldWeight in 1056, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.5894876 = idf(docFreq=3207, maxDocs=42740)
                0.0625 = fieldNorm(doc=1056)
          0.06086387 = weight(abstract_txt:automatically in 1056) [ClassicSimilarity], result of:
            0.06086387 = score(doc=1056,freq=2.0), product of:
              0.124920204 = queryWeight, product of:
                1.1778008 = boost
                5.5122876 = idf(docFreq=468, maxDocs=42740)
                0.01924106 = queryNorm
              0.487222 = fieldWeight in 1056, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.5122876 = idf(docFreq=468, maxDocs=42740)
                0.0625 = fieldNorm(doc=1056)
          0.06868559 = weight(abstract_txt:base in 1056) [ClassicSimilarity], result of:
            0.06868559 = score(doc=1056,freq=1.0), product of:
              0.19528872 = queryWeight, product of:
                1.8035978 = boost
                5.627409 = idf(docFreq=417, maxDocs=42740)
                0.01924106 = queryNorm
              0.35171306 = fieldWeight in 1056, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.627409 = idf(docFreq=417, maxDocs=42740)
                0.0625 = fieldNorm(doc=1056)
          0.24687329 = weight(abstract_txt:extraction in 1056) [ClassicSimilarity], result of:
            0.24687329 = score(doc=1056,freq=4.0), product of:
              0.31771842 = queryWeight, product of:
                2.656389 = boost
                6.216153 = idf(docFreq=231, maxDocs=42740)
                0.01924106 = queryNorm
              0.77701914 = fieldWeight in 1056, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.216153 = idf(docFreq=231, maxDocs=42740)
                0.0625 = fieldNorm(doc=1056)
          0.03546438 = weight(abstract_txt:from in 1056) [ClassicSimilarity], result of:
            0.03546438 = score(doc=1056,freq=2.0), product of:
              0.14387722 = queryWeight, product of:
                2.6813765 = boost
                2.7887225 = idf(docFreq=7144, maxDocs=42740)
                0.01924106 = queryNorm
              0.24649057 = fieldWeight in 1056, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.7887225 = idf(docFreq=7144, maxDocs=42740)
                0.0625 = fieldNorm(doc=1056)
          0.123429894 = weight(abstract_txt:method in 1056) [ClassicSimilarity], result of:
            0.123429894 = score(doc=1056,freq=3.0), product of:
              0.25216407 = queryWeight, product of:
                2.898396 = boost
                4.5216455 = idf(docFreq=1262, maxDocs=42740)
                0.01924106 = queryNorm
              0.4894825 = fieldWeight in 1056, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.5216455 = idf(docFreq=1262, maxDocs=42740)
                0.0625 = fieldNorm(doc=1056)
        0.36 = coord(9/25)
    
  5. Li, J.; Zhang, Z.; Li, X.; Chen, H.: Kernel-based learning for biomedical relation extraction (2008) 0.22
    0.2183141 = sum of:
      0.2183141 = product of:
        0.68223155 = sum of:
          0.020784862 = weight(abstract_txt:results in 3612) [ClassicSimilarity], result of:
            0.020784862 = score(doc=3612,freq=1.0), product of:
              0.075856276 = queryWeight, product of:
                1.1240791 = boost
                3.5072412 = idf(docFreq=3482, maxDocs=42740)
                0.01924106 = queryNorm
              0.2740032 = fieldWeight in 3612, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.5072412 = idf(docFreq=3482, maxDocs=42740)
                0.078125 = fieldNorm(doc=3612)
          0.049671285 = weight(abstract_txt:experiments in 3612) [ClassicSimilarity], result of:
            0.049671285 = score(doc=3612,freq=1.0), product of:
              0.118449494 = queryWeight, product of:
                1.1468909 = boost
                5.3676248 = idf(docFreq=541, maxDocs=42740)
                0.01924106 = queryNorm
              0.41934568 = fieldWeight in 3612, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.3676248 = idf(docFreq=541, maxDocs=42740)
                0.078125 = fieldNorm(doc=3612)
          0.022281662 = weight(abstract_txt:knowledge in 3612) [ClassicSimilarity], result of:
            0.022281662 = score(doc=3612,freq=1.0), product of:
              0.07945571 = queryWeight, product of:
                1.1504393 = boost
                3.5894876 = idf(docFreq=3207, maxDocs=42740)
                0.01924106 = queryNorm
              0.2804287 = fieldWeight in 3612, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.5894876 = idf(docFreq=3207, maxDocs=42740)
                0.078125 = fieldNorm(doc=3612)
          0.07607984 = weight(abstract_txt:automatically in 3612) [ClassicSimilarity], result of:
            0.07607984 = score(doc=3612,freq=2.0), product of:
              0.124920204 = queryWeight, product of:
                1.1778008 = boost
                5.5122876 = idf(docFreq=468, maxDocs=42740)
                0.01924106 = queryNorm
              0.6090275 = fieldWeight in 3612, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.5122876 = idf(docFreq=468, maxDocs=42740)
                0.078125 = fieldNorm(doc=3612)
          0.09343316 = weight(abstract_txt:extract in 3612) [ClassicSimilarity], result of:
            0.09343316 = score(doc=3612,freq=1.0), product of:
              0.18049437 = queryWeight, product of:
                1.4157524 = boost
                6.625938 = idf(docFreq=153, maxDocs=42740)
                0.01924106 = queryNorm
              0.5176514 = fieldWeight in 3612, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.625938 = idf(docFreq=153, maxDocs=42740)
                0.078125 = fieldNorm(doc=3612)
          0.10840211 = weight(abstract_txt:extracting in 3612) [ClassicSimilarity], result of:
            0.10840211 = score(doc=3612,freq=1.0), product of:
              0.1992912 = queryWeight, product of:
                1.4876459 = boost
                6.96241 = idf(docFreq=109, maxDocs=42740)
                0.01924106 = queryNorm
              0.5439383 = fieldWeight in 3612, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.96241 = idf(docFreq=109, maxDocs=42740)
                0.078125 = fieldNorm(doc=3612)
          0.26724818 = weight(abstract_txt:extraction in 3612) [ClassicSimilarity], result of:
            0.26724818 = score(doc=3612,freq=3.0), product of:
              0.31771842 = queryWeight, product of:
                2.656389 = boost
                6.216153 = idf(docFreq=231, maxDocs=42740)
                0.01924106 = queryNorm
              0.8411479 = fieldWeight in 3612, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.216153 = idf(docFreq=231, maxDocs=42740)
                0.078125 = fieldNorm(doc=3612)
          0.044330474 = weight(abstract_txt:from in 3612) [ClassicSimilarity], result of:
            0.044330474 = score(doc=3612,freq=2.0), product of:
              0.14387722 = queryWeight, product of:
                2.6813765 = boost
                2.7887225 = idf(docFreq=7144, maxDocs=42740)
                0.01924106 = queryNorm
              0.30811322 = fieldWeight in 3612, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.7887225 = idf(docFreq=7144, maxDocs=42740)
                0.078125 = fieldNorm(doc=3612)
        0.32 = coord(8/25)