Document (#36449)

Author
Pereira, D.A.
Ribeiro-Neto, B.
Ziviani, N.
Laender, A.H.F.
Gonçalves, M.A.
Title
¬A generic Web-based entity resolution framework
Source
Journal of the American Society for Information Science and Technology. 62(2011) no.5, S.919-932
Year
2011
Abstract
Web data repositories usually contain references to thousands of real-world entities from multiple sources. It is not uncommon that multiple entities share the same label (polysemes) and that distinct label variations are associated with the same entity (synonyms), which frequently leads to ambiguous interpretations. Further, spelling variants, acronyms, abbreviated forms, and misspellings compound to worsen the problem. Solving this problem requires identifying which labels correspond to the same real-world entity, a process known as entity resolution. One approach to solve the entity resolution problem is to associate an authority identifier and a list of variant forms with each entity-a data structure known as an authority file. In this work, we propose a generic framework for implementing a method for generating authority files. Our method uses information from the Web to improve the quality of the authority file and, because of that, is referred to as WER-Web-based Entity Resolution. Our contribution here is threefold: (a) we discuss how to implement the WER framework, which is flexible and easy to adapt to new domains; (b) we run extended experimentation with our WER framework to show that it outperforms selected baselines; and (c) we compare the results of a specialized solution for author name resolution with those produced by the generic WER framework, and show that the WER results remain competitive.
Theme
Internet

Similar documents (author)

  1. Ribeiro-Neto, B.; Laender, A.H.F.; Lima, L.R.S. de: ¬An experimental study in automatically categorizing medical documents (2001) 2.52
    2.5236473 = sum of:
      2.5236473 = product of:
        4.416383 = sum of:
          0.89511544 = weight(author_txt:ribeiro in 700) [ClassicSimilarity], result of:
            0.89511544 = score(doc=700,freq=1.0), product of:
              0.32782993 = queryWeight, product of:
                1.0223553 = boost
                8.737364 = idf(docFreq=18, maxDocs=43556)
                0.03670002 = queryNorm
              2.7304263 = fieldWeight in 700, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.737364 = idf(docFreq=18, maxDocs=43556)
                0.3125 = fieldNorm(doc=700)
          1.1449639 = weight(author_txt:neto in 700) [ClassicSimilarity], result of:
            1.1449639 = score(doc=700,freq=1.0), product of:
              0.38629916 = queryWeight, product of:
                1.1097865 = boost
                9.484578 = idf(docFreq=8, maxDocs=43556)
                0.03670002 = queryNorm
              2.9639306 = fieldWeight in 700, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.484578 = idf(docFreq=8, maxDocs=43556)
                0.3125 = fieldNorm(doc=700)
          1.1881518 = weight(author_txt:laender in 700) [ClassicSimilarity], result of:
            1.1881518 = score(doc=700,freq=1.0), product of:
              0.39595318 = queryWeight, product of:
                1.1235683 = boost
                9.602362 = idf(docFreq=7, maxDocs=43556)
                0.03670002 = queryNorm
              3.0007381 = fieldWeight in 700, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.602362 = idf(docFreq=7, maxDocs=43556)
                0.3125 = fieldNorm(doc=700)
          1.1881518 = weight(author_txt:a.h.f in 700) [ClassicSimilarity], result of:
            1.1881518 = score(doc=700,freq=1.0), product of:
              0.39595318 = queryWeight, product of:
                1.1235683 = boost
                9.602362 = idf(docFreq=7, maxDocs=43556)
                0.03670002 = queryNorm
              3.0007381 = fieldWeight in 700, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.602362 = idf(docFreq=7, maxDocs=43556)
                0.3125 = fieldNorm(doc=700)
        0.5714286 = coord(4/7)
    
  2. Silva, A.J.C.; Gonçalves, M.A.; Laender, A.H.F.; Modesto, M.A.B.; Cristo, M.; Ziviani, N.: Finding what is missing from a digital library : a case study in the computer science field (2009) 2.04
    2.0353765 = sum of:
      2.0353765 = product of:
        3.561909 = sum of:
          0.6701368 = weight(author_txt:gonçalves in 1217) [ClassicSimilarity], result of:
            0.6701368 = score(doc=1217,freq=1.0), product of:
              0.3136497 = queryWeight, product of:
                8.5463085 = idf(docFreq=22, maxDocs=43556)
                0.03670002 = queryNorm
              2.1365771 = fieldWeight in 1217, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.5463085 = idf(docFreq=22, maxDocs=43556)
                0.25 = fieldNorm(doc=1217)
          0.9505214 = weight(author_txt:laender in 1217) [ClassicSimilarity], result of:
            0.9505214 = score(doc=1217,freq=1.0), product of:
              0.39595318 = queryWeight, product of:
                1.1235683 = boost
                9.602362 = idf(docFreq=7, maxDocs=43556)
                0.03670002 = queryNorm
              2.4005904 = fieldWeight in 1217, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.602362 = idf(docFreq=7, maxDocs=43556)
                0.25 = fieldNorm(doc=1217)
          0.9505214 = weight(author_txt:a.h.f in 1217) [ClassicSimilarity], result of:
            0.9505214 = score(doc=1217,freq=1.0), product of:
              0.39595318 = queryWeight, product of:
                1.1235683 = boost
                9.602362 = idf(docFreq=7, maxDocs=43556)
                0.03670002 = queryNorm
              2.4005904 = fieldWeight in 1217, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.602362 = idf(docFreq=7, maxDocs=43556)
                0.25 = fieldNorm(doc=1217)
          0.9907294 = weight(author_txt:ziviani in 1217) [ClassicSimilarity], result of:
            0.9907294 = score(doc=1217,freq=1.0), product of:
              0.40704206 = queryWeight, product of:
                1.1391927 = boost
                9.735892 = idf(docFreq=6, maxDocs=43556)
                0.03670002 = queryNorm
              2.433973 = fieldWeight in 1217, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.735892 = idf(docFreq=6, maxDocs=43556)
                0.25 = fieldNorm(doc=1217)
        0.5714286 = coord(4/7)
    
  3. Freitas-Junior, H.R.; Ribeiro-Neto, B.A.; Freitas-Vale, R. de; Laender, A.H.F.; Lima, L.R.S. de: Categorization-driven cross-language retrieval of medical information (2006) 2.02
    2.018918 = sum of:
      2.018918 = product of:
        3.5331063 = sum of:
          0.71609235 = weight(author_txt:ribeiro in 280) [ClassicSimilarity], result of:
            0.71609235 = score(doc=280,freq=1.0), product of:
              0.32782993 = queryWeight, product of:
                1.0223553 = boost
                8.737364 = idf(docFreq=18, maxDocs=43556)
                0.03670002 = queryNorm
              2.184341 = fieldWeight in 280, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.737364 = idf(docFreq=18, maxDocs=43556)
                0.25 = fieldNorm(doc=280)
          0.91597116 = weight(author_txt:neto in 280) [ClassicSimilarity], result of:
            0.91597116 = score(doc=280,freq=1.0), product of:
              0.38629916 = queryWeight, product of:
                1.1097865 = boost
                9.484578 = idf(docFreq=8, maxDocs=43556)
                0.03670002 = queryNorm
              2.3711445 = fieldWeight in 280, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.484578 = idf(docFreq=8, maxDocs=43556)
                0.25 = fieldNorm(doc=280)
          0.9505214 = weight(author_txt:laender in 280) [ClassicSimilarity], result of:
            0.9505214 = score(doc=280,freq=1.0), product of:
              0.39595318 = queryWeight, product of:
                1.1235683 = boost
                9.602362 = idf(docFreq=7, maxDocs=43556)
                0.03670002 = queryNorm
              2.4005904 = fieldWeight in 280, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.602362 = idf(docFreq=7, maxDocs=43556)
                0.25 = fieldNorm(doc=280)
          0.9505214 = weight(author_txt:a.h.f in 280) [ClassicSimilarity], result of:
            0.9505214 = score(doc=280,freq=1.0), product of:
              0.39595318 = queryWeight, product of:
                1.1235683 = boost
                9.602362 = idf(docFreq=7, maxDocs=43556)
                0.03670002 = queryNorm
              2.4005904 = fieldWeight in 280, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.602362 = idf(docFreq=7, maxDocs=43556)
                0.25 = fieldNorm(doc=280)
        0.5714286 = coord(4/7)
    
  4. Calado, P.; Cristo, M.; Gonçalves, M.A.; Moura, E.S. de; Ribeiro-Neto, B.; Ziviani, N.: Link-based similarity measures for the classification of Web documents (2006) 1.88
    1.8816742 = sum of:
      1.8816742 = product of:
        3.2929296 = sum of:
          0.6701368 = weight(author_txt:gonçalves in 919) [ClassicSimilarity], result of:
            0.6701368 = score(doc=919,freq=1.0), product of:
              0.3136497 = queryWeight, product of:
                8.5463085 = idf(docFreq=22, maxDocs=43556)
                0.03670002 = queryNorm
              2.1365771 = fieldWeight in 919, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.5463085 = idf(docFreq=22, maxDocs=43556)
                0.25 = fieldNorm(doc=919)
          0.71609235 = weight(author_txt:ribeiro in 919) [ClassicSimilarity], result of:
            0.71609235 = score(doc=919,freq=1.0), product of:
              0.32782993 = queryWeight, product of:
                1.0223553 = boost
                8.737364 = idf(docFreq=18, maxDocs=43556)
                0.03670002 = queryNorm
              2.184341 = fieldWeight in 919, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.737364 = idf(docFreq=18, maxDocs=43556)
                0.25 = fieldNorm(doc=919)
          0.91597116 = weight(author_txt:neto in 919) [ClassicSimilarity], result of:
            0.91597116 = score(doc=919,freq=1.0), product of:
              0.38629916 = queryWeight, product of:
                1.1097865 = boost
                9.484578 = idf(docFreq=8, maxDocs=43556)
                0.03670002 = queryNorm
              2.3711445 = fieldWeight in 919, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.484578 = idf(docFreq=8, maxDocs=43556)
                0.25 = fieldNorm(doc=919)
          0.9907294 = weight(author_txt:ziviani in 919) [ClassicSimilarity], result of:
            0.9907294 = score(doc=919,freq=1.0), product of:
              0.40704206 = queryWeight, product of:
                1.1391927 = boost
                9.735892 = idf(docFreq=6, maxDocs=43556)
                0.03670002 = queryNorm
              2.433973 = fieldWeight in 919, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.735892 = idf(docFreq=6, maxDocs=43556)
                0.25 = fieldNorm(doc=919)
        0.5714286 = coord(4/7)
    
  5. Couto, T.; Cristo, M.; Gonçalves, M.A.; Calado, P.; Ziviani, N.; Moura, E.; Ribeiro-Neto, B.: ¬A comparative study of citations and links in document classification (2006) 1.88
    1.8816742 = sum of:
      1.8816742 = product of:
        3.2929296 = sum of:
          0.6701368 = weight(author_txt:gonçalves in 4529) [ClassicSimilarity], result of:
            0.6701368 = score(doc=4529,freq=1.0), product of:
              0.3136497 = queryWeight, product of:
                8.5463085 = idf(docFreq=22, maxDocs=43556)
                0.03670002 = queryNorm
              2.1365771 = fieldWeight in 4529, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.5463085 = idf(docFreq=22, maxDocs=43556)
                0.25 = fieldNorm(doc=4529)
          0.71609235 = weight(author_txt:ribeiro in 4529) [ClassicSimilarity], result of:
            0.71609235 = score(doc=4529,freq=1.0), product of:
              0.32782993 = queryWeight, product of:
                1.0223553 = boost
                8.737364 = idf(docFreq=18, maxDocs=43556)
                0.03670002 = queryNorm
              2.184341 = fieldWeight in 4529, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.737364 = idf(docFreq=18, maxDocs=43556)
                0.25 = fieldNorm(doc=4529)
          0.91597116 = weight(author_txt:neto in 4529) [ClassicSimilarity], result of:
            0.91597116 = score(doc=4529,freq=1.0), product of:
              0.38629916 = queryWeight, product of:
                1.1097865 = boost
                9.484578 = idf(docFreq=8, maxDocs=43556)
                0.03670002 = queryNorm
              2.3711445 = fieldWeight in 4529, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.484578 = idf(docFreq=8, maxDocs=43556)
                0.25 = fieldNorm(doc=4529)
          0.9907294 = weight(author_txt:ziviani in 4529) [ClassicSimilarity], result of:
            0.9907294 = score(doc=4529,freq=1.0), product of:
              0.40704206 = queryWeight, product of:
                1.1391927 = boost
                9.735892 = idf(docFreq=6, maxDocs=43556)
                0.03670002 = queryNorm
              2.433973 = fieldWeight in 4529, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.735892 = idf(docFreq=6, maxDocs=43556)
                0.25 = fieldNorm(doc=4529)
        0.5714286 = coord(4/7)
    

Similar documents (content)

  1. Lawrie, D.; Mayfield, J.; McNamee, P.; Oard, P.W.: Cross-language person-entity linking from 20 languages (2015) 0.34
    0.33603534 = sum of:
      0.33603534 = product of:
        1.2001262 = sum of:
          0.019792289 = weight(abstract_txt:which in 3846) [ClassicSimilarity], result of:
            0.019792289 = score(doc=3846,freq=3.0), product of:
              0.0499875 = queryWeight, product of:
                1.010606 = boost
                2.9260652 = idf(docFreq=6346, maxDocs=43556)
                0.016904235 = queryNorm
              0.39594477 = fieldWeight in 3846, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.9260652 = idf(docFreq=6346, maxDocs=43556)
                0.078125 = fieldNorm(doc=3846)
          0.019203033 = weight(abstract_txt:with in 3846) [ClassicSimilarity], result of:
            0.019203033 = score(doc=3846,freq=4.0), product of:
              0.048990358 = queryWeight, product of:
                1.1552497 = boost
                2.508645 = idf(docFreq=9634, maxDocs=43556)
                0.016904235 = queryNorm
              0.3919758 = fieldWeight in 3846, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                2.508645 = idf(docFreq=9634, maxDocs=43556)
                0.078125 = fieldNorm(doc=3846)
          0.060689315 = weight(abstract_txt:known in 3846) [ClassicSimilarity], result of:
            0.060689315 = score(doc=3846,freq=2.0), product of:
              0.105504796 = queryWeight, product of:
                1.198786 = boost
                5.20637 = idf(docFreq=648, maxDocs=43556)
                0.016904235 = queryNorm
              0.57522804 = fieldWeight in 3846, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.20637 = idf(docFreq=648, maxDocs=43556)
                0.078125 = fieldNorm(doc=3846)
          0.06096716 = weight(abstract_txt:entities in 3846) [ClassicSimilarity], result of:
            0.06096716 = score(doc=3846,freq=1.0), product of:
              0.1333331 = queryWeight, product of:
                1.3476421 = boost
                5.852857 = idf(docFreq=339, maxDocs=43556)
                0.016904235 = queryNorm
              0.45725447 = fieldWeight in 3846, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.852857 = idf(docFreq=339, maxDocs=43556)
                0.078125 = fieldNorm(doc=3846)
          0.017789394 = weight(abstract_txt:that in 3846) [ClassicSimilarity], result of:
            0.017789394 = score(doc=3846,freq=3.0), product of:
              0.055197712 = queryWeight, product of:
                1.3709958 = boost
                2.3817132 = idf(docFreq=10938, maxDocs=43556)
                0.016904235 = queryNorm
              0.322285 = fieldWeight in 3846, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.3817132 = idf(docFreq=10938, maxDocs=43556)
                0.078125 = fieldNorm(doc=3846)
          0.48778424 = weight(abstract_txt:resolution in 3846) [ClassicSimilarity], result of:
            0.48778424 = score(doc=3846,freq=3.0), product of:
              0.5019173 = queryWeight, product of:
                4.1342015 = boost
                7.181993 = idf(docFreq=89, maxDocs=43556)
                0.016904235 = queryNorm
              0.9718419 = fieldWeight in 3846, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.181993 = idf(docFreq=89, maxDocs=43556)
                0.078125 = fieldNorm(doc=3846)
          0.5339007 = weight(abstract_txt:entity in 3846) [ClassicSimilarity], result of:
            0.5339007 = score(doc=3846,freq=4.0), product of:
              0.5418142 = queryWeight, product of:
                5.082352 = boost
                6.3065243 = idf(docFreq=215, maxDocs=43556)
                0.016904235 = queryNorm
              0.9853944 = fieldWeight in 3846, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.3065243 = idf(docFreq=215, maxDocs=43556)
                0.078125 = fieldNorm(doc=3846)
        0.28 = coord(7/25)
    
  2. Liu, X.; Zheng, W.; Fang, H.: ¬An exploration of ranking models and feedback method for related entity finding (2013) 0.25
    0.24641237 = sum of:
      0.24641237 = product of:
        0.68447876 = sum of:
          0.009141668 = weight(abstract_txt:which in 4712) [ClassicSimilarity], result of:
            0.009141668 = score(doc=4712,freq=1.0), product of:
              0.0499875 = queryWeight, product of:
                1.010606 = boost
                2.9260652 = idf(docFreq=6346, maxDocs=43556)
                0.016904235 = queryNorm
              0.18287908 = fieldWeight in 4712, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.9260652 = idf(docFreq=6346, maxDocs=43556)
                0.0625 = fieldNorm(doc=4712)
          0.021206656 = weight(abstract_txt:show in 4712) [ClassicSimilarity], result of:
            0.021206656 = score(doc=4712,freq=1.0), product of:
              0.076523624 = queryWeight, product of:
                1.0209473 = boost
                4.43401 = idf(docFreq=1404, maxDocs=43556)
                0.016904235 = queryNorm
              0.27712563 = fieldWeight in 4712, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.43401 = idf(docFreq=1404, maxDocs=43556)
                0.0625 = fieldNorm(doc=4712)
          0.031610776 = weight(abstract_txt:method in 4712) [ClassicSimilarity], result of:
            0.031610776 = score(doc=4712,freq=2.0), product of:
              0.07925515 = queryWeight, product of:
                1.039009 = boost
                4.5124526 = idf(docFreq=1298, maxDocs=43556)
                0.016904235 = queryNorm
              0.39884824 = fieldWeight in 4712, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.5124526 = idf(docFreq=1298, maxDocs=43556)
                0.0625 = fieldNorm(doc=4712)
          0.007681214 = weight(abstract_txt:with in 4712) [ClassicSimilarity], result of:
            0.007681214 = score(doc=4712,freq=1.0), product of:
              0.048990358 = queryWeight, product of:
                1.1552497 = boost
                2.508645 = idf(docFreq=9634, maxDocs=43556)
                0.016904235 = queryNorm
              0.15679032 = fieldWeight in 4712, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.508645 = idf(docFreq=9634, maxDocs=43556)
                0.0625 = fieldNorm(doc=4712)
          0.12904315 = weight(abstract_txt:entities in 4712) [ClassicSimilarity], result of:
            0.12904315 = score(doc=4712,freq=7.0), product of:
              0.1333331 = queryWeight, product of:
                1.3476421 = boost
                5.852857 = idf(docFreq=339, maxDocs=43556)
                0.016904235 = queryNorm
              0.96782523 = fieldWeight in 4712, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                5.852857 = idf(docFreq=339, maxDocs=43556)
                0.0625 = fieldNorm(doc=4712)
          0.011619984 = weight(abstract_txt:that in 4712) [ClassicSimilarity], result of:
            0.011619984 = score(doc=4712,freq=2.0), product of:
              0.055197712 = queryWeight, product of:
                1.3709958 = boost
                2.3817132 = idf(docFreq=10938, maxDocs=43556)
                0.016904235 = queryNorm
              0.2105157 = fieldWeight in 4712, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.3817132 = idf(docFreq=10938, maxDocs=43556)
                0.0625 = fieldNorm(doc=4712)
          0.045623925 = weight(abstract_txt:problem in 4712) [ClassicSimilarity], result of:
            0.045623925 = score(doc=4712,freq=2.0), product of:
              0.11586783 = queryWeight, product of:
                1.5386245 = boost
                4.454867 = idf(docFreq=1375, maxDocs=43556)
                0.016904235 = queryNorm
              0.39375833 = fieldWeight in 4712, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.454867 = idf(docFreq=1375, maxDocs=43556)
                0.0625 = fieldNorm(doc=4712)
          0.05865413 = weight(abstract_txt:framework in 4712) [ClassicSimilarity], result of:
            0.05865413 = score(doc=4712,freq=1.0), product of:
              0.20464122 = queryWeight, product of:
                2.6398067 = boost
                4.5859094 = idf(docFreq=1206, maxDocs=43556)
                0.016904235 = queryNorm
              0.28661934 = fieldWeight in 4712, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5859094 = idf(docFreq=1206, maxDocs=43556)
                0.0625 = fieldNorm(doc=4712)
          0.36989725 = weight(abstract_txt:entity in 4712) [ClassicSimilarity], result of:
            0.36989725 = score(doc=4712,freq=3.0), product of:
              0.5418142 = queryWeight, product of:
                5.082352 = boost
                6.3065243 = idf(docFreq=215, maxDocs=43556)
                0.016904235 = queryNorm
              0.6827013 = fieldWeight in 4712, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.3065243 = idf(docFreq=215, maxDocs=43556)
                0.0625 = fieldNorm(doc=4712)
        0.36 = coord(9/25)
    
  3. Vechtomova, O.; Robertson, S.E.: ¬A domain-independent approach to finding related entities (2012) 0.23
    0.22830719 = sum of:
      0.22830719 = product of:
        0.95128 = sum of:
          0.027940243 = weight(abstract_txt:method in 4731) [ClassicSimilarity], result of:
            0.027940243 = score(doc=4731,freq=1.0), product of:
              0.07925515 = queryWeight, product of:
                1.039009 = boost
                4.5124526 = idf(docFreq=1298, maxDocs=43556)
                0.016904235 = queryNorm
              0.35253537 = fieldWeight in 4731, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5124526 = idf(docFreq=1298, maxDocs=43556)
                0.078125 = fieldNorm(doc=4731)
          0.009601517 = weight(abstract_txt:with in 4731) [ClassicSimilarity], result of:
            0.009601517 = score(doc=4731,freq=1.0), product of:
              0.048990358 = queryWeight, product of:
                1.1552497 = boost
                2.508645 = idf(docFreq=9634, maxDocs=43556)
                0.016904235 = queryNorm
              0.1959879 = fieldWeight in 4731, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.508645 = idf(docFreq=9634, maxDocs=43556)
                0.078125 = fieldNorm(doc=4731)
          0.14933842 = weight(abstract_txt:entities in 4731) [ClassicSimilarity], result of:
            0.14933842 = score(doc=4731,freq=6.0), product of:
              0.1333331 = queryWeight, product of:
                1.3476421 = boost
                5.852857 = idf(docFreq=339, maxDocs=43556)
                0.016904235 = queryNorm
              1.1200402 = fieldWeight in 4731, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                5.852857 = idf(docFreq=339, maxDocs=43556)
                0.078125 = fieldNorm(doc=4731)
          0.017789394 = weight(abstract_txt:that in 4731) [ClassicSimilarity], result of:
            0.017789394 = score(doc=4731,freq=3.0), product of:
              0.055197712 = queryWeight, product of:
                1.3709958 = boost
                2.3817132 = idf(docFreq=10938, maxDocs=43556)
                0.016904235 = queryNorm
              0.322285 = fieldWeight in 4731, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.3817132 = idf(docFreq=10938, maxDocs=43556)
                0.078125 = fieldNorm(doc=4731)
          0.04032623 = weight(abstract_txt:problem in 4731) [ClassicSimilarity], result of:
            0.04032623 = score(doc=4731,freq=1.0), product of:
              0.11586783 = queryWeight, product of:
                1.5386245 = boost
                4.454867 = idf(docFreq=1375, maxDocs=43556)
                0.016904235 = queryNorm
              0.34803647 = fieldWeight in 4731, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.454867 = idf(docFreq=1375, maxDocs=43556)
                0.078125 = fieldNorm(doc=4731)
          0.70628417 = weight(abstract_txt:entity in 4731) [ClassicSimilarity], result of:
            0.70628417 = score(doc=4731,freq=7.0), product of:
              0.5418142 = queryWeight, product of:
                5.082352 = boost
                6.3065243 = idf(docFreq=215, maxDocs=43556)
                0.016904235 = queryNorm
              1.3035542 = fieldWeight in 4731, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                6.3065243 = idf(docFreq=215, maxDocs=43556)
                0.078125 = fieldNorm(doc=4731)
        0.24 = coord(6/25)
    
  4. Soulier, L.; Jabeur, L.B.; Tamine, L.; Bahsoun, W.: On ranking relevant entities in heterogeneous networks using a language-based model (2013) 0.21
    0.21041869 = sum of:
      0.21041869 = product of:
        0.5844963 = sum of:
          0.021206656 = weight(abstract_txt:show in 2662) [ClassicSimilarity], result of:
            0.021206656 = score(doc=2662,freq=1.0), product of:
              0.076523624 = queryWeight, product of:
                1.0209473 = boost
                4.43401 = idf(docFreq=1404, maxDocs=43556)
                0.016904235 = queryNorm
              0.27712563 = fieldWeight in 2662, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.43401 = idf(docFreq=1404, maxDocs=43556)
                0.0625 = fieldNorm(doc=2662)
          0.007681214 = weight(abstract_txt:with in 2662) [ClassicSimilarity], result of:
            0.007681214 = score(doc=2662,freq=1.0), product of:
              0.048990358 = queryWeight, product of:
                1.1552497 = boost
                2.508645 = idf(docFreq=9634, maxDocs=43556)
                0.016904235 = queryNorm
              0.15679032 = fieldWeight in 2662, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.508645 = idf(docFreq=9634, maxDocs=43556)
                0.0625 = fieldNorm(doc=2662)
          0.033387728 = weight(abstract_txt:multiple in 2662) [ClassicSimilarity], result of:
            0.033387728 = score(doc=2662,freq=1.0), product of:
              0.10356316 = queryWeight, product of:
                1.187704 = boost
                5.1582403 = idf(docFreq=680, maxDocs=43556)
                0.016904235 = queryNorm
              0.32239002 = fieldWeight in 2662, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.1582403 = idf(docFreq=680, maxDocs=43556)
                0.0625 = fieldNorm(doc=2662)
          0.039753538 = weight(abstract_txt:forms in 2662) [ClassicSimilarity], result of:
            0.039753538 = score(doc=2662,freq=1.0), product of:
              0.11634058 = queryWeight, product of:
                1.2588419 = boost
                5.4671946 = idf(docFreq=499, maxDocs=43556)
                0.016904235 = queryNorm
              0.34169966 = fieldWeight in 2662, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.4671946 = idf(docFreq=499, maxDocs=43556)
                0.0625 = fieldNorm(doc=2662)
          0.09754745 = weight(abstract_txt:entities in 2662) [ClassicSimilarity], result of:
            0.09754745 = score(doc=2662,freq=4.0), product of:
              0.1333331 = queryWeight, product of:
                1.3476421 = boost
                5.852857 = idf(docFreq=339, maxDocs=43556)
                0.016904235 = queryNorm
              0.73160714 = fieldWeight in 2662, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.852857 = idf(docFreq=339, maxDocs=43556)
                0.0625 = fieldNorm(doc=2662)
          0.011619984 = weight(abstract_txt:that in 2662) [ClassicSimilarity], result of:
            0.011619984 = score(doc=2662,freq=2.0), product of:
              0.055197712 = queryWeight, product of:
                1.3709958 = boost
                2.3817132 = idf(docFreq=10938, maxDocs=43556)
                0.016904235 = queryNorm
              0.2105157 = fieldWeight in 2662, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.3817132 = idf(docFreq=10938, maxDocs=43556)
                0.0625 = fieldNorm(doc=2662)
          0.032260984 = weight(abstract_txt:problem in 2662) [ClassicSimilarity], result of:
            0.032260984 = score(doc=2662,freq=1.0), product of:
              0.11586783 = queryWeight, product of:
                1.5386245 = boost
                4.454867 = idf(docFreq=1375, maxDocs=43556)
                0.016904235 = queryNorm
              0.27842918 = fieldWeight in 2662, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.454867 = idf(docFreq=1375, maxDocs=43556)
                0.0625 = fieldNorm(doc=2662)
          0.03901893 = weight(abstract_txt:same in 2662) [ClassicSimilarity], result of:
            0.03901893 = score(doc=2662,freq=1.0), product of:
              0.13153097 = queryWeight, product of:
                1.6393255 = boost
                4.7464323 = idf(docFreq=1027, maxDocs=43556)
                0.016904235 = queryNorm
              0.29665202 = fieldWeight in 2662, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.7464323 = idf(docFreq=1027, maxDocs=43556)
                0.0625 = fieldNorm(doc=2662)
          0.30201983 = weight(abstract_txt:entity in 2662) [ClassicSimilarity], result of:
            0.30201983 = score(doc=2662,freq=2.0), product of:
              0.5418142 = queryWeight, product of:
                5.082352 = boost
                6.3065243 = idf(docFreq=215, maxDocs=43556)
                0.016904235 = queryNorm
              0.55742323 = fieldWeight in 2662, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.3065243 = idf(docFreq=215, maxDocs=43556)
                0.0625 = fieldNorm(doc=2662)
        0.36 = coord(9/25)
    
  5. Li, X.; Schijvenaars, B.J.A.; Rijke, M.de: Investigating queries and search failures in academic search (2017) 0.21
    0.20976023 = sum of:
      0.20976023 = product of:
        0.5244006 = sum of:
          0.007998959 = weight(abstract_txt:which in 1319) [ClassicSimilarity], result of:
            0.007998959 = score(doc=1319,freq=1.0), product of:
              0.0499875 = queryWeight, product of:
                1.010606 = boost
                2.9260652 = idf(docFreq=6346, maxDocs=43556)
                0.016904235 = queryNorm
              0.16001919 = fieldWeight in 1319, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.9260652 = idf(docFreq=6346, maxDocs=43556)
                0.0546875 = fieldNorm(doc=1319)
          0.02765943 = weight(abstract_txt:method in 1319) [ClassicSimilarity], result of:
            0.02765943 = score(doc=1319,freq=2.0), product of:
              0.07925515 = queryWeight, product of:
                1.039009 = boost
                4.5124526 = idf(docFreq=1298, maxDocs=43556)
                0.016904235 = queryNorm
              0.3489922 = fieldWeight in 1319, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.5124526 = idf(docFreq=1298, maxDocs=43556)
                0.0546875 = fieldNorm(doc=1319)
          0.006721062 = weight(abstract_txt:with in 1319) [ClassicSimilarity], result of:
            0.006721062 = score(doc=1319,freq=1.0), product of:
              0.048990358 = queryWeight, product of:
                1.1552497 = boost
                2.508645 = idf(docFreq=9634, maxDocs=43556)
                0.016904235 = queryNorm
              0.13719153 = fieldWeight in 1319, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.508645 = idf(docFreq=9634, maxDocs=43556)
                0.0546875 = fieldNorm(doc=1319)
          0.030039677 = weight(abstract_txt:known in 1319) [ClassicSimilarity], result of:
            0.030039677 = score(doc=1319,freq=1.0), product of:
              0.105504796 = queryWeight, product of:
                1.198786 = boost
                5.20637 = idf(docFreq=648, maxDocs=43556)
                0.016904235 = queryNorm
              0.28472334 = fieldWeight in 1319, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.20637 = idf(docFreq=648, maxDocs=43556)
                0.0546875 = fieldNorm(doc=1319)
          0.04267701 = weight(abstract_txt:entities in 1319) [ClassicSimilarity], result of:
            0.04267701 = score(doc=1319,freq=1.0), product of:
              0.1333331 = queryWeight, product of:
                1.3476421 = boost
                5.852857 = idf(docFreq=339, maxDocs=43556)
                0.016904235 = queryNorm
              0.32007813 = fieldWeight in 1319, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.852857 = idf(docFreq=339, maxDocs=43556)
                0.0546875 = fieldNorm(doc=1319)
          0.016076207 = weight(abstract_txt:that in 1319) [ClassicSimilarity], result of:
            0.016076207 = score(doc=1319,freq=5.0), product of:
              0.055197712 = queryWeight, product of:
                1.3709958 = boost
                2.3817132 = idf(docFreq=10938, maxDocs=43556)
                0.016904235 = queryNorm
              0.29124773 = fieldWeight in 1319, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                2.3817132 = idf(docFreq=10938, maxDocs=43556)
                0.0546875 = fieldNorm(doc=1319)
          0.039920934 = weight(abstract_txt:problem in 1319) [ClassicSimilarity], result of:
            0.039920934 = score(doc=1319,freq=2.0), product of:
              0.11586783 = queryWeight, product of:
                1.5386245 = boost
                4.454867 = idf(docFreq=1375, maxDocs=43556)
                0.016904235 = queryNorm
              0.34453854 = fieldWeight in 1319, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.454867 = idf(docFreq=1375, maxDocs=43556)
                0.0546875 = fieldNorm(doc=1319)
          0.08279781 = weight(abstract_txt:label in 1319) [ClassicSimilarity], result of:
            0.08279781 = score(doc=1319,freq=1.0), product of:
              0.20740595 = queryWeight, product of:
                1.6808006 = boost
                7.299776 = idf(docFreq=79, maxDocs=43556)
                0.016904235 = queryNorm
              0.39920652 = fieldWeight in 1319, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.299776 = idf(docFreq=79, maxDocs=43556)
                0.0546875 = fieldNorm(doc=1319)
          0.08364428 = weight(abstract_txt:generic in 1319) [ClassicSimilarity], result of:
            0.08364428 = score(doc=1319,freq=1.0), product of:
              0.23903596 = queryWeight, product of:
                2.2099519 = boost
                6.398599 = idf(docFreq=196, maxDocs=43556)
                0.016904235 = queryNorm
              0.3499234 = fieldWeight in 1319, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.398599 = idf(docFreq=196, maxDocs=43556)
                0.0546875 = fieldNorm(doc=1319)
          0.18686524 = weight(abstract_txt:entity in 1319) [ClassicSimilarity], result of:
            0.18686524 = score(doc=1319,freq=1.0), product of:
              0.5418142 = queryWeight, product of:
                5.082352 = boost
                6.3065243 = idf(docFreq=215, maxDocs=43556)
                0.016904235 = queryNorm
              0.34488803 = fieldWeight in 1319, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.3065243 = idf(docFreq=215, maxDocs=43556)
                0.0546875 = fieldNorm(doc=1319)
        0.4 = coord(10/25)