Document (#36452)

Author
Pereira, D.A.
Ribeiro-Neto, B.
Ziviani, N.
Laender, A.H.F.
Gonçalves, M.A.
Title
¬A generic Web-based entity resolution framework
Source
Journal of the American Society for Information Science and Technology. 62(2011) no.5, S.919-932
Year
2011
Abstract
Web data repositories usually contain references to thousands of real-world entities from multiple sources. It is not uncommon that multiple entities share the same label (polysemes) and that distinct label variations are associated with the same entity (synonyms), which frequently leads to ambiguous interpretations. Further, spelling variants, acronyms, abbreviated forms, and misspellings compound to worsen the problem. Solving this problem requires identifying which labels correspond to the same real-world entity, a process known as entity resolution. One approach to solve the entity resolution problem is to associate an authority identifier and a list of variant forms with each entity-a data structure known as an authority file. In this work, we propose a generic framework for implementing a method for generating authority files. Our method uses information from the Web to improve the quality of the authority file and, because of that, is referred to as WER-Web-based Entity Resolution. Our contribution here is threefold: (a) we discuss how to implement the WER framework, which is flexible and easy to adapt to new domains; (b) we run extended experimentation with our WER framework to show that it outperforms selected baselines; and (c) we compare the results of a specialized solution for author name resolution with those produced by the generic WER framework, and show that the WER results remain competitive.
Theme
Internet

Similar documents (author)

  1. Ribeiro-Neto, B.; Laender, A.H.F.; Lima, L.R.S. de: ¬An experimental study in automatically categorizing medical documents (2001) 3.21
    3.209461 = sum of:
      3.209461 = product of:
        4.8141913 = sum of:
          0.97506964 = weight(author_txt:ribeiro in 703) [ClassicSimilarity], result of:
            0.97506964 = score(doc=703,freq=1.0), product of:
              0.35830674 = queryWeight, product of:
                1.0171233 = boost
                8.708245 = idf(docFreq=18, maxDocs=42306)
                0.040452994 = queryNorm
              2.7213266 = fieldWeight in 703, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.708245 = idf(docFreq=18, maxDocs=42306)
                0.3125 = fieldNorm(doc=703)
          1.2482213 = weight(author_txt:neto in 703) [ClassicSimilarity], result of:
            1.2482213 = score(doc=703,freq=1.0), product of:
              0.42243406 = queryWeight, product of:
                1.104398 = boost
                9.45546 = idf(docFreq=8, maxDocs=42306)
                0.040452994 = queryNorm
              2.9548311 = fieldWeight in 703, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.45546 = idf(docFreq=8, maxDocs=42306)
                0.3125 = fieldNorm(doc=703)
          1.2954503 = weight(author_txt:a.h.f in 703) [ClassicSimilarity], result of:
            1.2954503 = score(doc=703,freq=1.0), product of:
              0.43302375 = queryWeight, product of:
                1.118155 = boost
                9.573242 = idf(docFreq=7, maxDocs=42306)
                0.040452994 = queryNorm
              2.9916382 = fieldWeight in 703, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.573242 = idf(docFreq=7, maxDocs=42306)
                0.3125 = fieldNorm(doc=703)
          1.2954503 = weight(author_txt:laender in 703) [ClassicSimilarity], result of:
            1.2954503 = score(doc=703,freq=1.0), product of:
              0.43302375 = queryWeight, product of:
                1.118155 = boost
                9.573242 = idf(docFreq=7, maxDocs=42306)
                0.040452994 = queryNorm
              2.9916382 = fieldWeight in 703, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.573242 = idf(docFreq=7, maxDocs=42306)
                0.3125 = fieldNorm(doc=703)
        0.6666667 = coord(4/6)
    
  2. Silva, A.J.C.; Gonçalves, M.A.; Laender, A.H.F.; Modesto, M.A.B.; Cristo, M.; Ziviani, N.: Finding what is missing from a digital library : a case study in the computer science field (2009) 2.60
    2.5962493 = sum of:
      2.5962493 = product of:
        3.894374 = sum of:
          0.7413184 = weight(author_txt:gonçalves in 1220) [ClassicSimilarity], result of:
            0.7413184 = score(doc=1220,freq=1.0), product of:
              0.34634405 = queryWeight, product of:
                8.561642 = idf(docFreq=21, maxDocs=42306)
                0.040452994 = queryNorm
              2.1404104 = fieldWeight in 1220, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.561642 = idf(docFreq=21, maxDocs=42306)
                0.25 = fieldNorm(doc=1220)
          1.0363603 = weight(author_txt:a.h.f in 1220) [ClassicSimilarity], result of:
            1.0363603 = score(doc=1220,freq=1.0), product of:
              0.43302375 = queryWeight, product of:
                1.118155 = boost
                9.573242 = idf(docFreq=7, maxDocs=42306)
                0.040452994 = queryNorm
              2.3933105 = fieldWeight in 1220, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.573242 = idf(docFreq=7, maxDocs=42306)
                0.25 = fieldNorm(doc=1220)
          1.0363603 = weight(author_txt:laender in 1220) [ClassicSimilarity], result of:
            1.0363603 = score(doc=1220,freq=1.0), product of:
              0.43302375 = queryWeight, product of:
                1.118155 = boost
                9.573242 = idf(docFreq=7, maxDocs=42306)
                0.040452994 = queryNorm
              2.3933105 = fieldWeight in 1220, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.573242 = idf(docFreq=7, maxDocs=42306)
                0.25 = fieldNorm(doc=1220)
          1.0803348 = weight(author_txt:ziviani in 1220) [ClassicSimilarity], result of:
            1.0803348 = score(doc=1220,freq=1.0), product of:
              0.44518802 = queryWeight, product of:
                1.1337515 = boost
                9.706774 = idf(docFreq=6, maxDocs=42306)
                0.040452994 = queryNorm
              2.4266934 = fieldWeight in 1220, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.706774 = idf(docFreq=6, maxDocs=42306)
                0.25 = fieldNorm(doc=1220)
        0.6666667 = coord(4/6)
    
  3. Freitas-Junior, H.R.; Ribeiro-Neto, B.A.; Freitas-Vale, R. de; Laender, A.H.F.; Lima, L.R.S. de: Categorization-driven cross-language retrieval of medical information (2006) 2.57
    2.567569 = sum of:
      2.567569 = product of:
        3.8513534 = sum of:
          0.78005576 = weight(author_txt:ribeiro in 283) [ClassicSimilarity], result of:
            0.78005576 = score(doc=283,freq=1.0), product of:
              0.35830674 = queryWeight, product of:
                1.0171233 = boost
                8.708245 = idf(docFreq=18, maxDocs=42306)
                0.040452994 = queryNorm
              2.1770613 = fieldWeight in 283, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.708245 = idf(docFreq=18, maxDocs=42306)
                0.25 = fieldNorm(doc=283)
          0.99857706 = weight(author_txt:neto in 283) [ClassicSimilarity], result of:
            0.99857706 = score(doc=283,freq=1.0), product of:
              0.42243406 = queryWeight, product of:
                1.104398 = boost
                9.45546 = idf(docFreq=8, maxDocs=42306)
                0.040452994 = queryNorm
              2.363865 = fieldWeight in 283, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.45546 = idf(docFreq=8, maxDocs=42306)
                0.25 = fieldNorm(doc=283)
          1.0363603 = weight(author_txt:a.h.f in 283) [ClassicSimilarity], result of:
            1.0363603 = score(doc=283,freq=1.0), product of:
              0.43302375 = queryWeight, product of:
                1.118155 = boost
                9.573242 = idf(docFreq=7, maxDocs=42306)
                0.040452994 = queryNorm
              2.3933105 = fieldWeight in 283, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.573242 = idf(docFreq=7, maxDocs=42306)
                0.25 = fieldNorm(doc=283)
          1.0363603 = weight(author_txt:laender in 283) [ClassicSimilarity], result of:
            1.0363603 = score(doc=283,freq=1.0), product of:
              0.43302375 = queryWeight, product of:
                1.118155 = boost
                9.573242 = idf(docFreq=7, maxDocs=42306)
                0.040452994 = queryNorm
              2.3933105 = fieldWeight in 283, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.573242 = idf(docFreq=7, maxDocs=42306)
                0.25 = fieldNorm(doc=283)
        0.6666667 = coord(4/6)
    
  4. Calado, P.; Cristo, M.; Gonçalves, M.A.; Moura, E.S. de; Ribeiro-Neto, B.; Ziviani, N.: Link-based similarity measures for the classification of Web documents (2006) 2.40
    2.4001908 = sum of:
      2.4001908 = product of:
        3.600286 = sum of:
          0.7413184 = weight(author_txt:gonçalves in 922) [ClassicSimilarity], result of:
            0.7413184 = score(doc=922,freq=1.0), product of:
              0.34634405 = queryWeight, product of:
                8.561642 = idf(docFreq=21, maxDocs=42306)
                0.040452994 = queryNorm
              2.1404104 = fieldWeight in 922, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.561642 = idf(docFreq=21, maxDocs=42306)
                0.25 = fieldNorm(doc=922)
          0.78005576 = weight(author_txt:ribeiro in 922) [ClassicSimilarity], result of:
            0.78005576 = score(doc=922,freq=1.0), product of:
              0.35830674 = queryWeight, product of:
                1.0171233 = boost
                8.708245 = idf(docFreq=18, maxDocs=42306)
                0.040452994 = queryNorm
              2.1770613 = fieldWeight in 922, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.708245 = idf(docFreq=18, maxDocs=42306)
                0.25 = fieldNorm(doc=922)
          0.99857706 = weight(author_txt:neto in 922) [ClassicSimilarity], result of:
            0.99857706 = score(doc=922,freq=1.0), product of:
              0.42243406 = queryWeight, product of:
                1.104398 = boost
                9.45546 = idf(docFreq=8, maxDocs=42306)
                0.040452994 = queryNorm
              2.363865 = fieldWeight in 922, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.45546 = idf(docFreq=8, maxDocs=42306)
                0.25 = fieldNorm(doc=922)
          1.0803348 = weight(author_txt:ziviani in 922) [ClassicSimilarity], result of:
            1.0803348 = score(doc=922,freq=1.0), product of:
              0.44518802 = queryWeight, product of:
                1.1337515 = boost
                9.706774 = idf(docFreq=6, maxDocs=42306)
                0.040452994 = queryNorm
              2.4266934 = fieldWeight in 922, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.706774 = idf(docFreq=6, maxDocs=42306)
                0.25 = fieldNorm(doc=922)
        0.6666667 = coord(4/6)
    
  5. Couto, T.; Cristo, M.; Gonçalves, M.A.; Calado, P.; Ziviani, N.; Moura, E.; Ribeiro-Neto, B.: ¬A comparative study of citations and links in document classification (2006) 2.40
    2.4001908 = sum of:
      2.4001908 = product of:
        3.600286 = sum of:
          0.7413184 = weight(author_txt:gonçalves in 351) [ClassicSimilarity], result of:
            0.7413184 = score(doc=351,freq=1.0), product of:
              0.34634405 = queryWeight, product of:
                8.561642 = idf(docFreq=21, maxDocs=42306)
                0.040452994 = queryNorm
              2.1404104 = fieldWeight in 351, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.561642 = idf(docFreq=21, maxDocs=42306)
                0.25 = fieldNorm(doc=351)
          0.78005576 = weight(author_txt:ribeiro in 351) [ClassicSimilarity], result of:
            0.78005576 = score(doc=351,freq=1.0), product of:
              0.35830674 = queryWeight, product of:
                1.0171233 = boost
                8.708245 = idf(docFreq=18, maxDocs=42306)
                0.040452994 = queryNorm
              2.1770613 = fieldWeight in 351, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.708245 = idf(docFreq=18, maxDocs=42306)
                0.25 = fieldNorm(doc=351)
          0.99857706 = weight(author_txt:neto in 351) [ClassicSimilarity], result of:
            0.99857706 = score(doc=351,freq=1.0), product of:
              0.42243406 = queryWeight, product of:
                1.104398 = boost
                9.45546 = idf(docFreq=8, maxDocs=42306)
                0.040452994 = queryNorm
              2.363865 = fieldWeight in 351, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.45546 = idf(docFreq=8, maxDocs=42306)
                0.25 = fieldNorm(doc=351)
          1.0803348 = weight(author_txt:ziviani in 351) [ClassicSimilarity], result of:
            1.0803348 = score(doc=351,freq=1.0), product of:
              0.44518802 = queryWeight, product of:
                1.1337515 = boost
                9.706774 = idf(docFreq=6, maxDocs=42306)
                0.040452994 = queryNorm
              2.4266934 = fieldWeight in 351, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.706774 = idf(docFreq=6, maxDocs=42306)
                0.25 = fieldNorm(doc=351)
        0.6666667 = coord(4/6)
    

Similar documents (content)

  1. Lawrie, D.; Mayfield, J.; McNamee, P.; Oard, P.W.: Cross-language person-entity linking from 20 languages (2015) 0.34
    0.3378271 = sum of:
      0.3378271 = product of:
        1.2065253 = sum of:
          0.019919092 = weight(abstract_txt:which in 3849) [ClassicSimilarity], result of:
            0.019919092 = score(doc=3849,freq=3.0), product of:
              0.0500874 = queryWeight, product of:
                1.0124673 = boost
                2.938938 = idf(docFreq=6085, maxDocs=42306)
                0.016832829 = queryNorm
              0.3976867 = fieldWeight in 3849, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.938938 = idf(docFreq=6085, maxDocs=42306)
                0.078125 = fieldNorm(doc=3849)
          0.019485587 = weight(abstract_txt:with in 3849) [ClassicSimilarity], result of:
            0.019485587 = score(doc=3849,freq=4.0), product of:
              0.04935803 = queryWeight, product of:
                1.1605531 = boost
                2.5265954 = idf(docFreq=9191, maxDocs=42306)
                0.016832829 = queryNorm
              0.39478052 = fieldWeight in 3849, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                2.5265954 = idf(docFreq=9191, maxDocs=42306)
                0.078125 = fieldNorm(doc=3849)
          0.061601054 = weight(abstract_txt:known in 3849) [ClassicSimilarity], result of:
            0.061601054 = score(doc=3849,freq=2.0), product of:
              0.10631818 = queryWeight, product of:
                1.2044115 = boost
                5.2441554 = idf(docFreq=606, maxDocs=42306)
                0.016832829 = queryNorm
              0.5794028 = fieldWeight in 3849, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.2441554 = idf(docFreq=606, maxDocs=42306)
                0.078125 = fieldNorm(doc=3849)
          0.061735116 = weight(abstract_txt:entities in 3849) [ClassicSimilarity], result of:
            0.061735116 = score(doc=3849,freq=1.0), product of:
              0.1341468 = queryWeight, product of:
                1.3528862 = boost
                5.8906326 = idf(docFreq=317, maxDocs=42306)
                0.016832829 = queryNorm
              0.46020567 = fieldWeight in 3849, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8906326 = idf(docFreq=317, maxDocs=42306)
                0.078125 = fieldNorm(doc=3849)
          0.01818922 = weight(abstract_txt:that in 3849) [ClassicSimilarity], result of:
            0.01818922 = score(doc=3849,freq=3.0), product of:
              0.055895187 = queryWeight, product of:
                1.3807923 = boost
                2.4048555 = idf(docFreq=10381, maxDocs=42306)
                0.016832829 = queryNorm
              0.32541656 = fieldWeight in 3849, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.4048555 = idf(docFreq=10381, maxDocs=42306)
                0.078125 = fieldNorm(doc=3849)
          0.48314324 = weight(abstract_txt:resolution in 3849) [ClassicSimilarity], result of:
            0.48314324 = score(doc=3849,freq=3.0), product of:
              0.49760222 = queryWeight, product of:
                4.1198583 = boost
                7.1753473 = idf(docFreq=87, maxDocs=42306)
                0.016832829 = queryNorm
              0.9709427 = fieldWeight in 3849, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.1753473 = idf(docFreq=87, maxDocs=42306)
                0.078125 = fieldNorm(doc=3849)
          0.54245204 = weight(abstract_txt:entity in 3849) [ClassicSimilarity], result of:
            0.54245204 = score(doc=3849,freq=4.0), product of:
              0.5463476 = queryWeight, product of:
                5.107868 = boost
                6.354367 = idf(docFreq=199, maxDocs=42306)
                0.016832829 = queryNorm
              0.9928698 = fieldWeight in 3849, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.354367 = idf(docFreq=199, maxDocs=42306)
                0.078125 = fieldNorm(doc=3849)
        0.28 = coord(7/25)
    
  2. Liu, X.; Zheng, W.; Fang, H.: ¬An exploration of ranking models and feedback method for related entity finding (2013) 0.25
    0.2500526 = sum of:
      0.2500526 = product of:
        0.6945905 = sum of:
          0.009200235 = weight(abstract_txt:which in 4715) [ClassicSimilarity], result of:
            0.009200235 = score(doc=4715,freq=1.0), product of:
              0.0500874 = queryWeight, product of:
                1.0124673 = boost
                2.938938 = idf(docFreq=6085, maxDocs=42306)
                0.016832829 = queryNorm
              0.18368362 = fieldWeight in 4715, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.938938 = idf(docFreq=6085, maxDocs=42306)
                0.0625 = fieldNorm(doc=4715)
          0.021486606 = weight(abstract_txt:show in 4715) [ClassicSimilarity], result of:
            0.021486606 = score(doc=4715,freq=1.0), product of:
              0.07702127 = queryWeight, product of:
                1.0251242 = boost
                4.463516 = idf(docFreq=1324, maxDocs=42306)
                0.016832829 = queryNorm
              0.27896976 = fieldWeight in 4715, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.463516 = idf(docFreq=1324, maxDocs=42306)
                0.0625 = fieldNorm(doc=4715)
          0.03186309 = weight(abstract_txt:method in 4715) [ClassicSimilarity], result of:
            0.03186309 = score(doc=4715,freq=2.0), product of:
              0.07949639 = queryWeight, product of:
                1.0414654 = boost
                4.534668 = idf(docFreq=1233, maxDocs=42306)
                0.016832829 = queryNorm
              0.4008118 = fieldWeight in 4715, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.534668 = idf(docFreq=1233, maxDocs=42306)
                0.0625 = fieldNorm(doc=4715)
          0.0077942354 = weight(abstract_txt:with in 4715) [ClassicSimilarity], result of:
            0.0077942354 = score(doc=4715,freq=1.0), product of:
              0.04935803 = queryWeight, product of:
                1.1605531 = boost
                2.5265954 = idf(docFreq=9191, maxDocs=42306)
                0.016832829 = queryNorm
              0.15791221 = fieldWeight in 4715, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.5265954 = idf(docFreq=9191, maxDocs=42306)
                0.0625 = fieldNorm(doc=4715)
          0.13066861 = weight(abstract_txt:entities in 4715) [ClassicSimilarity], result of:
            0.13066861 = score(doc=4715,freq=7.0), product of:
              0.1341468 = queryWeight, product of:
                1.3528862 = boost
                5.8906326 = idf(docFreq=317, maxDocs=42306)
                0.016832829 = queryNorm
              0.9740718 = fieldWeight in 4715, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                5.8906326 = idf(docFreq=317, maxDocs=42306)
                0.0625 = fieldNorm(doc=4715)
          0.011881148 = weight(abstract_txt:that in 4715) [ClassicSimilarity], result of:
            0.011881148 = score(doc=4715,freq=2.0), product of:
              0.055895187 = queryWeight, product of:
                1.3807923 = boost
                2.4048555 = idf(docFreq=10381, maxDocs=42306)
                0.016832829 = queryNorm
              0.2125612 = fieldWeight in 4715, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.4048555 = idf(docFreq=10381, maxDocs=42306)
                0.0625 = fieldNorm(doc=4715)
          0.04571917 = weight(abstract_txt:problem in 4715) [ClassicSimilarity], result of:
            0.04571917 = score(doc=4715,freq=2.0), product of:
              0.115767 = queryWeight, product of:
                1.5392499 = boost
                4.4680552 = idf(docFreq=1318, maxDocs=42306)
                0.016832829 = queryNorm
              0.394924 = fieldWeight in 4715, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.4680552 = idf(docFreq=1318, maxDocs=42306)
                0.0625 = fieldNorm(doc=4715)
          0.060155556 = weight(abstract_txt:framework in 4715) [ClassicSimilarity], result of:
            0.060155556 = score(doc=4715,freq=1.0), product of:
              0.20764874 = queryWeight, product of:
                2.6613731 = boost
                4.635178 = idf(docFreq=1115, maxDocs=42306)
                0.016832829 = queryNorm
              0.28969863 = fieldWeight in 4715, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.635178 = idf(docFreq=1115, maxDocs=42306)
                0.0625 = fieldNorm(doc=4715)
          0.37582183 = weight(abstract_txt:entity in 4715) [ClassicSimilarity], result of:
            0.37582183 = score(doc=4715,freq=3.0), product of:
              0.5463476 = queryWeight, product of:
                5.107868 = boost
                6.354367 = idf(docFreq=199, maxDocs=42306)
                0.016832829 = queryNorm
              0.6878804 = fieldWeight in 4715, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.354367 = idf(docFreq=199, maxDocs=42306)
                0.0625 = fieldNorm(doc=4715)
        0.36 = coord(9/25)
    
  3. Vechtomova, O.; Robertson, S.E.: ¬A domain-independent approach to finding related entities (2012) 0.23
    0.23167723 = sum of:
      0.23167723 = product of:
        0.96532184 = sum of:
          0.028163262 = weight(abstract_txt:method in 4734) [ClassicSimilarity], result of:
            0.028163262 = score(doc=4734,freq=1.0), product of:
              0.07949639 = queryWeight, product of:
                1.0414654 = boost
                4.534668 = idf(docFreq=1233, maxDocs=42306)
                0.016832829 = queryNorm
              0.35427094 = fieldWeight in 4734, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.534668 = idf(docFreq=1233, maxDocs=42306)
                0.078125 = fieldNorm(doc=4734)
          0.009742794 = weight(abstract_txt:with in 4734) [ClassicSimilarity], result of:
            0.009742794 = score(doc=4734,freq=1.0), product of:
              0.04935803 = queryWeight, product of:
                1.1605531 = boost
                2.5265954 = idf(docFreq=9191, maxDocs=42306)
                0.016832829 = queryNorm
              0.19739026 = fieldWeight in 4734, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.5265954 = idf(docFreq=9191, maxDocs=42306)
                0.078125 = fieldNorm(doc=4734)
          0.15121955 = weight(abstract_txt:entities in 4734) [ClassicSimilarity], result of:
            0.15121955 = score(doc=4734,freq=6.0), product of:
              0.1341468 = queryWeight, product of:
                1.3528862 = boost
                5.8906326 = idf(docFreq=317, maxDocs=42306)
                0.016832829 = queryNorm
              1.1272691 = fieldWeight in 4734, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                5.8906326 = idf(docFreq=317, maxDocs=42306)
                0.078125 = fieldNorm(doc=4734)
          0.01818922 = weight(abstract_txt:that in 4734) [ClassicSimilarity], result of:
            0.01818922 = score(doc=4734,freq=3.0), product of:
              0.055895187 = queryWeight, product of:
                1.3807923 = boost
                2.4048555 = idf(docFreq=10381, maxDocs=42306)
                0.016832829 = queryNorm
              0.32541656 = fieldWeight in 4734, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.4048555 = idf(docFreq=10381, maxDocs=42306)
                0.078125 = fieldNorm(doc=4734)
          0.040410418 = weight(abstract_txt:problem in 4734) [ClassicSimilarity], result of:
            0.040410418 = score(doc=4734,freq=1.0), product of:
              0.115767 = queryWeight, product of:
                1.5392499 = boost
                4.4680552 = idf(docFreq=1318, maxDocs=42306)
                0.016832829 = queryNorm
              0.34906682 = fieldWeight in 4734, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4680552 = idf(docFreq=1318, maxDocs=42306)
                0.078125 = fieldNorm(doc=4734)
          0.7175966 = weight(abstract_txt:entity in 4734) [ClassicSimilarity], result of:
            0.7175966 = score(doc=4734,freq=7.0), product of:
              0.5463476 = queryWeight, product of:
                5.107868 = boost
                6.354367 = idf(docFreq=199, maxDocs=42306)
                0.016832829 = queryNorm
              1.3134433 = fieldWeight in 4734, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                6.354367 = idf(docFreq=199, maxDocs=42306)
                0.078125 = fieldNorm(doc=4734)
        0.24 = coord(6/25)
    
  4. Soulier, L.; Jabeur, L.B.; Tamine, L.; Bahsoun, W.: On ranking relevant entities in heterogeneous networks using a language-based model (2013) 0.21
    0.21325675 = sum of:
      0.21325675 = product of:
        0.5923798 = sum of:
          0.021486606 = weight(abstract_txt:show in 2665) [ClassicSimilarity], result of:
            0.021486606 = score(doc=2665,freq=1.0), product of:
              0.07702127 = queryWeight, product of:
                1.0251242 = boost
                4.463516 = idf(docFreq=1324, maxDocs=42306)
                0.016832829 = queryNorm
              0.27896976 = fieldWeight in 2665, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.463516 = idf(docFreq=1324, maxDocs=42306)
                0.0625 = fieldNorm(doc=2665)
          0.0077942354 = weight(abstract_txt:with in 2665) [ClassicSimilarity], result of:
            0.0077942354 = score(doc=2665,freq=1.0), product of:
              0.04935803 = queryWeight, product of:
                1.1605531 = boost
                2.5265954 = idf(docFreq=9191, maxDocs=42306)
                0.016832829 = queryNorm
              0.15791221 = fieldWeight in 2665, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.5265954 = idf(docFreq=9191, maxDocs=42306)
                0.0625 = fieldNorm(doc=2665)
          0.0337412 = weight(abstract_txt:multiple in 2665) [ClassicSimilarity], result of:
            0.0337412 = score(doc=2665,freq=1.0), product of:
              0.10405728 = queryWeight, product of:
                1.1915365 = boost
                5.188096 = idf(docFreq=641, maxDocs=42306)
                0.016832829 = queryNorm
              0.324256 = fieldWeight in 2665, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.188096 = idf(docFreq=641, maxDocs=42306)
                0.0625 = fieldNorm(doc=2665)
          0.04019879 = weight(abstract_txt:forms in 2665) [ClassicSimilarity], result of:
            0.04019879 = score(doc=2665,freq=1.0), product of:
              0.11694297 = queryWeight, product of:
                1.2631595 = boost
                5.4999514 = idf(docFreq=469, maxDocs=42306)
                0.016832829 = queryNorm
              0.34374696 = fieldWeight in 2665, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.4999514 = idf(docFreq=469, maxDocs=42306)
                0.0625 = fieldNorm(doc=2665)
          0.098776184 = weight(abstract_txt:entities in 2665) [ClassicSimilarity], result of:
            0.098776184 = score(doc=2665,freq=4.0), product of:
              0.1341468 = queryWeight, product of:
                1.3528862 = boost
                5.8906326 = idf(docFreq=317, maxDocs=42306)
                0.016832829 = queryNorm
              0.7363291 = fieldWeight in 2665, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.8906326 = idf(docFreq=317, maxDocs=42306)
                0.0625 = fieldNorm(doc=2665)
          0.011881148 = weight(abstract_txt:that in 2665) [ClassicSimilarity], result of:
            0.011881148 = score(doc=2665,freq=2.0), product of:
              0.055895187 = queryWeight, product of:
                1.3807923 = boost
                2.4048555 = idf(docFreq=10381, maxDocs=42306)
                0.016832829 = queryNorm
              0.2125612 = fieldWeight in 2665, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.4048555 = idf(docFreq=10381, maxDocs=42306)
                0.0625 = fieldNorm(doc=2665)
          0.032328334 = weight(abstract_txt:problem in 2665) [ClassicSimilarity], result of:
            0.032328334 = score(doc=2665,freq=1.0), product of:
              0.115767 = queryWeight, product of:
                1.5392499 = boost
                4.4680552 = idf(docFreq=1318, maxDocs=42306)
                0.016832829 = queryNorm
              0.27925345 = fieldWeight in 2665, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4680552 = idf(docFreq=1318, maxDocs=42306)
                0.0625 = fieldNorm(doc=2665)
          0.039316084 = weight(abstract_txt:same in 2665) [ClassicSimilarity], result of:
            0.039316084 = score(doc=2665,freq=1.0), product of:
              0.13189937 = queryWeight, product of:
                1.643002 = boost
                4.769222 = idf(docFreq=975, maxDocs=42306)
                0.016832829 = queryNorm
              0.29807636 = fieldWeight in 2665, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.769222 = idf(docFreq=975, maxDocs=42306)
                0.0625 = fieldNorm(doc=2665)
          0.3068572 = weight(abstract_txt:entity in 2665) [ClassicSimilarity], result of:
            0.3068572 = score(doc=2665,freq=2.0), product of:
              0.5463476 = queryWeight, product of:
                5.107868 = boost
                6.354367 = idf(docFreq=199, maxDocs=42306)
                0.016832829 = queryNorm
              0.56165195 = fieldWeight in 2665, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.354367 = idf(docFreq=199, maxDocs=42306)
                0.0625 = fieldNorm(doc=2665)
        0.36 = coord(9/25)
    
  5. Li, X.; Schijvenaars, B.J.A.; Rijke, M.de: Investigating queries and search failures in academic search (2017) 0.21
    0.21192212 = sum of:
      0.21192212 = product of:
        0.5298053 = sum of:
          0.008050205 = weight(abstract_txt:which in 1952) [ClassicSimilarity], result of:
            0.008050205 = score(doc=1952,freq=1.0), product of:
              0.0500874 = queryWeight, product of:
                1.0124673 = boost
                2.938938 = idf(docFreq=6085, maxDocs=42306)
                0.016832829 = queryNorm
              0.16072316 = fieldWeight in 1952, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.938938 = idf(docFreq=6085, maxDocs=42306)
                0.0546875 = fieldNorm(doc=1952)
          0.027880205 = weight(abstract_txt:method in 1952) [ClassicSimilarity], result of:
            0.027880205 = score(doc=1952,freq=2.0), product of:
              0.07949639 = queryWeight, product of:
                1.0414654 = boost
                4.534668 = idf(docFreq=1233, maxDocs=42306)
                0.016832829 = queryNorm
              0.35071033 = fieldWeight in 1952, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.534668 = idf(docFreq=1233, maxDocs=42306)
                0.0546875 = fieldNorm(doc=1952)
          0.0068199555 = weight(abstract_txt:with in 1952) [ClassicSimilarity], result of:
            0.0068199555 = score(doc=1952,freq=1.0), product of:
              0.04935803 = queryWeight, product of:
                1.1605531 = boost
                2.5265954 = idf(docFreq=9191, maxDocs=42306)
                0.016832829 = queryNorm
              0.13817318 = fieldWeight in 1952, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.5265954 = idf(docFreq=9191, maxDocs=42306)
                0.0546875 = fieldNorm(doc=1952)
          0.030490965 = weight(abstract_txt:known in 1952) [ClassicSimilarity], result of:
            0.030490965 = score(doc=1952,freq=1.0), product of:
              0.10631818 = queryWeight, product of:
                1.2044115 = boost
                5.2441554 = idf(docFreq=606, maxDocs=42306)
                0.016832829 = queryNorm
              0.28678975 = fieldWeight in 1952, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.2441554 = idf(docFreq=606, maxDocs=42306)
                0.0546875 = fieldNorm(doc=1952)
          0.043214582 = weight(abstract_txt:entities in 1952) [ClassicSimilarity], result of:
            0.043214582 = score(doc=1952,freq=1.0), product of:
              0.1341468 = queryWeight, product of:
                1.3528862 = boost
                5.8906326 = idf(docFreq=317, maxDocs=42306)
                0.016832829 = queryNorm
              0.32214397 = fieldWeight in 1952, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8906326 = idf(docFreq=317, maxDocs=42306)
                0.0546875 = fieldNorm(doc=1952)
          0.016437527 = weight(abstract_txt:that in 1952) [ClassicSimilarity], result of:
            0.016437527 = score(doc=1952,freq=5.0), product of:
              0.055895187 = queryWeight, product of:
                1.3807923 = boost
                2.4048555 = idf(docFreq=10381, maxDocs=42306)
                0.016832829 = queryNorm
              0.2940777 = fieldWeight in 1952, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                2.4048555 = idf(docFreq=10381, maxDocs=42306)
                0.0546875 = fieldNorm(doc=1952)
          0.040004276 = weight(abstract_txt:problem in 1952) [ClassicSimilarity], result of:
            0.040004276 = score(doc=1952,freq=2.0), product of:
              0.115767 = queryWeight, product of:
                1.5392499 = boost
                4.4680552 = idf(docFreq=1318, maxDocs=42306)
                0.016832829 = queryNorm
              0.34555852 = fieldWeight in 1952, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.4680552 = idf(docFreq=1318, maxDocs=42306)
                0.0546875 = fieldNorm(doc=1952)
          0.08389995 = weight(abstract_txt:label in 1952) [ClassicSimilarity], result of:
            0.08389995 = score(doc=1952,freq=1.0), product of:
              0.20876992 = queryWeight, product of:
                1.6877382 = boost
                7.348619 = idf(docFreq=73, maxDocs=42306)
                0.016832829 = queryNorm
              0.4018776 = fieldWeight in 1952, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.348619 = idf(docFreq=73, maxDocs=42306)
                0.0546875 = fieldNorm(doc=1952)
          0.08314944 = weight(abstract_txt:generic in 1952) [ClassicSimilarity], result of:
            0.08314944 = score(doc=1952,freq=1.0), product of:
              0.23755458 = queryWeight, product of:
                2.2049487 = boost
                6.4004107 = idf(docFreq=190, maxDocs=42306)
                0.016832829 = queryNorm
              0.35002246 = fieldWeight in 1952, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.4004107 = idf(docFreq=190, maxDocs=42306)
                0.0546875 = fieldNorm(doc=1952)
          0.18985823 = weight(abstract_txt:entity in 1952) [ClassicSimilarity], result of:
            0.18985823 = score(doc=1952,freq=1.0), product of:
              0.5463476 = queryWeight, product of:
                5.107868 = boost
                6.354367 = idf(docFreq=199, maxDocs=42306)
                0.016832829 = queryNorm
              0.34750444 = fieldWeight in 1952, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.354367 = idf(docFreq=199, maxDocs=42306)
                0.0546875 = fieldNorm(doc=1952)
        0.4 = coord(10/25)