Document (#35987)

Author
Cota, R.G.
Ferreira, A.A.
Nascimento, C.
Gonçalves, M.A.
Laender, A.H.F.
Title
¬An unsupervised heuristic-based hierarchical method for name disambiguation in bibliographic citations
Source
Journal of the American Society for Information Science and Technology. 61(2010) no.9, S.1853-1870
Year
2010
Abstract
Name ambiguity in the context of bibliographic citations is a difficult problem which, despite the many efforts from the research community, still has a lot of room for improvement. In this article, we present a heuristic-based hierarchical clustering method to deal with this problem. The method successively fuses clusters of citations of similar author names based on several heuristics and similarity measures on the components of the citations (e.g., coauthor names, work title, and publication venue title). During the disambiguation task, the information about fused clusters is aggregated providing more information for the next round of fusion. In order to demonstrate the effectiveness of our method, we ran a series of experiments in two different collections extracted from real-world digital libraries and compared it, under two metrics, with four representative methods described in the literature. We present comparisons of results using each considered attribute separately (i.e., coauthor names, work title, and publication venue title) with the author name attribute and using all attributes together. These results show that our unsupervised method, when using all attributes, performs competitively against all other methods, under both metrics, loosing only in one case against a supervised method, whose result was very close to ours. Moreover, such results are achieved without the burden of any training and without using any privileged information such as knowing a priori the correct number of clusters.

Similar documents (author)

  1. Ferreira, A.A.; Veloso, A.; Gonçalves, M.A.; Laender, A.H.F.: Self-training author name disambiguation for information scarce scenarios (2014) 4.02
    4.0151706 = sum of:
      4.0151706 = product of:
        5.018963 = sum of:
          1.0052938 = weight(author_txt:gonçalves in 1292) [ClassicSimilarity], result of:
            1.0052938 = score(doc=1292,freq=1.0), product of:
              0.37574962 = queryWeight, product of:
                8.561393 = idf(docFreq=22, maxDocs=44218)
                0.04388884 = queryNorm
              2.6754353 = fieldWeight in 1292, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.561393 = idf(docFreq=22, maxDocs=44218)
                0.3125 = fieldNorm(doc=1292)
          1.1635104 = weight(author_txt:ferreira in 1292) [ClassicSimilarity], result of:
            1.1635104 = score(doc=1292,freq=1.0), product of:
              0.41420633 = queryWeight, product of:
                1.049927 = boost
                8.988837 = idf(docFreq=14, maxDocs=44218)
                0.04388884 = queryNorm
              2.8090117 = fieldWeight in 1292, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.988837 = idf(docFreq=14, maxDocs=44218)
                0.3125 = fieldNorm(doc=1292)
          1.4250792 = weight(author_txt:laender in 1292) [ClassicSimilarity], result of:
            1.4250792 = score(doc=1292,freq=1.0), product of:
              0.4741647 = queryWeight, product of:
                1.1233506 = boost
                9.617446 = idf(docFreq=7, maxDocs=44218)
                0.04388884 = queryNorm
              3.005452 = fieldWeight in 1292, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.617446 = idf(docFreq=7, maxDocs=44218)
                0.3125 = fieldNorm(doc=1292)
          1.4250792 = weight(author_txt:a.h.f in 1292) [ClassicSimilarity], result of:
            1.4250792 = score(doc=1292,freq=1.0), product of:
              0.4741647 = queryWeight, product of:
                1.1233506 = boost
                9.617446 = idf(docFreq=7, maxDocs=44218)
                0.04388884 = queryNorm
              3.005452 = fieldWeight in 1292, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.617446 = idf(docFreq=7, maxDocs=44218)
                0.3125 = fieldNorm(doc=1292)
        0.8 = coord(4/5)
    
  2. Santana, A.F.; Gonçalves, M.A.; Laender, A.H.F.; Ferreira, A.A.: Incremental author name disambiguation by exploiting domain-specific heuristics (2017) 4.02
    4.0151706 = sum of:
      4.0151706 = product of:
        5.018963 = sum of:
          1.0052938 = weight(author_txt:gonçalves in 3587) [ClassicSimilarity], result of:
            1.0052938 = score(doc=3587,freq=1.0), product of:
              0.37574962 = queryWeight, product of:
                8.561393 = idf(docFreq=22, maxDocs=44218)
                0.04388884 = queryNorm
              2.6754353 = fieldWeight in 3587, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.561393 = idf(docFreq=22, maxDocs=44218)
                0.3125 = fieldNorm(doc=3587)
          1.1635104 = weight(author_txt:ferreira in 3587) [ClassicSimilarity], result of:
            1.1635104 = score(doc=3587,freq=1.0), product of:
              0.41420633 = queryWeight, product of:
                1.049927 = boost
                8.988837 = idf(docFreq=14, maxDocs=44218)
                0.04388884 = queryNorm
              2.8090117 = fieldWeight in 3587, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.988837 = idf(docFreq=14, maxDocs=44218)
                0.3125 = fieldNorm(doc=3587)
          1.4250792 = weight(author_txt:laender in 3587) [ClassicSimilarity], result of:
            1.4250792 = score(doc=3587,freq=1.0), product of:
              0.4741647 = queryWeight, product of:
                1.1233506 = boost
                9.617446 = idf(docFreq=7, maxDocs=44218)
                0.04388884 = queryNorm
              3.005452 = fieldWeight in 3587, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.617446 = idf(docFreq=7, maxDocs=44218)
                0.3125 = fieldNorm(doc=3587)
          1.4250792 = weight(author_txt:a.h.f in 3587) [ClassicSimilarity], result of:
            1.4250792 = score(doc=3587,freq=1.0), product of:
              0.4741647 = queryWeight, product of:
                1.1233506 = boost
                9.617446 = idf(docFreq=7, maxDocs=44218)
                0.04388884 = queryNorm
              3.005452 = fieldWeight in 3587, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.617446 = idf(docFreq=7, maxDocs=44218)
                0.3125 = fieldNorm(doc=3587)
        0.8 = coord(4/5)
    
  3. Silva, A.J.C.; Gonçalves, M.A.; Laender, A.H.F.; Modesto, M.A.B.; Cristo, M.; Ziviani, N.: Finding what is missing from a digital library : a case study in the computer science field (2009) 1.85
    1.850617 = sum of:
      1.850617 = product of:
        3.0843616 = sum of:
          0.80423504 = weight(author_txt:gonçalves in 4219) [ClassicSimilarity], result of:
            0.80423504 = score(doc=4219,freq=1.0), product of:
              0.37574962 = queryWeight, product of:
                8.561393 = idf(docFreq=22, maxDocs=44218)
                0.04388884 = queryNorm
              2.1403482 = fieldWeight in 4219, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.561393 = idf(docFreq=22, maxDocs=44218)
                0.25 = fieldNorm(doc=4219)
          1.1400633 = weight(author_txt:laender in 4219) [ClassicSimilarity], result of:
            1.1400633 = score(doc=4219,freq=1.0), product of:
              0.4741647 = queryWeight, product of:
                1.1233506 = boost
                9.617446 = idf(docFreq=7, maxDocs=44218)
                0.04388884 = queryNorm
              2.4043615 = fieldWeight in 4219, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.617446 = idf(docFreq=7, maxDocs=44218)
                0.25 = fieldNorm(doc=4219)
          1.1400633 = weight(author_txt:a.h.f in 4219) [ClassicSimilarity], result of:
            1.1400633 = score(doc=4219,freq=1.0), product of:
              0.4741647 = queryWeight, product of:
                1.1233506 = boost
                9.617446 = idf(docFreq=7, maxDocs=44218)
                0.04388884 = queryNorm
              2.4043615 = fieldWeight in 4219, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.617446 = idf(docFreq=7, maxDocs=44218)
                0.25 = fieldNorm(doc=4219)
        0.6 = coord(3/5)
    
  4. Pereira, D.A.; Ribeiro-Neto, B.; Ziviani, N.; Laender, A.H.F.; Gonçalves, M.A.: ¬A generic Web-based entity resolution framework (2011) 1.85
    1.850617 = sum of:
      1.850617 = product of:
        3.0843616 = sum of:
          0.80423504 = weight(author_txt:gonçalves in 4450) [ClassicSimilarity], result of:
            0.80423504 = score(doc=4450,freq=1.0), product of:
              0.37574962 = queryWeight, product of:
                8.561393 = idf(docFreq=22, maxDocs=44218)
                0.04388884 = queryNorm
              2.1403482 = fieldWeight in 4450, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.561393 = idf(docFreq=22, maxDocs=44218)
                0.25 = fieldNorm(doc=4450)
          1.1400633 = weight(author_txt:laender in 4450) [ClassicSimilarity], result of:
            1.1400633 = score(doc=4450,freq=1.0), product of:
              0.4741647 = queryWeight, product of:
                1.1233506 = boost
                9.617446 = idf(docFreq=7, maxDocs=44218)
                0.04388884 = queryNorm
              2.4043615 = fieldWeight in 4450, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.617446 = idf(docFreq=7, maxDocs=44218)
                0.25 = fieldNorm(doc=4450)
          1.1400633 = weight(author_txt:a.h.f in 4450) [ClassicSimilarity], result of:
            1.1400633 = score(doc=4450,freq=1.0), product of:
              0.4741647 = queryWeight, product of:
                1.1233506 = boost
                9.617446 = idf(docFreq=7, maxDocs=44218)
                0.04388884 = queryNorm
              2.4043615 = fieldWeight in 4450, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.617446 = idf(docFreq=7, maxDocs=44218)
                0.25 = fieldNorm(doc=4450)
        0.6 = coord(3/5)
    
  5. Ribeiro-Neto, B.; Laender, A.H.F.; Lima, L.R.S. de: ¬An experimental study in automatically categorizing medical documents (2001) 1.14
    1.1400634 = sum of:
      1.1400634 = product of:
        2.8501585 = sum of:
          1.4250792 = weight(author_txt:laender in 5702) [ClassicSimilarity], result of:
            1.4250792 = score(doc=5702,freq=1.0), product of:
              0.4741647 = queryWeight, product of:
                1.1233506 = boost
                9.617446 = idf(docFreq=7, maxDocs=44218)
                0.04388884 = queryNorm
              3.005452 = fieldWeight in 5702, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.617446 = idf(docFreq=7, maxDocs=44218)
                0.3125 = fieldNorm(doc=5702)
          1.4250792 = weight(author_txt:a.h.f in 5702) [ClassicSimilarity], result of:
            1.4250792 = score(doc=5702,freq=1.0), product of:
              0.4741647 = queryWeight, product of:
                1.1233506 = boost
                9.617446 = idf(docFreq=7, maxDocs=44218)
                0.04388884 = queryNorm
              3.005452 = fieldWeight in 5702, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.617446 = idf(docFreq=7, maxDocs=44218)
                0.3125 = fieldNorm(doc=5702)
        0.4 = coord(2/5)
    

Similar documents (content)

  1. Ferreira, A.A.; Veloso, A.; Gonçalves, M.A.; Laender, A.H.F.: Self-training author name disambiguation for information scarce scenarios (2014) 0.52
    0.5210096 = sum of:
      0.5210096 = product of:
        1.1841128 = sum of:
          0.071025185 = weight(abstract_txt:author in 1292) [ClassicSimilarity], result of:
            0.071025185 = score(doc=1292,freq=5.0), product of:
              0.102094755 = queryWeight, product of:
                1.0706266 = boost
                4.9778743 = idf(docFreq=827, maxDocs=44218)
                0.019156734 = queryNorm
              0.69567907 = fieldWeight in 1292, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.9778743 = idf(docFreq=827, maxDocs=44218)
                0.0625 = fieldNorm(doc=1292)
          0.05938543 = weight(abstract_txt:attributes in 1292) [ClassicSimilarity], result of:
            0.05938543 = score(doc=1292,freq=1.0), product of:
              0.15494293 = queryWeight, product of:
                1.3189315 = boost
                6.1323667 = idf(docFreq=260, maxDocs=44218)
                0.019156734 = queryNorm
              0.38327292 = fieldWeight in 1292, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.1323667 = idf(docFreq=260, maxDocs=44218)
                0.0625 = fieldNorm(doc=1292)
          0.021390831 = weight(abstract_txt:using in 1292) [ClassicSimilarity], result of:
            0.021390831 = score(doc=1292,freq=1.0), product of:
              0.09882806 = queryWeight, product of:
                1.4896748 = boost
                3.4631186 = idf(docFreq=3765, maxDocs=44218)
                0.019156734 = queryNorm
              0.21644491 = fieldWeight in 1292, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4631186 = idf(docFreq=3765, maxDocs=44218)
                0.0625 = fieldNorm(doc=1292)
          0.17639086 = weight(abstract_txt:disambiguation in 1292) [ClassicSimilarity], result of:
            0.17639086 = score(doc=1292,freq=3.0), product of:
              0.22198765 = queryWeight, product of:
                1.5787041 = boost
                7.3401785 = idf(docFreq=77, maxDocs=44218)
                0.019156734 = queryNorm
              0.7945976 = fieldWeight in 1292, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.3401785 = idf(docFreq=77, maxDocs=44218)
                0.0625 = fieldNorm(doc=1292)
          0.113906726 = weight(abstract_txt:unsupervised in 1292) [ClassicSimilarity], result of:
            0.113906726 = score(doc=1292,freq=1.0), product of:
              0.23919463 = queryWeight, product of:
                1.6387475 = boost
                7.61935 = idf(docFreq=58, maxDocs=44218)
                0.019156734 = queryNorm
              0.47620937 = fieldWeight in 1292, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.61935 = idf(docFreq=58, maxDocs=44218)
                0.0625 = fieldNorm(doc=1292)
          0.103646405 = weight(abstract_txt:name in 1292) [ClassicSimilarity], result of:
            0.103646405 = score(doc=1292,freq=2.0), product of:
              0.2040681 = queryWeight, product of:
                1.8538283 = boost
                5.746245 = idf(docFreq=383, maxDocs=44218)
                0.019156734 = queryNorm
              0.5079011 = fieldWeight in 1292, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.746245 = idf(docFreq=383, maxDocs=44218)
                0.0625 = fieldNorm(doc=1292)
          0.076669045 = weight(abstract_txt:names in 1292) [ClassicSimilarity], result of:
            0.076669045 = score(doc=1292,freq=1.0), product of:
              0.21029502 = queryWeight, product of:
                1.8818996 = boost
                5.8332562 = idf(docFreq=351, maxDocs=44218)
                0.019156734 = queryNorm
              0.36457852 = fieldWeight in 1292, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8332562 = idf(docFreq=351, maxDocs=44218)
                0.0625 = fieldNorm(doc=1292)
          0.19136745 = weight(abstract_txt:venue in 1292) [ClassicSimilarity], result of:
            0.19136745 = score(doc=1292,freq=1.0), product of:
              0.33803672 = queryWeight, product of:
                1.9481316 = boost
                9.05783 = idf(docFreq=13, maxDocs=44218)
                0.019156734 = queryNorm
              0.56611437 = fieldWeight in 1292, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.05783 = idf(docFreq=13, maxDocs=44218)
                0.0625 = fieldNorm(doc=1292)
          0.15106569 = weight(abstract_txt:clusters in 1292) [ClassicSimilarity], result of:
            0.15106569 = score(doc=1292,freq=2.0), product of:
              0.26233092 = queryWeight, product of:
                2.101874 = boost
                6.515104 = idf(docFreq=177, maxDocs=44218)
                0.019156734 = queryNorm
              0.57585925 = fieldWeight in 1292, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.515104 = idf(docFreq=177, maxDocs=44218)
                0.0625 = fieldNorm(doc=1292)
          0.07838199 = weight(abstract_txt:citations in 1292) [ClassicSimilarity], result of:
            0.07838199 = score(doc=1292,freq=1.0), product of:
              0.23489442 = queryWeight, product of:
                2.2966123 = boost
                5.339045 = idf(docFreq=576, maxDocs=44218)
                0.019156734 = queryNorm
              0.33369032 = fieldWeight in 1292, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.339045 = idf(docFreq=576, maxDocs=44218)
                0.0625 = fieldNorm(doc=1292)
          0.14088325 = weight(abstract_txt:method in 1292) [ClassicSimilarity], result of:
            0.14088325 = score(doc=1292,freq=4.0), product of:
              0.25040627 = queryWeight, product of:
                2.9041533 = boost
                4.50095 = idf(docFreq=1333, maxDocs=44218)
                0.019156734 = queryNorm
              0.56261873 = fieldWeight in 1292, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.50095 = idf(docFreq=1333, maxDocs=44218)
                0.0625 = fieldNorm(doc=1292)
        0.44 = coord(11/25)
    
  2. Liu, Y.; Li, W.; Huang, Z.; Fang, Q.: ¬A fast method based on multiple clustering for name disambiguation in bibliographic citations (2015) 0.47
    0.4720393 = sum of:
      0.4720393 = product of:
        1.0728166 = sum of:
          0.012514464 = weight(abstract_txt:based in 1672) [ClassicSimilarity], result of:
            0.012514464 = score(doc=1672,freq=1.0), product of:
              0.06280927 = queryWeight, product of:
                1.0284753 = boost
                3.1879277 = idf(docFreq=4958, maxDocs=44218)
                0.019156734 = queryNorm
              0.19924548 = fieldWeight in 1672, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.1879277 = idf(docFreq=4958, maxDocs=44218)
                0.0625 = fieldNorm(doc=1672)
          0.016312897 = weight(abstract_txt:results in 1672) [ClassicSimilarity], result of:
            0.016312897 = score(doc=1672,freq=1.0), product of:
              0.07494966 = queryWeight, product of:
                1.1234838 = boost
                3.482422 = idf(docFreq=3693, maxDocs=44218)
                0.019156734 = queryNorm
              0.21765138 = fieldWeight in 1672, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.482422 = idf(docFreq=3693, maxDocs=44218)
                0.0625 = fieldNorm(doc=1672)
          0.039039064 = weight(abstract_txt:publication in 1672) [ClassicSimilarity], result of:
            0.039039064 = score(doc=1672,freq=1.0), product of:
              0.117143475 = queryWeight, product of:
                1.1468203 = boost
                5.3321366 = idf(docFreq=580, maxDocs=44218)
                0.019156734 = queryNorm
              0.33325854 = fieldWeight in 1672, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.3321366 = idf(docFreq=580, maxDocs=44218)
                0.0625 = fieldNorm(doc=1672)
          0.083983675 = weight(abstract_txt:attributes in 1672) [ClassicSimilarity], result of:
            0.083983675 = score(doc=1672,freq=2.0), product of:
              0.15494293 = queryWeight, product of:
                1.3189315 = boost
                6.1323667 = idf(docFreq=260, maxDocs=44218)
                0.019156734 = queryNorm
              0.54202974 = fieldWeight in 1672, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.1323667 = idf(docFreq=260, maxDocs=44218)
                0.0625 = fieldNorm(doc=1672)
          0.10183931 = weight(abstract_txt:disambiguation in 1672) [ClassicSimilarity], result of:
            0.10183931 = score(doc=1672,freq=1.0), product of:
              0.22198765 = queryWeight, product of:
                1.5787041 = boost
                7.3401785 = idf(docFreq=77, maxDocs=44218)
                0.019156734 = queryNorm
              0.45876116 = fieldWeight in 1672, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.3401785 = idf(docFreq=77, maxDocs=44218)
                0.0625 = fieldNorm(doc=1672)
          0.12694041 = weight(abstract_txt:name in 1672) [ClassicSimilarity], result of:
            0.12694041 = score(doc=1672,freq=3.0), product of:
              0.2040681 = queryWeight, product of:
                1.8538283 = boost
                5.746245 = idf(docFreq=383, maxDocs=44218)
                0.019156734 = queryNorm
              0.6220493 = fieldWeight in 1672, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.746245 = idf(docFreq=383, maxDocs=44218)
                0.0625 = fieldNorm(doc=1672)
          0.18702778 = weight(abstract_txt:coauthor in 1672) [ClassicSimilarity], result of:
            0.18702778 = score(doc=1672,freq=1.0), product of:
              0.33290675 = queryWeight, product of:
                1.9332927 = boost
                8.988837 = idf(docFreq=14, maxDocs=44218)
                0.019156734 = queryNorm
              0.5618023 = fieldWeight in 1672, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.988837 = idf(docFreq=14, maxDocs=44218)
                0.0625 = fieldNorm(doc=1672)
          0.19136745 = weight(abstract_txt:venue in 1672) [ClassicSimilarity], result of:
            0.19136745 = score(doc=1672,freq=1.0), product of:
              0.33803672 = queryWeight, product of:
                1.9481316 = boost
                9.05783 = idf(docFreq=13, maxDocs=44218)
                0.019156734 = queryNorm
              0.56611437 = fieldWeight in 1672, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.05783 = idf(docFreq=13, maxDocs=44218)
                0.0625 = fieldNorm(doc=1672)
          0.10681957 = weight(abstract_txt:clusters in 1672) [ClassicSimilarity], result of:
            0.10681957 = score(doc=1672,freq=1.0), product of:
              0.26233092 = queryWeight, product of:
                2.101874 = boost
                6.515104 = idf(docFreq=177, maxDocs=44218)
                0.019156734 = queryNorm
              0.407194 = fieldWeight in 1672, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.515104 = idf(docFreq=177, maxDocs=44218)
                0.0625 = fieldNorm(doc=1672)
          0.13653043 = weight(abstract_txt:title in 1672) [ClassicSimilarity], result of:
            0.13653043 = score(doc=1672,freq=2.0), product of:
              0.26990122 = queryWeight, product of:
                2.4618056 = boost
                5.723078 = idf(docFreq=392, maxDocs=44218)
                0.019156734 = queryNorm
              0.50585335 = fieldWeight in 1672, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.723078 = idf(docFreq=392, maxDocs=44218)
                0.0625 = fieldNorm(doc=1672)
          0.070441626 = weight(abstract_txt:method in 1672) [ClassicSimilarity], result of:
            0.070441626 = score(doc=1672,freq=1.0), product of:
              0.25040627 = queryWeight, product of:
                2.9041533 = boost
                4.50095 = idf(docFreq=1333, maxDocs=44218)
                0.019156734 = queryNorm
              0.28130937 = fieldWeight in 1672, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.50095 = idf(docFreq=1333, maxDocs=44218)
                0.0625 = fieldNorm(doc=1672)
        0.44 = coord(11/25)
    
  3. Pellack, L.J.; Kappmeyer, L.O.: ¬The ripple effect of women's name changes in indexing, citation, and authority control (2011) 0.27
    0.2652965 = sum of:
      0.2652965 = product of:
        0.73693466 = sum of:
          0.071025185 = weight(abstract_txt:author in 4347) [ClassicSimilarity], result of:
            0.071025185 = score(doc=4347,freq=5.0), product of:
              0.102094755 = queryWeight, product of:
                1.0706266 = boost
                4.9778743 = idf(docFreq=827, maxDocs=44218)
                0.019156734 = queryNorm
              0.69567907 = fieldWeight in 4347, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.9778743 = idf(docFreq=827, maxDocs=44218)
                0.0625 = fieldNorm(doc=4347)
          0.035293784 = weight(abstract_txt:under in 4347) [ClassicSimilarity], result of:
            0.035293784 = score(doc=4347,freq=1.0), product of:
              0.10952602 = queryWeight, product of:
                1.1089066 = boost
                5.155857 = idf(docFreq=692, maxDocs=44218)
                0.019156734 = queryNorm
              0.32224107 = fieldWeight in 4347, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.155857 = idf(docFreq=692, maxDocs=44218)
                0.0625 = fieldNorm(doc=4347)
          0.016312897 = weight(abstract_txt:results in 4347) [ClassicSimilarity], result of:
            0.016312897 = score(doc=4347,freq=1.0), product of:
              0.07494966 = queryWeight, product of:
                1.1234838 = boost
                3.482422 = idf(docFreq=3693, maxDocs=44218)
                0.019156734 = queryNorm
              0.21765138 = fieldWeight in 4347, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.482422 = idf(docFreq=3693, maxDocs=44218)
                0.0625 = fieldNorm(doc=4347)
          0.039039064 = weight(abstract_txt:publication in 4347) [ClassicSimilarity], result of:
            0.039039064 = score(doc=4347,freq=1.0), product of:
              0.117143475 = queryWeight, product of:
                1.1468203 = boost
                5.3321366 = idf(docFreq=580, maxDocs=44218)
                0.019156734 = queryNorm
              0.33325854 = fieldWeight in 4347, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.3321366 = idf(docFreq=580, maxDocs=44218)
                0.0625 = fieldNorm(doc=4347)
          0.021390831 = weight(abstract_txt:using in 4347) [ClassicSimilarity], result of:
            0.021390831 = score(doc=4347,freq=1.0), product of:
              0.09882806 = queryWeight, product of:
                1.4896748 = boost
                3.4631186 = idf(docFreq=3765, maxDocs=44218)
                0.019156734 = queryNorm
              0.21644491 = fieldWeight in 4347, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4631186 = idf(docFreq=3765, maxDocs=44218)
                0.0625 = fieldNorm(doc=4347)
          0.103646405 = weight(abstract_txt:name in 4347) [ClassicSimilarity], result of:
            0.103646405 = score(doc=4347,freq=2.0), product of:
              0.2040681 = queryWeight, product of:
                1.8538283 = boost
                5.746245 = idf(docFreq=383, maxDocs=44218)
                0.019156734 = queryNorm
              0.5079011 = fieldWeight in 4347, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.746245 = idf(docFreq=383, maxDocs=44218)
                0.0625 = fieldNorm(doc=4347)
          0.20284723 = weight(abstract_txt:names in 4347) [ClassicSimilarity], result of:
            0.20284723 = score(doc=4347,freq=7.0), product of:
              0.21029502 = queryWeight, product of:
                1.8818996 = boost
                5.8332562 = idf(docFreq=351, maxDocs=44218)
                0.019156734 = queryNorm
              0.96458405 = fieldWeight in 4347, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                5.8332562 = idf(docFreq=351, maxDocs=44218)
                0.0625 = fieldNorm(doc=4347)
          0.11084888 = weight(abstract_txt:citations in 4347) [ClassicSimilarity], result of:
            0.11084888 = score(doc=4347,freq=2.0), product of:
              0.23489442 = queryWeight, product of:
                2.2966123 = boost
                5.339045 = idf(docFreq=576, maxDocs=44218)
                0.019156734 = queryNorm
              0.47190937 = fieldWeight in 4347, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.339045 = idf(docFreq=576, maxDocs=44218)
                0.0625 = fieldNorm(doc=4347)
          0.13653043 = weight(abstract_txt:title in 4347) [ClassicSimilarity], result of:
            0.13653043 = score(doc=4347,freq=2.0), product of:
              0.26990122 = queryWeight, product of:
                2.4618056 = boost
                5.723078 = idf(docFreq=392, maxDocs=44218)
                0.019156734 = queryNorm
              0.50585335 = fieldWeight in 4347, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.723078 = idf(docFreq=392, maxDocs=44218)
                0.0625 = fieldNorm(doc=4347)
        0.36 = coord(9/25)
    
  4. Kim, J.; Kim, J.; Owen-Smith, J.: Ethnicity-based name partitioning for author name disambiguation using supervised machine learning (2021) 0.25
    0.25219488 = sum of:
      0.25219488 = product of:
        0.90069604 = sum of:
          0.017698124 = weight(abstract_txt:based in 311) [ClassicSimilarity], result of:
            0.017698124 = score(doc=311,freq=2.0), product of:
              0.06280927 = queryWeight, product of:
                1.0284753 = boost
                3.1879277 = idf(docFreq=4958, maxDocs=44218)
                0.019156734 = queryNorm
              0.28177565 = fieldWeight in 311, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.1879277 = idf(docFreq=4958, maxDocs=44218)
                0.0625 = fieldNorm(doc=311)
          0.04492027 = weight(abstract_txt:author in 311) [ClassicSimilarity], result of:
            0.04492027 = score(doc=311,freq=2.0), product of:
              0.102094755 = queryWeight, product of:
                1.0706266 = boost
                4.9778743 = idf(docFreq=827, maxDocs=44218)
                0.019156734 = queryNorm
              0.43998608 = fieldWeight in 311, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.9778743 = idf(docFreq=827, maxDocs=44218)
                0.0625 = fieldNorm(doc=311)
          0.016312897 = weight(abstract_txt:results in 311) [ClassicSimilarity], result of:
            0.016312897 = score(doc=311,freq=1.0), product of:
              0.07494966 = queryWeight, product of:
                1.1234838 = boost
                3.482422 = idf(docFreq=3693, maxDocs=44218)
                0.019156734 = queryNorm
              0.21765138 = fieldWeight in 311, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.482422 = idf(docFreq=3693, maxDocs=44218)
                0.0625 = fieldNorm(doc=311)
          0.22771962 = weight(abstract_txt:disambiguation in 311) [ClassicSimilarity], result of:
            0.22771962 = score(doc=311,freq=5.0), product of:
              0.22198765 = queryWeight, product of:
                1.5787041 = boost
                7.3401785 = idf(docFreq=77, maxDocs=44218)
                0.019156734 = queryNorm
              1.0258211 = fieldWeight in 311, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                7.3401785 = idf(docFreq=77, maxDocs=44218)
                0.0625 = fieldNorm(doc=311)
          0.27422264 = weight(abstract_txt:name in 311) [ClassicSimilarity], result of:
            0.27422264 = score(doc=311,freq=14.0), product of:
              0.2040681 = queryWeight, product of:
                1.8538283 = boost
                5.746245 = idf(docFreq=383, maxDocs=44218)
                0.019156734 = queryNorm
              1.34378 = fieldWeight in 311, product of:
                3.7416575 = tf(freq=14.0), with freq of:
                  14.0 = termFreq=14.0
                5.746245 = idf(docFreq=383, maxDocs=44218)
                0.0625 = fieldNorm(doc=311)
          0.13279468 = weight(abstract_txt:names in 311) [ClassicSimilarity], result of:
            0.13279468 = score(doc=311,freq=3.0), product of:
              0.21029502 = queryWeight, product of:
                1.8818996 = boost
                5.8332562 = idf(docFreq=351, maxDocs=44218)
                0.019156734 = queryNorm
              0.6314685 = fieldWeight in 311, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.8332562 = idf(docFreq=351, maxDocs=44218)
                0.0625 = fieldNorm(doc=311)
          0.18702778 = weight(abstract_txt:coauthor in 311) [ClassicSimilarity], result of:
            0.18702778 = score(doc=311,freq=1.0), product of:
              0.33290675 = queryWeight, product of:
                1.9332927 = boost
                8.988837 = idf(docFreq=14, maxDocs=44218)
                0.019156734 = queryNorm
              0.5618023 = fieldWeight in 311, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.988837 = idf(docFreq=14, maxDocs=44218)
                0.0625 = fieldNorm(doc=311)
        0.28 = coord(7/25)
    
  5. Cortez, E.; Silva, A.S. da; Gonçalves, M.A.; Mesquita, F.; Moura, E.S. de: ¬A flexible approach for extracting metadata from bibliographic citations (2009) 0.24
    0.24190386 = sum of:
      0.24190386 = product of:
        0.60475963 = sum of:
          0.09059021 = weight(abstract_txt:ours in 2848) [ClassicSimilarity], result of:
            0.09059021 = score(doc=2848,freq=1.0), product of:
              0.17813832 = queryWeight, product of:
                9.298992 = idf(docFreq=10, maxDocs=44218)
                0.019156734 = queryNorm
              0.5085386 = fieldWeight in 2848, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.298992 = idf(docFreq=10, maxDocs=44218)
                0.0546875 = fieldNorm(doc=2848)
          0.0109501565 = weight(abstract_txt:based in 2848) [ClassicSimilarity], result of:
            0.0109501565 = score(doc=2848,freq=1.0), product of:
              0.06280927 = queryWeight, product of:
                1.0284753 = boost
                3.1879277 = idf(docFreq=4958, maxDocs=44218)
                0.019156734 = queryNorm
              0.1743398 = fieldWeight in 2848, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.1879277 = idf(docFreq=4958, maxDocs=44218)
                0.0546875 = fieldNorm(doc=2848)
          0.027793 = weight(abstract_txt:author in 2848) [ClassicSimilarity], result of:
            0.027793 = score(doc=2848,freq=1.0), product of:
              0.102094755 = queryWeight, product of:
                1.0706266 = boost
                4.9778743 = idf(docFreq=827, maxDocs=44218)
                0.019156734 = queryNorm
              0.2722275 = fieldWeight in 2848, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.9778743 = idf(docFreq=827, maxDocs=44218)
                0.0546875 = fieldNorm(doc=2848)
          0.02472292 = weight(abstract_txt:results in 2848) [ClassicSimilarity], result of:
            0.02472292 = score(doc=2848,freq=3.0), product of:
              0.07494966 = queryWeight, product of:
                1.1234838 = boost
                3.482422 = idf(docFreq=3693, maxDocs=44218)
                0.019156734 = queryNorm
              0.32986036 = fieldWeight in 2848, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.482422 = idf(docFreq=3693, maxDocs=44218)
                0.0546875 = fieldNorm(doc=2848)
          0.032161307 = weight(abstract_txt:without in 2848) [ClassicSimilarity], result of:
            0.032161307 = score(doc=2848,freq=1.0), product of:
              0.112530164 = queryWeight, product of:
                1.1240116 = boost
                5.2260876 = idf(docFreq=645, maxDocs=44218)
                0.019156734 = queryNorm
              0.28580165 = fieldWeight in 2848, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.2260876 = idf(docFreq=645, maxDocs=44218)
                0.0546875 = fieldNorm(doc=2848)
          0.044593416 = weight(abstract_txt:against in 2848) [ClassicSimilarity], result of:
            0.044593416 = score(doc=2848,freq=1.0), product of:
              0.13992447 = queryWeight, product of:
                1.2533811 = boost
                5.8275905 = idf(docFreq=353, maxDocs=44218)
                0.019156734 = queryNorm
              0.31869635 = fieldWeight in 2848, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8275905 = idf(docFreq=353, maxDocs=44218)
                0.0546875 = fieldNorm(doc=2848)
          0.018716976 = weight(abstract_txt:using in 2848) [ClassicSimilarity], result of:
            0.018716976 = score(doc=2848,freq=1.0), product of:
              0.09882806 = queryWeight, product of:
                1.4896748 = boost
                3.4631186 = idf(docFreq=3765, maxDocs=44218)
                0.019156734 = queryNorm
              0.18938929 = fieldWeight in 2848, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4631186 = idf(docFreq=3765, maxDocs=44218)
                0.0546875 = fieldNorm(doc=2848)
          0.067085415 = weight(abstract_txt:names in 2848) [ClassicSimilarity], result of:
            0.067085415 = score(doc=2848,freq=1.0), product of:
              0.21029502 = queryWeight, product of:
                1.8818996 = boost
                5.8332562 = idf(docFreq=351, maxDocs=44218)
                0.019156734 = queryNorm
              0.3190062 = fieldWeight in 2848, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8332562 = idf(docFreq=351, maxDocs=44218)
                0.0546875 = fieldNorm(doc=2848)
          0.13716848 = weight(abstract_txt:citations in 2848) [ClassicSimilarity], result of:
            0.13716848 = score(doc=2848,freq=4.0), product of:
              0.23489442 = queryWeight, product of:
                2.2966123 = boost
                5.339045 = idf(docFreq=576, maxDocs=44218)
                0.019156734 = queryNorm
              0.583958 = fieldWeight in 2848, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.339045 = idf(docFreq=576, maxDocs=44218)
                0.0546875 = fieldNorm(doc=2848)
          0.15097779 = weight(abstract_txt:method in 2848) [ClassicSimilarity], result of:
            0.15097779 = score(doc=2848,freq=6.0), product of:
              0.25040627 = queryWeight, product of:
                2.9041533 = boost
                4.50095 = idf(docFreq=1333, maxDocs=44218)
                0.019156734 = queryNorm
              0.6029314 = fieldWeight in 2848, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                4.50095 = idf(docFreq=1333, maxDocs=44218)
                0.0546875 = fieldNorm(doc=2848)
        0.4 = coord(10/25)