Document (#38241)

Author
Liu, W.
Dog(an, R.I.
Kim, S.
Comeau, D.C.
Kim, W.
Yeganova, L.
Lu, Z.
Wilbur, W.J.
Title
Author name disambiguation for PubMed
Source
Journal of the Association for Information Science and Technology. 65(2014) no.4, S.765-781
Year
2014
Abstract
Log analysis shows that PubMed users frequently use author names in queries for retrieving scientific literature. However, author name ambiguity may lead to irrelevant retrieval results. To improve the PubMed user experience with author name queries, we designed an author name disambiguation system consisting of similarity estimation and agglomerative clustering. A machine-learning method was employed to score the features for disambiguating a pair of papers with ambiguous names. These features enable the computation of pairwise similarity scores to estimate the probability of a pair of papers belonging to the same author, which drives an agglomerative clustering algorithm regulated by 2 factors: name compatibility and probability level. With transitivity violation correction, high precision author clustering is achieved by focusing on minimizing false-positive pairing. Disambiguation performance is evaluated with manual verification of random samples of pairs from clustering results. When compared with a state-of-the-art system, our evaluation shows that among all the pairs the lumping error rate drops from 10.1% to 2.2% for our system, while the splitting error rises from 1.8% to 7.7%. This results in an overall error rate of 9.9%, compared with 11.9% for the state-of-the-art method. Other evaluations based on gold standard data also show the increase in accuracy of our clustering. We attribute the performance improvement to the machine-learning method driven by a large-scale training set and the clustering algorithm regulated by a name compatibility scheme preferring precision. With integration of the author name disambiguation system into the PubMed search engine, the overall click-through-rate of PubMed users on author name query results improved from 34.9% to 36.9%.
Object
PubMed

Similar documents (author)

  1. Wilbur, W.J.: Global term weights for document retrieval learned from TREC data (2001) 5.62
    5.6180234 = sum of:
      5.6180234 = weight(author_txt:wilbur in 2647) [ClassicSimilarity], result of:
        5.6180234 = fieldWeight in 2647, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.988837 = idf(docFreq=14, maxDocs=44218)
          0.625 = fieldNorm(doc=2647)
    
  2. Wilbur, W.J.: Human subjectivity and performance limits in document retrieval (1996) 5.62
    5.6180234 = sum of:
      5.6180234 = weight(author_txt:wilbur in 6607) [ClassicSimilarity], result of:
        5.6180234 = fieldWeight in 6607, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.988837 = idf(docFreq=14, maxDocs=44218)
          0.625 = fieldNorm(doc=6607)
    
  3. Wilbur, W.J.: ¬A comparison of group and individual performance among subject experts and untrained workers at the document retrieval task (1998) 5.62
    5.6180234 = sum of:
      5.6180234 = weight(author_txt:wilbur in 3263) [ClassicSimilarity], result of:
        5.6180234 = fieldWeight in 3263, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.988837 = idf(docFreq=14, maxDocs=44218)
          0.625 = fieldNorm(doc=3263)
    
  4. Wilbur, W.J.: Human subjectivity and performance limits in document retrieval (1999) 5.62
    5.6180234 = sum of:
      5.6180234 = weight(author_txt:wilbur in 4539) [ClassicSimilarity], result of:
        5.6180234 = fieldWeight in 4539, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.988837 = idf(docFreq=14, maxDocs=44218)
          0.625 = fieldNorm(doc=4539)
    
  5. Wilbur, W.J.: ¬A retrieval system based on automatic relevance weighting of search terms (1992) 5.62
    5.6180234 = sum of:
      5.6180234 = weight(author_txt:wilbur in 5269) [ClassicSimilarity], result of:
        5.6180234 = fieldWeight in 5269, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.988837 = idf(docFreq=14, maxDocs=44218)
          0.625 = fieldNorm(doc=5269)
    

Similar documents (content)

  1. Kim, J.; Kim, J.; Owen-Smith, J.: Ethnicity-based name partitioning for author name disambiguation using supervised machine learning (2021) 0.38
    0.3773886 = sum of:
      0.3773886 = product of:
        1.1793394 = sum of:
          0.026318857 = weight(abstract_txt:machine in 311) [ClassicSimilarity], result of:
            0.026318857 = score(doc=311,freq=1.0), product of:
              0.079776436 = queryWeight, product of:
                1.0006175 = boost
                5.2785225 = idf(docFreq=612, maxDocs=44218)
                0.015104076 = queryNorm
              0.32990766 = fieldWeight in 311, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.2785225 = idf(docFreq=612, maxDocs=44218)
                0.0625 = fieldNorm(doc=311)
          0.061521046 = weight(abstract_txt:names in 311) [ClassicSimilarity], result of:
            0.061521046 = score(doc=311,freq=3.0), product of:
              0.097425364 = queryWeight, product of:
                1.1057751 = boost
                5.8332562 = idf(docFreq=351, maxDocs=44218)
                0.015104076 = queryNorm
              0.6314685 = fieldWeight in 311, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.8332562 = idf(docFreq=351, maxDocs=44218)
                0.0625 = fieldNorm(doc=311)
          0.097417004 = weight(abstract_txt:pairs in 311) [ClassicSimilarity], result of:
            0.097417004 = score(doc=311,freq=3.0), product of:
              0.13235673 = queryWeight, product of:
                1.2888542 = boost
                6.7990475 = idf(docFreq=133, maxDocs=44218)
                0.015104076 = queryNorm
              0.7360185 = fieldWeight in 311, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.7990475 = idf(docFreq=133, maxDocs=44218)
                0.0625 = fieldNorm(doc=311)
          0.015114862 = weight(abstract_txt:results in 311) [ClassicSimilarity], result of:
            0.015114862 = score(doc=311,freq=1.0), product of:
              0.06944528 = queryWeight, product of:
                1.3202834 = boost
                3.482422 = idf(docFreq=3693, maxDocs=44218)
                0.015104076 = queryNorm
              0.21765138 = fieldWeight in 311, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.482422 = idf(docFreq=3693, maxDocs=44218)
                0.0625 = fieldNorm(doc=311)
          0.013835475 = weight(abstract_txt:with in 311) [ClassicSimilarity], result of:
            0.013835475 = score(doc=311,freq=2.0), product of:
              0.06261889 = queryWeight, product of:
                1.6585077 = boost
                2.4997334 = idf(docFreq=9868, maxDocs=44218)
                0.015104076 = queryNorm
              0.22094731 = fieldWeight in 311, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.4997334 = idf(docFreq=9868, maxDocs=44218)
                0.0625 = fieldNorm(doc=311)
          0.31649348 = weight(abstract_txt:disambiguation in 311) [ClassicSimilarity], result of:
            0.31649348 = score(doc=311,freq=5.0), product of:
              0.308527 = queryWeight, product of:
                2.7828665 = boost
                7.3401785 = idf(docFreq=77, maxDocs=44218)
                0.015104076 = queryNorm
              1.0258211 = fieldWeight in 311, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                7.3401785 = idf(docFreq=77, maxDocs=44218)
                0.0625 = fieldNorm(doc=311)
          0.1404718 = weight(abstract_txt:author in 311) [ClassicSimilarity], result of:
            0.1404718 = score(doc=311,freq=2.0), product of:
              0.3192642 = queryWeight, product of:
                4.2463145 = boost
                4.9778743 = idf(docFreq=827, maxDocs=44218)
                0.015104076 = queryNorm
              0.43998608 = fieldWeight in 311, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.9778743 = idf(docFreq=827, maxDocs=44218)
                0.0625 = fieldNorm(doc=311)
          0.50816685 = weight(abstract_txt:name in 311) [ClassicSimilarity], result of:
            0.50816685 = score(doc=311,freq=14.0), product of:
              0.37816224 = queryWeight, product of:
                4.3571234 = boost
                5.746245 = idf(docFreq=383, maxDocs=44218)
                0.015104076 = queryNorm
              1.34378 = fieldWeight in 311, product of:
                3.7416575 = tf(freq=14.0), with freq of:
                  14.0 = termFreq=14.0
                5.746245 = idf(docFreq=383, maxDocs=44218)
                0.0625 = fieldNorm(doc=311)
        0.32 = coord(8/25)
    
  2. Zhang, L.; Lu, W.; Yang, J.: LAGOS-AND : a large gold standard dataset for scholarly author name disambiguation (2023) 0.36
    0.3560056 = sum of:
      0.3560056 = product of:
        0.9889044 = sum of:
          0.007556466 = weight(abstract_txt:from in 883) [ClassicSimilarity], result of:
            0.007556466 = score(doc=883,freq=1.0), product of:
              0.04374406 = queryWeight, product of:
                1.047865 = boost
                2.7638826 = idf(docFreq=7577, maxDocs=44218)
                0.015104076 = queryNorm
              0.17274266 = fieldWeight in 883, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.7638826 = idf(docFreq=7577, maxDocs=44218)
                0.0625 = fieldNorm(doc=883)
          0.061521046 = weight(abstract_txt:names in 883) [ClassicSimilarity], result of:
            0.061521046 = score(doc=883,freq=3.0), product of:
              0.097425364 = queryWeight, product of:
                1.1057751 = boost
                5.8332562 = idf(docFreq=351, maxDocs=44218)
                0.015104076 = queryNorm
              0.6314685 = fieldWeight in 883, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.8332562 = idf(docFreq=351, maxDocs=44218)
                0.0625 = fieldNorm(doc=883)
          0.013725931 = weight(abstract_txt:system in 883) [ClassicSimilarity], result of:
            0.013725931 = score(doc=883,freq=1.0), product of:
              0.065123014 = queryWeight, product of:
                1.2785362 = boost
                3.3723085 = idf(docFreq=4123, maxDocs=44218)
                0.015104076 = queryNorm
              0.21076928 = fieldWeight in 883, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3723085 = idf(docFreq=4123, maxDocs=44218)
                0.0625 = fieldNorm(doc=883)
          0.03461375 = weight(abstract_txt:method in 883) [ClassicSimilarity], result of:
            0.03461375 = score(doc=883,freq=2.0), product of:
              0.08700606 = queryWeight, product of:
                1.2798268 = boost
                4.50095 = idf(docFreq=1333, maxDocs=44218)
                0.015104076 = queryNorm
              0.3978315 = fieldWeight in 883, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.50095 = idf(docFreq=1333, maxDocs=44218)
                0.0625 = fieldNorm(doc=883)
          0.20016807 = weight(abstract_txt:disambiguation in 883) [ClassicSimilarity], result of:
            0.20016807 = score(doc=883,freq=2.0), product of:
              0.308527 = queryWeight, product of:
                2.7828665 = boost
                7.3401785 = idf(docFreq=77, maxDocs=44218)
                0.015104076 = queryNorm
              0.64878625 = fieldWeight in 883, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.3401785 = idf(docFreq=77, maxDocs=44218)
                0.0625 = fieldNorm(doc=883)
          0.12895444 = weight(abstract_txt:clustering in 883) [ClassicSimilarity], result of:
            0.12895444 = score(doc=883,freq=1.0), product of:
              0.3319158 = queryWeight, product of:
                3.53513 = boost
                6.2162485 = idf(docFreq=239, maxDocs=44218)
                0.015104076 = queryNorm
              0.38851553 = fieldWeight in 883, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.2162485 = idf(docFreq=239, maxDocs=44218)
                0.0625 = fieldNorm(doc=883)
          0.20789424 = weight(abstract_txt:pubmed in 883) [ClassicSimilarity], result of:
            0.20789424 = score(doc=883,freq=1.0), product of:
              0.4294424 = queryWeight, product of:
                3.670737 = boost
                7.7456436 = idf(docFreq=51, maxDocs=44218)
                0.015104076 = queryNorm
              0.48410273 = fieldWeight in 883, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.7456436 = idf(docFreq=51, maxDocs=44218)
                0.0625 = fieldNorm(doc=883)
          0.19865714 = weight(abstract_txt:author in 883) [ClassicSimilarity], result of:
            0.19865714 = score(doc=883,freq=4.0), product of:
              0.3192642 = queryWeight, product of:
                4.2463145 = boost
                4.9778743 = idf(docFreq=827, maxDocs=44218)
                0.015104076 = queryNorm
              0.6222343 = fieldWeight in 883, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.9778743 = idf(docFreq=827, maxDocs=44218)
                0.0625 = fieldNorm(doc=883)
          0.1358133 = weight(abstract_txt:name in 883) [ClassicSimilarity], result of:
            0.1358133 = score(doc=883,freq=1.0), product of:
              0.37816224 = queryWeight, product of:
                4.3571234 = boost
                5.746245 = idf(docFreq=383, maxDocs=44218)
                0.015104076 = queryNorm
              0.3591403 = fieldWeight in 883, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.746245 = idf(docFreq=383, maxDocs=44218)
                0.0625 = fieldNorm(doc=883)
        0.36 = coord(9/25)
    
  3. Cota, R.G.; Ferreira, A.A.; Nascimento, C.; Gonçalves, M.A.; Laender, A.H.F.: ¬An unsupervised heuristic-based hierarchical method for name disambiguation in bibliographic citations (2010) 0.32
    0.32334355 = sum of:
      0.32334355 = product of:
        0.80835885 = sum of:
          0.010686456 = weight(abstract_txt:from in 3986) [ClassicSimilarity], result of:
            0.010686456 = score(doc=3986,freq=2.0), product of:
              0.04374406 = queryWeight, product of:
                1.047865 = boost
                2.7638826 = idf(docFreq=7577, maxDocs=44218)
                0.015104076 = queryNorm
              0.24429502 = fieldWeight in 3986, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.7638826 = idf(docFreq=7577, maxDocs=44218)
                0.0625 = fieldNorm(doc=3986)
          0.035262156 = weight(abstract_txt:similarity in 3986) [ClassicSimilarity], result of:
            0.035262156 = score(doc=3986,freq=1.0), product of:
              0.09695477 = queryWeight, product of:
                1.1031013 = boost
                5.8191514 = idf(docFreq=356, maxDocs=44218)
                0.015104076 = queryNorm
              0.36369696 = fieldWeight in 3986, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8191514 = idf(docFreq=356, maxDocs=44218)
                0.0625 = fieldNorm(doc=3986)
          0.061521046 = weight(abstract_txt:names in 3986) [ClassicSimilarity], result of:
            0.061521046 = score(doc=3986,freq=3.0), product of:
              0.097425364 = queryWeight, product of:
                1.1057751 = boost
                5.8332562 = idf(docFreq=351, maxDocs=44218)
                0.015104076 = queryNorm
              0.6314685 = fieldWeight in 3986, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.8332562 = idf(docFreq=351, maxDocs=44218)
                0.0625 = fieldNorm(doc=3986)
          0.054729152 = weight(abstract_txt:method in 3986) [ClassicSimilarity], result of:
            0.054729152 = score(doc=3986,freq=5.0), product of:
              0.08700606 = queryWeight, product of:
                1.2798268 = boost
                4.50095 = idf(docFreq=1333, maxDocs=44218)
                0.015104076 = queryNorm
              0.6290269 = fieldWeight in 3986, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.50095 = idf(docFreq=1333, maxDocs=44218)
                0.0625 = fieldNorm(doc=3986)
          0.026179709 = weight(abstract_txt:results in 3986) [ClassicSimilarity], result of:
            0.026179709 = score(doc=3986,freq=3.0), product of:
              0.06944528 = queryWeight, product of:
                1.3202834 = boost
                3.482422 = idf(docFreq=3693, maxDocs=44218)
                0.015104076 = queryNorm
              0.37698326 = fieldWeight in 3986, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.482422 = idf(docFreq=3693, maxDocs=44218)
                0.0625 = fieldNorm(doc=3986)
          0.016944926 = weight(abstract_txt:with in 3986) [ClassicSimilarity], result of:
            0.016944926 = score(doc=3986,freq=3.0), product of:
              0.06261889 = queryWeight, product of:
                1.6585077 = boost
                2.4997334 = idf(docFreq=9868, maxDocs=44218)
                0.015104076 = queryNorm
              0.27060407 = fieldWeight in 3986, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.4997334 = idf(docFreq=9868, maxDocs=44218)
                0.0625 = fieldNorm(doc=3986)
          0.1415402 = weight(abstract_txt:disambiguation in 3986) [ClassicSimilarity], result of:
            0.1415402 = score(doc=3986,freq=1.0), product of:
              0.308527 = queryWeight, product of:
                2.7828665 = boost
                7.3401785 = idf(docFreq=77, maxDocs=44218)
                0.015104076 = queryNorm
              0.45876116 = fieldWeight in 3986, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.3401785 = idf(docFreq=77, maxDocs=44218)
                0.0625 = fieldNorm(doc=3986)
          0.12895444 = weight(abstract_txt:clustering in 3986) [ClassicSimilarity], result of:
            0.12895444 = score(doc=3986,freq=1.0), product of:
              0.3319158 = queryWeight, product of:
                3.53513 = boost
                6.2162485 = idf(docFreq=239, maxDocs=44218)
                0.015104076 = queryNorm
              0.38851553 = fieldWeight in 3986, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.2162485 = idf(docFreq=239, maxDocs=44218)
                0.0625 = fieldNorm(doc=3986)
          0.1404718 = weight(abstract_txt:author in 3986) [ClassicSimilarity], result of:
            0.1404718 = score(doc=3986,freq=2.0), product of:
              0.3192642 = queryWeight, product of:
                4.2463145 = boost
                4.9778743 = idf(docFreq=827, maxDocs=44218)
                0.015104076 = queryNorm
              0.43998608 = fieldWeight in 3986, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.9778743 = idf(docFreq=827, maxDocs=44218)
                0.0625 = fieldNorm(doc=3986)
          0.19206901 = weight(abstract_txt:name in 3986) [ClassicSimilarity], result of:
            0.19206901 = score(doc=3986,freq=2.0), product of:
              0.37816224 = queryWeight, product of:
                4.3571234 = boost
                5.746245 = idf(docFreq=383, maxDocs=44218)
                0.015104076 = queryNorm
              0.5079011 = fieldWeight in 3986, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.746245 = idf(docFreq=383, maxDocs=44218)
                0.0625 = fieldNorm(doc=3986)
        0.4 = coord(10/25)
    
  4. Zhao, D.; Strotmann, A.: Counting first, last, or all authors in citation analysis : a comprehensive comparison in the highly collaborative stem cell research field (2011) 0.31
    0.3130506 = sum of:
      0.3130506 = product of:
        0.869585 = sum of:
          0.029421465 = weight(abstract_txt:overall in 4368) [ClassicSimilarity], result of:
            0.029421465 = score(doc=4368,freq=1.0), product of:
              0.08592895 = queryWeight, product of:
                1.0384858 = boost
                5.478287 = idf(docFreq=501, maxDocs=44218)
                0.015104076 = queryNorm
              0.34239295 = fieldWeight in 4368, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.478287 = idf(docFreq=501, maxDocs=44218)
                0.0625 = fieldNorm(doc=4368)
          0.007556466 = weight(abstract_txt:from in 4368) [ClassicSimilarity], result of:
            0.007556466 = score(doc=4368,freq=1.0), product of:
              0.04374406 = queryWeight, product of:
                1.047865 = boost
                2.7638826 = idf(docFreq=7577, maxDocs=44218)
                0.015104076 = queryNorm
              0.17274266 = fieldWeight in 4368, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.7638826 = idf(docFreq=7577, maxDocs=44218)
                0.0625 = fieldNorm(doc=4368)
          0.024475621 = weight(abstract_txt:method in 4368) [ClassicSimilarity], result of:
            0.024475621 = score(doc=4368,freq=1.0), product of:
              0.08700606 = queryWeight, product of:
                1.2798268 = boost
                4.50095 = idf(docFreq=1333, maxDocs=44218)
                0.015104076 = queryNorm
              0.28130937 = fieldWeight in 4368, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.50095 = idf(docFreq=1333, maxDocs=44218)
                0.0625 = fieldNorm(doc=4368)
          0.015114862 = weight(abstract_txt:results in 4368) [ClassicSimilarity], result of:
            0.015114862 = score(doc=4368,freq=1.0), product of:
              0.06944528 = queryWeight, product of:
                1.3202834 = boost
                3.482422 = idf(docFreq=3693, maxDocs=44218)
                0.015104076 = queryNorm
              0.21765138 = fieldWeight in 4368, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.482422 = idf(docFreq=3693, maxDocs=44218)
                0.0625 = fieldNorm(doc=4368)
          0.009783158 = weight(abstract_txt:with in 4368) [ClassicSimilarity], result of:
            0.009783158 = score(doc=4368,freq=1.0), product of:
              0.06261889 = queryWeight, product of:
                1.6585077 = boost
                2.4997334 = idf(docFreq=9868, maxDocs=44218)
                0.015104076 = queryNorm
              0.15623334 = fieldWeight in 4368, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4997334 = idf(docFreq=9868, maxDocs=44218)
                0.0625 = fieldNorm(doc=4368)
          0.1415402 = weight(abstract_txt:disambiguation in 4368) [ClassicSimilarity], result of:
            0.1415402 = score(doc=4368,freq=1.0), product of:
              0.308527 = queryWeight, product of:
                2.7828665 = boost
                7.3401785 = idf(docFreq=77, maxDocs=44218)
                0.015104076 = queryNorm
              0.45876116 = fieldWeight in 4368, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.3401785 = idf(docFreq=77, maxDocs=44218)
                0.0625 = fieldNorm(doc=4368)
          0.20789424 = weight(abstract_txt:pubmed in 4368) [ClassicSimilarity], result of:
            0.20789424 = score(doc=4368,freq=1.0), product of:
              0.4294424 = queryWeight, product of:
                3.670737 = boost
                7.7456436 = idf(docFreq=51, maxDocs=44218)
                0.015104076 = queryNorm
              0.48410273 = fieldWeight in 4368, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.7456436 = idf(docFreq=51, maxDocs=44218)
                0.0625 = fieldNorm(doc=4368)
          0.2979857 = weight(abstract_txt:author in 4368) [ClassicSimilarity], result of:
            0.2979857 = score(doc=4368,freq=9.0), product of:
              0.3192642 = queryWeight, product of:
                4.2463145 = boost
                4.9778743 = idf(docFreq=827, maxDocs=44218)
                0.015104076 = queryNorm
              0.9333514 = fieldWeight in 4368, product of:
                3.0 = tf(freq=9.0), with freq of:
                  9.0 = termFreq=9.0
                4.9778743 = idf(docFreq=827, maxDocs=44218)
                0.0625 = fieldNorm(doc=4368)
          0.1358133 = weight(abstract_txt:name in 4368) [ClassicSimilarity], result of:
            0.1358133 = score(doc=4368,freq=1.0), product of:
              0.37816224 = queryWeight, product of:
                4.3571234 = boost
                5.746245 = idf(docFreq=383, maxDocs=44218)
                0.015104076 = queryNorm
              0.3591403 = fieldWeight in 4368, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.746245 = idf(docFreq=383, maxDocs=44218)
                0.0625 = fieldNorm(doc=4368)
        0.36 = coord(9/25)
    
  5. Torvik, V.I.; Weeber, M.; Swanson, D.R.; Smalheiser, N.R.: ¬A probabilistic similarity metric for medline mecords : a model for author name disambiguation (2005) 0.31
    0.31126565 = sum of:
      0.31126565 = product of:
        0.86462677 = sum of:
          0.007556466 = weight(abstract_txt:from in 3308) [ClassicSimilarity], result of:
            0.007556466 = score(doc=3308,freq=1.0), product of:
              0.04374406 = queryWeight, product of:
                1.047865 = boost
                2.7638826 = idf(docFreq=7577, maxDocs=44218)
                0.015104076 = queryNorm
              0.17274266 = fieldWeight in 3308, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.7638826 = idf(docFreq=7577, maxDocs=44218)
                0.0625 = fieldNorm(doc=3308)
          0.06107584 = weight(abstract_txt:similarity in 3308) [ClassicSimilarity], result of:
            0.06107584 = score(doc=3308,freq=3.0), product of:
              0.09695477 = queryWeight, product of:
                1.1031013 = boost
                5.8191514 = idf(docFreq=356, maxDocs=44218)
                0.015104076 = queryNorm
              0.6299416 = fieldWeight in 3308, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.8191514 = idf(docFreq=356, maxDocs=44218)
                0.0625 = fieldNorm(doc=3308)
          0.050231725 = weight(abstract_txt:names in 3308) [ClassicSimilarity], result of:
            0.050231725 = score(doc=3308,freq=2.0), product of:
              0.097425364 = queryWeight, product of:
                1.1057751 = boost
                5.8332562 = idf(docFreq=351, maxDocs=44218)
                0.015104076 = queryNorm
              0.51559186 = fieldWeight in 3308, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.8332562 = idf(docFreq=351, maxDocs=44218)
                0.0625 = fieldNorm(doc=3308)
          0.056243733 = weight(abstract_txt:pairs in 3308) [ClassicSimilarity], result of:
            0.056243733 = score(doc=3308,freq=1.0), product of:
              0.13235673 = queryWeight, product of:
                1.2888542 = boost
                6.7990475 = idf(docFreq=133, maxDocs=44218)
                0.015104076 = queryNorm
              0.42494047 = fieldWeight in 3308, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.7990475 = idf(docFreq=133, maxDocs=44218)
                0.0625 = fieldNorm(doc=3308)
          0.08258487 = weight(abstract_txt:probability in 3308) [ClassicSimilarity], result of:
            0.08258487 = score(doc=3308,freq=2.0), product of:
              0.13571264 = queryWeight, product of:
                1.3050914 = boost
                6.8847027 = idf(docFreq=122, maxDocs=44218)
                0.015104076 = queryNorm
              0.6085275 = fieldWeight in 3308, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.8847027 = idf(docFreq=122, maxDocs=44218)
                0.0625 = fieldNorm(doc=3308)
          0.11425936 = weight(abstract_txt:pair in 3308) [ClassicSimilarity], result of:
            0.11425936 = score(doc=3308,freq=2.0), product of:
              0.16850565 = queryWeight, product of:
                1.4542465 = boost
                7.6715355 = idf(docFreq=55, maxDocs=44218)
                0.015104076 = queryNorm
              0.67807436 = fieldWeight in 3308, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.6715355 = idf(docFreq=55, maxDocs=44218)
                0.0625 = fieldNorm(doc=3308)
          0.016944926 = weight(abstract_txt:with in 3308) [ClassicSimilarity], result of:
            0.016944926 = score(doc=3308,freq=3.0), product of:
              0.06261889 = queryWeight, product of:
                1.6585077 = boost
                2.4997334 = idf(docFreq=9868, maxDocs=44218)
                0.015104076 = queryNorm
              0.27060407 = fieldWeight in 3308, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.4997334 = idf(docFreq=9868, maxDocs=44218)
                0.0625 = fieldNorm(doc=3308)
          0.17204212 = weight(abstract_txt:author in 3308) [ClassicSimilarity], result of:
            0.17204212 = score(doc=3308,freq=3.0), product of:
              0.3192642 = queryWeight, product of:
                4.2463145 = boost
                4.9778743 = idf(docFreq=827, maxDocs=44218)
                0.015104076 = queryNorm
              0.5388707 = fieldWeight in 3308, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.9778743 = idf(docFreq=827, maxDocs=44218)
                0.0625 = fieldNorm(doc=3308)
          0.30368778 = weight(abstract_txt:name in 3308) [ClassicSimilarity], result of:
            0.30368778 = score(doc=3308,freq=5.0), product of:
              0.37816224 = queryWeight, product of:
                4.3571234 = boost
                5.746245 = idf(docFreq=383, maxDocs=44218)
                0.015104076 = queryNorm
              0.80306214 = fieldWeight in 3308, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                5.746245 = idf(docFreq=383, maxDocs=44218)
                0.0625 = fieldNorm(doc=3308)
        0.36 = coord(9/25)