Document (#43313)

Author
Kim, J.
Kim, J.
Owen-Smith, J.
Title
Ethnicity-based name partitioning for author name disambiguation using supervised machine learning
Source
Journal of the Association for Information Science and Technology. 72(2021) no.8, S.979-994
Year
2021
Abstract
In several author name disambiguation studies, some ethnic name groups such as East Asian names are reported to be more difficult to disambiguate than others. This implies that disambiguation approaches might be improved if ethnic name groups are distinguished before disambiguation. We explore the potential of ethnic name partitioning by comparing performance of four machine learning algorithms trained and tested on the entire data or specifically on individual name groups. Results show that ethnicity-based name partitioning can substantially improve disambiguation performance because the individual models are better suited for their respective name group. The improvements occur across all ethnic name groups with different magnitudes. Performance gains in predicting matched name pairs outweigh losses in predicting nonmatched pairs. Feature (e.g., coauthor name) similarities of name pairs vary across ethnic name groups. Such differences may enable the development of ethnicity-specific feature weights to improve prediction for specific ethic name categories. These findings are observed for three labeled data with a natural distribution of problem sizes as well as one in which all ethnic name groups are controlled for the same sizes of ambiguous names. This study is expected to motive scholars to group author names based on ethnicity prior to disambiguation.
Content
Vgl.: https://asistdl.onlinelibrary.wiley.com/doi/10.1002/asi.24459.
Theme
Formalerschließung

Similar documents (author)

  1. Owen, T.: Success at the enquiry desk (1996) 2.55
    2.5527885 = sum of:
      2.5527885 = product of:
        5.105577 = sum of:
          5.105577 = weight(author_txt:owen in 6115) [ClassicSimilarity], result of:
            5.105577 = score(doc=6115,freq=1.0), product of:
              0.8945443 = queryWeight, product of:
                1.4146769 = boost
                9.131938 = idf(docFreq=12, maxDocs=44218)
                0.069243915 = queryNorm
              5.7074614 = fieldWeight in 6115, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.131938 = idf(docFreq=12, maxDocs=44218)
                0.625 = fieldNorm(doc=6115)
        0.5 = coord(1/2)
    
  2. Owen, P.: Structured for success : the continuing role of quality indexing in intelligent information retrieval systems (1994) 2.55
    2.5527885 = sum of:
      2.5527885 = product of:
        5.105577 = sum of:
          5.105577 = weight(author_txt:owen in 1866) [ClassicSimilarity], result of:
            5.105577 = score(doc=1866,freq=1.0), product of:
              0.8945443 = queryWeight, product of:
                1.4146769 = boost
                9.131938 = idf(docFreq=12, maxDocs=44218)
                0.069243915 = queryNorm
              5.7074614 = fieldWeight in 1866, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.131938 = idf(docFreq=12, maxDocs=44218)
                0.625 = fieldNorm(doc=1866)
        0.5 = coord(1/2)
    
  3. Owen, C.: ¬The influence of CD-ROM databases on information selection (1996) 2.55
    2.5527885 = sum of:
      2.5527885 = product of:
        5.105577 = sum of:
          5.105577 = weight(author_txt:owen in 5499) [ClassicSimilarity], result of:
            5.105577 = score(doc=5499,freq=1.0), product of:
              0.8945443 = queryWeight, product of:
                1.4146769 = boost
                9.131938 = idf(docFreq=12, maxDocs=44218)
                0.069243915 = queryNorm
              5.7074614 = fieldWeight in 5499, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.131938 = idf(docFreq=12, maxDocs=44218)
                0.625 = fieldNorm(doc=5499)
        0.5 = coord(1/2)
    
  4. Owen, C.: Metadata for music and movies (1998) 2.55
    2.5527885 = sum of:
      2.5527885 = product of:
        5.105577 = sum of:
          5.105577 = weight(author_txt:owen in 2472) [ClassicSimilarity], result of:
            5.105577 = score(doc=2472,freq=1.0), product of:
              0.8945443 = queryWeight, product of:
                1.4146769 = boost
                9.131938 = idf(docFreq=12, maxDocs=44218)
                0.069243915 = queryNorm
              5.7074614 = fieldWeight in 2472, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.131938 = idf(docFreq=12, maxDocs=44218)
                0.625 = fieldNorm(doc=2472)
        0.5 = coord(1/2)
    
  5. Owen, T.: Success at the enquiry desk : Successful enquiry answering - every time (1998) 2.55
    2.5527885 = sum of:
      2.5527885 = product of:
        5.105577 = sum of:
          5.105577 = weight(author_txt:owen in 1440) [ClassicSimilarity], result of:
            5.105577 = score(doc=1440,freq=1.0), product of:
              0.8945443 = queryWeight, product of:
                1.4146769 = boost
                9.131938 = idf(docFreq=12, maxDocs=44218)
                0.069243915 = queryNorm
              5.7074614 = fieldWeight in 1440, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.131938 = idf(docFreq=12, maxDocs=44218)
                0.625 = fieldNorm(doc=1440)
        0.5 = coord(1/2)
    

Similar documents (content)

  1. Liu, W.; Dog(an, R.I.; Kim, S.; Comeau, D.C.; Kim, W.; Yeganova, L.; Lu, Z.; Wilbur, W.J.: Author name disambiguation for PubMed (2014) 0.40
    0.39630324 = sum of:
      0.39630324 = product of:
        0.99075806 = sum of:
          0.017837796 = weight(abstract_txt:learning in 1240) [ClassicSimilarity], result of:
            0.017837796 = score(doc=1240,freq=2.0), product of:
              0.04854726 = queryWeight, product of:
                1.0981675 = boost
                4.750873 = idf(docFreq=1038, maxDocs=44218)
                0.009305135 = queryNorm
              0.36743158 = fieldWeight in 1240, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.750873 = idf(docFreq=1038, maxDocs=44218)
                0.0546875 = fieldNorm(doc=1240)
          0.0057163998 = weight(abstract_txt:based in 1240) [ClassicSimilarity], result of:
            0.0057163998 = score(doc=1240,freq=1.0), product of:
              0.03278884 = queryWeight, product of:
                1.1053375 = boost
                3.1879277 = idf(docFreq=4958, maxDocs=44218)
                0.009305135 = queryNorm
              0.1743398 = fieldWeight in 1240, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.1879277 = idf(docFreq=4958, maxDocs=44218)
                0.0546875 = fieldNorm(doc=1240)
          0.014352584 = weight(abstract_txt:improve in 1240) [ClassicSimilarity], result of:
            0.014352584 = score(doc=1240,freq=1.0), product of:
              0.0529136 = queryWeight, product of:
                1.146489 = boost
                4.9599204 = idf(docFreq=842, maxDocs=44218)
                0.009305135 = queryNorm
              0.27124566 = fieldWeight in 1240, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.9599204 = idf(docFreq=842, maxDocs=44218)
                0.0546875 = fieldNorm(doc=1240)
          0.024465723 = weight(abstract_txt:machine in 1240) [ClassicSimilarity], result of:
            0.024465723 = score(doc=1240,freq=2.0), product of:
              0.05992977 = queryWeight, product of:
                1.220134 = boost
                5.2785225 = idf(docFreq=612, maxDocs=44218)
                0.009305135 = queryNorm
              0.4082399 = fieldWeight in 1240, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.2785225 = idf(docFreq=612, maxDocs=44218)
                0.0546875 = fieldNorm(doc=1240)
          0.024772704 = weight(abstract_txt:performance in 1240) [ClassicSimilarity], result of:
            0.024772704 = score(doc=1240,freq=2.0), product of:
              0.069175124 = queryWeight, product of:
                1.6054871 = boost
                4.63042 = idf(docFreq=1171, maxDocs=44218)
                0.009305135 = queryNorm
              0.3581158 = fieldWeight in 1240, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.63042 = idf(docFreq=1171, maxDocs=44218)
                0.0546875 = fieldNorm(doc=1240)
          0.061556507 = weight(abstract_txt:author in 1240) [ClassicSimilarity], result of:
            0.061556507 = score(doc=1240,freq=8.0), product of:
              0.07994604 = queryWeight, product of:
                1.7259585 = boost
                4.9778743 = idf(docFreq=827, maxDocs=44218)
                0.009305135 = queryNorm
              0.76997566 = fieldWeight in 1240, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                4.9778743 = idf(docFreq=827, maxDocs=44218)
                0.0546875 = fieldNorm(doc=1240)
          0.049527377 = weight(abstract_txt:names in 1240) [ClassicSimilarity], result of:
            0.049527377 = score(doc=1240,freq=2.0), product of:
              0.109782025 = queryWeight, product of:
                2.0225418 = boost
                5.8332562 = idf(docFreq=351, maxDocs=44218)
                0.009305135 = queryNorm
              0.45114288 = fieldWeight in 1240, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.8332562 = idf(docFreq=351, maxDocs=44218)
                0.0546875 = fieldNorm(doc=1240)
          0.07842535 = weight(abstract_txt:pairs in 1240) [ClassicSimilarity], result of:
            0.07842535 = score(doc=1240,freq=2.0), product of:
              0.14914383 = queryWeight, product of:
                2.3574066 = boost
                6.7990475 = idf(docFreq=133, maxDocs=44218)
                0.009305135 = queryNorm
              0.525837 = fieldWeight in 1240, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.7990475 = idf(docFreq=133, maxDocs=44218)
                0.0546875 = fieldNorm(doc=1240)
          0.24171726 = weight(abstract_txt:disambiguation in 1240) [ClassicSimilarity], result of:
            0.24171726 = score(doc=1240,freq=3.0), product of:
              0.3476581 = queryWeight, product of:
                5.0900617 = boost
                7.3401785 = idf(docFreq=77, maxDocs=44218)
                0.009305135 = queryNorm
              0.6952729 = fieldWeight in 1240, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.3401785 = idf(docFreq=77, maxDocs=44218)
                0.0546875 = fieldNorm(doc=1240)
          0.47238636 = weight(abstract_txt:name in 1240) [ClassicSimilarity], result of:
            0.47238636 = score(doc=1240,freq=7.0), product of:
              0.56816715 = queryWeight, product of:
                10.625987 = boost
                5.746245 = idf(docFreq=383, maxDocs=44218)
                0.009305135 = queryNorm
              0.83142143 = fieldWeight in 1240, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                5.746245 = idf(docFreq=383, maxDocs=44218)
                0.0546875 = fieldNorm(doc=1240)
        0.4 = coord(10/25)
    
  2. HaCohen-Kerner, Y.; Beck, H.; Yehudai, E.; Rosenstein, M.; Mughaz, D.: Cuisine : classification using stylistic feature sets and/or name-based feature sets (2010) 0.28
    0.28160766 = sum of:
      0.28160766 = product of:
        1.1733652 = sum of:
          0.011315537 = weight(abstract_txt:based in 3706) [ClassicSimilarity], result of:
            0.011315537 = score(doc=3706,freq=3.0), product of:
              0.03278884 = queryWeight, product of:
                1.1053375 = boost
                3.1879277 = idf(docFreq=4958, maxDocs=44218)
                0.009305135 = queryNorm
              0.3451033 = fieldWeight in 3706, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.1879277 = idf(docFreq=4958, maxDocs=44218)
                0.0625 = fieldNorm(doc=3706)
          0.07812356 = weight(abstract_txt:feature in 3706) [ClassicSimilarity], result of:
            0.07812356 = score(doc=3706,freq=8.0), product of:
              0.074893475 = queryWeight, product of:
                1.3639808 = boost
                5.9008293 = idf(docFreq=328, maxDocs=44218)
                0.009305135 = queryNorm
              1.0431291 = fieldWeight in 3706, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                5.9008293 = idf(docFreq=328, maxDocs=44218)
                0.0625 = fieldNorm(doc=3706)
          0.0534889 = weight(abstract_txt:groups in 3706) [ClassicSimilarity], result of:
            0.0534889 = score(doc=3706,freq=1.0), product of:
              0.16781682 = queryWeight, product of:
                3.5364263 = boost
                5.0997415 = idf(docFreq=732, maxDocs=44218)
                0.009305135 = queryNorm
              0.31873384 = fieldWeight in 3706, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.0997415 = idf(docFreq=732, maxDocs=44218)
                0.0625 = fieldNorm(doc=3706)
          0.44677138 = weight(abstract_txt:ethnicity in 3706) [ClassicSimilarity], result of:
            0.44677138 = score(doc=3706,freq=5.0), product of:
              0.35293618 = queryWeight, product of:
                4.187447 = boost
                9.05783 = idf(docFreq=13, maxDocs=44218)
                0.009305135 = queryNorm
              1.2658702 = fieldWeight in 3706, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                9.05783 = idf(docFreq=13, maxDocs=44218)
                0.0625 = fieldNorm(doc=3706)
          0.23023781 = weight(abstract_txt:ethnic in 3706) [ClassicSimilarity], result of:
            0.23023781 = score(doc=3706,freq=1.0), product of:
              0.44406253 = queryWeight, product of:
                5.752663 = boost
                8.29569 = idf(docFreq=29, maxDocs=44218)
                0.009305135 = queryNorm
              0.5184806 = fieldWeight in 3706, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.29569 = idf(docFreq=29, maxDocs=44218)
                0.0625 = fieldNorm(doc=3706)
          0.35342798 = weight(abstract_txt:name in 3706) [ClassicSimilarity], result of:
            0.35342798 = score(doc=3706,freq=3.0), product of:
              0.56816715 = queryWeight, product of:
                10.625987 = boost
                5.746245 = idf(docFreq=383, maxDocs=44218)
                0.009305135 = queryNorm
              0.6220493 = fieldWeight in 3706, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.746245 = idf(docFreq=383, maxDocs=44218)
                0.0625 = fieldNorm(doc=3706)
        0.24 = coord(6/25)
    
  3. Kim, J.; Diesner, J.: Distortive effects of initial-based name disambiguation on measurements of large-scale coauthorship networks (2016) 0.26
    0.2585473 = sum of:
      0.2585473 = product of:
        0.92338324 = sum of:
          0.048817832 = weight(abstract_txt:coauthor in 2936) [ClassicSimilarity], result of:
            0.048817832 = score(doc=2936,freq=1.0), product of:
              0.08689503 = queryWeight, product of:
                1.038888 = boost
                8.988837 = idf(docFreq=14, maxDocs=44218)
                0.009305135 = queryNorm
              0.5618023 = fieldWeight in 2936, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.988837 = idf(docFreq=14, maxDocs=44218)
                0.0625 = fieldNorm(doc=2936)
          0.014608295 = weight(abstract_txt:based in 2936) [ClassicSimilarity], result of:
            0.014608295 = score(doc=2936,freq=5.0), product of:
              0.03278884 = queryWeight, product of:
                1.1053375 = boost
                3.1879277 = idf(docFreq=4958, maxDocs=44218)
                0.009305135 = queryNorm
              0.44552645 = fieldWeight in 2936, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                3.1879277 = idf(docFreq=4958, maxDocs=44218)
                0.0625 = fieldNorm(doc=2936)
          0.02001937 = weight(abstract_txt:performance in 2936) [ClassicSimilarity], result of:
            0.02001937 = score(doc=2936,freq=1.0), product of:
              0.069175124 = queryWeight, product of:
                1.6054871 = boost
                4.63042 = idf(docFreq=1171, maxDocs=44218)
                0.009305135 = queryNorm
              0.28940126 = fieldWeight in 2936, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.63042 = idf(docFreq=1171, maxDocs=44218)
                0.0625 = fieldNorm(doc=2936)
          0.035175145 = weight(abstract_txt:author in 2936) [ClassicSimilarity], result of:
            0.035175145 = score(doc=2936,freq=2.0), product of:
              0.07994604 = queryWeight, product of:
                1.7259585 = boost
                4.9778743 = idf(docFreq=827, maxDocs=44218)
                0.009305135 = queryNorm
              0.43998608 = fieldWeight in 2936, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.9778743 = idf(docFreq=827, maxDocs=44218)
                0.0625 = fieldNorm(doc=2936)
          0.04002417 = weight(abstract_txt:names in 2936) [ClassicSimilarity], result of:
            0.04002417 = score(doc=2936,freq=1.0), product of:
              0.109782025 = queryWeight, product of:
                2.0225418 = boost
                5.8332562 = idf(docFreq=351, maxDocs=44218)
                0.009305135 = queryNorm
              0.36457852 = fieldWeight in 2936, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8332562 = idf(docFreq=351, maxDocs=44218)
                0.0625 = fieldNorm(doc=2936)
          0.356635 = weight(abstract_txt:disambiguation in 2936) [ClassicSimilarity], result of:
            0.356635 = score(doc=2936,freq=5.0), product of:
              0.3476581 = queryWeight, product of:
                5.0900617 = boost
                7.3401785 = idf(docFreq=77, maxDocs=44218)
                0.009305135 = queryNorm
              1.0258211 = fieldWeight in 2936, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                7.3401785 = idf(docFreq=77, maxDocs=44218)
                0.0625 = fieldNorm(doc=2936)
          0.40810344 = weight(abstract_txt:name in 2936) [ClassicSimilarity], result of:
            0.40810344 = score(doc=2936,freq=4.0), product of:
              0.56816715 = queryWeight, product of:
                10.625987 = boost
                5.746245 = idf(docFreq=383, maxDocs=44218)
                0.009305135 = queryNorm
              0.7182806 = fieldWeight in 2936, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.746245 = idf(docFreq=383, maxDocs=44218)
                0.0625 = fieldNorm(doc=2936)
        0.28 = coord(7/25)
    
  4. Kim, J.(im); Kim, J.(enna): Effect of forename string on author name disambiguation (2020) 0.25
    0.25201458 = sum of:
      0.25201458 = product of:
        0.9000521 = sum of:
          0.014415117 = weight(abstract_txt:learning in 5930) [ClassicSimilarity], result of:
            0.014415117 = score(doc=5930,freq=1.0), product of:
              0.04854726 = queryWeight, product of:
                1.0981675 = boost
                4.750873 = idf(docFreq=1038, maxDocs=44218)
                0.009305135 = queryNorm
              0.29692957 = fieldWeight in 5930, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.750873 = idf(docFreq=1038, maxDocs=44218)
                0.0625 = fieldNorm(doc=5930)
          0.006533028 = weight(abstract_txt:based in 5930) [ClassicSimilarity], result of:
            0.006533028 = score(doc=5930,freq=1.0), product of:
              0.03278884 = queryWeight, product of:
                1.1053375 = boost
                3.1879277 = idf(docFreq=4958, maxDocs=44218)
                0.009305135 = queryNorm
              0.19924548 = fieldWeight in 5930, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.1879277 = idf(docFreq=4958, maxDocs=44218)
                0.0625 = fieldNorm(doc=5930)
          0.01977129 = weight(abstract_txt:machine in 5930) [ClassicSimilarity], result of:
            0.01977129 = score(doc=5930,freq=1.0), product of:
              0.05992977 = queryWeight, product of:
                1.220134 = boost
                5.2785225 = idf(docFreq=612, maxDocs=44218)
                0.009305135 = queryNorm
              0.32990766 = fieldWeight in 5930, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.2785225 = idf(docFreq=612, maxDocs=44218)
                0.0625 = fieldNorm(doc=5930)
          0.02831166 = weight(abstract_txt:performance in 5930) [ClassicSimilarity], result of:
            0.02831166 = score(doc=5930,freq=2.0), product of:
              0.069175124 = queryWeight, product of:
                1.6054871 = boost
                4.63042 = idf(docFreq=1171, maxDocs=44218)
                0.009305135 = queryNorm
              0.40927517 = fieldWeight in 5930, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.63042 = idf(docFreq=1171, maxDocs=44218)
                0.0625 = fieldNorm(doc=5930)
          0.05561679 = weight(abstract_txt:author in 5930) [ClassicSimilarity], result of:
            0.05561679 = score(doc=5930,freq=5.0), product of:
              0.07994604 = queryWeight, product of:
                1.7259585 = boost
                4.9778743 = idf(docFreq=827, maxDocs=44218)
                0.009305135 = queryNorm
              0.69567907 = fieldWeight in 5930, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.9778743 = idf(docFreq=827, maxDocs=44218)
                0.0625 = fieldNorm(doc=5930)
          0.42197624 = weight(abstract_txt:disambiguation in 5930) [ClassicSimilarity], result of:
            0.42197624 = score(doc=5930,freq=7.0), product of:
              0.3476581 = queryWeight, product of:
                5.0900617 = boost
                7.3401785 = idf(docFreq=77, maxDocs=44218)
                0.009305135 = queryNorm
              1.2137679 = fieldWeight in 5930, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                7.3401785 = idf(docFreq=77, maxDocs=44218)
                0.0625 = fieldNorm(doc=5930)
          0.35342798 = weight(abstract_txt:name in 5930) [ClassicSimilarity], result of:
            0.35342798 = score(doc=5930,freq=3.0), product of:
              0.56816715 = queryWeight, product of:
                10.625987 = boost
                5.746245 = idf(docFreq=383, maxDocs=44218)
                0.009305135 = queryNorm
              0.6220493 = fieldWeight in 5930, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.746245 = idf(docFreq=383, maxDocs=44218)
                0.0625 = fieldNorm(doc=5930)
        0.28 = coord(7/25)
    
  5. Torvik, V.I.; Weeber, M.; Swanson, D.R.; Smalheiser, N.R.: ¬A probabilistic similarity metric for medline mecords : a model for author name disambiguation (2005) 0.19
    0.19327773 = sum of:
      0.19327773 = product of:
        0.69027764 = sum of:
          0.048817832 = weight(abstract_txt:coauthor in 3308) [ClassicSimilarity], result of:
            0.048817832 = score(doc=3308,freq=1.0), product of:
              0.08689503 = queryWeight, product of:
                1.038888 = boost
                8.988837 = idf(docFreq=14, maxDocs=44218)
                0.009305135 = queryNorm
              0.5618023 = fieldWeight in 3308, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.988837 = idf(docFreq=14, maxDocs=44218)
                0.0625 = fieldNorm(doc=3308)
          0.006533028 = weight(abstract_txt:based in 3308) [ClassicSimilarity], result of:
            0.006533028 = score(doc=3308,freq=1.0), product of:
              0.03278884 = queryWeight, product of:
                1.1053375 = boost
                3.1879277 = idf(docFreq=4958, maxDocs=44218)
                0.009305135 = queryNorm
              0.19924548 = fieldWeight in 3308, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.1879277 = idf(docFreq=4958, maxDocs=44218)
                0.0625 = fieldNorm(doc=3308)
          0.015592709 = weight(abstract_txt:individual in 3308) [ClassicSimilarity], result of:
            0.015592709 = score(doc=3308,freq=1.0), product of:
              0.051156443 = queryWeight, product of:
                1.1272919 = boost
                4.8768706 = idf(docFreq=915, maxDocs=44218)
                0.009305135 = queryNorm
              0.3048044 = fieldWeight in 3308, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.8768706 = idf(docFreq=915, maxDocs=44218)
                0.0625 = fieldNorm(doc=3308)
          0.04308058 = weight(abstract_txt:author in 3308) [ClassicSimilarity], result of:
            0.04308058 = score(doc=3308,freq=3.0), product of:
              0.07994604 = queryWeight, product of:
                1.7259585 = boost
                4.9778743 = idf(docFreq=827, maxDocs=44218)
                0.009305135 = queryNorm
              0.5388707 = fieldWeight in 3308, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.9778743 = idf(docFreq=827, maxDocs=44218)
                0.0625 = fieldNorm(doc=3308)
          0.05660272 = weight(abstract_txt:names in 3308) [ClassicSimilarity], result of:
            0.05660272 = score(doc=3308,freq=2.0), product of:
              0.109782025 = queryWeight, product of:
                2.0225418 = boost
                5.8332562 = idf(docFreq=351, maxDocs=44218)
                0.009305135 = queryNorm
              0.51559186 = fieldWeight in 3308, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.8332562 = idf(docFreq=351, maxDocs=44218)
                0.0625 = fieldNorm(doc=3308)
          0.063377246 = weight(abstract_txt:pairs in 3308) [ClassicSimilarity], result of:
            0.063377246 = score(doc=3308,freq=1.0), product of:
              0.14914383 = queryWeight, product of:
                2.3574066 = boost
                6.7990475 = idf(docFreq=133, maxDocs=44218)
                0.009305135 = queryNorm
              0.42494047 = fieldWeight in 3308, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.7990475 = idf(docFreq=133, maxDocs=44218)
                0.0625 = fieldNorm(doc=3308)
          0.45627353 = weight(abstract_txt:name in 3308) [ClassicSimilarity], result of:
            0.45627353 = score(doc=3308,freq=5.0), product of:
              0.56816715 = queryWeight, product of:
                10.625987 = boost
                5.746245 = idf(docFreq=383, maxDocs=44218)
                0.009305135 = queryNorm
              0.80306214 = fieldWeight in 3308, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                5.746245 = idf(docFreq=383, maxDocs=44218)
                0.0625 = fieldNorm(doc=3308)
        0.28 = coord(7/25)