Document (#38892)

Author
Thomas, B.
Title
Name disambiguation : learning from more user-friendly models
Source
Cataloging and classification quarterly. 49(2011) no.3, S.223-232
Year
2011
Abstract
Library catalogs do not provide catalog users with the assistance they need to easily and confidently select the person they are interested in. Examples are provided of Web services that do a better job of helping information seekers differentiate the person they are seeking from those with similar names. Some of the reasons for this failure in library catalogs are examined. This article then looks at how much information is necessary to help users disambiguate names, how that information could be captured and shared, and some ways the information could be displayed in library catalogs.
Theme
Formalerschließung

Similar documents (author)

  1. Thomas, D.: Book indexing principles and standards (1989) 4.66
    4.659586 = sum of:
      4.659586 = weight(author_txt:thomas in 865) [ClassicSimilarity], result of:
        4.659586 = fieldWeight in 865, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          7.4553375 = idf(docFreq=67, maxDocs=43254)
          0.625 = fieldNorm(doc=865)
    
  2. Thomas, A.R.: Options in the arrangement of library materials and the new edition of the Bliss Bibliographic Classification (1992) 4.66
    4.659586 = sum of:
      4.659586 = weight(author_txt:thomas in 3934) [ClassicSimilarity], result of:
        4.659586 = fieldWeight in 3934, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          7.4553375 = idf(docFreq=67, maxDocs=43254)
          0.625 = fieldNorm(doc=3934)
    
  3. Thomas, A.: Bliss regained : the second edition of the Bliss Bibliographic Classification (1993) 4.66
    4.659586 = sum of:
      4.659586 = weight(author_txt:thomas in 5077) [ClassicSimilarity], result of:
        4.659586 = fieldWeight in 5077, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          7.4553375 = idf(docFreq=67, maxDocs=43254)
          0.625 = fieldNorm(doc=5077)
    
  4. Thomas, S.E.: CatTutor: a prototypical hypertext tutorial for catalogers (1992) 4.66
    4.659586 = sum of:
      4.659586 = weight(author_txt:thomas in 2453) [ClassicSimilarity], result of:
        4.659586 = fieldWeight in 2453, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          7.4553375 = idf(docFreq=67, maxDocs=43254)
          0.625 = fieldNorm(doc=2453)
    
  5. Thomas, A.R.: CAPS (Counseling and Personnel Services Clearinghouse) : the work of ERIC Clearinghouse. (1989) 4.66
    4.659586 = sum of:
      4.659586 = weight(author_txt:thomas in 2606) [ClassicSimilarity], result of:
        4.659586 = fieldWeight in 2606, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          7.4553375 = idf(docFreq=67, maxDocs=43254)
          0.625 = fieldNorm(doc=2606)
    

Similar documents (content)

  1. Vu, Q.M.; Takasu, A.; Adachi, J.: Improving the performance of personal name disambiguation using web directories (2008) 0.24
    0.24350819 = sum of:
      0.24350819 = product of:
        0.8696721 = sum of:
          0.097786516 = weight(abstract_txt:name in 4109) [ClassicSimilarity], result of:
            0.097786516 = score(doc=4109,freq=3.0), product of:
              0.12566547 = queryWeight, product of:
                5.7505894 = idf(docFreq=373, maxDocs=43254)
                0.021852624 = queryNorm
              0.7781494 = fieldWeight in 4109, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.7505894 = idf(docFreq=373, maxDocs=43254)
                0.078125 = fieldNorm(doc=4109)
          0.038229898 = weight(abstract_txt:users in 4109) [ClassicSimilarity], result of:
            0.038229898 = score(doc=4109,freq=2.0), product of:
              0.096903436 = queryWeight, product of:
                1.2418714 = boost
                3.570746 = idf(docFreq=3307, maxDocs=43254)
                0.021852624 = queryNorm
              0.3945154 = fieldWeight in 4109, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.570746 = idf(docFreq=3307, maxDocs=43254)
                0.078125 = fieldNorm(doc=4109)
          0.16720979 = weight(abstract_txt:disambiguation in 4109) [ClassicSimilarity], result of:
            0.16720979 = score(doc=4109,freq=2.0), product of:
              0.20570026 = queryWeight, product of:
                1.2794092 = boost
                7.357357 = idf(docFreq=74, maxDocs=43254)
                0.021852624 = queryNorm
              0.81288075 = fieldWeight in 4109, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.357357 = idf(docFreq=74, maxDocs=43254)
                0.078125 = fieldNorm(doc=4109)
          0.27938274 = weight(abstract_txt:disambiguate in 4109) [ClassicSimilarity], result of:
            0.27938274 = score(doc=4109,freq=2.0), product of:
              0.28964105 = queryWeight, product of:
                1.5181758 = boost
                8.730406 = idf(docFreq=18, maxDocs=43254)
                0.021852624 = queryNorm
              0.9645826 = fieldWeight in 4109, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.730406 = idf(docFreq=18, maxDocs=43254)
                0.078125 = fieldNorm(doc=4109)
          0.016974913 = weight(abstract_txt:information in 4109) [ClassicSimilarity], result of:
            0.016974913 = score(doc=4109,freq=1.0), product of:
              0.08952866 = queryWeight, product of:
                1.688119 = boost
                2.42692 = idf(docFreq=10382, maxDocs=43254)
                0.021852624 = queryNorm
              0.18960312 = fieldWeight in 4109, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.42692 = idf(docFreq=10382, maxDocs=43254)
                0.078125 = fieldNorm(doc=4109)
          0.11862202 = weight(abstract_txt:names in 4109) [ClassicSimilarity], result of:
            0.11862202 = score(doc=4109,freq=1.0), product of:
              0.25973108 = queryWeight, product of:
                2.033148 = boost
                5.8458996 = idf(docFreq=339, maxDocs=43254)
                0.021852624 = queryNorm
              0.4567109 = fieldWeight in 4109, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8458996 = idf(docFreq=339, maxDocs=43254)
                0.078125 = fieldNorm(doc=4109)
          0.15146627 = weight(abstract_txt:person in 4109) [ClassicSimilarity], result of:
            0.15146627 = score(doc=4109,freq=1.0), product of:
              0.30569687 = queryWeight, product of:
                2.205731 = boost
                6.3421264 = idf(docFreq=206, maxDocs=43254)
                0.021852624 = queryNorm
              0.49547863 = fieldWeight in 4109, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.3421264 = idf(docFreq=206, maxDocs=43254)
                0.078125 = fieldNorm(doc=4109)
        0.28 = coord(7/25)
    
  2. Delgado, A.D.; Martínez, R.; Montalvo, S.; Fresno, V.: Person name disambiguation in the Web using adaptive threshold clustering (2017) 0.17
    0.16519782 = sum of:
      0.16519782 = product of:
        0.6883243 = sum of:
          0.056457072 = weight(abstract_txt:name in 5159) [ClassicSimilarity], result of:
            0.056457072 = score(doc=5159,freq=1.0), product of:
              0.12566547 = queryWeight, product of:
                5.7505894 = idf(docFreq=373, maxDocs=43254)
                0.021852624 = queryNorm
              0.4492648 = fieldWeight in 5159, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.7505894 = idf(docFreq=373, maxDocs=43254)
                0.078125 = fieldNorm(doc=5159)
          0.11823518 = weight(abstract_txt:disambiguation in 5159) [ClassicSimilarity], result of:
            0.11823518 = score(doc=5159,freq=1.0), product of:
              0.20570026 = queryWeight, product of:
                1.2794092 = boost
                7.357357 = idf(docFreq=74, maxDocs=43254)
                0.021852624 = queryNorm
              0.5747935 = fieldWeight in 5159, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.357357 = idf(docFreq=74, maxDocs=43254)
                0.078125 = fieldNorm(doc=5159)
          0.06514501 = weight(abstract_txt:could in 5159) [ClassicSimilarity], result of:
            0.06514501 = score(doc=5159,freq=1.0), product of:
              0.17418115 = queryWeight, product of:
                1.6649746 = boost
                4.7872925 = idf(docFreq=979, maxDocs=43254)
                0.021852624 = queryNorm
              0.37400723 = fieldWeight in 5159, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.7872925 = idf(docFreq=979, maxDocs=43254)
                0.078125 = fieldNorm(doc=5159)
          0.067517735 = weight(abstract_txt:they in 5159) [ClassicSimilarity], result of:
            0.067517735 = score(doc=5159,freq=2.0), product of:
              0.16207378 = queryWeight, product of:
                1.9670212 = boost
                3.7705102 = idf(docFreq=2708, maxDocs=43254)
                0.021852624 = queryNorm
              0.41658643 = fieldWeight in 5159, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.7705102 = idf(docFreq=2708, maxDocs=43254)
                0.078125 = fieldNorm(doc=5159)
          0.11862202 = weight(abstract_txt:names in 5159) [ClassicSimilarity], result of:
            0.11862202 = score(doc=5159,freq=1.0), product of:
              0.25973108 = queryWeight, product of:
                2.033148 = boost
                5.8458996 = idf(docFreq=339, maxDocs=43254)
                0.021852624 = queryNorm
              0.4567109 = fieldWeight in 5159, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8458996 = idf(docFreq=339, maxDocs=43254)
                0.078125 = fieldNorm(doc=5159)
          0.26234728 = weight(abstract_txt:person in 5159) [ClassicSimilarity], result of:
            0.26234728 = score(doc=5159,freq=3.0), product of:
              0.30569687 = queryWeight, product of:
                2.205731 = boost
                6.3421264 = idf(docFreq=206, maxDocs=43254)
                0.021852624 = queryNorm
              0.8581942 = fieldWeight in 5159, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.3421264 = idf(docFreq=206, maxDocs=43254)
                0.078125 = fieldNorm(doc=5159)
        0.24 = coord(6/25)
    
  3. Crane, G.; Jones, A.: Text, information, knowledge and the evolving record of humanity (2006) 0.16
    0.16262133 = sum of:
      0.16262133 = product of:
        0.40655333 = sum of:
          0.044184648 = weight(abstract_txt:name in 3183) [ClassicSimilarity], result of:
            0.044184648 = score(doc=3183,freq=5.0), product of:
              0.12566547 = queryWeight, product of:
                5.7505894 = idf(docFreq=373, maxDocs=43254)
                0.021852624 = queryNorm
              0.35160533 = fieldWeight in 3183, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                5.7505894 = idf(docFreq=373, maxDocs=43254)
                0.02734375 = fieldNorm(doc=3183)
          0.013380464 = weight(abstract_txt:users in 3183) [ClassicSimilarity], result of:
            0.013380464 = score(doc=3183,freq=2.0), product of:
              0.096903436 = queryWeight, product of:
                1.2418714 = boost
                3.570746 = idf(docFreq=3307, maxDocs=43254)
                0.021852624 = queryNorm
              0.13808039 = fieldWeight in 3183, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.570746 = idf(docFreq=3307, maxDocs=43254)
                0.02734375 = fieldNorm(doc=3183)
          0.014669335 = weight(abstract_txt:some in 3183) [ClassicSimilarity], result of:
            0.014669335 = score(doc=3183,freq=2.0), product of:
              0.10303038 = queryWeight, product of:
                1.2805297 = boost
                3.6819005 = idf(docFreq=2959, maxDocs=43254)
                0.021852624 = queryNorm
              0.14237873 = fieldWeight in 3183, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.6819005 = idf(docFreq=2959, maxDocs=43254)
                0.02734375 = fieldNorm(doc=3183)
          0.04207497 = weight(abstract_txt:captured in 3183) [ClassicSimilarity], result of:
            0.04207497 = score(doc=3183,freq=1.0), product of:
              0.20798926 = queryWeight, product of:
                1.286508 = boost
                7.398179 = idf(docFreq=71, maxDocs=43254)
                0.021852624 = queryNorm
              0.20229396 = fieldWeight in 3183, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.398179 = idf(docFreq=71, maxDocs=43254)
                0.02734375 = fieldNorm(doc=3183)
          0.020147556 = weight(abstract_txt:library in 3183) [ClassicSimilarity], result of:
            0.020147556 = score(doc=3183,freq=4.0), product of:
              0.1156628 = queryWeight, product of:
                1.6616881 = boost
                3.1852286 = idf(docFreq=4863, maxDocs=43254)
                0.021852624 = queryNorm
              0.17419219 = fieldWeight in 3183, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.1852286 = idf(docFreq=4863, maxDocs=43254)
                0.02734375 = fieldNorm(doc=3183)
          0.032245133 = weight(abstract_txt:could in 3183) [ClassicSimilarity], result of:
            0.032245133 = score(doc=3183,freq=2.0), product of:
              0.17418115 = queryWeight, product of:
                1.6649746 = boost
                4.7872925 = idf(docFreq=979, maxDocs=43254)
                0.021852624 = queryNorm
              0.18512413 = fieldWeight in 3183, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.7872925 = idf(docFreq=979, maxDocs=43254)
                0.02734375 = fieldNorm(doc=3183)
          0.0145529555 = weight(abstract_txt:information in 3183) [ClassicSimilarity], result of:
            0.0145529555 = score(doc=3183,freq=6.0), product of:
              0.08952866 = queryWeight, product of:
                1.688119 = boost
                2.42692 = idf(docFreq=10382, maxDocs=43254)
                0.021852624 = queryNorm
              0.1625508 = fieldWeight in 3183, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                2.42692 = idf(docFreq=10382, maxDocs=43254)
                0.02734375 = fieldNorm(doc=3183)
          0.023631208 = weight(abstract_txt:they in 3183) [ClassicSimilarity], result of:
            0.023631208 = score(doc=3183,freq=2.0), product of:
              0.16207378 = queryWeight, product of:
                1.9670212 = boost
                3.7705102 = idf(docFreq=2708, maxDocs=43254)
                0.021852624 = queryNorm
              0.14580525 = fieldWeight in 3183, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.7705102 = idf(docFreq=2708, maxDocs=43254)
                0.02734375 = fieldNorm(doc=3183)
          0.10984552 = weight(abstract_txt:names in 3183) [ClassicSimilarity], result of:
            0.10984552 = score(doc=3183,freq=7.0), product of:
              0.25973108 = queryWeight, product of:
                2.033148 = boost
                5.8458996 = idf(docFreq=339, maxDocs=43254)
                0.021852624 = queryNorm
              0.4229202 = fieldWeight in 3183, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                5.8458996 = idf(docFreq=339, maxDocs=43254)
                0.02734375 = fieldNorm(doc=3183)
          0.091821544 = weight(abstract_txt:person in 3183) [ClassicSimilarity], result of:
            0.091821544 = score(doc=3183,freq=3.0), product of:
              0.30569687 = queryWeight, product of:
                2.205731 = boost
                6.3421264 = idf(docFreq=206, maxDocs=43254)
                0.021852624 = queryNorm
              0.30036795 = fieldWeight in 3183, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.3421264 = idf(docFreq=206, maxDocs=43254)
                0.02734375 = fieldNorm(doc=3183)
        0.4 = coord(10/25)
    
  4. Sardo, L.: Multiple names (2004) 0.16
    0.15509807 = sum of:
      0.15509807 = product of:
        0.7754903 = sum of:
          0.090331316 = weight(abstract_txt:name in 117) [ClassicSimilarity], result of:
            0.090331316 = score(doc=117,freq=1.0), product of:
              0.12566547 = queryWeight, product of:
                5.7505894 = idf(docFreq=373, maxDocs=43254)
                0.021852624 = queryNorm
              0.7188237 = fieldWeight in 117, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.7505894 = idf(docFreq=373, maxDocs=43254)
                0.125 = fieldNorm(doc=117)
          0.047418453 = weight(abstract_txt:some in 117) [ClassicSimilarity], result of:
            0.047418453 = score(doc=117,freq=1.0), product of:
              0.10303038 = queryWeight, product of:
                1.2805297 = boost
                3.6819005 = idf(docFreq=2959, maxDocs=43254)
                0.021852624 = queryNorm
              0.46023756 = fieldWeight in 117, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6819005 = idf(docFreq=2959, maxDocs=43254)
                0.125 = fieldNorm(doc=117)
          0.046051558 = weight(abstract_txt:library in 117) [ClassicSimilarity], result of:
            0.046051558 = score(doc=117,freq=1.0), product of:
              0.1156628 = queryWeight, product of:
                1.6616881 = boost
                3.1852286 = idf(docFreq=4863, maxDocs=43254)
                0.021852624 = queryNorm
              0.39815357 = fieldWeight in 117, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.1852286 = idf(docFreq=4863, maxDocs=43254)
                0.125 = fieldNorm(doc=117)
          0.26841098 = weight(abstract_txt:names in 117) [ClassicSimilarity], result of:
            0.26841098 = score(doc=117,freq=2.0), product of:
              0.25973108 = queryWeight, product of:
                2.033148 = boost
                5.8458996 = idf(docFreq=339, maxDocs=43254)
                0.021852624 = queryNorm
              1.0334188 = fieldWeight in 117, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.8458996 = idf(docFreq=339, maxDocs=43254)
                0.125 = fieldNorm(doc=117)
          0.323278 = weight(abstract_txt:catalogs in 117) [ClassicSimilarity], result of:
            0.323278 = score(doc=117,freq=1.0), product of:
              0.42404792 = queryWeight, product of:
                3.1817067 = boost
                6.098896 = idf(docFreq=263, maxDocs=43254)
                0.021852624 = queryNorm
              0.762362 = fieldWeight in 117, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.098896 = idf(docFreq=263, maxDocs=43254)
                0.125 = fieldNorm(doc=117)
        0.2 = coord(5/25)
    
  5. Senzig, D.: Library catalogs for library users (1984) 0.14
    0.14461343 = sum of:
      0.14461343 = product of:
        0.72306716 = sum of:
          0.07569134 = weight(abstract_txt:users in 831) [ClassicSimilarity], result of:
            0.07569134 = score(doc=831,freq=4.0), product of:
              0.096903436 = queryWeight, product of:
                1.2418714 = boost
                3.570746 = idf(docFreq=3307, maxDocs=43254)
                0.021852624 = queryNorm
              0.7811007 = fieldWeight in 831, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.570746 = idf(docFreq=3307, maxDocs=43254)
                0.109375 = fieldNorm(doc=831)
          0.056985892 = weight(abstract_txt:library in 831) [ClassicSimilarity], result of:
            0.056985892 = score(doc=831,freq=2.0), product of:
              0.1156628 = queryWeight, product of:
                1.6616881 = boost
                3.1852286 = idf(docFreq=4863, maxDocs=43254)
                0.021852624 = queryNorm
              0.4926899 = fieldWeight in 831, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.1852286 = idf(docFreq=4863, maxDocs=43254)
                0.109375 = fieldNorm(doc=831)
          0.03360861 = weight(abstract_txt:information in 831) [ClassicSimilarity], result of:
            0.03360861 = score(doc=831,freq=2.0), product of:
              0.08952866 = queryWeight, product of:
                1.688119 = boost
                2.42692 = idf(docFreq=10382, maxDocs=43254)
                0.021852624 = queryNorm
              0.37539503 = fieldWeight in 831, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.42692 = idf(docFreq=10382, maxDocs=43254)
                0.109375 = fieldNorm(doc=831)
          0.06683915 = weight(abstract_txt:they in 831) [ClassicSimilarity], result of:
            0.06683915 = score(doc=831,freq=1.0), product of:
              0.16207378 = queryWeight, product of:
                1.9670212 = boost
                3.7705102 = idf(docFreq=2708, maxDocs=43254)
                0.021852624 = queryNorm
              0.41239956 = fieldWeight in 831, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.7705102 = idf(docFreq=2708, maxDocs=43254)
                0.109375 = fieldNorm(doc=831)
          0.4899422 = weight(abstract_txt:catalogs in 831) [ClassicSimilarity], result of:
            0.4899422 = score(doc=831,freq=3.0), product of:
              0.42404792 = queryWeight, product of:
                3.1817067 = boost
                6.098896 = idf(docFreq=263, maxDocs=43254)
                0.021852624 = queryNorm
              1.1553935 = fieldWeight in 831, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.098896 = idf(docFreq=263, maxDocs=43254)
                0.109375 = fieldNorm(doc=831)
        0.2 = coord(5/25)