Document (#32615)

Author
Galvez, C.
Moya-Anegón, F.
Title
Approximate personal name-matching through finite-state graphs
Source
Journal of the American Society for Information Science and Technology. 58(2007) no.13, S.1960-1976
Year
2007
Abstract
This article shows how finite-state methods can be employed in a new and different task: the conflation of personal name variants in standard forms. In bibliographic databases and citation index systems, variant forms create problems of inaccuracy that affect information retrieval, the quality of information from databases, and the citation statistics used for the evaluation of scientists' work. A number of approximate string matching techniques have been developed to validate variant forms, based on similarity and equivalence relations. We classify the personal name variants as nonvalid and valid forms. In establishing an equivalence relation between valid variants and the standard form of its equivalence class, we defend the application of finite-state transducers. The process of variant identification requires the elaboration of: (a) binary matrices and (b) finite-state graphs. This procedure was tested on samples of author names from bibliographic records, selected from the Library and Information Science Abstracts and Science Citation Index Expanded databases. The evaluation involved calculating the measures of precision and recall, based on completeness and accuracy. The results demonstrate the usefulness of this approach, although it should be complemented with methods based on similarity relations for the recognition of spelling variants and misspellings.

Similar documents (author)

  1. Anegón, F. de Moya -> Moya Anegón, F. de: 5.15
    5.145693 = sum of:
      5.145693 = sum of:
        2.5141854 = weight(author_txt:moya in 3455) [ClassicSimilarity], result of:
          2.5141854 = score(doc=3455,freq=2.0), product of:
            0.69627726 = queryWeight, product of:
              8.1705265 = idf(docFreq=33, maxDocs=44218)
              0.08521816 = queryNorm
            3.6108968 = fieldWeight in 3455, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              8.1705265 = idf(docFreq=33, maxDocs=44218)
              0.3125 = fieldNorm(doc=3455)
        2.6315076 = weight(author_txt:anegón in 3455) [ClassicSimilarity], result of:
          2.6315076 = score(doc=3455,freq=2.0), product of:
            0.717773 = queryWeight, product of:
              1.0153189 = boost
              8.29569 = idf(docFreq=29, maxDocs=44218)
              0.08521816 = queryNorm
            3.6662114 = fieldWeight in 3455, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              8.29569 = idf(docFreq=29, maxDocs=44218)
              0.3125 = fieldNorm(doc=3455)
    
  2. Bornmann, L.; Moya Anegón, F.de: What proportion of excellent papers makes an institution one of the best worldwide? : Specifying thresholds for the interpretation of the results of the SCImago Institutions Ranking and the Leiden Ranking (2014) 5.09
    5.093976 = sum of:
      5.093976 = sum of:
        2.4889164 = weight(author_txt:moya in 1235) [ClassicSimilarity], result of:
          2.4889164 = score(doc=1235,freq=1.0), product of:
            0.69627726 = queryWeight, product of:
              8.1705265 = idf(docFreq=33, maxDocs=44218)
              0.08521816 = queryNorm
            3.5746055 = fieldWeight in 1235, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.1705265 = idf(docFreq=33, maxDocs=44218)
              0.4375 = fieldNorm(doc=1235)
        2.6050596 = weight(author_txt:anegón in 1235) [ClassicSimilarity], result of:
          2.6050596 = score(doc=1235,freq=1.0), product of:
            0.717773 = queryWeight, product of:
              1.0153189 = boost
              8.29569 = idf(docFreq=29, maxDocs=44218)
              0.08521816 = queryNorm
            3.6293643 = fieldWeight in 1235, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.29569 = idf(docFreq=29, maxDocs=44218)
              0.4375 = fieldNorm(doc=1235)
    
  3. Guerrero, V.P.; Moya Anegón, F. de: Reduction of the dimension of a document space using the fuzzified output of a Kohonen network (2001) 4.37
    4.3662653 = sum of:
      4.3662653 = sum of:
        2.133357 = weight(author_txt:moya in 6935) [ClassicSimilarity], result of:
          2.133357 = score(doc=6935,freq=1.0), product of:
            0.69627726 = queryWeight, product of:
              8.1705265 = idf(docFreq=33, maxDocs=44218)
              0.08521816 = queryNorm
            3.0639474 = fieldWeight in 6935, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.1705265 = idf(docFreq=33, maxDocs=44218)
              0.375 = fieldNorm(doc=6935)
        2.2329085 = weight(author_txt:anegón in 6935) [ClassicSimilarity], result of:
          2.2329085 = score(doc=6935,freq=1.0), product of:
            0.717773 = queryWeight, product of:
              1.0153189 = boost
              8.29569 = idf(docFreq=29, maxDocs=44218)
              0.08521816 = queryNorm
            3.1108837 = fieldWeight in 6935, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.29569 = idf(docFreq=29, maxDocs=44218)
              0.375 = fieldNorm(doc=6935)
    
  4. Moya Anegón, F. de; López-Huertas, M.J.: ¬An automatic model for updating the conceptual structure of a scientific discipline (2000) 4.37
    4.3662653 = sum of:
      4.3662653 = sum of:
        2.133357 = weight(author_txt:moya in 126) [ClassicSimilarity], result of:
          2.133357 = score(doc=126,freq=1.0), product of:
            0.69627726 = queryWeight, product of:
              8.1705265 = idf(docFreq=33, maxDocs=44218)
              0.08521816 = queryNorm
            3.0639474 = fieldWeight in 126, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.1705265 = idf(docFreq=33, maxDocs=44218)
              0.375 = fieldNorm(doc=126)
        2.2329085 = weight(author_txt:anegón in 126) [ClassicSimilarity], result of:
          2.2329085 = score(doc=126,freq=1.0), product of:
            0.717773 = queryWeight, product of:
              1.0153189 = boost
              8.29569 = idf(docFreq=29, maxDocs=44218)
              0.08521816 = queryNorm
            3.1108837 = fieldWeight in 126, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.29569 = idf(docFreq=29, maxDocs=44218)
              0.375 = fieldNorm(doc=126)
    
  5. Herrero-Solana, V.; Moya Anegón, F. de: Graphical Table of Contents (GTOC) for library collections : the application of UDC codes for the subject maps (2003) 4.37
    4.3662653 = sum of:
      4.3662653 = sum of:
        2.133357 = weight(author_txt:moya in 2758) [ClassicSimilarity], result of:
          2.133357 = score(doc=2758,freq=1.0), product of:
            0.69627726 = queryWeight, product of:
              8.1705265 = idf(docFreq=33, maxDocs=44218)
              0.08521816 = queryNorm
            3.0639474 = fieldWeight in 2758, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.1705265 = idf(docFreq=33, maxDocs=44218)
              0.375 = fieldNorm(doc=2758)
        2.2329085 = weight(author_txt:anegón in 2758) [ClassicSimilarity], result of:
          2.2329085 = score(doc=2758,freq=1.0), product of:
            0.717773 = queryWeight, product of:
              1.0153189 = boost
              8.29569 = idf(docFreq=29, maxDocs=44218)
              0.08521816 = queryNorm
            3.1108837 = fieldWeight in 2758, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.29569 = idf(docFreq=29, maxDocs=44218)
              0.375 = fieldNorm(doc=2758)
    

Similar documents (content)

  1. Galvez, C.; Moya-Anegón, F. de; Solana, V.H.: Term conflation methods in information retrieval : non-linguistic and linguistic approaches (2005) 0.35
    0.34711543 = sum of:
      0.34711543 = product of:
        1.2396979 = sum of:
          0.19804317 = weight(abstract_txt:conflation in 4394) [ClassicSimilarity], result of:
            0.19804317 = score(doc=4394,freq=3.0), product of:
              0.15406395 = queryWeight, product of:
                1.1356871 = boost
                9.499662 = idf(docFreq=8, maxDocs=44218)
                0.014280196 = queryNorm
              1.2854607 = fieldWeight in 4394, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                9.499662 = idf(docFreq=8, maxDocs=44218)
                0.078125 = fieldNorm(doc=4394)
          0.045510456 = weight(abstract_txt:relations in 4394) [ClassicSimilarity], result of:
            0.045510456 = score(doc=4394,freq=1.0), product of:
              0.10503136 = queryWeight, product of:
                1.3261195 = boost
                5.5462847 = idf(docFreq=468, maxDocs=44218)
                0.014280196 = queryNorm
              0.43330348 = fieldWeight in 4394, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.5462847 = idf(docFreq=468, maxDocs=44218)
                0.078125 = fieldNorm(doc=4394)
          0.059009418 = weight(abstract_txt:matching in 4394) [ClassicSimilarity], result of:
            0.059009418 = score(doc=4394,freq=1.0), product of:
              0.124889456 = queryWeight, product of:
                1.4460591 = boost
                6.047913 = idf(docFreq=283, maxDocs=44218)
                0.014280196 = queryNorm
              0.4724932 = fieldWeight in 4394, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.047913 = idf(docFreq=283, maxDocs=44218)
                0.078125 = fieldNorm(doc=4394)
          0.060112353 = weight(abstract_txt:state in 4394) [ClassicSimilarity], result of:
            0.060112353 = score(doc=4394,freq=1.0), product of:
              0.15930547 = queryWeight, product of:
                2.309689 = boost
                4.829954 = idf(docFreq=959, maxDocs=44218)
                0.014280196 = queryNorm
              0.37734017 = fieldWeight in 4394, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.829954 = idf(docFreq=959, maxDocs=44218)
                0.078125 = fieldNorm(doc=4394)
          0.17032665 = weight(abstract_txt:equivalence in 4394) [ClassicSimilarity], result of:
            0.17032665 = score(doc=4394,freq=1.0), product of:
              0.28982136 = queryWeight, product of:
                2.6979506 = boost
                7.5225 = idf(docFreq=64, maxDocs=44218)
                0.014280196 = queryNorm
              0.5876953 = fieldWeight in 4394, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.5225 = idf(docFreq=64, maxDocs=44218)
                0.078125 = fieldNorm(doc=4394)
          0.31921944 = weight(abstract_txt:variants in 4394) [ClassicSimilarity], result of:
            0.31921944 = score(doc=4394,freq=2.0), product of:
              0.38486147 = queryWeight, product of:
                3.5899663 = boost
                7.5072327 = idf(docFreq=65, maxDocs=44218)
                0.014280196 = queryNorm
              0.8294398 = fieldWeight in 4394, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.5072327 = idf(docFreq=65, maxDocs=44218)
                0.078125 = fieldNorm(doc=4394)
          0.38747638 = weight(abstract_txt:finite in 4394) [ClassicSimilarity], result of:
            0.38747638 = score(doc=4394,freq=1.0), product of:
              0.55176187 = queryWeight, product of:
                4.298471 = boost
                8.988837 = idf(docFreq=14, maxDocs=44218)
                0.014280196 = queryNorm
              0.7022529 = fieldWeight in 4394, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.988837 = idf(docFreq=14, maxDocs=44218)
                0.078125 = fieldNorm(doc=4394)
        0.28 = coord(7/25)
    
  2. Galvez, C.; Moya-Anegón, F. de: ¬An evaluation of conflation accuracy using finite-state transducers (2006) 0.34
    0.33641586 = sum of:
      0.33641586 = product of:
        1.2014852 = sum of:
          0.19804317 = weight(abstract_txt:conflation in 5599) [ClassicSimilarity], result of:
            0.19804317 = score(doc=5599,freq=3.0), product of:
              0.15406395 = queryWeight, product of:
                1.1356871 = boost
                9.499662 = idf(docFreq=8, maxDocs=44218)
                0.014280196 = queryNorm
              1.2854607 = fieldWeight in 5599, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                9.499662 = idf(docFreq=8, maxDocs=44218)
                0.078125 = fieldNorm(doc=5599)
          0.018333118 = weight(abstract_txt:based in 5599) [ClassicSimilarity], result of:
            0.018333118 = score(doc=5599,freq=2.0), product of:
              0.052050255 = queryWeight, product of:
                1.1433527 = boost
                3.1879277 = idf(docFreq=4958, maxDocs=44218)
                0.014280196 = queryNorm
              0.35221958 = fieldWeight in 5599, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.1879277 = idf(docFreq=4958, maxDocs=44218)
                0.078125 = fieldNorm(doc=5599)
          0.060112353 = weight(abstract_txt:state in 5599) [ClassicSimilarity], result of:
            0.060112353 = score(doc=5599,freq=1.0), product of:
              0.15930547 = queryWeight, product of:
                2.309689 = boost
                4.829954 = idf(docFreq=959, maxDocs=44218)
                0.014280196 = queryNorm
              0.37734017 = fieldWeight in 5599, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.829954 = idf(docFreq=959, maxDocs=44218)
                0.078125 = fieldNorm(doc=5599)
          0.15012892 = weight(abstract_txt:forms in 5599) [ClassicSimilarity], result of:
            0.15012892 = score(doc=5599,freq=3.0), product of:
              0.20332496 = queryWeight, product of:
                2.609357 = boost
                5.456611 = idf(docFreq=512, maxDocs=44218)
                0.014280196 = queryNorm
              0.73836935 = fieldWeight in 5599, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.456611 = idf(docFreq=512, maxDocs=44218)
                0.078125 = fieldNorm(doc=5599)
          0.161669 = weight(abstract_txt:variant in 5599) [ClassicSimilarity], result of:
            0.161669 = score(doc=5599,freq=1.0), product of:
              0.27991518 = queryWeight, product of:
                2.6514413 = boost
                7.3928223 = idf(docFreq=73, maxDocs=44218)
                0.014280196 = queryNorm
              0.57756424 = fieldWeight in 5599, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.3928223 = idf(docFreq=73, maxDocs=44218)
                0.078125 = fieldNorm(doc=5599)
          0.22572224 = weight(abstract_txt:variants in 5599) [ClassicSimilarity], result of:
            0.22572224 = score(doc=5599,freq=1.0), product of:
              0.38486147 = queryWeight, product of:
                3.5899663 = boost
                7.5072327 = idf(docFreq=65, maxDocs=44218)
                0.014280196 = queryNorm
              0.58650255 = fieldWeight in 5599, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.5072327 = idf(docFreq=65, maxDocs=44218)
                0.078125 = fieldNorm(doc=5599)
          0.38747638 = weight(abstract_txt:finite in 5599) [ClassicSimilarity], result of:
            0.38747638 = score(doc=5599,freq=1.0), product of:
              0.55176187 = queryWeight, product of:
                4.298471 = boost
                8.988837 = idf(docFreq=14, maxDocs=44218)
                0.014280196 = queryNorm
              0.7022529 = fieldWeight in 5599, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.988837 = idf(docFreq=14, maxDocs=44218)
                0.078125 = fieldNorm(doc=5599)
        0.28 = coord(7/25)
    
  3. French, J.C.; Powell, A.L.; Schulman, E.: Using clustering strategies for creating authority files (2000) 0.27
    0.27417374 = sum of:
      0.27417374 = product of:
        0.97919196 = sum of:
          0.09911678 = weight(abstract_txt:misspellings in 4811) [ClassicSimilarity], result of:
            0.09911678 = score(doc=4811,freq=1.0), product of:
              0.14006609 = queryWeight, product of:
                1.0828658 = boost
                9.05783 = idf(docFreq=13, maxDocs=44218)
                0.014280196 = queryNorm
              0.707643 = fieldWeight in 4811, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.05783 = idf(docFreq=13, maxDocs=44218)
                0.078125 = fieldNorm(doc=4811)
          0.08345192 = weight(abstract_txt:matching in 4811) [ClassicSimilarity], result of:
            0.08345192 = score(doc=4811,freq=2.0), product of:
              0.124889456 = queryWeight, product of:
                1.4460591 = boost
                6.047913 = idf(docFreq=283, maxDocs=44218)
                0.014280196 = queryNorm
              0.6682063 = fieldWeight in 4811, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.047913 = idf(docFreq=283, maxDocs=44218)
                0.078125 = fieldNorm(doc=4811)
          0.034381494 = weight(abstract_txt:databases in 4811) [ClassicSimilarity], result of:
            0.034381494 = score(doc=4811,freq=1.0), product of:
              0.09972984 = queryWeight, product of:
                1.5826372 = boost
                4.4127526 = idf(docFreq=1456, maxDocs=44218)
                0.014280196 = queryNorm
              0.3447463 = fieldWeight in 4811, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4127526 = idf(docFreq=1456, maxDocs=44218)
                0.078125 = fieldNorm(doc=4811)
          0.1853053 = weight(abstract_txt:approximate in 4811) [ClassicSimilarity], result of:
            0.1853053 = score(doc=4811,freq=2.0), product of:
              0.21256582 = queryWeight, product of:
                1.8865567 = boost
                7.890225 = idf(docFreq=44, maxDocs=44218)
                0.014280196 = queryNorm
              0.8717549 = fieldWeight in 4811, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.890225 = idf(docFreq=44, maxDocs=44218)
                0.078125 = fieldNorm(doc=4811)
          0.12257975 = weight(abstract_txt:forms in 4811) [ClassicSimilarity], result of:
            0.12257975 = score(doc=4811,freq=2.0), product of:
              0.20332496 = queryWeight, product of:
                2.609357 = boost
                5.456611 = idf(docFreq=512, maxDocs=44218)
                0.014280196 = queryNorm
              0.60287607 = fieldWeight in 4811, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.456611 = idf(docFreq=512, maxDocs=44218)
                0.078125 = fieldNorm(doc=4811)
          0.22863449 = weight(abstract_txt:variant in 4811) [ClassicSimilarity], result of:
            0.22863449 = score(doc=4811,freq=2.0), product of:
              0.27991518 = queryWeight, product of:
                2.6514413 = boost
                7.3928223 = idf(docFreq=73, maxDocs=44218)
                0.014280196 = queryNorm
              0.81679916 = fieldWeight in 4811, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.3928223 = idf(docFreq=73, maxDocs=44218)
                0.078125 = fieldNorm(doc=4811)
          0.22572224 = weight(abstract_txt:variants in 4811) [ClassicSimilarity], result of:
            0.22572224 = score(doc=4811,freq=1.0), product of:
              0.38486147 = queryWeight, product of:
                3.5899663 = boost
                7.5072327 = idf(docFreq=65, maxDocs=44218)
                0.014280196 = queryNorm
              0.58650255 = fieldWeight in 4811, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.5072327 = idf(docFreq=65, maxDocs=44218)
                0.078125 = fieldNorm(doc=4811)
        0.28 = coord(7/25)
    
  4. Järvelin, A.; Keskustalo, H.; Sormunen, E.; Saastamoinen, M.; Kettunen, K.: Information retrieval from historical newspaper collections in highly inflectional languages : a query expansion approach (2016) 0.19
    0.18885468 = sum of:
      0.18885468 = product of:
        0.7868945 = sum of:
          0.039586592 = weight(abstract_txt:index in 3223) [ClassicSimilarity], result of:
            0.039586592 = score(doc=3223,freq=3.0), product of:
              0.0770034 = queryWeight, product of:
                1.1354764 = boost
                4.74895 = idf(docFreq=1040, maxDocs=44218)
                0.014280196 = queryNorm
              0.5140889 = fieldWeight in 3223, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.74895 = idf(docFreq=1040, maxDocs=44218)
                0.0625 = fieldNorm(doc=3223)
          0.010370778 = weight(abstract_txt:based in 3223) [ClassicSimilarity], result of:
            0.010370778 = score(doc=3223,freq=1.0), product of:
              0.052050255 = queryWeight, product of:
                1.1433527 = boost
                3.1879277 = idf(docFreq=4958, maxDocs=44218)
                0.014280196 = queryNorm
              0.19924548 = fieldWeight in 3223, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.1879277 = idf(docFreq=4958, maxDocs=44218)
                0.0625 = fieldNorm(doc=3223)
          0.09441507 = weight(abstract_txt:matching in 3223) [ClassicSimilarity], result of:
            0.09441507 = score(doc=3223,freq=4.0), product of:
              0.124889456 = queryWeight, product of:
                1.4460591 = boost
                6.047913 = idf(docFreq=283, maxDocs=44218)
                0.014280196 = queryNorm
              0.75598913 = fieldWeight in 3223, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.047913 = idf(docFreq=283, maxDocs=44218)
                0.0625 = fieldNorm(doc=3223)
          0.20964903 = weight(abstract_txt:approximate in 3223) [ClassicSimilarity], result of:
            0.20964903 = score(doc=3223,freq=4.0), product of:
              0.21256582 = queryWeight, product of:
                1.8865567 = boost
                7.890225 = idf(docFreq=44, maxDocs=44218)
                0.014280196 = queryNorm
              0.9862781 = fieldWeight in 3223, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                7.890225 = idf(docFreq=44, maxDocs=44218)
                0.0625 = fieldNorm(doc=3223)
          0.120103136 = weight(abstract_txt:forms in 3223) [ClassicSimilarity], result of:
            0.120103136 = score(doc=3223,freq=3.0), product of:
              0.20332496 = queryWeight, product of:
                2.609357 = boost
                5.456611 = idf(docFreq=512, maxDocs=44218)
                0.014280196 = queryNorm
              0.5906955 = fieldWeight in 3223, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.456611 = idf(docFreq=512, maxDocs=44218)
                0.0625 = fieldNorm(doc=3223)
          0.3127699 = weight(abstract_txt:variants in 3223) [ClassicSimilarity], result of:
            0.3127699 = score(doc=3223,freq=3.0), product of:
              0.38486147 = queryWeight, product of:
                3.5899663 = boost
                7.5072327 = idf(docFreq=65, maxDocs=44218)
                0.014280196 = queryNorm
              0.81268173 = fieldWeight in 3223, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.5072327 = idf(docFreq=65, maxDocs=44218)
                0.0625 = fieldNorm(doc=3223)
        0.24 = coord(6/25)
    
  5. Pereira, D.A.; Ribeiro-Neto, B.; Ziviani, N.; Laender, A.H.F.; Gonçalves, M.A.: ¬A generic Web-based entity resolution framework (2011) 0.13
    0.1340102 = sum of:
      0.1340102 = product of:
        0.55837584 = sum of:
          0.07929342 = weight(abstract_txt:misspellings in 4450) [ClassicSimilarity], result of:
            0.07929342 = score(doc=4450,freq=1.0), product of:
              0.14006609 = queryWeight, product of:
                1.0828658 = boost
                9.05783 = idf(docFreq=13, maxDocs=44218)
                0.014280196 = queryNorm
              0.56611437 = fieldWeight in 4450, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.05783 = idf(docFreq=13, maxDocs=44218)
                0.0625 = fieldNorm(doc=4450)
          0.010370778 = weight(abstract_txt:based in 4450) [ClassicSimilarity], result of:
            0.010370778 = score(doc=4450,freq=1.0), product of:
              0.052050255 = queryWeight, product of:
                1.1433527 = boost
                3.1879277 = idf(docFreq=4958, maxDocs=44218)
                0.014280196 = queryNorm
              0.19924548 = fieldWeight in 4450, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.1879277 = idf(docFreq=4958, maxDocs=44218)
                0.0625 = fieldNorm(doc=4450)
          0.0607349 = weight(abstract_txt:name in 4450) [ClassicSimilarity], result of:
            0.0607349 = score(doc=4450,freq=1.0), product of:
              0.16911191 = queryWeight, product of:
                2.0608952 = boost
                5.746245 = idf(docFreq=383, maxDocs=44218)
                0.014280196 = queryNorm
              0.3591403 = fieldWeight in 4450, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.746245 = idf(docFreq=383, maxDocs=44218)
                0.0625 = fieldNorm(doc=4450)
          0.0980638 = weight(abstract_txt:forms in 4450) [ClassicSimilarity], result of:
            0.0980638 = score(doc=4450,freq=2.0), product of:
              0.20332496 = queryWeight, product of:
                2.609357 = boost
                5.456611 = idf(docFreq=512, maxDocs=44218)
                0.014280196 = queryNorm
              0.48230085 = fieldWeight in 4450, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.456611 = idf(docFreq=512, maxDocs=44218)
                0.0625 = fieldNorm(doc=4450)
          0.1293352 = weight(abstract_txt:variant in 4450) [ClassicSimilarity], result of:
            0.1293352 = score(doc=4450,freq=1.0), product of:
              0.27991518 = queryWeight, product of:
                2.6514413 = boost
                7.3928223 = idf(docFreq=73, maxDocs=44218)
                0.014280196 = queryNorm
              0.4620514 = fieldWeight in 4450, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.3928223 = idf(docFreq=73, maxDocs=44218)
                0.0625 = fieldNorm(doc=4450)
          0.18057778 = weight(abstract_txt:variants in 4450) [ClassicSimilarity], result of:
            0.18057778 = score(doc=4450,freq=1.0), product of:
              0.38486147 = queryWeight, product of:
                3.5899663 = boost
                7.5072327 = idf(docFreq=65, maxDocs=44218)
                0.014280196 = queryNorm
              0.46920204 = fieldWeight in 4450, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.5072327 = idf(docFreq=65, maxDocs=44218)
                0.0625 = fieldNorm(doc=4450)
        0.24 = coord(6/25)