Document (#32616)

Author
Galvez, C.
Moya-Anegón, F.
Title
Approximate personal name-matching through finite-state graphs
Source
Journal of the American Society for Information Science and Technology. 58(2007) no.13, S.1960-1976
Year
2007
Abstract
This article shows how finite-state methods can be employed in a new and different task: the conflation of personal name variants in standard forms. In bibliographic databases and citation index systems, variant forms create problems of inaccuracy that affect information retrieval, the quality of information from databases, and the citation statistics used for the evaluation of scientists' work. A number of approximate string matching techniques have been developed to validate variant forms, based on similarity and equivalence relations. We classify the personal name variants as nonvalid and valid forms. In establishing an equivalence relation between valid variants and the standard form of its equivalence class, we defend the application of finite-state transducers. The process of variant identification requires the elaboration of: (a) binary matrices and (b) finite-state graphs. This procedure was tested on samples of author names from bibliographic records, selected from the Library and Information Science Abstracts and Science Citation Index Expanded databases. The evaluation involved calculating the measures of precision and recall, based on completeness and accuracy. The results demonstrate the usefulness of this approach, although it should be complemented with methods based on similarity relations for the recognition of spelling variants and misspellings.

Similar documents (author)

  1. Anegón, F. de Moya -> Moya Anegón, F. de: 5.12
    5.118066 = sum of:
      5.118066 = sum of:
        2.500372 = weight(author_txt:moya in 3524) [ClassicSimilarity], result of:
          2.500372 = score(doc=3524,freq=2.0), product of:
            0.6962184 = queryWeight, product of:
              8.126324 = idf(docFreq=33, maxDocs=42306)
              0.08567446 = queryNorm
            3.5913615 = fieldWeight in 3524, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              8.126324 = idf(docFreq=33, maxDocs=42306)
              0.3125 = fieldNorm(doc=3524)
        2.6176941 = weight(author_txt:anegón in 3524) [ClassicSimilarity], result of:
          2.6176941 = score(doc=3524,freq=2.0), product of:
            0.7178301 = queryWeight, product of:
              1.0154022 = boost
              8.251487 = idf(docFreq=29, maxDocs=42306)
              0.08567446 = queryNorm
            3.6466763 = fieldWeight in 3524, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              8.251487 = idf(docFreq=29, maxDocs=42306)
              0.3125 = fieldNorm(doc=3524)
    
  2. Bornmann, L.; Moya Anegón, F.de: What proportion of excellent papers makes an institution one of the best worldwide? : Specifying thresholds for the interpretation of the results of the SCImago Institutions Ranking and the Leiden Ranking (2014) 5.07
    5.0666265 = sum of:
      5.0666265 = sum of:
        2.475242 = weight(author_txt:moya in 3236) [ClassicSimilarity], result of:
          2.475242 = score(doc=3236,freq=1.0), product of:
            0.6962184 = queryWeight, product of:
              8.126324 = idf(docFreq=33, maxDocs=42306)
              0.08567446 = queryNorm
            3.5552666 = fieldWeight in 3236, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.126324 = idf(docFreq=33, maxDocs=42306)
              0.4375 = fieldNorm(doc=3236)
        2.591385 = weight(author_txt:anegón in 3236) [ClassicSimilarity], result of:
          2.591385 = score(doc=3236,freq=1.0), product of:
            0.7178301 = queryWeight, product of:
              1.0154022 = boost
              8.251487 = idf(docFreq=29, maxDocs=42306)
              0.08567446 = queryNorm
            3.6100254 = fieldWeight in 3236, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.251487 = idf(docFreq=29, maxDocs=42306)
              0.4375 = fieldNorm(doc=3236)
    
  3. Guerrero, V.P.; Moya Anegón, F. de: Reduction of the dimension of a document space using the fuzzified output of a Kohonen network (2001) 4.34
    4.342823 = sum of:
      4.342823 = sum of:
        2.121636 = weight(author_txt:moya in 936) [ClassicSimilarity], result of:
          2.121636 = score(doc=936,freq=1.0), product of:
            0.6962184 = queryWeight, product of:
              8.126324 = idf(docFreq=33, maxDocs=42306)
              0.08567446 = queryNorm
            3.0473714 = fieldWeight in 936, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.126324 = idf(docFreq=33, maxDocs=42306)
              0.375 = fieldNorm(doc=936)
        2.221187 = weight(author_txt:anegón in 936) [ClassicSimilarity], result of:
          2.221187 = score(doc=936,freq=1.0), product of:
            0.7178301 = queryWeight, product of:
              1.0154022 = boost
              8.251487 = idf(docFreq=29, maxDocs=42306)
              0.08567446 = queryNorm
            3.0943074 = fieldWeight in 936, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.251487 = idf(docFreq=29, maxDocs=42306)
              0.375 = fieldNorm(doc=936)
    
  4. Moya Anegón, F. de; López-Huertas, M.J.: ¬An automatic model for updating the conceptual structure of a scientific discipline (2000) 4.34
    4.342823 = sum of:
      4.342823 = sum of:
        2.121636 = weight(author_txt:moya in 1127) [ClassicSimilarity], result of:
          2.121636 = score(doc=1127,freq=1.0), product of:
            0.6962184 = queryWeight, product of:
              8.126324 = idf(docFreq=33, maxDocs=42306)
              0.08567446 = queryNorm
            3.0473714 = fieldWeight in 1127, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.126324 = idf(docFreq=33, maxDocs=42306)
              0.375 = fieldNorm(doc=1127)
        2.221187 = weight(author_txt:anegón in 1127) [ClassicSimilarity], result of:
          2.221187 = score(doc=1127,freq=1.0), product of:
            0.7178301 = queryWeight, product of:
              1.0154022 = boost
              8.251487 = idf(docFreq=29, maxDocs=42306)
              0.08567446 = queryNorm
            3.0943074 = fieldWeight in 1127, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.251487 = idf(docFreq=29, maxDocs=42306)
              0.375 = fieldNorm(doc=1127)
    
  5. Herrero-Solana, V.; Moya Anegón, F. de: Graphical Table of Contents (GTOC) for library collections : the application of UDC codes for the subject maps (2003) 4.34
    4.342823 = sum of:
      4.342823 = sum of:
        2.121636 = weight(author_txt:moya in 3759) [ClassicSimilarity], result of:
          2.121636 = score(doc=3759,freq=1.0), product of:
            0.6962184 = queryWeight, product of:
              8.126324 = idf(docFreq=33, maxDocs=42306)
              0.08567446 = queryNorm
            3.0473714 = fieldWeight in 3759, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.126324 = idf(docFreq=33, maxDocs=42306)
              0.375 = fieldNorm(doc=3759)
        2.221187 = weight(author_txt:anegón in 3759) [ClassicSimilarity], result of:
          2.221187 = score(doc=3759,freq=1.0), product of:
            0.7178301 = queryWeight, product of:
              1.0154022 = boost
              8.251487 = idf(docFreq=29, maxDocs=42306)
              0.08567446 = queryNorm
            3.0943074 = fieldWeight in 3759, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.251487 = idf(docFreq=29, maxDocs=42306)
              0.375 = fieldNorm(doc=3759)
    

Similar documents (content)

  1. Galvez, C.; Moya-Anegón, F. de; Solana, V.H.: Term conflation methods in information retrieval : non-linguistic and linguistic approaches (2005) 0.35
    0.3477675 = sum of:
      0.3477675 = product of:
        1.2420268 = sum of:
          0.195703 = weight(abstract_txt:conflation in 395) [ClassicSimilarity], result of:
            0.195703 = score(doc=395,freq=3.0), product of:
              0.15295517 = queryWeight, product of:
                1.1269064 = boost
                9.45546 = idf(docFreq=8, maxDocs=42306)
                0.014354686 = queryNorm
              1.2794794 = fieldWeight in 395, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                9.45546 = idf(docFreq=8, maxDocs=42306)
                0.078125 = fieldNorm(doc=395)
          0.046607528 = weight(abstract_txt:relations in 395) [ClassicSimilarity], result of:
            0.046607528 = score(doc=395,freq=1.0), product of:
              0.10678747 = queryWeight, product of:
                1.3316218 = boost
                5.586576 = idf(docFreq=430, maxDocs=42306)
                0.014354686 = queryNorm
              0.43645126 = fieldWeight in 395, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.586576 = idf(docFreq=430, maxDocs=42306)
                0.078125 = fieldNorm(doc=395)
          0.05942934 = weight(abstract_txt:matching in 395) [ClassicSimilarity], result of:
            0.05942934 = score(doc=395,freq=1.0), product of:
              0.12556933 = queryWeight, product of:
                1.4439845 = boost
                6.057973 = idf(docFreq=268, maxDocs=42306)
                0.014354686 = queryNorm
              0.47327912 = fieldWeight in 395, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.057973 = idf(docFreq=268, maxDocs=42306)
                0.078125 = fieldNorm(doc=395)
          0.06185497 = weight(abstract_txt:state in 395) [ClassicSimilarity], result of:
            0.06185497 = score(doc=395,freq=1.0), product of:
              0.16248353 = queryWeight, product of:
                2.322953 = boost
                4.872762 = idf(docFreq=879, maxDocs=42306)
                0.014354686 = queryNorm
              0.38068455 = fieldWeight in 395, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.872762 = idf(docFreq=879, maxDocs=42306)
                0.078125 = fieldNorm(doc=395)
          0.16873947 = weight(abstract_txt:equivalence in 395) [ClassicSimilarity], result of:
            0.16873947 = score(doc=395,freq=1.0), product of:
              0.28822026 = queryWeight, product of:
                2.679345 = boost
                7.493801 = idf(docFreq=63, maxDocs=42306)
                0.014354686 = queryNorm
              0.5854532 = fieldWeight in 395, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.493801 = idf(docFreq=63, maxDocs=42306)
                0.078125 = fieldNorm(doc=395)
          0.31817818 = weight(abstract_txt:variants in 395) [ClassicSimilarity], result of:
            0.31817818 = score(doc=395,freq=2.0), product of:
              0.38429365 = queryWeight, product of:
                3.5724597 = boost
                7.493801 = idf(docFreq=63, maxDocs=42306)
                0.014354686 = queryNorm
              0.82795584 = fieldWeight in 395, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.493801 = idf(docFreq=63, maxDocs=42306)
                0.078125 = fieldNorm(doc=395)
          0.39151433 = weight(abstract_txt:finite in 395) [ClassicSimilarity], result of:
            0.39151433 = score(doc=395,freq=1.0), product of:
              0.5559786 = queryWeight, product of:
                4.296994 = boost
                9.013627 = idf(docFreq=13, maxDocs=42306)
                0.014354686 = queryNorm
              0.7041896 = fieldWeight in 395, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.013627 = idf(docFreq=13, maxDocs=42306)
                0.078125 = fieldNorm(doc=395)
        0.28 = coord(7/25)
    
  2. Galvez, C.; Moya-Anegón, F. de: ¬An evaluation of conflation accuracy using finite-state transducers (2006) 0.34
    0.3392595 = sum of:
      0.3392595 = product of:
        1.2116411 = sum of:
          0.195703 = weight(abstract_txt:conflation in 600) [ClassicSimilarity], result of:
            0.195703 = score(doc=600,freq=3.0), product of:
              0.15295517 = queryWeight, product of:
                1.1269064 = boost
                9.45546 = idf(docFreq=8, maxDocs=42306)
                0.014354686 = queryNorm
              1.2794794 = fieldWeight in 600, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                9.45546 = idf(docFreq=8, maxDocs=42306)
                0.078125 = fieldNorm(doc=600)
          0.018847015 = weight(abstract_txt:based in 600) [ClassicSimilarity], result of:
            0.018847015 = score(doc=600,freq=2.0), product of:
              0.053055663 = queryWeight, product of:
                1.1495616 = boost
                3.2151837 = idf(docFreq=4616, maxDocs=42306)
                0.014354686 = queryNorm
              0.355231 = fieldWeight in 600, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.2151837 = idf(docFreq=4616, maxDocs=42306)
                0.078125 = fieldNorm(doc=600)
          0.06185497 = weight(abstract_txt:state in 600) [ClassicSimilarity], result of:
            0.06185497 = score(doc=600,freq=1.0), product of:
              0.16248353 = queryWeight, product of:
                2.322953 = boost
                4.872762 = idf(docFreq=879, maxDocs=42306)
                0.014354686 = queryNorm
              0.38068455 = fieldWeight in 600, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.872762 = idf(docFreq=879, maxDocs=42306)
                0.078125 = fieldNorm(doc=600)
          0.15405865 = weight(abstract_txt:forms in 600) [ClassicSimilarity], result of:
            0.15405865 = score(doc=600,freq=3.0), product of:
              0.20700298 = queryWeight, product of:
                2.6219478 = boost
                5.4999514 = idf(docFreq=469, maxDocs=42306)
                0.014354686 = queryNorm
              0.74423397 = fieldWeight in 600, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.4999514 = idf(docFreq=469, maxDocs=42306)
                0.078125 = fieldNorm(doc=600)
          0.16467722 = weight(abstract_txt:variant in 600) [ClassicSimilarity], result of:
            0.16467722 = score(doc=600,freq=1.0), product of:
              0.2835757 = queryWeight, product of:
                2.657669 = boost
                7.4331765 = idf(docFreq=67, maxDocs=42306)
                0.014354686 = queryNorm
              0.5807169 = fieldWeight in 600, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.4331765 = idf(docFreq=67, maxDocs=42306)
                0.078125 = fieldNorm(doc=600)
          0.22498594 = weight(abstract_txt:variants in 600) [ClassicSimilarity], result of:
            0.22498594 = score(doc=600,freq=1.0), product of:
              0.38429365 = queryWeight, product of:
                3.5724597 = boost
                7.493801 = idf(docFreq=63, maxDocs=42306)
                0.014354686 = queryNorm
              0.5854532 = fieldWeight in 600, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.493801 = idf(docFreq=63, maxDocs=42306)
                0.078125 = fieldNorm(doc=600)
          0.39151433 = weight(abstract_txt:finite in 600) [ClassicSimilarity], result of:
            0.39151433 = score(doc=600,freq=1.0), product of:
              0.5559786 = queryWeight, product of:
                4.296994 = boost
                9.013627 = idf(docFreq=13, maxDocs=42306)
                0.014354686 = queryNorm
              0.7041896 = fieldWeight in 600, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.013627 = idf(docFreq=13, maxDocs=42306)
                0.078125 = fieldNorm(doc=600)
        0.28 = coord(7/25)
    
  3. French, J.C.; Powell, A.L.; Schulman, E.: Using clustering strategies for creating authority files (2000) 0.32
    0.31708598 = sum of:
      0.31708598 = product of:
        0.9908937 = sum of:
          0.00877265 = weight(abstract_txt:from in 5812) [ClassicSimilarity], result of:
            0.00877265 = score(doc=5812,freq=1.0), product of:
              0.040148307 = queryWeight, product of:
                2.796878 = idf(docFreq=7014, maxDocs=42306)
                0.014354686 = queryNorm
              0.2185061 = fieldWeight in 5812, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.796878 = idf(docFreq=7014, maxDocs=42306)
                0.078125 = fieldNorm(doc=5812)
          0.09787858 = weight(abstract_txt:misspellings in 5812) [ClassicSimilarity], result of:
            0.09787858 = score(doc=5812,freq=1.0), product of:
              0.13899465 = queryWeight, product of:
                1.0742486 = boost
                9.013627 = idf(docFreq=13, maxDocs=42306)
                0.014354686 = queryNorm
              0.7041896 = fieldWeight in 5812, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.013627 = idf(docFreq=13, maxDocs=42306)
                0.078125 = fieldNorm(doc=5812)
          0.08404578 = weight(abstract_txt:matching in 5812) [ClassicSimilarity], result of:
            0.08404578 = score(doc=5812,freq=2.0), product of:
              0.12556933 = queryWeight, product of:
                1.4439845 = boost
                6.057973 = idf(docFreq=268, maxDocs=42306)
                0.014354686 = queryNorm
              0.6693178 = fieldWeight in 5812, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.057973 = idf(docFreq=268, maxDocs=42306)
                0.078125 = fieldNorm(doc=5812)
          0.033941295 = weight(abstract_txt:databases in 5812) [ClassicSimilarity], result of:
            0.033941295 = score(doc=5812,freq=1.0), product of:
              0.09894617 = queryWeight, product of:
                1.5698779 = boost
                4.390757 = idf(docFreq=1424, maxDocs=42306)
                0.014354686 = queryNorm
              0.3430279 = fieldWeight in 5812, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.390757 = idf(docFreq=1424, maxDocs=42306)
                0.078125 = fieldNorm(doc=5812)
          0.18259229 = weight(abstract_txt:approximate in 5812) [ClassicSimilarity], result of:
            0.18259229 = score(doc=5812,freq=2.0), product of:
              0.21063372 = queryWeight, product of:
                1.8701855 = boost
                7.8460217 = idf(docFreq=44, maxDocs=42306)
                0.014354686 = queryNorm
              0.8668711 = fieldWeight in 5812, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.8460217 = idf(docFreq=44, maxDocs=42306)
                0.078125 = fieldNorm(doc=5812)
          0.12578838 = weight(abstract_txt:forms in 5812) [ClassicSimilarity], result of:
            0.12578838 = score(doc=5812,freq=2.0), product of:
              0.20700298 = queryWeight, product of:
                2.6219478 = boost
                5.4999514 = idf(docFreq=469, maxDocs=42306)
                0.014354686 = queryNorm
              0.6076645 = fieldWeight in 5812, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.4999514 = idf(docFreq=469, maxDocs=42306)
                0.078125 = fieldNorm(doc=5812)
          0.23288876 = weight(abstract_txt:variant in 5812) [ClassicSimilarity], result of:
            0.23288876 = score(doc=5812,freq=2.0), product of:
              0.2835757 = queryWeight, product of:
                2.657669 = boost
                7.4331765 = idf(docFreq=67, maxDocs=42306)
                0.014354686 = queryNorm
              0.82125777 = fieldWeight in 5812, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.4331765 = idf(docFreq=67, maxDocs=42306)
                0.078125 = fieldNorm(doc=5812)
          0.22498594 = weight(abstract_txt:variants in 5812) [ClassicSimilarity], result of:
            0.22498594 = score(doc=5812,freq=1.0), product of:
              0.38429365 = queryWeight, product of:
                3.5724597 = boost
                7.493801 = idf(docFreq=63, maxDocs=42306)
                0.014354686 = queryNorm
              0.5854532 = fieldWeight in 5812, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.493801 = idf(docFreq=63, maxDocs=42306)
                0.078125 = fieldNorm(doc=5812)
        0.32 = coord(8/25)
    
  4. Järvelin, A.; Keskustalo, H.; Sormunen, E.; Saastamoinen, M.; Kettunen, K.: Information retrieval from historical newspaper collections in highly inflectional languages : a query expansion approach (2016) 0.22
    0.22303979 = sum of:
      0.22303979 = product of:
        0.79657066 = sum of:
          0.0099251205 = weight(abstract_txt:from in 142) [ClassicSimilarity], result of:
            0.0099251205 = score(doc=142,freq=2.0), product of:
              0.040148307 = queryWeight, product of:
                2.796878 = idf(docFreq=7014, maxDocs=42306)
                0.014354686 = queryNorm
              0.24721143 = fieldWeight in 142, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.796878 = idf(docFreq=7014, maxDocs=42306)
                0.0625 = fieldNorm(doc=142)
          0.039320912 = weight(abstract_txt:index in 142) [ClassicSimilarity], result of:
            0.039320912 = score(doc=142,freq=3.0), product of:
              0.07671229 = queryWeight, product of:
                1.1286342 = boost
                4.7349787 = idf(docFreq=1009, maxDocs=42306)
                0.014354686 = queryNorm
              0.51257646 = fieldWeight in 142, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.7349787 = idf(docFreq=1009, maxDocs=42306)
                0.0625 = fieldNorm(doc=142)
          0.010661482 = weight(abstract_txt:based in 142) [ClassicSimilarity], result of:
            0.010661482 = score(doc=142,freq=1.0), product of:
              0.053055663 = queryWeight, product of:
                1.1495616 = boost
                3.2151837 = idf(docFreq=4616, maxDocs=42306)
                0.014354686 = queryNorm
              0.20094898 = fieldWeight in 142, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.2151837 = idf(docFreq=4616, maxDocs=42306)
                0.0625 = fieldNorm(doc=142)
          0.09508695 = weight(abstract_txt:matching in 142) [ClassicSimilarity], result of:
            0.09508695 = score(doc=142,freq=4.0), product of:
              0.12556933 = queryWeight, product of:
                1.4439845 = boost
                6.057973 = idf(docFreq=268, maxDocs=42306)
                0.014354686 = queryNorm
              0.7572466 = fieldWeight in 142, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.057973 = idf(docFreq=268, maxDocs=42306)
                0.0625 = fieldNorm(doc=142)
          0.2065796 = weight(abstract_txt:approximate in 142) [ClassicSimilarity], result of:
            0.2065796 = score(doc=142,freq=4.0), product of:
              0.21063372 = queryWeight, product of:
                1.8701855 = boost
                7.8460217 = idf(docFreq=44, maxDocs=42306)
                0.014354686 = queryNorm
              0.9807527 = fieldWeight in 142, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                7.8460217 = idf(docFreq=44, maxDocs=42306)
                0.0625 = fieldNorm(doc=142)
          0.123246916 = weight(abstract_txt:forms in 142) [ClassicSimilarity], result of:
            0.123246916 = score(doc=142,freq=3.0), product of:
              0.20700298 = queryWeight, product of:
                2.6219478 = boost
                5.4999514 = idf(docFreq=469, maxDocs=42306)
                0.014354686 = queryNorm
              0.59538716 = fieldWeight in 142, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.4999514 = idf(docFreq=469, maxDocs=42306)
                0.0625 = fieldNorm(doc=142)
          0.31174967 = weight(abstract_txt:variants in 142) [ClassicSimilarity], result of:
            0.31174967 = score(doc=142,freq=3.0), product of:
              0.38429365 = queryWeight, product of:
                3.5724597 = boost
                7.493801 = idf(docFreq=63, maxDocs=42306)
                0.014354686 = queryNorm
              0.81122774 = fieldWeight in 142, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.493801 = idf(docFreq=63, maxDocs=42306)
                0.0625 = fieldNorm(doc=142)
        0.28 = coord(7/25)
    
  5. Pereira, D.A.; Ribeiro-Neto, B.; Ziviani, N.; Laender, A.H.F.; Gonçalves, M.A.: ¬A generic Web-based entity resolution framework (2011) 0.16
    0.16032378 = sum of:
      0.16032378 = product of:
        0.5725849 = sum of:
          0.0099251205 = weight(abstract_txt:from in 1451) [ClassicSimilarity], result of:
            0.0099251205 = score(doc=1451,freq=2.0), product of:
              0.040148307 = queryWeight, product of:
                2.796878 = idf(docFreq=7014, maxDocs=42306)
                0.014354686 = queryNorm
              0.24721143 = fieldWeight in 1451, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.796878 = idf(docFreq=7014, maxDocs=42306)
                0.0625 = fieldNorm(doc=1451)
          0.07830287 = weight(abstract_txt:misspellings in 1451) [ClassicSimilarity], result of:
            0.07830287 = score(doc=1451,freq=1.0), product of:
              0.13899465 = queryWeight, product of:
                1.0742486 = boost
                9.013627 = idf(docFreq=13, maxDocs=42306)
                0.014354686 = queryNorm
              0.5633517 = fieldWeight in 1451, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.013627 = idf(docFreq=13, maxDocs=42306)
                0.0625 = fieldNorm(doc=1451)
          0.010661482 = weight(abstract_txt:based in 1451) [ClassicSimilarity], result of:
            0.010661482 = score(doc=1451,freq=1.0), product of:
              0.053055663 = queryWeight, product of:
                1.1495616 = boost
                3.2151837 = idf(docFreq=4616, maxDocs=42306)
                0.014354686 = queryNorm
              0.20094898 = fieldWeight in 1451, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.2151837 = idf(docFreq=4616, maxDocs=42306)
                0.0625 = fieldNorm(doc=1451)
          0.061334226 = weight(abstract_txt:name in 1451) [ClassicSimilarity], result of:
            0.061334226 = score(doc=1451,freq=1.0), product of:
              0.17034209 = queryWeight, product of:
                2.0598109 = boost
                5.76104 = idf(docFreq=361, maxDocs=42306)
                0.014354686 = queryNorm
              0.360065 = fieldWeight in 1451, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.76104 = idf(docFreq=361, maxDocs=42306)
                0.0625 = fieldNorm(doc=1451)
          0.10063069 = weight(abstract_txt:forms in 1451) [ClassicSimilarity], result of:
            0.10063069 = score(doc=1451,freq=2.0), product of:
              0.20700298 = queryWeight, product of:
                2.6219478 = boost
                5.4999514 = idf(docFreq=469, maxDocs=42306)
                0.014354686 = queryNorm
              0.4861316 = fieldWeight in 1451, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.4999514 = idf(docFreq=469, maxDocs=42306)
                0.0625 = fieldNorm(doc=1451)
          0.13174178 = weight(abstract_txt:variant in 1451) [ClassicSimilarity], result of:
            0.13174178 = score(doc=1451,freq=1.0), product of:
              0.2835757 = queryWeight, product of:
                2.657669 = boost
                7.4331765 = idf(docFreq=67, maxDocs=42306)
                0.014354686 = queryNorm
              0.46457353 = fieldWeight in 1451, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.4331765 = idf(docFreq=67, maxDocs=42306)
                0.0625 = fieldNorm(doc=1451)
          0.17998876 = weight(abstract_txt:variants in 1451) [ClassicSimilarity], result of:
            0.17998876 = score(doc=1451,freq=1.0), product of:
              0.38429365 = queryWeight, product of:
                3.5724597 = boost
                7.493801 = idf(docFreq=63, maxDocs=42306)
                0.014354686 = queryNorm
              0.46836257 = fieldWeight in 1451, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.493801 = idf(docFreq=63, maxDocs=42306)
                0.0625 = fieldNorm(doc=1451)
        0.28 = coord(7/25)