Document (#4210)

Author
Robertson, A.M.
Willett, P.
Title
Retrieval techniques for historical English text : searching the sixteenth and seventeenth century titles in the Catalogue of Caterbury Cathedral Library using spelling-correction methods
Source
Online information 92. Proc. of the 16th Int. Online Information Meeting, London, 8-10.12.1992. Ed. by David I. Raitt
Imprint
Oxford : Learned Information Ltd.
Year
1992
Pages
S.389-398
Abstract
A range of techniques has been developed for the correction of misspellings in machine readable texts. Discusses the use of such techniques for the identification of words in the sixteenth and seventeenth century titles from the Catalogue of Canterbury Cathedral Library that are most similar to query words in modern English. The experiments used digram matching, non phonetic coding, and dynamic programming methods for spelling correction. These allow very high recall searches to be carried out, although the latter methods are very demanding of computer resources

Similar documents (author)

  1. Robertson, A.M.; Willett, P.: Generation of equifrequent groups of words using a genetic algorithm (1994) 5.43
    5.426506 = sum of:
      5.426506 = sum of:
        2.3344345 = weight(author_txt:robertson in 8158) [ClassicSimilarity], result of:
          2.3344345 = score(doc=8158,freq=1.0), product of:
            0.63827175 = queryWeight, product of:
              7.314861 = idf(docFreq=79, maxDocs=44218)
              0.087256856 = queryNorm
            3.6574304 = fieldWeight in 8158, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              7.314861 = idf(docFreq=79, maxDocs=44218)
              0.5 = fieldNorm(doc=8158)
        3.0920718 = weight(author_txt:willett in 8158) [ClassicSimilarity], result of:
          3.0920718 = score(doc=8158,freq=1.0), product of:
            0.76981115 = queryWeight, product of:
              1.0982199 = boost
              8.033325 = idf(docFreq=38, maxDocs=44218)
              0.087256856 = queryNorm
            4.0166626 = fieldWeight in 8158, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.033325 = idf(docFreq=38, maxDocs=44218)
              0.5 = fieldNorm(doc=8158)
    
  2. Robertson, A.M.; Willett, P.: Identification of word-variants in historical text databases : report for the period October 1990 to September 1992 (1994) 5.43
    5.426506 = sum of:
      5.426506 = sum of:
        2.3344345 = weight(author_txt:robertson in 939) [ClassicSimilarity], result of:
          2.3344345 = score(doc=939,freq=1.0), product of:
            0.63827175 = queryWeight, product of:
              7.314861 = idf(docFreq=79, maxDocs=44218)
              0.087256856 = queryNorm
            3.6574304 = fieldWeight in 939, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              7.314861 = idf(docFreq=79, maxDocs=44218)
              0.5 = fieldNorm(doc=939)
        3.0920718 = weight(author_txt:willett in 939) [ClassicSimilarity], result of:
          3.0920718 = score(doc=939,freq=1.0), product of:
            0.76981115 = queryWeight, product of:
              1.0982199 = boost
              8.033325 = idf(docFreq=38, maxDocs=44218)
              0.087256856 = queryNorm
            4.0166626 = fieldWeight in 939, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.033325 = idf(docFreq=38, maxDocs=44218)
              0.5 = fieldNorm(doc=939)
    
  3. Robertson, A.M.; Willett, P.: Use of genetic algorithms in information retrieval (1995) 5.43
    5.426506 = sum of:
      5.426506 = sum of:
        2.3344345 = weight(author_txt:robertson in 2418) [ClassicSimilarity], result of:
          2.3344345 = score(doc=2418,freq=1.0), product of:
            0.63827175 = queryWeight, product of:
              7.314861 = idf(docFreq=79, maxDocs=44218)
              0.087256856 = queryNorm
            3.6574304 = fieldWeight in 2418, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              7.314861 = idf(docFreq=79, maxDocs=44218)
              0.5 = fieldNorm(doc=2418)
        3.0920718 = weight(author_txt:willett in 2418) [ClassicSimilarity], result of:
          3.0920718 = score(doc=2418,freq=1.0), product of:
            0.76981115 = queryWeight, product of:
              1.0982199 = boost
              8.033325 = idf(docFreq=38, maxDocs=44218)
              0.087256856 = queryNorm
            4.0166626 = fieldWeight in 2418, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.033325 = idf(docFreq=38, maxDocs=44218)
              0.5 = fieldNorm(doc=2418)
    
  4. Robertson, M.; Willett, P.: ¬An upperbound to the performance of ranked output searching : optimal weighting of query terms using a genetic algorithms (1996) 5.43
    5.426506 = sum of:
      5.426506 = sum of:
        2.3344345 = weight(author_txt:robertson in 6977) [ClassicSimilarity], result of:
          2.3344345 = score(doc=6977,freq=1.0), product of:
            0.63827175 = queryWeight, product of:
              7.314861 = idf(docFreq=79, maxDocs=44218)
              0.087256856 = queryNorm
            3.6574304 = fieldWeight in 6977, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              7.314861 = idf(docFreq=79, maxDocs=44218)
              0.5 = fieldNorm(doc=6977)
        3.0920718 = weight(author_txt:willett in 6977) [ClassicSimilarity], result of:
          3.0920718 = score(doc=6977,freq=1.0), product of:
            0.76981115 = queryWeight, product of:
              1.0982199 = boost
              8.033325 = idf(docFreq=38, maxDocs=44218)
              0.087256856 = queryNorm
            4.0166626 = fieldWeight in 6977, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.033325 = idf(docFreq=38, maxDocs=44218)
              0.5 = fieldNorm(doc=6977)
    
  5. Robertson, A.M.; Willett, P.: Applications of n-grams in textual information systems (1998) 5.43
    5.426506 = sum of:
      5.426506 = sum of:
        2.3344345 = weight(author_txt:robertson in 4715) [ClassicSimilarity], result of:
          2.3344345 = score(doc=4715,freq=1.0), product of:
            0.63827175 = queryWeight, product of:
              7.314861 = idf(docFreq=79, maxDocs=44218)
              0.087256856 = queryNorm
            3.6574304 = fieldWeight in 4715, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              7.314861 = idf(docFreq=79, maxDocs=44218)
              0.5 = fieldNorm(doc=4715)
        3.0920718 = weight(author_txt:willett in 4715) [ClassicSimilarity], result of:
          3.0920718 = score(doc=4715,freq=1.0), product of:
            0.76981115 = queryWeight, product of:
              1.0982199 = boost
              8.033325 = idf(docFreq=38, maxDocs=44218)
              0.087256856 = queryNorm
            4.0166626 = fieldWeight in 4715, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.033325 = idf(docFreq=38, maxDocs=44218)
              0.5 = fieldNorm(doc=4715)
    

Similar documents (content)

  1. Bakar, Z.A.; Sembok, T.M.T.; Yusoff, M.: ¬An evaluation of retrieval effectiveness using spelling-correction and string-similarity matching methods on Malay texts (2000) 0.39
    0.39227098 = sum of:
      0.39227098 = product of:
        1.0896416 = sum of:
          0.03517675 = weight(abstract_txt:recall in 4804) [ClassicSimilarity], result of:
            0.03517675 = score(doc=4804,freq=1.0), product of:
              0.07832214 = queryWeight, product of:
                5.7488523 = idf(docFreq=382, maxDocs=44218)
                0.013623962 = queryNorm
              0.44912907 = fieldWeight in 4804, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.7488523 = idf(docFreq=382, maxDocs=44218)
                0.078125 = fieldNorm(doc=4804)
          0.05029746 = weight(abstract_txt:dynamic in 4804) [ClassicSimilarity], result of:
            0.05029746 = score(doc=4804,freq=2.0), product of:
              0.078898385 = queryWeight, product of:
                1.0036719 = boost
                5.7699614 = idf(docFreq=374, maxDocs=44218)
                0.013623962 = queryNorm
              0.6374967 = fieldWeight in 4804, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.7699614 = idf(docFreq=374, maxDocs=44218)
                0.078125 = fieldNorm(doc=4804)
          0.040957075 = weight(abstract_txt:matching in 4804) [ClassicSimilarity], result of:
            0.040957075 = score(doc=4804,freq=1.0), product of:
              0.086682886 = queryWeight, product of:
                1.0520209 = boost
                6.047913 = idf(docFreq=283, maxDocs=44218)
                0.013623962 = queryNorm
              0.4724932 = fieldWeight in 4804, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.047913 = idf(docFreq=283, maxDocs=44218)
                0.078125 = fieldNorm(doc=4804)
          0.08339998 = weight(abstract_txt:programming in 4804) [ClassicSimilarity], result of:
            0.08339998 = score(doc=4804,freq=2.0), product of:
              0.11053031 = queryWeight, product of:
                1.1879506 = boost
                6.829353 = idf(docFreq=129, maxDocs=44218)
                0.013623962 = queryNorm
              0.754544 = fieldWeight in 4804, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.829353 = idf(docFreq=129, maxDocs=44218)
                0.078125 = fieldNorm(doc=4804)
          0.11359669 = weight(abstract_txt:words in 4804) [ClassicSimilarity], result of:
            0.11359669 = score(doc=4804,freq=4.0), product of:
              0.13581504 = queryWeight, product of:
                1.8622872 = boost
                5.353007 = idf(docFreq=568, maxDocs=44218)
                0.013623962 = queryNorm
              0.8364073 = fieldWeight in 4804, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.353007 = idf(docFreq=568, maxDocs=44218)
                0.078125 = fieldNorm(doc=4804)
          0.06859909 = weight(abstract_txt:methods in 4804) [ClassicSimilarity], result of:
            0.06859909 = score(doc=4804,freq=3.0), product of:
              0.12225303 = queryWeight, product of:
                2.1639547 = boost
                4.146752 = idf(docFreq=1900, maxDocs=44218)
                0.013623962 = queryNorm
              0.56112385 = fieldWeight in 4804, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.146752 = idf(docFreq=1900, maxDocs=44218)
                0.078125 = fieldNorm(doc=4804)
          0.05162795 = weight(abstract_txt:techniques in 4804) [ClassicSimilarity], result of:
            0.05162795 = score(doc=4804,freq=1.0), product of:
              0.14588514 = queryWeight, product of:
                2.3638716 = boost
                4.5298495 = idf(docFreq=1295, maxDocs=44218)
                0.013623962 = queryNorm
              0.3538945 = fieldWeight in 4804, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5298495 = idf(docFreq=1295, maxDocs=44218)
                0.078125 = fieldNorm(doc=4804)
          0.23479816 = weight(abstract_txt:spelling in 4804) [ClassicSimilarity], result of:
            0.23479816 = score(doc=4804,freq=2.0), product of:
              0.27765822 = queryWeight, product of:
                2.6627352 = boost
                7.653836 = idf(docFreq=56, maxDocs=44218)
                0.013623962 = queryNorm
              0.8456373 = fieldWeight in 4804, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.653836 = idf(docFreq=56, maxDocs=44218)
                0.078125 = fieldNorm(doc=4804)
          0.41118833 = weight(abstract_txt:correction in 4804) [ClassicSimilarity], result of:
            0.41118833 = score(doc=4804,freq=2.0), product of:
              0.46178344 = queryWeight, product of:
                4.2056923 = boost
                8.059301 = idf(docFreq=37, maxDocs=44218)
                0.013623962 = queryNorm
              0.89043546 = fieldWeight in 4804, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.059301 = idf(docFreq=37, maxDocs=44218)
                0.078125 = fieldNorm(doc=4804)
        0.36 = coord(9/25)
    
  2. Blair, A: Too much to know : managing scholarly information before the modern age (2011) 0.25
    0.25272214 = sum of:
      0.25272214 = product of:
        0.9025791 = sum of:
          0.050863493 = weight(abstract_txt:modern in 4474) [ClassicSimilarity], result of:
            0.050863493 = score(doc=4474,freq=2.0), product of:
              0.079489216 = queryWeight, product of:
                1.0074229 = boost
                5.7915254 = idf(docFreq=366, maxDocs=44218)
                0.013623962 = queryNorm
              0.63987917 = fieldWeight in 4474, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.7915254 = idf(docFreq=366, maxDocs=44218)
                0.078125 = fieldNorm(doc=4474)
          0.04414082 = weight(abstract_txt:very in 4474) [ClassicSimilarity], result of:
            0.04414082 = score(doc=4474,freq=1.0), product of:
              0.11480241 = queryWeight, product of:
                1.7121753 = boost
                4.921521 = idf(docFreq=875, maxDocs=44218)
                0.013623962 = queryNorm
              0.38449383 = fieldWeight in 4474, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.921521 = idf(docFreq=875, maxDocs=44218)
                0.078125 = fieldNorm(doc=4474)
          0.06702181 = weight(abstract_txt:century in 4474) [ClassicSimilarity], result of:
            0.06702181 = score(doc=4474,freq=1.0), product of:
              0.151659 = queryWeight, product of:
                1.9679171 = boost
                5.6566324 = idf(docFreq=419, maxDocs=44218)
                0.013623962 = queryNorm
              0.4419244 = fieldWeight in 4474, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.6566324 = idf(docFreq=419, maxDocs=44218)
                0.078125 = fieldNorm(doc=4474)
          0.0396057 = weight(abstract_txt:methods in 4474) [ClassicSimilarity], result of:
            0.0396057 = score(doc=4474,freq=1.0), product of:
              0.12225303 = queryWeight, product of:
                2.1639547 = boost
                4.146752 = idf(docFreq=1900, maxDocs=44218)
                0.013623962 = queryNorm
              0.32396498 = fieldWeight in 4474, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.146752 = idf(docFreq=1900, maxDocs=44218)
                0.078125 = fieldNorm(doc=4474)
          0.05162795 = weight(abstract_txt:techniques in 4474) [ClassicSimilarity], result of:
            0.05162795 = score(doc=4474,freq=1.0), product of:
              0.14588514 = queryWeight, product of:
                2.3638716 = boost
                4.5298495 = idf(docFreq=1295, maxDocs=44218)
                0.013623962 = queryNorm
              0.3538945 = fieldWeight in 4474, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5298495 = idf(docFreq=1295, maxDocs=44218)
                0.078125 = fieldNorm(doc=4474)
          0.28946856 = weight(abstract_txt:seventeenth in 4474) [ClassicSimilarity], result of:
            0.28946856 = score(doc=4474,freq=1.0), product of:
              0.40221506 = queryWeight, product of:
                3.204807 = boost
                9.211981 = idf(docFreq=11, maxDocs=44218)
                0.013623962 = queryNorm
              0.71968603 = fieldWeight in 4474, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.211981 = idf(docFreq=11, maxDocs=44218)
                0.078125 = fieldNorm(doc=4474)
          0.35985082 = weight(abstract_txt:sixteenth in 4474) [ClassicSimilarity], result of:
            0.35985082 = score(doc=4474,freq=1.0), product of:
              0.4650208 = queryWeight, product of:
                3.4459496 = boost
                9.905128 = idf(docFreq=5, maxDocs=44218)
                0.013623962 = queryNorm
              0.7738381 = fieldWeight in 4474, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.905128 = idf(docFreq=5, maxDocs=44218)
                0.078125 = fieldNorm(doc=4474)
        0.28 = coord(7/25)
    
  3. Drabenstott, K.M.; Weller, M.S.: Handling spelling errors in online catalog searches (1996) 0.21
    0.20749086 = sum of:
      0.20749086 = product of:
        0.8645453 = sum of:
          0.19064939 = weight(abstract_txt:misspellings in 5973) [ClassicSimilarity], result of:
            0.19064939 = score(doc=5973,freq=3.0), product of:
              0.19443329 = queryWeight, product of:
                1.5755893 = boost
                9.05783 = idf(docFreq=13, maxDocs=44218)
                0.013623962 = queryNorm
              0.98053885 = fieldWeight in 5973, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                9.05783 = idf(docFreq=13, maxDocs=44218)
                0.0625 = fieldNorm(doc=5973)
          0.03873147 = weight(abstract_txt:catalogue in 5973) [ClassicSimilarity], result of:
            0.03873147 = score(doc=5973,freq=1.0), product of:
              0.12209749 = queryWeight, product of:
                1.7657373 = boost
                5.0754814 = idf(docFreq=750, maxDocs=44218)
                0.013623962 = queryNorm
              0.3172176 = fieldWeight in 5973, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.0754814 = idf(docFreq=750, maxDocs=44218)
                0.0625 = fieldNorm(doc=5973)
          0.06425999 = weight(abstract_txt:words in 5973) [ClassicSimilarity], result of:
            0.06425999 = score(doc=5973,freq=2.0), product of:
              0.13581504 = queryWeight, product of:
                1.8622872 = boost
                5.353007 = idf(docFreq=568, maxDocs=44218)
                0.013623962 = queryNorm
              0.47314343 = fieldWeight in 5973, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.353007 = idf(docFreq=568, maxDocs=44218)
                0.0625 = fieldNorm(doc=5973)
          0.041302357 = weight(abstract_txt:techniques in 5973) [ClassicSimilarity], result of:
            0.041302357 = score(doc=5973,freq=1.0), product of:
              0.14588514 = queryWeight, product of:
                2.3638716 = boost
                4.5298495 = idf(docFreq=1295, maxDocs=44218)
                0.013623962 = queryNorm
              0.2831156 = fieldWeight in 5973, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5298495 = idf(docFreq=1295, maxDocs=44218)
                0.0625 = fieldNorm(doc=5973)
          0.29699883 = weight(abstract_txt:spelling in 5973) [ClassicSimilarity], result of:
            0.29699883 = score(doc=5973,freq=5.0), product of:
              0.27765822 = queryWeight, product of:
                2.6627352 = boost
                7.653836 = idf(docFreq=56, maxDocs=44218)
                0.013623962 = queryNorm
              1.0696561 = fieldWeight in 5973, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                7.653836 = idf(docFreq=56, maxDocs=44218)
                0.0625 = fieldNorm(doc=5973)
          0.23260324 = weight(abstract_txt:correction in 5973) [ClassicSimilarity], result of:
            0.23260324 = score(doc=5973,freq=1.0), product of:
              0.46178344 = queryWeight, product of:
                4.2056923 = boost
                8.059301 = idf(docFreq=37, maxDocs=44218)
                0.013623962 = queryNorm
              0.50370634 = fieldWeight in 5973, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.059301 = idf(docFreq=37, maxDocs=44218)
                0.0625 = fieldNorm(doc=5973)
        0.24 = coord(6/25)
    
  4. Robertson, A.M.; Willett, P.: Identification of word-variants in historical text databases : report for the period October 1990 to September 1992 (1994) 0.20
    0.19990408 = sum of:
      0.19990408 = product of:
        0.9995204 = sum of:
          0.057545476 = weight(abstract_txt:modern in 939) [ClassicSimilarity], result of:
            0.057545476 = score(doc=939,freq=1.0), product of:
              0.079489216 = queryWeight, product of:
                1.0074229 = boost
                5.7915254 = idf(docFreq=366, maxDocs=44218)
                0.013623962 = queryNorm
              0.7239407 = fieldWeight in 939, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.7915254 = idf(docFreq=366, maxDocs=44218)
                0.125 = fieldNorm(doc=939)
          0.12851998 = weight(abstract_txt:words in 939) [ClassicSimilarity], result of:
            0.12851998 = score(doc=939,freq=2.0), product of:
              0.13581504 = queryWeight, product of:
                1.8622872 = boost
                5.353007 = idf(docFreq=568, maxDocs=44218)
                0.013623962 = queryNorm
              0.94628686 = fieldWeight in 939, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.353007 = idf(docFreq=568, maxDocs=44218)
                0.125 = fieldNorm(doc=939)
          0.082604714 = weight(abstract_txt:techniques in 939) [ClassicSimilarity], result of:
            0.082604714 = score(doc=939,freq=1.0), product of:
              0.14588514 = queryWeight, product of:
                2.3638716 = boost
                4.5298495 = idf(docFreq=1295, maxDocs=44218)
                0.013623962 = queryNorm
              0.5662312 = fieldWeight in 939, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5298495 = idf(docFreq=1295, maxDocs=44218)
                0.125 = fieldNorm(doc=939)
          0.2656438 = weight(abstract_txt:spelling in 939) [ClassicSimilarity], result of:
            0.2656438 = score(doc=939,freq=1.0), product of:
              0.27765822 = queryWeight, product of:
                2.6627352 = boost
                7.653836 = idf(docFreq=56, maxDocs=44218)
                0.013623962 = queryNorm
              0.9567295 = fieldWeight in 939, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.653836 = idf(docFreq=56, maxDocs=44218)
                0.125 = fieldNorm(doc=939)
          0.46520647 = weight(abstract_txt:correction in 939) [ClassicSimilarity], result of:
            0.46520647 = score(doc=939,freq=1.0), product of:
              0.46178344 = queryWeight, product of:
                4.2056923 = boost
                8.059301 = idf(docFreq=37, maxDocs=44218)
                0.013623962 = queryNorm
              1.0074127 = fieldWeight in 939, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.059301 = idf(docFreq=37, maxDocs=44218)
                0.125 = fieldNorm(doc=939)
        0.2 = coord(5/25)
    
  5. Galvez, C.; Moya-Anegón, F.: Approximate personal name-matching through finite-state graphs (2007) 0.14
    0.13730253 = sum of:
      0.13730253 = product of:
        0.4290704 = sum of:
          0.028141402 = weight(abstract_txt:recall in 614) [ClassicSimilarity], result of:
            0.028141402 = score(doc=614,freq=1.0), product of:
              0.07832214 = queryWeight, product of:
                5.7488523 = idf(docFreq=382, maxDocs=44218)
                0.013623962 = queryNorm
              0.35930327 = fieldWeight in 614, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.7488523 = idf(docFreq=382, maxDocs=44218)
                0.0625 = fieldNorm(doc=614)
          0.029572332 = weight(abstract_txt:identification in 614) [ClassicSimilarity], result of:
            0.029572332 = score(doc=614,freq=1.0), product of:
              0.08095515 = queryWeight, product of:
                1.0166699 = boost
                5.8446846 = idf(docFreq=347, maxDocs=44218)
                0.013623962 = queryNorm
              0.3652928 = fieldWeight in 614, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8446846 = idf(docFreq=347, maxDocs=44218)
                0.0625 = fieldNorm(doc=614)
          0.03276566 = weight(abstract_txt:matching in 614) [ClassicSimilarity], result of:
            0.03276566 = score(doc=614,freq=1.0), product of:
              0.086682886 = queryWeight, product of:
                1.0520209 = boost
                6.047913 = idf(docFreq=283, maxDocs=44218)
                0.013623962 = queryNorm
              0.37799457 = fieldWeight in 614, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.047913 = idf(docFreq=283, maxDocs=44218)
                0.0625 = fieldNorm(doc=614)
          0.009586575 = weight(abstract_txt:library in 614) [ClassicSimilarity], result of:
            0.009586575 = score(doc=614,freq=1.0), product of:
              0.048132647 = queryWeight, product of:
                1.1086452 = boost
                3.1867187 = idf(docFreq=4964, maxDocs=44218)
                0.013623962 = queryNorm
              0.19916992 = fieldWeight in 614, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.1867187 = idf(docFreq=4964, maxDocs=44218)
                0.0625 = fieldNorm(doc=614)
          0.11007148 = weight(abstract_txt:misspellings in 614) [ClassicSimilarity], result of:
            0.11007148 = score(doc=614,freq=1.0), product of:
              0.19443329 = queryWeight, product of:
                1.5755893 = boost
                9.05783 = idf(docFreq=13, maxDocs=44218)
                0.013623962 = queryNorm
              0.56611437 = fieldWeight in 614, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.05783 = idf(docFreq=13, maxDocs=44218)
                0.0625 = fieldNorm(doc=614)
          0.044808738 = weight(abstract_txt:methods in 614) [ClassicSimilarity], result of:
            0.044808738 = score(doc=614,freq=2.0), product of:
              0.12225303 = queryWeight, product of:
                2.1639547 = boost
                4.146752 = idf(docFreq=1900, maxDocs=44218)
                0.013623962 = queryNorm
              0.36652455 = fieldWeight in 614, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.146752 = idf(docFreq=1900, maxDocs=44218)
                0.0625 = fieldNorm(doc=614)
          0.041302357 = weight(abstract_txt:techniques in 614) [ClassicSimilarity], result of:
            0.041302357 = score(doc=614,freq=1.0), product of:
              0.14588514 = queryWeight, product of:
                2.3638716 = boost
                4.5298495 = idf(docFreq=1295, maxDocs=44218)
                0.013623962 = queryNorm
              0.2831156 = fieldWeight in 614, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5298495 = idf(docFreq=1295, maxDocs=44218)
                0.0625 = fieldNorm(doc=614)
          0.1328219 = weight(abstract_txt:spelling in 614) [ClassicSimilarity], result of:
            0.1328219 = score(doc=614,freq=1.0), product of:
              0.27765822 = queryWeight, product of:
                2.6627352 = boost
                7.653836 = idf(docFreq=56, maxDocs=44218)
                0.013623962 = queryNorm
              0.47836474 = fieldWeight in 614, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.653836 = idf(docFreq=56, maxDocs=44218)
                0.0625 = fieldNorm(doc=614)
        0.32 = coord(8/25)