Document (#22812)

Author
French, J.C.
Powell, A.L.
Schulman, E.
Title
Using clustering strategies for creating authority files
Source
Journal of the American Society for Information Science. 51(2000) no.8, S.774-786
Year
2000
Abstract
As more online databases are integrated into digital libraries, the issue of quality control of the data becomes increasingly important, especially as it relates to the effective retrieval of information. Authority work, the need to discover and reconcile variant forms of strings in bibliographical entries, will become more critical in the future. Spelling variants, misspellings, and transliteration differences will all increase the difficulty of retrieving information. We investigate a number of approximate string matching techniques that have traditionally been used to help with this problem. We then introduce the notion of approximate word matching and show how it can be used to improve detection and categorization of variant forms. We demonstrate the utility of these approaches using data from the Astrophysics Data System and show how we can reduce the human effort involved in the creation of authority files
Theme
Normdateien
Computerlinguistik
Retrievalalgorithmen

Similar documents (author)

  1. French, J.C.; Knight, J.C.; Powell, A.L.: Applying hypertext structures to software documentation (1997) 4.76
    4.760104 = sum of:
      4.760104 = sum of:
        2.1237423 = weight(author_txt:powell in 3257) [ClassicSimilarity], result of:
          2.1237423 = score(doc=3257,freq=1.0), product of:
            0.65453935 = queryWeight, product of:
              8.652365 = idf(docFreq=20, maxDocs=44218)
              0.075648606 = queryNorm
            3.2446368 = fieldWeight in 3257, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.652365 = idf(docFreq=20, maxDocs=44218)
              0.375 = fieldNorm(doc=3257)
        2.6363618 = weight(author_txt:french in 3257) [ClassicSimilarity], result of:
          2.6363618 = score(doc=3257,freq=1.0), product of:
            0.756028 = queryWeight, product of:
              1.0747342 = boost
              9.298992 = idf(docFreq=10, maxDocs=44218)
              0.075648606 = queryNorm
            3.487122 = fieldWeight in 3257, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              9.298992 = idf(docFreq=10, maxDocs=44218)
              0.375 = fieldNorm(doc=3257)
    
  2. French, J.C.; Powell, A.L.; Gey, F.; Perelman, N.: Exploiting manual indexing to improve collection selection and retrieval effectiveness (2002) 3.97
    3.9667537 = sum of:
      3.9667537 = sum of:
        1.7697854 = weight(author_txt:powell in 3896) [ClassicSimilarity], result of:
          1.7697854 = score(doc=3896,freq=1.0), product of:
            0.65453935 = queryWeight, product of:
              8.652365 = idf(docFreq=20, maxDocs=44218)
              0.075648606 = queryNorm
            2.703864 = fieldWeight in 3896, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.652365 = idf(docFreq=20, maxDocs=44218)
              0.3125 = fieldNorm(doc=3896)
        2.1969683 = weight(author_txt:french in 3896) [ClassicSimilarity], result of:
          2.1969683 = score(doc=3896,freq=1.0), product of:
            0.756028 = queryWeight, product of:
              1.0747342 = boost
              9.298992 = idf(docFreq=10, maxDocs=44218)
              0.075648606 = queryNorm
            2.905935 = fieldWeight in 3896, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              9.298992 = idf(docFreq=10, maxDocs=44218)
              0.3125 = fieldNorm(doc=3896)
    
  3. French, J.: Changes in reference services (1995) 2.20
    2.1969683 = sum of:
      2.1969683 = product of:
        4.3939366 = sum of:
          4.3939366 = weight(author_txt:french in 3680) [ClassicSimilarity], result of:
            4.3939366 = score(doc=3680,freq=1.0), product of:
              0.756028 = queryWeight, product of:
                1.0747342 = boost
                9.298992 = idf(docFreq=10, maxDocs=44218)
                0.075648606 = queryNorm
              5.81187 = fieldWeight in 3680, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.298992 = idf(docFreq=10, maxDocs=44218)
                0.625 = fieldNorm(doc=3680)
        0.5 = coord(1/2)
    
  4. Powell, A.P.: ZYindex: bringing order to electronic chaos (1989) 1.77
    1.7697854 = sum of:
      1.7697854 = product of:
        3.5395708 = sum of:
          3.5395708 = weight(author_txt:powell in 3233) [ClassicSimilarity], result of:
            3.5395708 = score(doc=3233,freq=1.0), product of:
              0.65453935 = queryWeight, product of:
                8.652365 = idf(docFreq=20, maxDocs=44218)
                0.075648606 = queryNorm
              5.407728 = fieldWeight in 3233, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.652365 = idf(docFreq=20, maxDocs=44218)
                0.625 = fieldNorm(doc=3233)
        0.5 = coord(1/2)
    
  5. Powell, J.: Spinning the World-Wide Web : an HTML primer (1995) 1.77
    1.7697854 = sum of:
      1.7697854 = product of:
        3.5395708 = sum of:
          3.5395708 = weight(author_txt:powell in 6013) [ClassicSimilarity], result of:
            3.5395708 = score(doc=6013,freq=1.0), product of:
              0.65453935 = queryWeight, product of:
                8.652365 = idf(docFreq=20, maxDocs=44218)
                0.075648606 = queryNorm
              5.407728 = fieldWeight in 6013, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.652365 = idf(docFreq=20, maxDocs=44218)
                0.625 = fieldNorm(doc=6013)
        0.5 = coord(1/2)
    

Similar documents (content)

  1. Galvez, C.; Moya-Anegón, F.: Approximate personal name-matching through finite-state graphs (2007) 0.42
    0.4183879 = sum of:
      0.4183879 = product of:
        1.1621885 = sum of:
          0.014791193 = weight(abstract_txt:used in 614) [ClassicSimilarity], result of:
            0.014791193 = score(doc=614,freq=1.0), product of:
              0.07044895 = queryWeight, product of:
                1.0193685 = boost
                3.3592992 = idf(docFreq=4177, maxDocs=44218)
                0.020572858 = queryNorm
              0.2099562 = fieldWeight in 614, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3592992 = idf(docFreq=4177, maxDocs=44218)
                0.0625 = fieldNorm(doc=614)
          0.07206274 = weight(abstract_txt:string in 614) [ClassicSimilarity], result of:
            0.07206274 = score(doc=614,freq=1.0), product of:
              0.1606952 = queryWeight, product of:
                1.0886302 = boost
                7.1750984 = idf(docFreq=91, maxDocs=44218)
                0.020572858 = queryNorm
              0.44844365 = fieldWeight in 614, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.1750984 = idf(docFreq=91, maxDocs=44218)
                0.0625 = fieldNorm(doc=614)
          0.16508086 = weight(abstract_txt:variants in 614) [ClassicSimilarity], result of:
            0.16508086 = score(doc=614,freq=4.0), product of:
              0.17591661 = queryWeight, product of:
                1.1390227 = boost
                7.5072327 = idf(docFreq=65, maxDocs=44218)
                0.020572858 = queryNorm
              0.9384041 = fieldWeight in 614, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                7.5072327 = idf(docFreq=65, maxDocs=44218)
                0.0625 = fieldNorm(doc=614)
          0.0874711 = weight(abstract_txt:spelling in 614) [ClassicSimilarity], result of:
            0.0874711 = score(doc=614,freq=1.0), product of:
              0.1828544 = queryWeight, product of:
                1.1612659 = boost
                7.653836 = idf(docFreq=56, maxDocs=44218)
                0.020572858 = queryNorm
              0.47836474 = fieldWeight in 614, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.653836 = idf(docFreq=56, maxDocs=44218)
                0.0625 = fieldNorm(doc=614)
          0.14497718 = weight(abstract_txt:misspellings in 614) [ClassicSimilarity], result of:
            0.14497718 = score(doc=614,freq=1.0), product of:
              0.25609168 = queryWeight, product of:
                1.3742846 = boost
                9.05783 = idf(docFreq=13, maxDocs=44218)
                0.020572858 = queryNorm
              0.56611437 = fieldWeight in 614, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.05783 = idf(docFreq=13, maxDocs=44218)
                0.0625 = fieldNorm(doc=614)
          0.12678158 = weight(abstract_txt:forms in 614) [ClassicSimilarity], result of:
            0.12678158 = score(doc=614,freq=4.0), product of:
              0.18587592 = queryWeight, product of:
                1.655791 = boost
                5.456611 = idf(docFreq=512, maxDocs=44218)
                0.020572858 = queryNorm
              0.6820764 = fieldWeight in 614, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.456611 = idf(docFreq=512, maxDocs=44218)
                0.0625 = fieldNorm(doc=614)
          0.08631251 = weight(abstract_txt:matching in 614) [ClassicSimilarity], result of:
            0.08631251 = score(doc=614,freq=1.0), product of:
              0.22834326 = queryWeight, product of:
                1.8352196 = boost
                6.047913 = idf(docFreq=283, maxDocs=44218)
                0.020572858 = queryNorm
              0.37799457 = fieldWeight in 614, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.047913 = idf(docFreq=283, maxDocs=44218)
                0.0625 = fieldNorm(doc=614)
          0.27305406 = weight(abstract_txt:variant in 614) [ClassicSimilarity], result of:
            0.27305406 = score(doc=614,freq=3.0), product of:
              0.3411911 = queryWeight, product of:
                2.243328 = boost
                7.3928223 = idf(docFreq=73, maxDocs=44218)
                0.020572858 = queryNorm
              0.8002965 = fieldWeight in 614, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.3928223 = idf(docFreq=73, maxDocs=44218)
                0.0625 = fieldNorm(doc=614)
          0.19165727 = weight(abstract_txt:approximate in 614) [ClassicSimilarity], result of:
            0.19165727 = score(doc=614,freq=1.0), product of:
              0.38864753 = queryWeight, product of:
                2.3942633 = boost
                7.890225 = idf(docFreq=44, maxDocs=44218)
                0.020572858 = queryNorm
              0.49313906 = fieldWeight in 614, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.890225 = idf(docFreq=44, maxDocs=44218)
                0.0625 = fieldNorm(doc=614)
        0.36 = coord(9/25)
    
  2. Pereira, D.A.; Ribeiro-Neto, B.; Ziviani, N.; Laender, A.H.F.; Gonçalves, M.A.: ¬A generic Web-based entity resolution framework (2011) 0.32
    0.3201652 = sum of:
      0.3201652 = product of:
        0.88934773 = sum of:
          0.08254043 = weight(abstract_txt:variants in 4450) [ClassicSimilarity], result of:
            0.08254043 = score(doc=4450,freq=1.0), product of:
              0.17591661 = queryWeight, product of:
                1.1390227 = boost
                7.5072327 = idf(docFreq=65, maxDocs=44218)
                0.020572858 = queryNorm
              0.46920204 = fieldWeight in 4450, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.5072327 = idf(docFreq=65, maxDocs=44218)
                0.0625 = fieldNorm(doc=4450)
          0.0874711 = weight(abstract_txt:spelling in 4450) [ClassicSimilarity], result of:
            0.0874711 = score(doc=4450,freq=1.0), product of:
              0.1828544 = queryWeight, product of:
                1.1612659 = boost
                7.653836 = idf(docFreq=56, maxDocs=44218)
                0.020572858 = queryNorm
              0.47836474 = fieldWeight in 4450, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.653836 = idf(docFreq=56, maxDocs=44218)
                0.0625 = fieldNorm(doc=4450)
          0.04719323 = weight(abstract_txt:show in 4450) [ClassicSimilarity], result of:
            0.04719323 = score(doc=4450,freq=2.0), product of:
              0.12118499 = queryWeight, product of:
                1.3369598 = boost
                4.4059124 = idf(docFreq=1466, maxDocs=44218)
                0.020572858 = queryNorm
              0.3894313 = fieldWeight in 4450, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.4059124 = idf(docFreq=1466, maxDocs=44218)
                0.0625 = fieldNorm(doc=4450)
          0.14497718 = weight(abstract_txt:misspellings in 4450) [ClassicSimilarity], result of:
            0.14497718 = score(doc=4450,freq=1.0), product of:
              0.25609168 = queryWeight, product of:
                1.3742846 = boost
                9.05783 = idf(docFreq=13, maxDocs=44218)
                0.020572858 = queryNorm
              0.56611437 = fieldWeight in 4450, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.05783 = idf(docFreq=13, maxDocs=44218)
                0.0625 = fieldNorm(doc=4450)
          0.030738125 = weight(abstract_txt:data in 4450) [ClassicSimilarity], result of:
            0.030738125 = score(doc=4450,freq=2.0), product of:
              0.10423439 = queryWeight, product of:
                1.518606 = boost
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.020572858 = queryNorm
              0.29489428 = fieldWeight in 4450, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.0625 = fieldNorm(doc=4450)
          0.08964811 = weight(abstract_txt:forms in 4450) [ClassicSimilarity], result of:
            0.08964811 = score(doc=4450,freq=2.0), product of:
              0.18587592 = queryWeight, product of:
                1.655791 = boost
                5.456611 = idf(docFreq=512, maxDocs=44218)
                0.020572858 = queryNorm
              0.48230085 = fieldWeight in 4450, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.456611 = idf(docFreq=512, maxDocs=44218)
                0.0625 = fieldNorm(doc=4450)
          0.07304111 = weight(abstract_txt:files in 4450) [ClassicSimilarity], result of:
            0.07304111 = score(doc=4450,freq=1.0), product of:
              0.20429164 = queryWeight, product of:
                1.7358782 = boost
                5.720536 = idf(docFreq=393, maxDocs=44218)
                0.020572858 = queryNorm
              0.3575335 = fieldWeight in 4450, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.720536 = idf(docFreq=393, maxDocs=44218)
                0.0625 = fieldNorm(doc=4450)
          0.15764783 = weight(abstract_txt:variant in 4450) [ClassicSimilarity], result of:
            0.15764783 = score(doc=4450,freq=1.0), product of:
              0.3411911 = queryWeight, product of:
                2.243328 = boost
                7.3928223 = idf(docFreq=73, maxDocs=44218)
                0.020572858 = queryNorm
              0.4620514 = fieldWeight in 4450, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.3928223 = idf(docFreq=73, maxDocs=44218)
                0.0625 = fieldNorm(doc=4450)
          0.17609067 = weight(abstract_txt:authority in 4450) [ClassicSimilarity], result of:
            0.17609067 = score(doc=4450,freq=4.0), product of:
              0.26487464 = queryWeight, product of:
                2.4208047 = boost
                5.318461 = idf(docFreq=588, maxDocs=44218)
                0.020572858 = queryNorm
              0.6648076 = fieldWeight in 4450, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.318461 = idf(docFreq=588, maxDocs=44218)
                0.0625 = fieldNorm(doc=4450)
        0.36 = coord(9/25)
    
  3. Järvelin, A.; Keskustalo, H.; Sormunen, E.; Saastamoinen, M.; Kettunen, K.: Information retrieval from historical newspaper collections in highly inflectional languages : a query expansion approach (2016) 0.32
    0.31973937 = sum of:
      0.31973937 = product of:
        0.99918556 = sum of:
          0.014791193 = weight(abstract_txt:used in 3223) [ClassicSimilarity], result of:
            0.014791193 = score(doc=3223,freq=1.0), product of:
              0.07044895 = queryWeight, product of:
                1.0193685 = boost
                3.3592992 = idf(docFreq=4177, maxDocs=44218)
                0.020572858 = queryNorm
              0.2099562 = fieldWeight in 3223, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3592992 = idf(docFreq=4177, maxDocs=44218)
                0.0625 = fieldNorm(doc=3223)
          0.015363626 = weight(abstract_txt:more in 3223) [ClassicSimilarity], result of:
            0.015363626 = score(doc=3223,freq=1.0), product of:
              0.072255045 = queryWeight, product of:
                1.0323526 = boost
                3.402088 = idf(docFreq=4002, maxDocs=44218)
                0.020572858 = queryNorm
              0.2126305 = fieldWeight in 3223, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.402088 = idf(docFreq=4002, maxDocs=44218)
                0.0625 = fieldNorm(doc=3223)
          0.01620538 = weight(abstract_txt:using in 3223) [ClassicSimilarity], result of:
            0.01620538 = score(doc=3223,freq=1.0), product of:
              0.07487069 = queryWeight, product of:
                1.0508721 = boost
                3.4631186 = idf(docFreq=3765, maxDocs=44218)
                0.020572858 = queryNorm
              0.21644491 = fieldWeight in 3223, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4631186 = idf(docFreq=3765, maxDocs=44218)
                0.0625 = fieldNorm(doc=3223)
          0.14412548 = weight(abstract_txt:string in 3223) [ClassicSimilarity], result of:
            0.14412548 = score(doc=3223,freq=4.0), product of:
              0.1606952 = queryWeight, product of:
                1.0886302 = boost
                7.1750984 = idf(docFreq=91, maxDocs=44218)
                0.020572858 = queryNorm
              0.8968873 = fieldWeight in 3223, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                7.1750984 = idf(docFreq=91, maxDocs=44218)
                0.0625 = fieldNorm(doc=3223)
          0.14296421 = weight(abstract_txt:variants in 3223) [ClassicSimilarity], result of:
            0.14296421 = score(doc=3223,freq=3.0), product of:
              0.17591661 = queryWeight, product of:
                1.1390227 = boost
                7.5072327 = idf(docFreq=65, maxDocs=44218)
                0.020572858 = queryNorm
              0.81268173 = fieldWeight in 3223, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.5072327 = idf(docFreq=65, maxDocs=44218)
                0.0625 = fieldNorm(doc=3223)
          0.10979607 = weight(abstract_txt:forms in 3223) [ClassicSimilarity], result of:
            0.10979607 = score(doc=3223,freq=3.0), product of:
              0.18587592 = queryWeight, product of:
                1.655791 = boost
                5.456611 = idf(docFreq=512, maxDocs=44218)
                0.020572858 = queryNorm
              0.5906955 = fieldWeight in 3223, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.456611 = idf(docFreq=512, maxDocs=44218)
                0.0625 = fieldNorm(doc=3223)
          0.17262502 = weight(abstract_txt:matching in 3223) [ClassicSimilarity], result of:
            0.17262502 = score(doc=3223,freq=4.0), product of:
              0.22834326 = queryWeight, product of:
                1.8352196 = boost
                6.047913 = idf(docFreq=283, maxDocs=44218)
                0.020572858 = queryNorm
              0.75598913 = fieldWeight in 3223, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.047913 = idf(docFreq=283, maxDocs=44218)
                0.0625 = fieldNorm(doc=3223)
          0.38331455 = weight(abstract_txt:approximate in 3223) [ClassicSimilarity], result of:
            0.38331455 = score(doc=3223,freq=4.0), product of:
              0.38864753 = queryWeight, product of:
                2.3942633 = boost
                7.890225 = idf(docFreq=44, maxDocs=44218)
                0.020572858 = queryNorm
              0.9862781 = fieldWeight in 3223, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                7.890225 = idf(docFreq=44, maxDocs=44218)
                0.0625 = fieldNorm(doc=3223)
        0.32 = coord(8/25)
    
  4. Bellaachia, A.; Amor-Tijani, G.: Proper nouns in English-Arabic cross language information retrieval (2008) 0.28
    0.2758657 = sum of:
      0.2758657 = product of:
        0.86208034 = sum of:
          0.01620538 = weight(abstract_txt:using in 2372) [ClassicSimilarity], result of:
            0.01620538 = score(doc=2372,freq=1.0), product of:
              0.07487069 = queryWeight, product of:
                1.0508721 = boost
                3.4631186 = idf(docFreq=3765, maxDocs=44218)
                0.020572858 = queryNorm
              0.21644491 = fieldWeight in 2372, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4631186 = idf(docFreq=3765, maxDocs=44218)
                0.0625 = fieldNorm(doc=2372)
          0.1019121 = weight(abstract_txt:string in 2372) [ClassicSimilarity], result of:
            0.1019121 = score(doc=2372,freq=2.0), product of:
              0.1606952 = queryWeight, product of:
                1.0886302 = boost
                7.1750984 = idf(docFreq=91, maxDocs=44218)
                0.020572858 = queryNorm
              0.6341951 = fieldWeight in 2372, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.1750984 = idf(docFreq=91, maxDocs=44218)
                0.0625 = fieldNorm(doc=2372)
          0.08254043 = weight(abstract_txt:variants in 2372) [ClassicSimilarity], result of:
            0.08254043 = score(doc=2372,freq=1.0), product of:
              0.17591661 = queryWeight, product of:
                1.1390227 = boost
                7.5072327 = idf(docFreq=65, maxDocs=44218)
                0.020572858 = queryNorm
              0.46920204 = fieldWeight in 2372, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.5072327 = idf(docFreq=65, maxDocs=44218)
                0.0625 = fieldNorm(doc=2372)
          0.1237028 = weight(abstract_txt:spelling in 2372) [ClassicSimilarity], result of:
            0.1237028 = score(doc=2372,freq=2.0), product of:
              0.1828544 = queryWeight, product of:
                1.1612659 = boost
                7.653836 = idf(docFreq=56, maxDocs=44218)
                0.020572858 = queryNorm
              0.67650986 = fieldWeight in 2372, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.653836 = idf(docFreq=56, maxDocs=44218)
                0.0625 = fieldNorm(doc=2372)
          0.1906274 = weight(abstract_txt:transliteration in 2372) [ClassicSimilarity], result of:
            0.1906274 = score(doc=2372,freq=3.0), product of:
              0.2131141 = queryWeight, product of:
                1.2536752 = boost
                8.2629 = idf(docFreq=30, maxDocs=44218)
                0.020572858 = queryNorm
              0.8944852 = fieldWeight in 2372, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                8.2629 = idf(docFreq=30, maxDocs=44218)
                0.0625 = fieldNorm(doc=2372)
          0.03337065 = weight(abstract_txt:show in 2372) [ClassicSimilarity], result of:
            0.03337065 = score(doc=2372,freq=1.0), product of:
              0.12118499 = queryWeight, product of:
                1.3369598 = boost
                4.4059124 = idf(docFreq=1466, maxDocs=44218)
                0.020572858 = queryNorm
              0.27536952 = fieldWeight in 2372, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4059124 = idf(docFreq=1466, maxDocs=44218)
                0.0625 = fieldNorm(doc=2372)
          0.12206432 = weight(abstract_txt:matching in 2372) [ClassicSimilarity], result of:
            0.12206432 = score(doc=2372,freq=2.0), product of:
              0.22834326 = queryWeight, product of:
                1.8352196 = boost
                6.047913 = idf(docFreq=283, maxDocs=44218)
                0.020572858 = queryNorm
              0.53456503 = fieldWeight in 2372, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.047913 = idf(docFreq=283, maxDocs=44218)
                0.0625 = fieldNorm(doc=2372)
          0.19165727 = weight(abstract_txt:approximate in 2372) [ClassicSimilarity], result of:
            0.19165727 = score(doc=2372,freq=1.0), product of:
              0.38864753 = queryWeight, product of:
                2.3942633 = boost
                7.890225 = idf(docFreq=44, maxDocs=44218)
                0.020572858 = queryNorm
              0.49313906 = fieldWeight in 2372, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.890225 = idf(docFreq=44, maxDocs=44218)
                0.0625 = fieldNorm(doc=2372)
        0.32 = coord(8/25)
    
  5. Maaten, L. van den: Accelerating t-SNE using Tree-Based Algorithms (2014) 0.25
    0.25131854 = sum of:
      0.25131854 = product of:
        0.8975662 = sum of:
          0.031376857 = weight(abstract_txt:used in 3886) [ClassicSimilarity], result of:
            0.031376857 = score(doc=3886,freq=2.0), product of:
              0.07044895 = queryWeight, product of:
                1.0193685 = boost
                3.3592992 = idf(docFreq=4177, maxDocs=44218)
                0.020572858 = queryNorm
              0.44538432 = fieldWeight in 3886, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.3592992 = idf(docFreq=4177, maxDocs=44218)
                0.09375 = fieldNorm(doc=3886)
          0.02430807 = weight(abstract_txt:using in 3886) [ClassicSimilarity], result of:
            0.02430807 = score(doc=3886,freq=1.0), product of:
              0.07487069 = queryWeight, product of:
                1.0508721 = boost
                3.4631186 = idf(docFreq=3765, maxDocs=44218)
                0.020572858 = queryNorm
              0.32466736 = fieldWeight in 3886, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4631186 = idf(docFreq=3765, maxDocs=44218)
                0.09375 = fieldNorm(doc=3886)
          0.12381065 = weight(abstract_txt:variants in 3886) [ClassicSimilarity], result of:
            0.12381065 = score(doc=3886,freq=1.0), product of:
              0.17591661 = queryWeight, product of:
                1.1390227 = boost
                7.5072327 = idf(docFreq=65, maxDocs=44218)
                0.020572858 = queryNorm
              0.70380306 = fieldWeight in 3886, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.5072327 = idf(docFreq=65, maxDocs=44218)
                0.09375 = fieldNorm(doc=3886)
          0.05005598 = weight(abstract_txt:show in 3886) [ClassicSimilarity], result of:
            0.05005598 = score(doc=3886,freq=1.0), product of:
              0.12118499 = queryWeight, product of:
                1.3369598 = boost
                4.4059124 = idf(docFreq=1466, maxDocs=44218)
                0.020572858 = queryNorm
              0.4130543 = fieldWeight in 3886, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4059124 = idf(docFreq=1466, maxDocs=44218)
                0.09375 = fieldNorm(doc=3886)
          0.046107188 = weight(abstract_txt:data in 3886) [ClassicSimilarity], result of:
            0.046107188 = score(doc=3886,freq=2.0), product of:
              0.10423439 = queryWeight, product of:
                1.518606 = boost
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.020572858 = queryNorm
              0.44234142 = fieldWeight in 3886, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.09375 = fieldNorm(doc=3886)
          0.33442155 = weight(abstract_txt:variant in 3886) [ClassicSimilarity], result of:
            0.33442155 = score(doc=3886,freq=2.0), product of:
              0.3411911 = queryWeight, product of:
                2.243328 = boost
                7.3928223 = idf(docFreq=73, maxDocs=44218)
                0.020572858 = queryNorm
              0.98015904 = fieldWeight in 3886, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.3928223 = idf(docFreq=73, maxDocs=44218)
                0.09375 = fieldNorm(doc=3886)
          0.28748593 = weight(abstract_txt:approximate in 3886) [ClassicSimilarity], result of:
            0.28748593 = score(doc=3886,freq=1.0), product of:
              0.38864753 = queryWeight, product of:
                2.3942633 = boost
                7.890225 = idf(docFreq=44, maxDocs=44218)
                0.020572858 = queryNorm
              0.7397086 = fieldWeight in 3886, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.890225 = idf(docFreq=44, maxDocs=44218)
                0.09375 = fieldNorm(doc=3886)
        0.28 = coord(7/25)