Document (#22813)

Author
French, J.C.
Powell, A.L.
Schulman, E.
Title
Using clustering strategies for creating authority files
Source
Journal of the American Society for Information Science. 51(2000) no.8, S.774-786
Year
2000
Abstract
As more online databases are integrated into digital libraries, the issue of quality control of the data becomes increasingly important, especially as it relates to the effective retrieval of information. Authority work, the need to discover and reconcile variant forms of strings in bibliographical entries, will become more critical in the future. Spelling variants, misspellings, and transliteration differences will all increase the difficulty of retrieving information. We investigate a number of approximate string matching techniques that have traditionally been used to help with this problem. We then introduce the notion of approximate word matching and show how it can be used to improve detection and categorization of variant forms. We demonstrate the utility of these approaches using data from the Astrophysics Data System and show how we can reduce the human effort involved in the creation of authority files
Theme
Normdateien
Computerlinguistik
Retrievalalgorithmen

Similar documents (author)

  1. French, J.C.; Knight, J.C.; Powell, A.L.: Applying hypertext structures to software documentation (1997) 4.74
    4.736663 = sum of:
      4.736663 = sum of:
        2.1120303 = weight(author_txt:powell in 3257) [ClassicSimilarity], result of:
          2.1120303 = score(doc=3257,freq=1.0), product of:
            0.6542722 = queryWeight, product of:
              8.608162 = idf(docFreq=20, maxDocs=42306)
              0.076006025 = queryNorm
            3.2280607 = fieldWeight in 3257, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.608162 = idf(docFreq=20, maxDocs=42306)
              0.375 = fieldNorm(doc=3257)
        2.6246324 = weight(author_txt:french in 3257) [ClassicSimilarity], result of:
          2.6246324 = score(doc=3257,freq=1.0), product of:
            0.7562592 = queryWeight, product of:
              1.075118 = boost
              9.254789 = idf(docFreq=10, maxDocs=42306)
              0.076006025 = queryNorm
            3.470546 = fieldWeight in 3257, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              9.254789 = idf(docFreq=10, maxDocs=42306)
              0.375 = fieldNorm(doc=3257)
    
  2. French, J.C.; Powell, A.L.; Gey, F.; Perelman, N.: Exploiting manual indexing to improve collection selection and retrieval effectiveness (2002) 3.95
    3.947219 = sum of:
      3.947219 = sum of:
        1.7600253 = weight(author_txt:powell in 4897) [ClassicSimilarity], result of:
          1.7600253 = score(doc=4897,freq=1.0), product of:
            0.6542722 = queryWeight, product of:
              8.608162 = idf(docFreq=20, maxDocs=42306)
              0.076006025 = queryNorm
            2.6900506 = fieldWeight in 4897, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.608162 = idf(docFreq=20, maxDocs=42306)
              0.3125 = fieldNorm(doc=4897)
        2.1871936 = weight(author_txt:french in 4897) [ClassicSimilarity], result of:
          2.1871936 = score(doc=4897,freq=1.0), product of:
            0.7562592 = queryWeight, product of:
              1.075118 = boost
              9.254789 = idf(docFreq=10, maxDocs=42306)
              0.076006025 = queryNorm
            2.8921218 = fieldWeight in 4897, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              9.254789 = idf(docFreq=10, maxDocs=42306)
              0.3125 = fieldNorm(doc=4897)
    
  3. French, J.: Changes in reference services (1995) 2.19
    2.1871936 = sum of:
      2.1871936 = product of:
        4.3743873 = sum of:
          4.3743873 = weight(author_txt:french in 3749) [ClassicSimilarity], result of:
            4.3743873 = score(doc=3749,freq=1.0), product of:
              0.7562592 = queryWeight, product of:
                1.075118 = boost
                9.254789 = idf(docFreq=10, maxDocs=42306)
                0.076006025 = queryNorm
              5.7842436 = fieldWeight in 3749, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.254789 = idf(docFreq=10, maxDocs=42306)
                0.625 = fieldNorm(doc=3749)
        0.5 = coord(1/2)
    
  4. Powell, A.P.: ZYindex: bringing order to electronic chaos (1989) 1.76
    1.7600253 = sum of:
      1.7600253 = product of:
        3.5200505 = sum of:
          3.5200505 = weight(author_txt:powell in 3233) [ClassicSimilarity], result of:
            3.5200505 = score(doc=3233,freq=1.0), product of:
              0.6542722 = queryWeight, product of:
                8.608162 = idf(docFreq=20, maxDocs=42306)
                0.076006025 = queryNorm
              5.380101 = fieldWeight in 3233, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.608162 = idf(docFreq=20, maxDocs=42306)
                0.625 = fieldNorm(doc=3233)
        0.5 = coord(1/2)
    
  5. Powell, J.: Spinning the World-Wide Web : an HTML primer (1995) 1.76
    1.7600253 = sum of:
      1.7600253 = product of:
        3.5200505 = sum of:
          3.5200505 = weight(author_txt:powell in 6013) [ClassicSimilarity], result of:
            3.5200505 = score(doc=6013,freq=1.0), product of:
              0.6542722 = queryWeight, product of:
                8.608162 = idf(docFreq=20, maxDocs=42306)
                0.076006025 = queryNorm
              5.380101 = fieldWeight in 6013, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.608162 = idf(docFreq=20, maxDocs=42306)
                0.625 = fieldNorm(doc=6013)
        0.5 = coord(1/2)
    

Similar documents (content)

  1. Galvez, C.; Moya-Anegón, F.: Approximate personal name-matching through finite-state graphs (2007) 0.42
    0.4179298 = sum of:
      0.4179298 = product of:
        1.1609161 = sum of:
          0.015062108 = weight(abstract_txt:used in 2615) [ClassicSimilarity], result of:
            0.015062108 = score(doc=2615,freq=1.0), product of:
              0.071275964 = queryWeight, product of:
                1.015013 = boost
                3.381136 = idf(docFreq=3910, maxDocs=42306)
                0.020768678 = queryNorm
              0.211321 = fieldWeight in 2615, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.381136 = idf(docFreq=3910, maxDocs=42306)
                0.0625 = fieldNorm(doc=2615)
          0.071637966 = weight(abstract_txt:string in 2615) [ClassicSimilarity], result of:
            0.071637966 = score(doc=2615,freq=1.0), product of:
              0.1599944 = queryWeight, product of:
                1.0753193 = boost
                7.1640477 = idf(docFreq=88, maxDocs=42306)
                0.020768678 = queryNorm
              0.44775298 = fieldWeight in 2615, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.1640477 = idf(docFreq=88, maxDocs=42306)
                0.0625 = fieldNorm(doc=2615)
          0.16398509 = weight(abstract_txt:variants in 2615) [ClassicSimilarity], result of:
            0.16398509 = score(doc=2615,freq=4.0), product of:
              0.17506212 = queryWeight, product of:
                1.1248151 = boost
                7.493801 = idf(docFreq=63, maxDocs=42306)
                0.020768678 = queryNorm
              0.93672514 = fieldWeight in 2615, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                7.493801 = idf(docFreq=63, maxDocs=42306)
                0.0625 = fieldNorm(doc=2615)
          0.08585369 = weight(abstract_txt:spelling in 2615) [ClassicSimilarity], result of:
            0.08585369 = score(doc=2615,freq=1.0), product of:
              0.18051581 = queryWeight, product of:
                1.1422014 = boost
                7.609633 = idf(docFreq=56, maxDocs=42306)
                0.020768678 = queryNorm
              0.47560206 = fieldWeight in 2615, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.609633 = idf(docFreq=56, maxDocs=42306)
                0.0625 = fieldNorm(doc=2615)
          0.14268115 = weight(abstract_txt:misspellings in 2615) [ClassicSimilarity], result of:
            0.14268115 = score(doc=2615,freq=1.0), product of:
              0.2532719 = queryWeight, product of:
                1.3529401 = boost
                9.013627 = idf(docFreq=13, maxDocs=42306)
                0.020768678 = queryNorm
              0.5633517 = fieldWeight in 2615, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.013627 = idf(docFreq=13, maxDocs=42306)
                0.0625 = fieldNorm(doc=2615)
          0.12965952 = weight(abstract_txt:forms in 2615) [ClassicSimilarity], result of:
            0.12965952 = score(doc=2615,freq=4.0), product of:
              0.18859732 = queryWeight, product of:
                1.6510789 = boost
                5.4999514 = idf(docFreq=469, maxDocs=42306)
                0.020768678 = queryNorm
              0.6874939 = fieldWeight in 2615, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.4999514 = idf(docFreq=469, maxDocs=42306)
                0.0625 = fieldNorm(doc=2615)
          0.0866323 = weight(abstract_txt:matching in 2615) [ClassicSimilarity], result of:
            0.0866323 = score(doc=2615,freq=1.0), product of:
              0.22880867 = queryWeight, product of:
                1.8185962 = boost
                6.057973 = idf(docFreq=268, maxDocs=42306)
                0.020768678 = queryNorm
              0.3786233 = fieldWeight in 2615, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.057973 = idf(docFreq=268, maxDocs=42306)
                0.0625 = fieldNorm(doc=2615)
          0.2771927 = weight(abstract_txt:variant in 2615) [ClassicSimilarity], result of:
            0.2771927 = score(doc=2615,freq=3.0), product of:
              0.34448212 = queryWeight, product of:
                2.2314308 = boost
                7.4331765 = idf(docFreq=67, maxDocs=42306)
                0.020768678 = queryNorm
              0.80466497 = fieldWeight in 2615, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.4331765 = idf(docFreq=67, maxDocs=42306)
                0.0625 = fieldNorm(doc=2615)
          0.18821159 = weight(abstract_txt:approximate in 2615) [ClassicSimilarity], result of:
            0.18821159 = score(doc=2615,freq=1.0), product of:
              0.3838105 = queryWeight, product of:
                2.3553665 = boost
                7.8460217 = idf(docFreq=44, maxDocs=42306)
                0.020768678 = queryNorm
              0.49037635 = fieldWeight in 2615, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.8460217 = idf(docFreq=44, maxDocs=42306)
                0.0625 = fieldNorm(doc=2615)
        0.36 = coord(9/25)
    
  2. Pereira, D.A.; Ribeiro-Neto, B.; Ziviani, N.; Laender, A.H.F.; Gonçalves, M.A.: ¬A generic Web-based entity resolution framework (2011) 0.32
    0.321148 = sum of:
      0.321148 = product of:
        0.89207774 = sum of:
          0.081992544 = weight(abstract_txt:variants in 1451) [ClassicSimilarity], result of:
            0.081992544 = score(doc=1451,freq=1.0), product of:
              0.17506212 = queryWeight, product of:
                1.1248151 = boost
                7.493801 = idf(docFreq=63, maxDocs=42306)
                0.020768678 = queryNorm
              0.46836257 = fieldWeight in 1451, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.493801 = idf(docFreq=63, maxDocs=42306)
                0.0625 = fieldNorm(doc=1451)
          0.08585369 = weight(abstract_txt:spelling in 1451) [ClassicSimilarity], result of:
            0.08585369 = score(doc=1451,freq=1.0), product of:
              0.18051581 = queryWeight, product of:
                1.1422014 = boost
                7.609633 = idf(docFreq=56, maxDocs=42306)
                0.020768678 = queryNorm
              0.47560206 = fieldWeight in 1451, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.609633 = idf(docFreq=56, maxDocs=42306)
                0.0625 = fieldNorm(doc=1451)
          0.04900543 = weight(abstract_txt:show in 1451) [ClassicSimilarity], result of:
            0.04900543 = score(doc=1451,freq=2.0), product of:
              0.12421444 = queryWeight, product of:
                1.3399423 = boost
                4.463516 = idf(docFreq=1324, maxDocs=42306)
                0.020768678 = queryNorm
              0.39452282 = fieldWeight in 1451, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.463516 = idf(docFreq=1324, maxDocs=42306)
                0.0625 = fieldNorm(doc=1451)
          0.14268115 = weight(abstract_txt:misspellings in 1451) [ClassicSimilarity], result of:
            0.14268115 = score(doc=1451,freq=1.0), product of:
              0.2532719 = queryWeight, product of:
                1.3529401 = boost
                9.013627 = idf(docFreq=13, maxDocs=42306)
                0.020768678 = queryNorm
              0.5633517 = fieldWeight in 1451, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.013627 = idf(docFreq=13, maxDocs=42306)
                0.0625 = fieldNorm(doc=1451)
          0.031995095 = weight(abstract_txt:data in 1451) [ClassicSimilarity], result of:
            0.031995095 = score(doc=1451,freq=2.0), product of:
              0.10701105 = queryWeight, product of:
                1.5232108 = boost
                3.382671 = idf(docFreq=3904, maxDocs=42306)
                0.020768678 = queryNorm
              0.2989887 = fieldWeight in 1451, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.382671 = idf(docFreq=3904, maxDocs=42306)
                0.0625 = fieldNorm(doc=1451)
          0.09168312 = weight(abstract_txt:forms in 1451) [ClassicSimilarity], result of:
            0.09168312 = score(doc=1451,freq=2.0), product of:
              0.18859732 = queryWeight, product of:
                1.6510789 = boost
                5.4999514 = idf(docFreq=469, maxDocs=42306)
                0.020768678 = queryNorm
              0.4861316 = fieldWeight in 1451, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.4999514 = idf(docFreq=469, maxDocs=42306)
                0.0625 = fieldNorm(doc=1451)
          0.07234102 = weight(abstract_txt:files in 1451) [ClassicSimilarity], result of:
            0.07234102 = score(doc=1451,freq=1.0), product of:
              0.20289701 = queryWeight, product of:
                1.7125288 = boost
                5.704649 = idf(docFreq=382, maxDocs=42306)
                0.020768678 = queryNorm
              0.35654056 = fieldWeight in 1451, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.704649 = idf(docFreq=382, maxDocs=42306)
                0.0625 = fieldNorm(doc=1451)
          0.16003728 = weight(abstract_txt:variant in 1451) [ClassicSimilarity], result of:
            0.16003728 = score(doc=1451,freq=1.0), product of:
              0.34448212 = queryWeight, product of:
                2.2314308 = boost
                7.4331765 = idf(docFreq=67, maxDocs=42306)
                0.020768678 = queryNorm
              0.46457353 = fieldWeight in 1451, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.4331765 = idf(docFreq=67, maxDocs=42306)
                0.0625 = fieldNorm(doc=1451)
          0.1764884 = weight(abstract_txt:authority in 1451) [ClassicSimilarity], result of:
            0.1764884 = score(doc=1451,freq=4.0), product of:
              0.26515946 = queryWeight, product of:
                2.3977244 = boost
                5.3247476 = idf(docFreq=559, maxDocs=42306)
                0.020768678 = queryNorm
              0.66559345 = fieldWeight in 1451, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.3247476 = idf(docFreq=559, maxDocs=42306)
                0.0625 = fieldNorm(doc=1451)
        0.36 = coord(9/25)
    
  3. Järvelin, A.; Keskustalo, H.; Sormunen, E.; Saastamoinen, M.; Kettunen, K.: Information retrieval from historical newspaper collections in highly inflectional languages : a query expansion approach (2016) 0.32
    0.31829938 = sum of:
      0.31829938 = product of:
        0.9946856 = sum of:
          0.015062108 = weight(abstract_txt:used in 142) [ClassicSimilarity], result of:
            0.015062108 = score(doc=142,freq=1.0), product of:
              0.071275964 = queryWeight, product of:
                1.015013 = boost
                3.381136 = idf(docFreq=3910, maxDocs=42306)
                0.020768678 = queryNorm
              0.211321 = fieldWeight in 142, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.381136 = idf(docFreq=3910, maxDocs=42306)
                0.0625 = fieldNorm(doc=142)
          0.015837932 = weight(abstract_txt:more in 142) [ClassicSimilarity], result of:
            0.015837932 = score(doc=142,freq=1.0), product of:
              0.07370296 = queryWeight, product of:
                1.0321493 = boost
                3.438219 = idf(docFreq=3693, maxDocs=42306)
                0.020768678 = queryNorm
              0.21488869 = fieldWeight in 142, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.438219 = idf(docFreq=3693, maxDocs=42306)
                0.0625 = fieldNorm(doc=142)
          0.016518136 = weight(abstract_txt:using in 142) [ClassicSimilarity], result of:
            0.016518136 = score(doc=142,freq=1.0), product of:
              0.075798385 = queryWeight, product of:
                1.0467188 = boost
                3.486752 = idf(docFreq=3518, maxDocs=42306)
                0.020768678 = queryNorm
              0.217922 = fieldWeight in 142, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.486752 = idf(docFreq=3518, maxDocs=42306)
                0.0625 = fieldNorm(doc=142)
          0.14327593 = weight(abstract_txt:string in 142) [ClassicSimilarity], result of:
            0.14327593 = score(doc=142,freq=4.0), product of:
              0.1599944 = queryWeight, product of:
                1.0753193 = boost
                7.1640477 = idf(docFreq=88, maxDocs=42306)
                0.020768678 = queryNorm
              0.89550596 = fieldWeight in 142, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                7.1640477 = idf(docFreq=88, maxDocs=42306)
                0.0625 = fieldNorm(doc=142)
          0.14201525 = weight(abstract_txt:variants in 142) [ClassicSimilarity], result of:
            0.14201525 = score(doc=142,freq=3.0), product of:
              0.17506212 = queryWeight, product of:
                1.1248151 = boost
                7.493801 = idf(docFreq=63, maxDocs=42306)
                0.020768678 = queryNorm
              0.81122774 = fieldWeight in 142, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.493801 = idf(docFreq=63, maxDocs=42306)
                0.0625 = fieldNorm(doc=142)
          0.11228842 = weight(abstract_txt:forms in 142) [ClassicSimilarity], result of:
            0.11228842 = score(doc=142,freq=3.0), product of:
              0.18859732 = queryWeight, product of:
                1.6510789 = boost
                5.4999514 = idf(docFreq=469, maxDocs=42306)
                0.020768678 = queryNorm
              0.59538716 = fieldWeight in 142, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.4999514 = idf(docFreq=469, maxDocs=42306)
                0.0625 = fieldNorm(doc=142)
          0.1732646 = weight(abstract_txt:matching in 142) [ClassicSimilarity], result of:
            0.1732646 = score(doc=142,freq=4.0), product of:
              0.22880867 = queryWeight, product of:
                1.8185962 = boost
                6.057973 = idf(docFreq=268, maxDocs=42306)
                0.020768678 = queryNorm
              0.7572466 = fieldWeight in 142, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.057973 = idf(docFreq=268, maxDocs=42306)
                0.0625 = fieldNorm(doc=142)
          0.37642318 = weight(abstract_txt:approximate in 142) [ClassicSimilarity], result of:
            0.37642318 = score(doc=142,freq=4.0), product of:
              0.3838105 = queryWeight, product of:
                2.3553665 = boost
                7.8460217 = idf(docFreq=44, maxDocs=42306)
                0.020768678 = queryNorm
              0.9807527 = fieldWeight in 142, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                7.8460217 = idf(docFreq=44, maxDocs=42306)
                0.0625 = fieldNorm(doc=142)
        0.32 = coord(8/25)
    
  4. Bellaachia, A.; Amor-Tijani, G.: Proper nouns in English-Arabic cross language information retrieval (2008) 0.27
    0.2732674 = sum of:
      0.2732674 = product of:
        0.85396063 = sum of:
          0.016518136 = weight(abstract_txt:using in 192) [ClassicSimilarity], result of:
            0.016518136 = score(doc=192,freq=1.0), product of:
              0.075798385 = queryWeight, product of:
                1.0467188 = boost
                3.486752 = idf(docFreq=3518, maxDocs=42306)
                0.020768678 = queryNorm
              0.217922 = fieldWeight in 192, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.486752 = idf(docFreq=3518, maxDocs=42306)
                0.0625 = fieldNorm(doc=192)
          0.101311386 = weight(abstract_txt:string in 192) [ClassicSimilarity], result of:
            0.101311386 = score(doc=192,freq=2.0), product of:
              0.1599944 = queryWeight, product of:
                1.0753193 = boost
                7.1640477 = idf(docFreq=88, maxDocs=42306)
                0.020768678 = queryNorm
              0.63321835 = fieldWeight in 192, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.1640477 = idf(docFreq=88, maxDocs=42306)
                0.0625 = fieldNorm(doc=192)
          0.081992544 = weight(abstract_txt:variants in 192) [ClassicSimilarity], result of:
            0.081992544 = score(doc=192,freq=1.0), product of:
              0.17506212 = queryWeight, product of:
                1.1248151 = boost
                7.493801 = idf(docFreq=63, maxDocs=42306)
                0.020768678 = queryNorm
              0.46836257 = fieldWeight in 192, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.493801 = idf(docFreq=63, maxDocs=42306)
                0.0625 = fieldNorm(doc=192)
          0.12141546 = weight(abstract_txt:spelling in 192) [ClassicSimilarity], result of:
            0.12141546 = score(doc=192,freq=2.0), product of:
              0.18051581 = queryWeight, product of:
                1.1422014 = boost
                7.609633 = idf(docFreq=56, maxDocs=42306)
                0.020768678 = queryNorm
              0.6726029 = fieldWeight in 192, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.609633 = idf(docFreq=56, maxDocs=42306)
                0.0625 = fieldNorm(doc=192)
          0.18734293 = weight(abstract_txt:transliteration in 192) [ClassicSimilarity], result of:
            0.18734293 = score(doc=192,freq=3.0), product of:
              0.21056865 = queryWeight, product of:
                1.2336215 = boost
                8.218697 = idf(docFreq=30, maxDocs=42306)
                0.020768678 = queryNorm
              0.8897 = fieldWeight in 192, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                8.218697 = idf(docFreq=30, maxDocs=42306)
                0.0625 = fieldNorm(doc=192)
          0.034652073 = weight(abstract_txt:show in 192) [ClassicSimilarity], result of:
            0.034652073 = score(doc=192,freq=1.0), product of:
              0.12421444 = queryWeight, product of:
                1.3399423 = boost
                4.463516 = idf(docFreq=1324, maxDocs=42306)
                0.020768678 = queryNorm
              0.27896976 = fieldWeight in 192, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.463516 = idf(docFreq=1324, maxDocs=42306)
                0.0625 = fieldNorm(doc=192)
          0.122516565 = weight(abstract_txt:matching in 192) [ClassicSimilarity], result of:
            0.122516565 = score(doc=192,freq=2.0), product of:
              0.22880867 = queryWeight, product of:
                1.8185962 = boost
                6.057973 = idf(docFreq=268, maxDocs=42306)
                0.020768678 = queryNorm
              0.5354542 = fieldWeight in 192, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.057973 = idf(docFreq=268, maxDocs=42306)
                0.0625 = fieldNorm(doc=192)
          0.18821159 = weight(abstract_txt:approximate in 192) [ClassicSimilarity], result of:
            0.18821159 = score(doc=192,freq=1.0), product of:
              0.3838105 = queryWeight, product of:
                2.3553665 = boost
                7.8460217 = idf(docFreq=44, maxDocs=42306)
                0.020768678 = queryNorm
              0.49037635 = fieldWeight in 192, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.8460217 = idf(docFreq=44, maxDocs=42306)
                0.0625 = fieldNorm(doc=192)
        0.32 = coord(8/25)
    
  5. Maaten, L. van den: Accelerating t-SNE using Tree-Based Algorithms (2014) 0.25
    0.2524189 = sum of:
      0.2524189 = product of:
        0.90149605 = sum of:
          0.031951554 = weight(abstract_txt:used in 805) [ClassicSimilarity], result of:
            0.031951554 = score(doc=805,freq=2.0), product of:
              0.071275964 = queryWeight, product of:
                1.015013 = boost
                3.381136 = idf(docFreq=3910, maxDocs=42306)
                0.020768678 = queryNorm
              0.4482795 = fieldWeight in 805, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.381136 = idf(docFreq=3910, maxDocs=42306)
                0.09375 = fieldNorm(doc=805)
          0.024777206 = weight(abstract_txt:using in 805) [ClassicSimilarity], result of:
            0.024777206 = score(doc=805,freq=1.0), product of:
              0.075798385 = queryWeight, product of:
                1.0467188 = boost
                3.486752 = idf(docFreq=3518, maxDocs=42306)
                0.020768678 = queryNorm
              0.32688302 = fieldWeight in 805, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.486752 = idf(docFreq=3518, maxDocs=42306)
                0.09375 = fieldNorm(doc=805)
          0.12298882 = weight(abstract_txt:variants in 805) [ClassicSimilarity], result of:
            0.12298882 = score(doc=805,freq=1.0), product of:
              0.17506212 = queryWeight, product of:
                1.1248151 = boost
                7.493801 = idf(docFreq=63, maxDocs=42306)
                0.020768678 = queryNorm
              0.70254385 = fieldWeight in 805, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.493801 = idf(docFreq=63, maxDocs=42306)
                0.09375 = fieldNorm(doc=805)
          0.05197811 = weight(abstract_txt:show in 805) [ClassicSimilarity], result of:
            0.05197811 = score(doc=805,freq=1.0), product of:
              0.12421444 = queryWeight, product of:
                1.3399423 = boost
                4.463516 = idf(docFreq=1324, maxDocs=42306)
                0.020768678 = queryNorm
              0.41845465 = fieldWeight in 805, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.463516 = idf(docFreq=1324, maxDocs=42306)
                0.09375 = fieldNorm(doc=805)
          0.047992643 = weight(abstract_txt:data in 805) [ClassicSimilarity], result of:
            0.047992643 = score(doc=805,freq=2.0), product of:
              0.10701105 = queryWeight, product of:
                1.5232108 = boost
                3.382671 = idf(docFreq=3904, maxDocs=42306)
                0.020768678 = queryNorm
              0.44848305 = fieldWeight in 805, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.382671 = idf(docFreq=3904, maxDocs=42306)
                0.09375 = fieldNorm(doc=805)
          0.33949032 = weight(abstract_txt:variant in 805) [ClassicSimilarity], result of:
            0.33949032 = score(doc=805,freq=2.0), product of:
              0.34448212 = queryWeight, product of:
                2.2314308 = boost
                7.4331765 = idf(docFreq=67, maxDocs=42306)
                0.020768678 = queryNorm
              0.9855093 = fieldWeight in 805, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.4331765 = idf(docFreq=67, maxDocs=42306)
                0.09375 = fieldNorm(doc=805)
          0.28231737 = weight(abstract_txt:approximate in 805) [ClassicSimilarity], result of:
            0.28231737 = score(doc=805,freq=1.0), product of:
              0.3838105 = queryWeight, product of:
                2.3553665 = boost
                7.8460217 = idf(docFreq=44, maxDocs=42306)
                0.020768678 = queryNorm
              0.73556453 = fieldWeight in 805, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.8460217 = idf(docFreq=44, maxDocs=42306)
                0.09375 = fieldNorm(doc=805)
        0.28 = coord(7/25)