Document (#40224)

Author
Järvelin, A.
Keskustalo, H.
Sormunen, E.
Saastamoinen, M.
Kettunen, K.
Title
Information retrieval from historical newspaper collections in highly inflectional languages : a query expansion approach
Source
Journal of the Association for Information Science and Technology. 67(2016) no.12, S.2928-2946
Year
2016
Abstract
The aim of the study was to test whether query expansion by approximate string matching methods is beneficial in retrieval from historical newspaper collections in a language rich with compounds and inflectional forms (Finnish). First, approximate string matching methods were used to generate lists of index words most similar to contemporary query terms in a digitized newspaper collection from the 1800s. Top index word variants were categorized to estimate the appropriate query expansion ranges in the retrieval test. Second, the effectiveness of approximate string matching methods, automatically generated inflectional forms, and their combinations were measured in a Cranfield-style test. Finally, a detailed topic-level analysis of test results was conducted. In the index of historical newspaper collection the occurrences of a word typically spread to many linguistic and historical variants along with optical character recognition (OCR) errors. All query expansion methods improved the baseline results. Extensive expansion of around 30 variants for each query word was required to achieve the highest performance improvement. Query expansion based on approximate string matching was superior to using the inflectional forms of the query words, showing that coverage of the different types of variation is more important than precision in handling one type of variation.
Content
Vgl.: http://onlinelibrary.wiley.com/doi/10.1002/asi.23379/full.
Theme
Computerlinguistik
Semantisches Umfeld in Indexierung u. Retrieval
Form
Zeitungen

Similar documents (author)

  1. Järvelin, K.; Kristensen, J.; Niemi, T.; Sormunen, E.; Keskustalo, H.: ¬A deductive data model for query expansion (1996) 4.86
    4.8553424 = sum of:
      4.8553424 = sum of:
        1.1216575 = weight(author_txt:järvelin in 2230) [ClassicSimilarity], result of:
          1.1216575 = score(doc=2230,freq=1.0), product of:
            0.44960067 = queryWeight, product of:
              7.983315 = idf(docFreq=40, maxDocs=44218)
              0.05631754 = queryNorm
            2.494786 = fieldWeight in 2230, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              7.983315 = idf(docFreq=40, maxDocs=44218)
              0.3125 = fieldNorm(doc=2230)
        1.77263 = weight(author_txt:sormunen in 2230) [ClassicSimilarity], result of:
          1.77263 = score(doc=2230,freq=1.0), product of:
            0.6100033 = queryWeight, product of:
              1.1648034 = boost
              9.298992 = idf(docFreq=10, maxDocs=44218)
              0.05631754 = queryNorm
            2.905935 = fieldWeight in 2230, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              9.298992 = idf(docFreq=10, maxDocs=44218)
              0.3125 = fieldNorm(doc=2230)
        1.9610548 = weight(author_txt:keskustalo in 2230) [ClassicSimilarity], result of:
          1.9610548 = score(doc=2230,freq=1.0), product of:
            0.65249914 = queryWeight, product of:
              1.2046933 = boost
              9.617446 = idf(docFreq=7, maxDocs=44218)
              0.05631754 = queryNorm
            3.005452 = fieldWeight in 2230, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              9.617446 = idf(docFreq=7, maxDocs=44218)
              0.3125 = fieldNorm(doc=2230)
    
  2. Lehtokangas, R.; Keskustalo, H.; Järvelin, K.: Experiments with transitive dictionary translation and pseudo-relevance feedback using graded relevance assessments (2008) 2.47
    2.4661698 = sum of:
      2.4661698 = product of:
        3.6992545 = sum of:
          1.345989 = weight(author_txt:järvelin in 1349) [ClassicSimilarity], result of:
            1.345989 = score(doc=1349,freq=1.0), product of:
              0.44960067 = queryWeight, product of:
                7.983315 = idf(docFreq=40, maxDocs=44218)
                0.05631754 = queryNorm
              2.9937432 = fieldWeight in 1349, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.983315 = idf(docFreq=40, maxDocs=44218)
                0.375 = fieldNorm(doc=1349)
          2.3532655 = weight(author_txt:keskustalo in 1349) [ClassicSimilarity], result of:
            2.3532655 = score(doc=1349,freq=1.0), product of:
              0.65249914 = queryWeight, product of:
                1.2046933 = boost
                9.617446 = idf(docFreq=7, maxDocs=44218)
                0.05631754 = queryNorm
              3.606542 = fieldWeight in 1349, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.617446 = idf(docFreq=7, maxDocs=44218)
                0.375 = fieldNorm(doc=1349)
        0.6666667 = coord(2/3)
    
  3. Pirkola, A.; Hedlund, T.; Keskustalo, H.; Järvelin, K.: Dictionary-based cross-language information retrieval : problems, methods, and research findings (2001) 2.06
    2.0551414 = sum of:
      2.0551414 = product of:
        3.0827122 = sum of:
          1.1216575 = weight(author_txt:järvelin in 3908) [ClassicSimilarity], result of:
            1.1216575 = score(doc=3908,freq=1.0), product of:
              0.44960067 = queryWeight, product of:
                7.983315 = idf(docFreq=40, maxDocs=44218)
                0.05631754 = queryNorm
              2.494786 = fieldWeight in 3908, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.983315 = idf(docFreq=40, maxDocs=44218)
                0.3125 = fieldNorm(doc=3908)
          1.9610548 = weight(author_txt:keskustalo in 3908) [ClassicSimilarity], result of:
            1.9610548 = score(doc=3908,freq=1.0), product of:
              0.65249914 = queryWeight, product of:
                1.2046933 = boost
                9.617446 = idf(docFreq=7, maxDocs=44218)
                0.05631754 = queryNorm
              3.005452 = fieldWeight in 3908, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.617446 = idf(docFreq=7, maxDocs=44218)
                0.3125 = fieldNorm(doc=3908)
        0.6666667 = coord(2/3)
    
  4. Toivonen, J.; Pirkola, A.; Keskustalo, H.; Visala, K.; Järvelin, K.: Translating cross-lingual spelling variants using transformation rules (2005) 2.06
    2.0551414 = sum of:
      2.0551414 = product of:
        3.0827122 = sum of:
          1.1216575 = weight(author_txt:järvelin in 1052) [ClassicSimilarity], result of:
            1.1216575 = score(doc=1052,freq=1.0), product of:
              0.44960067 = queryWeight, product of:
                7.983315 = idf(docFreq=40, maxDocs=44218)
                0.05631754 = queryNorm
              2.494786 = fieldWeight in 1052, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.983315 = idf(docFreq=40, maxDocs=44218)
                0.3125 = fieldNorm(doc=1052)
          1.9610548 = weight(author_txt:keskustalo in 1052) [ClassicSimilarity], result of:
            1.9610548 = score(doc=1052,freq=1.0), product of:
              0.65249914 = queryWeight, product of:
                1.2046933 = boost
                9.617446 = idf(docFreq=7, maxDocs=44218)
                0.05631754 = queryNorm
              3.005452 = fieldWeight in 1052, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.617446 = idf(docFreq=7, maxDocs=44218)
                0.3125 = fieldNorm(doc=1052)
        0.6666667 = coord(2/3)
    
  5. Ferro, N.; Silvello, G.; Keskustalo, H.; Pirkola, A.; Järvelin, K.: ¬The twist measure for IR evaluation : taking user's effort into account (2016) 2.06
    2.0551414 = sum of:
      2.0551414 = product of:
        3.0827122 = sum of:
          1.1216575 = weight(author_txt:järvelin in 2771) [ClassicSimilarity], result of:
            1.1216575 = score(doc=2771,freq=1.0), product of:
              0.44960067 = queryWeight, product of:
                7.983315 = idf(docFreq=40, maxDocs=44218)
                0.05631754 = queryNorm
              2.494786 = fieldWeight in 2771, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.983315 = idf(docFreq=40, maxDocs=44218)
                0.3125 = fieldNorm(doc=2771)
          1.9610548 = weight(author_txt:keskustalo in 2771) [ClassicSimilarity], result of:
            1.9610548 = score(doc=2771,freq=1.0), product of:
              0.65249914 = queryWeight, product of:
                1.2046933 = boost
                9.617446 = idf(docFreq=7, maxDocs=44218)
                0.05631754 = queryNorm
              3.005452 = fieldWeight in 2771, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.617446 = idf(docFreq=7, maxDocs=44218)
                0.3125 = fieldNorm(doc=2771)
        0.6666667 = coord(2/3)
    

Similar documents (content)

  1. French, J.C.; Powell, A.L.; Schulman, E.: Using clustering strategies for creating authority files (2000) 0.33
    0.3348224 = sum of:
      0.3348224 = product of:
        1.04632 = sum of:
          0.008144573 = weight(abstract_txt:from in 4811) [ClassicSimilarity], result of:
            0.008144573 = score(doc=4811,freq=1.0), product of:
              0.037718873 = queryWeight, product of:
                1.1174394 = boost
                2.7638826 = idf(docFreq=7577, maxDocs=44218)
                0.012212797 = queryNorm
              0.21592833 = fieldWeight in 4811, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.7638826 = idf(docFreq=7577, maxDocs=44218)
                0.078125 = fieldNorm(doc=4811)
          0.01618923 = weight(abstract_txt:retrieval in 4811) [ClassicSimilarity], result of:
            0.01618923 = score(doc=4811,freq=1.0), product of:
              0.059629887 = queryWeight, product of:
                1.4050009 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.012212797 = queryNorm
              0.27149525 = fieldWeight in 4811, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.078125 = fieldNorm(doc=4811)
          0.061944544 = weight(abstract_txt:word in 4811) [ClassicSimilarity], result of:
            0.061944544 = score(doc=4811,freq=1.0), product of:
              0.14587533 = queryWeight, product of:
                2.1975338 = boost
                5.4353957 = idf(docFreq=523, maxDocs=44218)
                0.012212797 = queryNorm
              0.4246403 = fieldWeight in 4811, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.4353957 = idf(docFreq=523, maxDocs=44218)
                0.078125 = fieldNorm(doc=4811)
          0.08863263 = weight(abstract_txt:forms in 4811) [ClassicSimilarity], result of:
            0.08863263 = score(doc=4811,freq=2.0), product of:
              0.14701633 = queryWeight, product of:
                2.2061112 = boost
                5.456611 = idf(docFreq=512, maxDocs=44218)
                0.012212797 = queryNorm
              0.60287607 = fieldWeight in 4811, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.456611 = idf(docFreq=512, maxDocs=44218)
                0.078125 = fieldNorm(doc=4811)
          0.16321091 = weight(abstract_txt:variants in 4811) [ClassicSimilarity], result of:
            0.16321091 = score(doc=4811,freq=1.0), product of:
              0.27827826 = queryWeight, product of:
                3.0351787 = boost
                7.5072327 = idf(docFreq=65, maxDocs=44218)
                0.012212797 = queryNorm
              0.58650255 = fieldWeight in 4811, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.5072327 = idf(docFreq=65, maxDocs=44218)
                0.078125 = fieldNorm(doc=4811)
          0.16090885 = weight(abstract_txt:matching in 4811) [ClassicSimilarity], result of:
            0.16090885 = score(doc=4811,freq=2.0), product of:
              0.24080715 = queryWeight, product of:
                3.2602332 = boost
                6.047913 = idf(docFreq=283, maxDocs=44218)
                0.012212797 = queryNorm
              0.6682063 = fieldWeight in 4811, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.047913 = idf(docFreq=283, maxDocs=44218)
                0.078125 = fieldNorm(doc=4811)
          0.1899905 = weight(abstract_txt:string in 4811) [ClassicSimilarity], result of:
            0.1899905 = score(doc=4811,freq=1.0), product of:
              0.3389331 = queryWeight, product of:
                3.8678622 = boost
                7.1750984 = idf(docFreq=91, maxDocs=44218)
                0.012212797 = queryNorm
              0.56055456 = fieldWeight in 4811, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.1750984 = idf(docFreq=91, maxDocs=44218)
                0.078125 = fieldNorm(doc=4811)
          0.35729867 = weight(abstract_txt:approximate in 4811) [ClassicSimilarity], result of:
            0.35729867 = score(doc=4811,freq=2.0), product of:
              0.4098614 = queryWeight, product of:
                4.2533636 = boost
                7.890225 = idf(docFreq=44, maxDocs=44218)
                0.012212797 = queryNorm
              0.8717549 = fieldWeight in 4811, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.890225 = idf(docFreq=44, maxDocs=44218)
                0.078125 = fieldNorm(doc=4811)
        0.32 = coord(8/25)
    
  2. Galvez, C.; Moya-Anegón, F.: Approximate personal name-matching through finite-state graphs (2007) 0.33
    0.3308475 = sum of:
      0.3308475 = product of:
        0.9190208 = sum of:
          0.011285452 = weight(abstract_txt:from in 614) [ClassicSimilarity], result of:
            0.011285452 = score(doc=614,freq=3.0), product of:
              0.037718873 = queryWeight, product of:
                1.1174394 = boost
                2.7638826 = idf(docFreq=7577, maxDocs=44218)
                0.012212797 = queryNorm
              0.29919907 = fieldWeight in 614, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.7638826 = idf(docFreq=7577, maxDocs=44218)
                0.0625 = fieldNorm(doc=614)
          0.012951385 = weight(abstract_txt:retrieval in 614) [ClassicSimilarity], result of:
            0.012951385 = score(doc=614,freq=1.0), product of:
              0.059629887 = queryWeight, product of:
                1.4050009 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.012212797 = queryNorm
              0.21719621 = fieldWeight in 614, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.0625 = fieldNorm(doc=614)
          0.046742003 = weight(abstract_txt:index in 614) [ClassicSimilarity], result of:
            0.046742003 = score(doc=614,freq=2.0), product of:
              0.11135628 = queryWeight, product of:
                1.9200034 = boost
                4.74895 = idf(docFreq=1040, maxDocs=44218)
                0.012212797 = queryNorm
              0.41975182 = fieldWeight in 614, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.74895 = idf(docFreq=1040, maxDocs=44218)
                0.0625 = fieldNorm(doc=614)
          0.100276366 = weight(abstract_txt:forms in 614) [ClassicSimilarity], result of:
            0.100276366 = score(doc=614,freq=4.0), product of:
              0.14701633 = queryWeight, product of:
                2.2061112 = boost
                5.456611 = idf(docFreq=512, maxDocs=44218)
                0.012212797 = queryNorm
              0.6820764 = fieldWeight in 614, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.456611 = idf(docFreq=512, maxDocs=44218)
                0.0625 = fieldNorm(doc=614)
          0.041493244 = weight(abstract_txt:methods in 614) [ClassicSimilarity], result of:
            0.041493244 = score(doc=614,freq=2.0), product of:
              0.11320727 = queryWeight, product of:
                2.235379 = boost
                4.146752 = idf(docFreq=1900, maxDocs=44218)
                0.012212797 = queryNorm
              0.36652455 = fieldWeight in 614, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.146752 = idf(docFreq=1900, maxDocs=44218)
                0.0625 = fieldNorm(doc=614)
          0.26113746 = weight(abstract_txt:variants in 614) [ClassicSimilarity], result of:
            0.26113746 = score(doc=614,freq=4.0), product of:
              0.27827826 = queryWeight, product of:
                3.0351787 = boost
                7.5072327 = idf(docFreq=65, maxDocs=44218)
                0.012212797 = queryNorm
              0.9384041 = fieldWeight in 614, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                7.5072327 = idf(docFreq=65, maxDocs=44218)
                0.0625 = fieldNorm(doc=614)
          0.091023795 = weight(abstract_txt:matching in 614) [ClassicSimilarity], result of:
            0.091023795 = score(doc=614,freq=1.0), product of:
              0.24080715 = queryWeight, product of:
                3.2602332 = boost
                6.047913 = idf(docFreq=283, maxDocs=44218)
                0.012212797 = queryNorm
              0.37799457 = fieldWeight in 614, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.047913 = idf(docFreq=283, maxDocs=44218)
                0.0625 = fieldNorm(doc=614)
          0.1519924 = weight(abstract_txt:string in 614) [ClassicSimilarity], result of:
            0.1519924 = score(doc=614,freq=1.0), product of:
              0.3389331 = queryWeight, product of:
                3.8678622 = boost
                7.1750984 = idf(docFreq=91, maxDocs=44218)
                0.012212797 = queryNorm
              0.44844365 = fieldWeight in 614, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.1750984 = idf(docFreq=91, maxDocs=44218)
                0.0625 = fieldNorm(doc=614)
          0.20211866 = weight(abstract_txt:approximate in 614) [ClassicSimilarity], result of:
            0.20211866 = score(doc=614,freq=1.0), product of:
              0.4098614 = queryWeight, product of:
                4.2533636 = boost
                7.890225 = idf(docFreq=44, maxDocs=44218)
                0.012212797 = queryNorm
              0.49313906 = fieldWeight in 614, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.890225 = idf(docFreq=44, maxDocs=44218)
                0.0625 = fieldNorm(doc=614)
        0.36 = coord(9/25)
    
  3. Pirkola, A.; Puolamäki, D.; Järvelin, K.: Applying query structuring in cross-language retrieval (2003) 0.30
    0.30252573 = sum of:
      0.30252573 = product of:
        0.94539297 = sum of:
          0.10427582 = weight(abstract_txt:finnish in 1074) [ClassicSimilarity], result of:
            0.10427582 = score(doc=1074,freq=3.0), product of:
              0.099240296 = queryWeight, product of:
                1.046473 = boost
                7.7650614 = idf(docFreq=50, maxDocs=44218)
                0.012212797 = queryNorm
              1.0507407 = fieldWeight in 1074, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.7650614 = idf(docFreq=50, maxDocs=44218)
                0.078125 = fieldNorm(doc=1074)
          0.028040571 = weight(abstract_txt:retrieval in 1074) [ClassicSimilarity], result of:
            0.028040571 = score(doc=1074,freq=3.0), product of:
              0.059629887 = queryWeight, product of:
                1.4050009 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.012212797 = queryNorm
              0.47024357 = fieldWeight in 1074, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.078125 = fieldNorm(doc=1074)
          0.03813822 = weight(abstract_txt:were in 1074) [ClassicSimilarity], result of:
            0.03813822 = score(doc=1074,freq=4.0), product of:
              0.06650691 = queryWeight, product of:
                1.483809 = boost
                3.6700637 = idf(docFreq=3061, maxDocs=44218)
                0.012212797 = queryNorm
              0.57344747 = fieldWeight in 1074, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.6700637 = idf(docFreq=3061, maxDocs=44218)
                0.078125 = fieldNorm(doc=1074)
          0.09348933 = weight(abstract_txt:test in 1074) [ClassicSimilarity], result of:
            0.09348933 = score(doc=1074,freq=2.0), product of:
              0.1676708 = queryWeight, product of:
                2.7204623 = boost
                5.046608 = idf(docFreq=772, maxDocs=44218)
                0.012212797 = queryNorm
              0.55757666 = fieldWeight in 1074, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.046608 = idf(docFreq=772, maxDocs=44218)
                0.078125 = fieldNorm(doc=1074)
          0.16321091 = weight(abstract_txt:variants in 1074) [ClassicSimilarity], result of:
            0.16321091 = score(doc=1074,freq=1.0), product of:
              0.27827826 = queryWeight, product of:
                3.0351787 = boost
                7.5072327 = idf(docFreq=65, maxDocs=44218)
                0.012212797 = queryNorm
              0.58650255 = fieldWeight in 1074, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.5072327 = idf(docFreq=65, maxDocs=44218)
                0.078125 = fieldNorm(doc=1074)
          0.11377974 = weight(abstract_txt:matching in 1074) [ClassicSimilarity], result of:
            0.11377974 = score(doc=1074,freq=1.0), product of:
              0.24080715 = queryWeight, product of:
                3.2602332 = boost
                6.047913 = idf(docFreq=283, maxDocs=44218)
                0.012212797 = queryNorm
              0.4724932 = fieldWeight in 1074, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.047913 = idf(docFreq=283, maxDocs=44218)
                0.078125 = fieldNorm(doc=1074)
          0.1834436 = weight(abstract_txt:newspaper in 1074) [ClassicSimilarity], result of:
            0.1834436 = score(doc=1074,freq=1.0), product of:
              0.33110148 = queryWeight, product of:
                3.8229141 = boost
                7.0917172 = idf(docFreq=99, maxDocs=44218)
                0.012212797 = queryNorm
              0.55404043 = fieldWeight in 1074, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.0917172 = idf(docFreq=99, maxDocs=44218)
                0.078125 = fieldNorm(doc=1074)
          0.22101477 = weight(abstract_txt:query in 1074) [ClassicSimilarity], result of:
            0.22101477 = score(doc=1074,freq=4.0), product of:
              0.2975525 = queryWeight, product of:
                5.1252 = boost
                4.7537646 = idf(docFreq=1035, maxDocs=44218)
                0.012212797 = queryNorm
              0.74277574 = fieldWeight in 1074, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.7537646 = idf(docFreq=1035, maxDocs=44218)
                0.078125 = fieldNorm(doc=1074)
        0.32 = coord(8/25)
    
  4. Bellaachia, A.; Amor-Tijani, G.: Proper nouns in English-Arabic cross language information retrieval (2008) 0.29
    0.28865635 = sum of:
      0.28865635 = product of:
        0.90205115 = sum of:
          0.012951385 = weight(abstract_txt:retrieval in 2372) [ClassicSimilarity], result of:
            0.012951385 = score(doc=2372,freq=1.0), product of:
              0.059629887 = queryWeight, product of:
                1.4050009 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.012212797 = queryNorm
              0.21719621 = fieldWeight in 2372, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.0625 = fieldNorm(doc=2372)
          0.05465908 = weight(abstract_txt:words in 2372) [ClassicSimilarity], result of:
            0.05465908 = score(doc=2372,freq=3.0), product of:
              0.09432436 = queryWeight, product of:
                1.4428159 = boost
                5.353007 = idf(docFreq=568, maxDocs=44218)
                0.012212797 = queryNorm
              0.57948 = fieldWeight in 2372, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.353007 = idf(docFreq=568, maxDocs=44218)
                0.0625 = fieldNorm(doc=2372)
          0.033051588 = weight(abstract_txt:index in 2372) [ClassicSimilarity], result of:
            0.033051588 = score(doc=2372,freq=1.0), product of:
              0.11135628 = queryWeight, product of:
                1.9200034 = boost
                4.74895 = idf(docFreq=1040, maxDocs=44218)
                0.012212797 = queryNorm
              0.29680938 = fieldWeight in 2372, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.74895 = idf(docFreq=1040, maxDocs=44218)
                0.0625 = fieldNorm(doc=2372)
          0.13056873 = weight(abstract_txt:variants in 2372) [ClassicSimilarity], result of:
            0.13056873 = score(doc=2372,freq=1.0), product of:
              0.27827826 = queryWeight, product of:
                3.0351787 = boost
                7.5072327 = idf(docFreq=65, maxDocs=44218)
                0.012212797 = queryNorm
              0.46920204 = fieldWeight in 2372, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.5072327 = idf(docFreq=65, maxDocs=44218)
                0.0625 = fieldNorm(doc=2372)
          0.12872708 = weight(abstract_txt:matching in 2372) [ClassicSimilarity], result of:
            0.12872708 = score(doc=2372,freq=2.0), product of:
              0.24080715 = queryWeight, product of:
                3.2602332 = boost
                6.047913 = idf(docFreq=283, maxDocs=44218)
                0.012212797 = queryNorm
              0.53456503 = fieldWeight in 2372, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.047913 = idf(docFreq=283, maxDocs=44218)
                0.0625 = fieldNorm(doc=2372)
          0.21494971 = weight(abstract_txt:string in 2372) [ClassicSimilarity], result of:
            0.21494971 = score(doc=2372,freq=2.0), product of:
              0.3389331 = queryWeight, product of:
                3.8678622 = boost
                7.1750984 = idf(docFreq=91, maxDocs=44218)
                0.012212797 = queryNorm
              0.6341951 = fieldWeight in 2372, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.1750984 = idf(docFreq=91, maxDocs=44218)
                0.0625 = fieldNorm(doc=2372)
          0.20211866 = weight(abstract_txt:approximate in 2372) [ClassicSimilarity], result of:
            0.20211866 = score(doc=2372,freq=1.0), product of:
              0.4098614 = queryWeight, product of:
                4.2533636 = boost
                7.890225 = idf(docFreq=44, maxDocs=44218)
                0.012212797 = queryNorm
              0.49313906 = fieldWeight in 2372, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.890225 = idf(docFreq=44, maxDocs=44218)
                0.0625 = fieldNorm(doc=2372)
          0.12502484 = weight(abstract_txt:query in 2372) [ClassicSimilarity], result of:
            0.12502484 = score(doc=2372,freq=2.0), product of:
              0.2975525 = queryWeight, product of:
                5.1252 = boost
                4.7537646 = idf(docFreq=1035, maxDocs=44218)
                0.012212797 = queryNorm
              0.4201774 = fieldWeight in 2372, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.7537646 = idf(docFreq=1035, maxDocs=44218)
                0.0625 = fieldNorm(doc=2372)
        0.32 = coord(8/25)
    
  5. Airio, E.; Kettunen, K.: Does dictionary based bilingual retrieval work in a non-normalized index? (2009) 0.26
    0.25748056 = sum of:
      0.25748056 = product of:
        0.7152238 = sum of:
          0.096325874 = weight(abstract_txt:finnish in 4224) [ClassicSimilarity], result of:
            0.096325874 = score(doc=4224,freq=4.0), product of:
              0.099240296 = queryWeight, product of:
                1.046473 = boost
                7.7650614 = idf(docFreq=50, maxDocs=44218)
                0.012212797 = queryNorm
              0.9706327 = fieldWeight in 4224, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                7.7650614 = idf(docFreq=50, maxDocs=44218)
                0.0625 = fieldNorm(doc=4224)
          0.022432458 = weight(abstract_txt:retrieval in 4224) [ClassicSimilarity], result of:
            0.022432458 = score(doc=4224,freq=3.0), product of:
              0.059629887 = queryWeight, product of:
                1.4050009 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.012212797 = queryNorm
              0.37619486 = fieldWeight in 4224, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.0625 = fieldNorm(doc=4224)
          0.015255286 = weight(abstract_txt:were in 4224) [ClassicSimilarity], result of:
            0.015255286 = score(doc=4224,freq=1.0), product of:
              0.06650691 = queryWeight, product of:
                1.483809 = boost
                3.6700637 = idf(docFreq=3061, maxDocs=44218)
                0.012212797 = queryNorm
              0.22937898 = fieldWeight in 4224, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6700637 = idf(docFreq=3061, maxDocs=44218)
                0.0625 = fieldNorm(doc=4224)
          0.033051588 = weight(abstract_txt:index in 4224) [ClassicSimilarity], result of:
            0.033051588 = score(doc=4224,freq=1.0), product of:
              0.11135628 = queryWeight, product of:
                1.9200034 = boost
                4.74895 = idf(docFreq=1040, maxDocs=44218)
                0.012212797 = queryNorm
              0.29680938 = fieldWeight in 4224, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.74895 = idf(docFreq=1040, maxDocs=44218)
                0.0625 = fieldNorm(doc=4224)
          0.050138183 = weight(abstract_txt:forms in 4224) [ClassicSimilarity], result of:
            0.050138183 = score(doc=4224,freq=1.0), product of:
              0.14701633 = queryWeight, product of:
                2.2061112 = boost
                5.456611 = idf(docFreq=512, maxDocs=44218)
                0.012212797 = queryNorm
              0.3410382 = fieldWeight in 4224, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.456611 = idf(docFreq=512, maxDocs=44218)
                0.0625 = fieldNorm(doc=4224)
          0.05288555 = weight(abstract_txt:test in 4224) [ClassicSimilarity], result of:
            0.05288555 = score(doc=4224,freq=1.0), product of:
              0.1676708 = queryWeight, product of:
                2.7204623 = boost
                5.046608 = idf(docFreq=772, maxDocs=44218)
                0.012212797 = queryNorm
              0.315413 = fieldWeight in 4224, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.046608 = idf(docFreq=772, maxDocs=44218)
                0.0625 = fieldNorm(doc=4224)
          0.091023795 = weight(abstract_txt:matching in 4224) [ClassicSimilarity], result of:
            0.091023795 = score(doc=4224,freq=1.0), product of:
              0.24080715 = queryWeight, product of:
                3.2602332 = boost
                6.047913 = idf(docFreq=283, maxDocs=44218)
                0.012212797 = queryNorm
              0.37799457 = fieldWeight in 4224, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.047913 = idf(docFreq=283, maxDocs=44218)
                0.0625 = fieldNorm(doc=4224)
          0.1519924 = weight(abstract_txt:string in 4224) [ClassicSimilarity], result of:
            0.1519924 = score(doc=4224,freq=1.0), product of:
              0.3389331 = queryWeight, product of:
                3.8678622 = boost
                7.1750984 = idf(docFreq=91, maxDocs=44218)
                0.012212797 = queryNorm
              0.44844365 = fieldWeight in 4224, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.1750984 = idf(docFreq=91, maxDocs=44218)
                0.0625 = fieldNorm(doc=4224)
          0.20211866 = weight(abstract_txt:approximate in 4224) [ClassicSimilarity], result of:
            0.20211866 = score(doc=4224,freq=1.0), product of:
              0.4098614 = queryWeight, product of:
                4.2533636 = boost
                7.890225 = idf(docFreq=44, maxDocs=44218)
                0.012212797 = queryNorm
              0.49313906 = fieldWeight in 4224, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.890225 = idf(docFreq=44, maxDocs=44218)
                0.0625 = fieldNorm(doc=4224)
        0.36 = coord(9/25)