Document (#36103)

Author
Dolamic, L.
Savoy, J.
Title
Retrieval effectiveness of machine translated queries
Source
Journal of the American Society for Information Science and Technology. 61(2010) no.11, S.2266-2273
Year
2010
Abstract
This article describes and evaluates various information retrieval models used to search document collections written in English through submitting queries written in various other languages, either members of the Indo-European family (English, French, German, and Spanish) or radically different language groups such as Chinese. This evaluation method involves searching a rather large number of topics (around 300) and using two commercial machine translation systems to translate across the language barriers. In this study, mean average precision is used to measure variances in retrieval effectiveness when a query language differs from the document language. Although performance differences are rather large for certain languages pairs, this does not mean that bilingual search methods are not commercially viable. Causes of the difficulties incurred when searching or during translation are analyzed and the results of concrete examples are explained.
Theme
Computerlinguistik

Similar documents (author)

  1. Savoy, J.: Stemming of French words based on grammatical categories (1993) 5.21
    5.2059946 = sum of:
      5.2059946 = weight(author_txt:savoy in 4650) [ClassicSimilarity], result of:
        5.2059946 = fieldWeight in 4650, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.329592 = idf(docFreq=28, maxDocs=44218)
          0.625 = fieldNorm(doc=4650)
    
  2. Savoy, J.: Effectiveness of information retrieval systems used in a hypertext environment (1993) 5.21
    5.2059946 = sum of:
      5.2059946 = weight(author_txt:savoy in 6511) [ClassicSimilarity], result of:
        5.2059946 = fieldWeight in 6511, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.329592 = idf(docFreq=28, maxDocs=44218)
          0.625 = fieldNorm(doc=6511)
    
  3. Savoy, J.: ¬A learning scheme for information retrieval in hypertext (1994) 5.21
    5.2059946 = sum of:
      5.2059946 = weight(author_txt:savoy in 7292) [ClassicSimilarity], result of:
        5.2059946 = fieldWeight in 7292, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.329592 = idf(docFreq=28, maxDocs=44218)
          0.625 = fieldNorm(doc=7292)
    
  4. Savoy, J.: Bayesian inference networks and spreading activation in hypertext systems (1992) 5.21
    5.2059946 = sum of:
      5.2059946 = weight(author_txt:savoy in 192) [ClassicSimilarity], result of:
        5.2059946 = fieldWeight in 192, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.329592 = idf(docFreq=28, maxDocs=44218)
          0.625 = fieldNorm(doc=192)
    
  5. Savoy, J.: Searching information in legal hypertext systems (1993/94) 5.21
    5.2059946 = sum of:
      5.2059946 = weight(author_txt:savoy in 757) [ClassicSimilarity], result of:
        5.2059946 = fieldWeight in 757, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.329592 = idf(docFreq=28, maxDocs=44218)
          0.625 = fieldNorm(doc=757)
    

Similar documents (content)

  1. Pirkola, A.; Puolamäki, D.; Järvelin, K.: Applying query structuring in cross-language retrieval (2003) 0.42
    0.41704893 = sum of:
      0.41704893 = product of:
        1.0426223 = sum of:
          0.15989438 = weight(abstract_txt:translated in 1074) [ClassicSimilarity], result of:
            0.15989438 = score(doc=1074,freq=2.0), product of:
              0.19277394 = queryWeight, product of:
                1.017323 = boost
                7.5072327 = idf(docFreq=65, maxDocs=44218)
                0.025241176 = queryNorm
              0.8294398 = fieldWeight in 1074, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.5072327 = idf(docFreq=65, maxDocs=44218)
                0.078125 = fieldNorm(doc=1074)
          0.045332007 = weight(abstract_txt:various in 1074) [ClassicSimilarity], result of:
            0.045332007 = score(doc=1074,freq=1.0), product of:
              0.13206351 = queryWeight, product of:
                1.1908063 = boost
                4.3937173 = idf(docFreq=1484, maxDocs=44218)
                0.025241176 = queryNorm
              0.34325916 = fieldWeight in 1074, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.3937173 = idf(docFreq=1484, maxDocs=44218)
                0.078125 = fieldNorm(doc=1074)
          0.015018274 = weight(abstract_txt:this in 1074) [ClassicSimilarity], result of:
            0.015018274 = score(doc=1074,freq=1.0), product of:
              0.07966536 = queryWeight, product of:
                1.307975 = boost
                2.4130175 = idf(docFreq=10762, maxDocs=44218)
                0.025241176 = queryNorm
              0.18851699 = fieldWeight in 1074, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4130175 = idf(docFreq=10762, maxDocs=44218)
                0.078125 = fieldNorm(doc=1074)
          0.07082796 = weight(abstract_txt:effectiveness in 1074) [ClassicSimilarity], result of:
            0.07082796 = score(doc=1074,freq=1.0), product of:
              0.17782085 = queryWeight, product of:
                1.3817868 = boost
                5.098378 = idf(docFreq=733, maxDocs=44218)
                0.025241176 = queryNorm
              0.39831078 = fieldWeight in 1074, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.098378 = idf(docFreq=733, maxDocs=44218)
                0.078125 = fieldNorm(doc=1074)
          0.15914227 = weight(abstract_txt:queries in 1074) [ClassicSimilarity], result of:
            0.15914227 = score(doc=1074,freq=5.0), product of:
              0.17839384 = queryWeight, product of:
                1.3840113 = boost
                5.106586 = idf(docFreq=727, maxDocs=44218)
                0.025241176 = queryNorm
              0.8920839 = fieldWeight in 1074, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                5.106586 = idf(docFreq=727, maxDocs=44218)
                0.078125 = fieldNorm(doc=1074)
          0.07463424 = weight(abstract_txt:languages in 1074) [ClassicSimilarity], result of:
            0.07463424 = score(doc=1074,freq=1.0), product of:
              0.18413581 = queryWeight, product of:
                1.4061085 = boost
                5.188118 = idf(docFreq=670, maxDocs=44218)
                0.025241176 = queryNorm
              0.40532172 = fieldWeight in 1074, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.188118 = idf(docFreq=670, maxDocs=44218)
                0.078125 = fieldNorm(doc=1074)
          0.05827431 = weight(abstract_txt:retrieval in 1074) [ClassicSimilarity], result of:
            0.05827431 = score(doc=1074,freq=3.0), product of:
              0.123923674 = queryWeight, product of:
                1.4127733 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.025241176 = queryNorm
              0.47024357 = fieldWeight in 1074, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.078125 = fieldNorm(doc=1074)
          0.13092321 = weight(abstract_txt:english in 1074) [ClassicSimilarity], result of:
            0.13092321 = score(doc=1074,freq=2.0), product of:
              0.21257585 = queryWeight, product of:
                1.5107989 = boost
                5.574394 = idf(docFreq=455, maxDocs=44218)
                0.025241176 = queryNorm
              0.6158894 = fieldWeight in 1074, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.574394 = idf(docFreq=455, maxDocs=44218)
                0.078125 = fieldNorm(doc=1074)
          0.21800683 = weight(abstract_txt:translation in 1074) [ClassicSimilarity], result of:
            0.21800683 = score(doc=1074,freq=3.0), product of:
              0.26088703 = queryWeight, product of:
                1.6736935 = boost
                6.1754265 = idf(docFreq=249, maxDocs=44218)
                0.025241176 = queryNorm
              0.8356369 = fieldWeight in 1074, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.1754265 = idf(docFreq=249, maxDocs=44218)
                0.078125 = fieldNorm(doc=1074)
          0.11056888 = weight(abstract_txt:language in 1074) [ClassicSimilarity], result of:
            0.11056888 = score(doc=1074,freq=2.0), product of:
              0.23929565 = queryWeight, product of:
                2.2668986 = boost
                4.1820874 = idf(docFreq=1834, maxDocs=44218)
                0.025241176 = queryNorm
              0.46205974 = fieldWeight in 1074, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.1820874 = idf(docFreq=1834, maxDocs=44218)
                0.078125 = fieldNorm(doc=1074)
        0.4 = coord(10/25)
    
  2. Bian, G.-W.; Chen, H.-H.: Cross-language information access to multilingual collections on the Internet (2000) 0.42
    0.41648895 = sum of:
      0.41648895 = product of:
        1.0412223 = sum of:
          0.19582984 = weight(abstract_txt:translated in 4436) [ClassicSimilarity], result of:
            0.19582984 = score(doc=4436,freq=3.0), product of:
              0.19277394 = queryWeight, product of:
                1.017323 = boost
                7.5072327 = idf(docFreq=65, maxDocs=44218)
                0.025241176 = queryNorm
              1.0158522 = fieldWeight in 4436, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.5072327 = idf(docFreq=65, maxDocs=44218)
                0.078125 = fieldNorm(doc=4436)
          0.119816326 = weight(abstract_txt:bilingual in 4436) [ClassicSimilarity], result of:
            0.119816326 = score(doc=4436,freq=1.0), product of:
              0.20037651 = queryWeight, product of:
                1.0371895 = boost
                7.653836 = idf(docFreq=56, maxDocs=44218)
                0.025241176 = queryNorm
              0.59795594 = fieldWeight in 4436, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.653836 = idf(docFreq=56, maxDocs=44218)
                0.078125 = fieldNorm(doc=4436)
          0.059455644 = weight(abstract_txt:searching in 4436) [ClassicSimilarity], result of:
            0.059455644 = score(doc=4436,freq=2.0), product of:
              0.12559284 = queryWeight, product of:
                1.1612672 = boost
                4.284727 = idf(docFreq=1655, maxDocs=44218)
                0.025241176 = queryNorm
              0.47339994 = fieldWeight in 4436, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.284727 = idf(docFreq=1655, maxDocs=44218)
                0.078125 = fieldNorm(doc=4436)
          0.05978433 = weight(abstract_txt:document in 4436) [ClassicSimilarity], result of:
            0.05978433 = score(doc=4436,freq=2.0), product of:
              0.12605529 = queryWeight, product of:
                1.1634032 = boost
                4.2926083 = idf(docFreq=1642, maxDocs=44218)
                0.025241176 = queryNorm
              0.4742707 = fieldWeight in 4436, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.2926083 = idf(docFreq=1642, maxDocs=44218)
                0.078125 = fieldNorm(doc=4436)
          0.015018274 = weight(abstract_txt:this in 4436) [ClassicSimilarity], result of:
            0.015018274 = score(doc=4436,freq=1.0), product of:
              0.07966536 = queryWeight, product of:
                1.307975 = boost
                2.4130175 = idf(docFreq=10762, maxDocs=44218)
                0.025241176 = queryNorm
              0.18851699 = fieldWeight in 4436, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4130175 = idf(docFreq=10762, maxDocs=44218)
                0.078125 = fieldNorm(doc=4436)
          0.03364469 = weight(abstract_txt:retrieval in 4436) [ClassicSimilarity], result of:
            0.03364469 = score(doc=4436,freq=1.0), product of:
              0.123923674 = queryWeight, product of:
                1.4127733 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.025241176 = queryNorm
              0.27149525 = fieldWeight in 4436, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.078125 = fieldNorm(doc=4436)
          0.07860419 = weight(abstract_txt:machine in 4436) [ClassicSimilarity], result of:
            0.07860419 = score(doc=4436,freq=1.0), product of:
              0.19060895 = queryWeight, product of:
                1.4306103 = boost
                5.2785225 = idf(docFreq=612, maxDocs=44218)
                0.025241176 = queryNorm
              0.41238457 = fieldWeight in 4436, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.2785225 = idf(docFreq=612, maxDocs=44218)
                0.078125 = fieldNorm(doc=4436)
          0.09257669 = weight(abstract_txt:english in 4436) [ClassicSimilarity], result of:
            0.09257669 = score(doc=4436,freq=1.0), product of:
              0.21257585 = queryWeight, product of:
                1.5107989 = boost
                5.574394 = idf(docFreq=455, maxDocs=44218)
                0.025241176 = queryNorm
              0.43549955 = fieldWeight in 4436, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.574394 = idf(docFreq=455, maxDocs=44218)
                0.078125 = fieldNorm(doc=4436)
          0.30830824 = weight(abstract_txt:translation in 4436) [ClassicSimilarity], result of:
            0.30830824 = score(doc=4436,freq=6.0), product of:
              0.26088703 = queryWeight, product of:
                1.6736935 = boost
                6.1754265 = idf(docFreq=249, maxDocs=44218)
                0.025241176 = queryNorm
              1.1817691 = fieldWeight in 4436, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                6.1754265 = idf(docFreq=249, maxDocs=44218)
                0.078125 = fieldNorm(doc=4436)
          0.07818401 = weight(abstract_txt:language in 4436) [ClassicSimilarity], result of:
            0.07818401 = score(doc=4436,freq=1.0), product of:
              0.23929565 = queryWeight, product of:
                2.2668986 = boost
                4.1820874 = idf(docFreq=1834, maxDocs=44218)
                0.025241176 = queryNorm
              0.32672557 = fieldWeight in 4436, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1820874 = idf(docFreq=1834, maxDocs=44218)
                0.078125 = fieldNorm(doc=4436)
        0.4 = coord(10/25)
    
  3. Rosemblat, G.; Tse, T.; Gemoets, D.: Adapting a monolingual consumer health system for Spanish cross-language information retrieval (2004) 0.32
    0.3223664 = sum of:
      0.3223664 = product of:
        0.80591595 = sum of:
          0.09044992 = weight(abstract_txt:translated in 2673) [ClassicSimilarity], result of:
            0.09044992 = score(doc=2673,freq=1.0), product of:
              0.19277394 = queryWeight, product of:
                1.017323 = boost
                7.5072327 = idf(docFreq=65, maxDocs=44218)
                0.025241176 = queryNorm
              0.46920204 = fieldWeight in 2673, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.5072327 = idf(docFreq=65, maxDocs=44218)
                0.0625 = fieldNorm(doc=2673)
          0.09585305 = weight(abstract_txt:bilingual in 2673) [ClassicSimilarity], result of:
            0.09585305 = score(doc=2673,freq=1.0), product of:
              0.20037651 = queryWeight, product of:
                1.0371895 = boost
                7.653836 = idf(docFreq=56, maxDocs=44218)
                0.025241176 = queryNorm
              0.47836474 = fieldWeight in 2673, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.653836 = idf(docFreq=56, maxDocs=44218)
                0.0625 = fieldNorm(doc=2673)
          0.016991237 = weight(abstract_txt:this in 2673) [ClassicSimilarity], result of:
            0.016991237 = score(doc=2673,freq=2.0), product of:
              0.07966536 = queryWeight, product of:
                1.307975 = boost
                2.4130175 = idf(docFreq=10762, maxDocs=44218)
                0.025241176 = queryNorm
              0.21328263 = fieldWeight in 2673, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.4130175 = idf(docFreq=10762, maxDocs=44218)
                0.0625 = fieldNorm(doc=2673)
          0.05666237 = weight(abstract_txt:effectiveness in 2673) [ClassicSimilarity], result of:
            0.05666237 = score(doc=2673,freq=1.0), product of:
              0.17782085 = queryWeight, product of:
                1.3817868 = boost
                5.098378 = idf(docFreq=733, maxDocs=44218)
                0.025241176 = queryNorm
              0.31864864 = fieldWeight in 2673, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.098378 = idf(docFreq=733, maxDocs=44218)
                0.0625 = fieldNorm(doc=2673)
          0.12731381 = weight(abstract_txt:queries in 2673) [ClassicSimilarity], result of:
            0.12731381 = score(doc=2673,freq=5.0), product of:
              0.17839384 = queryWeight, product of:
                1.3840113 = boost
                5.106586 = idf(docFreq=727, maxDocs=44218)
                0.025241176 = queryNorm
              0.7136671 = fieldWeight in 2673, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                5.106586 = idf(docFreq=727, maxDocs=44218)
                0.0625 = fieldNorm(doc=2673)
          0.026915753 = weight(abstract_txt:retrieval in 2673) [ClassicSimilarity], result of:
            0.026915753 = score(doc=2673,freq=1.0), product of:
              0.123923674 = queryWeight, product of:
                1.4127733 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.025241176 = queryNorm
              0.21719621 = fieldWeight in 2673, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.0625 = fieldNorm(doc=2673)
          0.062883355 = weight(abstract_txt:machine in 2673) [ClassicSimilarity], result of:
            0.062883355 = score(doc=2673,freq=1.0), product of:
              0.19060895 = queryWeight, product of:
                1.4306103 = boost
                5.2785225 = idf(docFreq=612, maxDocs=44218)
                0.025241176 = queryNorm
              0.32990766 = fieldWeight in 2673, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.2785225 = idf(docFreq=612, maxDocs=44218)
                0.0625 = fieldNorm(doc=2673)
          0.16560622 = weight(abstract_txt:english in 2673) [ClassicSimilarity], result of:
            0.16560622 = score(doc=2673,freq=5.0), product of:
              0.21257585 = queryWeight, product of:
                1.5107989 = boost
                5.574394 = idf(docFreq=455, maxDocs=44218)
                0.025241176 = queryNorm
              0.7790453 = fieldWeight in 2673, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                5.574394 = idf(docFreq=455, maxDocs=44218)
                0.0625 = fieldNorm(doc=2673)
          0.10069304 = weight(abstract_txt:translation in 2673) [ClassicSimilarity], result of:
            0.10069304 = score(doc=2673,freq=1.0), product of:
              0.26088703 = queryWeight, product of:
                1.6736935 = boost
                6.1754265 = idf(docFreq=249, maxDocs=44218)
                0.025241176 = queryNorm
              0.38596416 = fieldWeight in 2673, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.1754265 = idf(docFreq=249, maxDocs=44218)
                0.0625 = fieldNorm(doc=2673)
          0.06254721 = weight(abstract_txt:language in 2673) [ClassicSimilarity], result of:
            0.06254721 = score(doc=2673,freq=1.0), product of:
              0.23929565 = queryWeight, product of:
                2.2668986 = boost
                4.1820874 = idf(docFreq=1834, maxDocs=44218)
                0.025241176 = queryNorm
              0.26138046 = fieldWeight in 2673, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1820874 = idf(docFreq=1834, maxDocs=44218)
                0.0625 = fieldNorm(doc=2673)
        0.4 = coord(10/25)
    
  4. Yang, C.C.; Li, K.W.: Automatic construction of English/Chinese parallel corpora (2003) 0.32
    0.3194831 = sum of:
      0.3194831 = product of:
        0.7987077 = sum of:
          0.083871424 = weight(abstract_txt:bilingual in 1683) [ClassicSimilarity], result of:
            0.083871424 = score(doc=1683,freq=1.0), product of:
              0.20037651 = queryWeight, product of:
                1.0371895 = boost
                7.653836 = idf(docFreq=56, maxDocs=44218)
                0.025241176 = queryNorm
              0.41856915 = fieldWeight in 1683, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.653836 = idf(docFreq=56, maxDocs=44218)
                0.0546875 = fieldNorm(doc=1683)
          0.088253066 = weight(abstract_txt:translate in 1683) [ClassicSimilarity], result of:
            0.088253066 = score(doc=1683,freq=1.0), product of:
              0.2072959 = queryWeight, product of:
                1.0549456 = boost
                7.7848644 = idf(docFreq=49, maxDocs=44218)
                0.025241176 = queryNorm
              0.42573476 = fieldWeight in 1683, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.7848644 = idf(docFreq=49, maxDocs=44218)
                0.0546875 = fieldNorm(doc=1683)
          0.033058513 = weight(abstract_txt:large in 1683) [ClassicSimilarity], result of:
            0.033058513 = score(doc=1683,freq=1.0), product of:
              0.13571766 = queryWeight, product of:
                1.2071685 = boost
                4.454089 = idf(docFreq=1397, maxDocs=44218)
                0.025241176 = queryNorm
              0.243583 = fieldWeight in 1683, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.454089 = idf(docFreq=1397, maxDocs=44218)
                0.0546875 = fieldNorm(doc=1683)
          0.0105127925 = weight(abstract_txt:this in 1683) [ClassicSimilarity], result of:
            0.0105127925 = score(doc=1683,freq=1.0), product of:
              0.07966536 = queryWeight, product of:
                1.307975 = boost
                2.4130175 = idf(docFreq=10762, maxDocs=44218)
                0.025241176 = queryNorm
              0.1319619 = fieldWeight in 1683, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4130175 = idf(docFreq=10762, maxDocs=44218)
                0.0546875 = fieldNorm(doc=1683)
          0.07388413 = weight(abstract_txt:languages in 1683) [ClassicSimilarity], result of:
            0.07388413 = score(doc=1683,freq=2.0), product of:
              0.18413581 = queryWeight, product of:
                1.4061085 = boost
                5.188118 = idf(docFreq=670, maxDocs=44218)
                0.025241176 = queryNorm
              0.40124804 = fieldWeight in 1683, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.188118 = idf(docFreq=670, maxDocs=44218)
                0.0546875 = fieldNorm(doc=1683)
          0.033306543 = weight(abstract_txt:retrieval in 1683) [ClassicSimilarity], result of:
            0.033306543 = score(doc=1683,freq=2.0), product of:
              0.123923674 = queryWeight, product of:
                1.4127733 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.025241176 = queryNorm
              0.26876658 = fieldWeight in 1683, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.0546875 = fieldNorm(doc=1683)
          0.055022933 = weight(abstract_txt:machine in 1683) [ClassicSimilarity], result of:
            0.055022933 = score(doc=1683,freq=1.0), product of:
              0.19060895 = queryWeight, product of:
                1.4306103 = boost
                5.2785225 = idf(docFreq=612, maxDocs=44218)
                0.025241176 = queryNorm
              0.2886692 = fieldWeight in 1683, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.2785225 = idf(docFreq=612, maxDocs=44218)
                0.0546875 = fieldNorm(doc=1683)
          0.15873596 = weight(abstract_txt:english in 1683) [ClassicSimilarity], result of:
            0.15873596 = score(doc=1683,freq=6.0), product of:
              0.21257585 = queryWeight, product of:
                1.5107989 = boost
                5.574394 = idf(docFreq=455, maxDocs=44218)
                0.025241176 = queryNorm
              0.7467262 = fieldWeight in 1683, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                5.574394 = idf(docFreq=455, maxDocs=44218)
                0.0546875 = fieldNorm(doc=1683)
          0.15260479 = weight(abstract_txt:translation in 1683) [ClassicSimilarity], result of:
            0.15260479 = score(doc=1683,freq=3.0), product of:
              0.26088703 = queryWeight, product of:
                1.6736935 = boost
                6.1754265 = idf(docFreq=249, maxDocs=44218)
                0.025241176 = queryNorm
              0.58494586 = fieldWeight in 1683, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.1754265 = idf(docFreq=249, maxDocs=44218)
                0.0546875 = fieldNorm(doc=1683)
          0.10945761 = weight(abstract_txt:language in 1683) [ClassicSimilarity], result of:
            0.10945761 = score(doc=1683,freq=4.0), product of:
              0.23929565 = queryWeight, product of:
                2.2668986 = boost
                4.1820874 = idf(docFreq=1834, maxDocs=44218)
                0.025241176 = queryNorm
              0.45741582 = fieldWeight in 1683, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.1820874 = idf(docFreq=1834, maxDocs=44218)
                0.0546875 = fieldNorm(doc=1683)
        0.4 = coord(10/25)
    
  5. Talvensaari, T.; Laurikkala, J.; Järvelin, K.; Juhola, M.: ¬A study on automatic creation of a comparable document collection in cross-language information retrieval (2006) 0.31
    0.31290728 = sum of:
      0.31290728 = product of:
        0.7111529 = sum of:
          0.09044992 = weight(abstract_txt:translated in 5601) [ClassicSimilarity], result of:
            0.09044992 = score(doc=5601,freq=1.0), product of:
              0.19277394 = queryWeight, product of:
                1.017323 = boost
                7.5072327 = idf(docFreq=65, maxDocs=44218)
                0.025241176 = queryNorm
              0.46920204 = fieldWeight in 5601, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.5072327 = idf(docFreq=65, maxDocs=44218)
                0.0625 = fieldNorm(doc=5601)
          0.030522414 = weight(abstract_txt:when in 5601) [ClassicSimilarity], result of:
            0.030522414 = score(doc=5601,freq=1.0), product of:
              0.11772411 = queryWeight, product of:
                1.1243005 = boost
                4.148331 = idf(docFreq=1897, maxDocs=44218)
                0.025241176 = queryNorm
              0.2592707 = fieldWeight in 5601, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.148331 = idf(docFreq=1897, maxDocs=44218)
                0.0625 = fieldNorm(doc=5601)
          0.07562186 = weight(abstract_txt:document in 5601) [ClassicSimilarity], result of:
            0.07562186 = score(doc=5601,freq=5.0), product of:
              0.12605529 = queryWeight, product of:
                1.1634032 = boost
                4.2926083 = idf(docFreq=1642, maxDocs=44218)
                0.025241176 = queryNorm
              0.59991026 = fieldWeight in 5601, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.2926083 = idf(docFreq=1642, maxDocs=44218)
                0.0625 = fieldNorm(doc=5601)
          0.036265608 = weight(abstract_txt:various in 5601) [ClassicSimilarity], result of:
            0.036265608 = score(doc=5601,freq=1.0), product of:
              0.13206351 = queryWeight, product of:
                1.1908063 = boost
                4.3937173 = idf(docFreq=1484, maxDocs=44218)
                0.025241176 = queryNorm
              0.27460733 = fieldWeight in 5601, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.3937173 = idf(docFreq=1484, maxDocs=44218)
                0.0625 = fieldNorm(doc=5601)
          0.01201462 = weight(abstract_txt:this in 5601) [ClassicSimilarity], result of:
            0.01201462 = score(doc=5601,freq=1.0), product of:
              0.07966536 = queryWeight, product of:
                1.307975 = boost
                2.4130175 = idf(docFreq=10762, maxDocs=44218)
                0.025241176 = queryNorm
              0.1508136 = fieldWeight in 5601, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4130175 = idf(docFreq=10762, maxDocs=44218)
                0.0625 = fieldNorm(doc=5601)
          0.05693647 = weight(abstract_txt:queries in 5601) [ClassicSimilarity], result of:
            0.05693647 = score(doc=5601,freq=1.0), product of:
              0.17839384 = queryWeight, product of:
                1.3840113 = boost
                5.106586 = idf(docFreq=727, maxDocs=44218)
                0.025241176 = queryNorm
              0.31916162 = fieldWeight in 5601, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.106586 = idf(docFreq=727, maxDocs=44218)
                0.0625 = fieldNorm(doc=5601)
          0.10341624 = weight(abstract_txt:languages in 5601) [ClassicSimilarity], result of:
            0.10341624 = score(doc=5601,freq=3.0), product of:
              0.18413581 = queryWeight, product of:
                1.4061085 = boost
                5.188118 = idf(docFreq=670, maxDocs=44218)
                0.025241176 = queryNorm
              0.56163025 = fieldWeight in 5601, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.188118 = idf(docFreq=670, maxDocs=44218)
                0.0625 = fieldNorm(doc=5601)
          0.026915753 = weight(abstract_txt:retrieval in 5601) [ClassicSimilarity], result of:
            0.026915753 = score(doc=5601,freq=1.0), product of:
              0.123923674 = queryWeight, product of:
                1.4127733 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.025241176 = queryNorm
              0.21719621 = fieldWeight in 5601, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.0625 = fieldNorm(doc=5601)
          0.07406135 = weight(abstract_txt:english in 5601) [ClassicSimilarity], result of:
            0.07406135 = score(doc=5601,freq=1.0), product of:
              0.21257585 = queryWeight, product of:
                1.5107989 = boost
                5.574394 = idf(docFreq=455, maxDocs=44218)
                0.025241176 = queryNorm
              0.34839964 = fieldWeight in 5601, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.574394 = idf(docFreq=455, maxDocs=44218)
                0.0625 = fieldNorm(doc=5601)
          0.14240146 = weight(abstract_txt:translation in 5601) [ClassicSimilarity], result of:
            0.14240146 = score(doc=5601,freq=2.0), product of:
              0.26088703 = queryWeight, product of:
                1.6736935 = boost
                6.1754265 = idf(docFreq=249, maxDocs=44218)
                0.025241176 = queryNorm
              0.54583573 = fieldWeight in 5601, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.1754265 = idf(docFreq=249, maxDocs=44218)
                0.0625 = fieldNorm(doc=5601)
          0.06254721 = weight(abstract_txt:language in 5601) [ClassicSimilarity], result of:
            0.06254721 = score(doc=5601,freq=1.0), product of:
              0.23929565 = queryWeight, product of:
                2.2668986 = boost
                4.1820874 = idf(docFreq=1834, maxDocs=44218)
                0.025241176 = queryNorm
              0.26138046 = fieldWeight in 5601, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1820874 = idf(docFreq=1834, maxDocs=44218)
                0.0625 = fieldNorm(doc=5601)
        0.44 = coord(11/25)