Document (#36214)

Author
Li, Q.
Chen, Y.P.
Myaeng, S.-H.
Jin, Y.
Kang, B.-Y.
Title
Concept unification of terms in different languages via web mining for Information Retrieval
Source
Information processing and management. 45(2009) no.2, S.246-262
Year
2009
Abstract
For historical and cultural reasons, English phrases, especially proper nouns and new words, frequently appear in Web pages written primarily in East Asian languages such as Chinese, Korean, and Japanese. Although such English terms and their equivalences in these East Asian languages refer to the same concept, they are often erroneously treated as independent index units in traditional Information Retrieval (IR). This paper describes the degree to which the problem arises in IR and proposes a novel technique to solve it. Our method first extracts English terms from native Web documents in an East Asian language, and then unifies the extracted terms and their equivalences in the native language as one index unit. For Cross-Language Information Retrieval (CLIR), one of the major hindrances to achieving retrieval performance at the level of Mono-Lingual Information Retrieval (MLIR) is the translation of terms in search queries which can not be found in a bilingual dictionary. The Web mining approach proposed in this paper for concept unification of terms in different languages can also be applied to solve this well-known challenge in CLIR. Experimental results based on NTCIR and KT-Set test collections show that the high translation precision of our approach greatly improves performance of both Mono-Lingual and Cross-Language Information Retrieval.
Theme
Computerlinguistik
Multilinguale Probleme

Similar documents (author)

  1. Khoo, C.; Myaeng, S.H.: Identifying semantic relations in text for information retrieval and information extraction (2002) 1.25
    1.2523949 = sum of:
      1.2523949 = product of:
        3.7571847 = sum of:
          3.7571847 = weight(author_txt:myaeng in 2195) [ClassicSimilarity], result of:
            3.7571847 = score(doc=2195,freq=1.0), product of:
              0.7597914 = queryWeight, product of:
                1.6054374 = boost
                9.890043 = idf(docFreq=5, maxDocs=43556)
                0.047852296 = queryNorm
              4.9450216 = fieldWeight in 2195, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.890043 = idf(docFreq=5, maxDocs=43556)
                0.5 = fieldNorm(doc=2195)
        0.33333334 = coord(1/3)
    
  2. Kang, M.: Dual paths to continuous online knowledge sharing : a repetitive behavior perspective (2020) 1.04
    1.0427682 = sum of:
      1.0427682 = product of:
        3.1283047 = sum of:
          3.1283047 = weight(author_txt:kang in 2271) [ClassicSimilarity], result of:
            3.1283047 = score(doc=2271,freq=1.0), product of:
              0.5794981 = queryWeight, product of:
                1.4020782 = boost
                8.63728 = idf(docFreq=20, maxDocs=43556)
                0.047852296 = queryNorm
              5.3983 = fieldWeight in 2271, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.63728 = idf(docFreq=20, maxDocs=43556)
                0.625 = fieldNorm(doc=2271)
        0.33333334 = coord(1/3)
    
  3. Kang, M.: Motivational affordances and survival of new askers on social Q&A sites : the case of Stack Exchange network (2022) 1.04
    1.0427682 = sum of:
      1.0427682 = product of:
        3.1283047 = sum of:
          3.1283047 = weight(author_txt:kang in 2734) [ClassicSimilarity], result of:
            3.1283047 = score(doc=2734,freq=1.0), product of:
              0.5794981 = queryWeight, product of:
                1.4020782 = boost
                8.63728 = idf(docFreq=20, maxDocs=43556)
                0.047852296 = queryNorm
              5.3983 = fieldWeight in 2734, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.63728 = idf(docFreq=20, maxDocs=43556)
                0.625 = fieldNorm(doc=2734)
        0.33333334 = coord(1/3)
    
  4. Jeong, K.S.; Myaeng, S.-H.; Lee, J.S.; Choi, K.: Automatic identification and back-transliteration of foreign words for information retrieval (1999) 0.78
    0.7827469 = sum of:
      0.7827469 = product of:
        2.3482406 = sum of:
          2.3482406 = weight(author_txt:myaeng in 501) [ClassicSimilarity], result of:
            2.3482406 = score(doc=501,freq=1.0), product of:
              0.7597914 = queryWeight, product of:
                1.6054374 = boost
                9.890043 = idf(docFreq=5, maxDocs=43556)
                0.047852296 = queryNorm
              3.0906386 = fieldWeight in 501, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.890043 = idf(docFreq=5, maxDocs=43556)
                0.3125 = fieldNorm(doc=501)
        0.33333334 = coord(1/3)
    
  5. Kang, I.-H.; Kim, G.C.: Integration of multiple evidences based on a query type for web search (2004) 0.73
    0.7299378 = sum of:
      0.7299378 = product of:
        2.1898134 = sum of:
          2.1898134 = weight(author_txt:kang in 3566) [ClassicSimilarity], result of:
            2.1898134 = score(doc=3566,freq=1.0), product of:
              0.5794981 = queryWeight, product of:
                1.4020782 = boost
                8.63728 = idf(docFreq=20, maxDocs=43556)
                0.047852296 = queryNorm
              3.7788103 = fieldWeight in 3566, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.63728 = idf(docFreq=20, maxDocs=43556)
                0.4375 = fieldNorm(doc=3566)
        0.33333334 = coord(1/3)
    

Similar documents (content)

  1. Toivonen, J.; Pirkola, A.; Keskustalo, H.; Visala, K.; Järvelin, K.: Translating cross-lingual spelling variants using transformation rules (2005) 0.36
    0.35500035 = sum of:
      0.35500035 = product of:
        0.8875009 = sum of:
          0.07257161 = weight(abstract_txt:cross in 3050) [ClassicSimilarity], result of:
            0.07257161 = score(doc=3050,freq=2.0), product of:
              0.11701392 = queryWeight, product of:
                1.4045868 = boost
                5.613377 = idf(docFreq=431, maxDocs=43556)
                0.014841054 = queryNorm
              0.6201964 = fieldWeight in 3050, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.613377 = idf(docFreq=431, maxDocs=43556)
                0.078125 = fieldNorm(doc=3050)
          0.0103463475 = weight(abstract_txt:information in 3050) [ClassicSimilarity], result of:
            0.0103463475 = score(doc=3050,freq=1.0), product of:
              0.054606088 = queryWeight, product of:
                1.5171214 = boost
                2.425247 = idf(docFreq=10472, maxDocs=43556)
                0.014841054 = queryNorm
              0.18947242 = fieldWeight in 3050, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.425247 = idf(docFreq=10472, maxDocs=43556)
                0.078125 = fieldNorm(doc=3050)
          0.09686665 = weight(abstract_txt:translation in 3050) [ClassicSimilarity], result of:
            0.09686665 = score(doc=3050,freq=2.0), product of:
              0.14185432 = queryWeight, product of:
                1.5465041 = boost
                6.1805444 = idf(docFreq=244, maxDocs=43556)
                0.014841054 = queryNorm
              0.6828601 = fieldWeight in 3050, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.1805444 = idf(docFreq=244, maxDocs=43556)
                0.078125 = fieldNorm(doc=3050)
          0.15408418 = weight(abstract_txt:lingual in 3050) [ClassicSimilarity], result of:
            0.15408418 = score(doc=3050,freq=1.0), product of:
              0.24354266 = queryWeight, product of:
                2.0263634 = boost
                8.098284 = idf(docFreq=35, maxDocs=43556)
                0.014841054 = queryNorm
              0.6326784 = fieldWeight in 3050, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.098284 = idf(docFreq=35, maxDocs=43556)
                0.078125 = fieldNorm(doc=3050)
          0.15910438 = weight(abstract_txt:clir in 3050) [ClassicSimilarity], result of:
            0.15910438 = score(doc=3050,freq=1.0), product of:
              0.24880423 = queryWeight, product of:
                2.0481355 = boost
                8.185295 = idf(docFreq=32, maxDocs=43556)
                0.014841054 = queryNorm
              0.6394762 = fieldWeight in 3050, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.185295 = idf(docFreq=32, maxDocs=43556)
                0.078125 = fieldNorm(doc=3050)
          0.075852185 = weight(abstract_txt:english in 3050) [ClassicSimilarity], result of:
            0.075852185 = score(doc=3050,freq=1.0), product of:
              0.17381163 = queryWeight, product of:
                2.0965965 = boost
                5.585978 = idf(docFreq=443, maxDocs=43556)
                0.014841054 = queryNorm
              0.43640453 = fieldWeight in 3050, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.585978 = idf(docFreq=443, maxDocs=43556)
                0.078125 = fieldNorm(doc=3050)
          0.08537488 = weight(abstract_txt:language in 3050) [ClassicSimilarity], result of:
            0.08537488 = score(doc=3050,freq=4.0), product of:
              0.13040064 = queryWeight, product of:
                2.0969336 = boost
                4.1901574 = idf(docFreq=1792, maxDocs=43556)
                0.014841054 = queryNorm
              0.6547121 = fieldWeight in 3050, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.1901574 = idf(docFreq=1792, maxDocs=43556)
                0.078125 = fieldNorm(doc=3050)
          0.11478815 = weight(abstract_txt:languages in 3050) [ClassicSimilarity], result of:
            0.11478815 = score(doc=3050,freq=2.0), product of:
              0.20014024 = queryWeight, product of:
                2.5978377 = boost
                5.191079 = idf(docFreq=658, maxDocs=43556)
                0.014841054 = queryNorm
              0.5735386 = fieldWeight in 3050, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.191079 = idf(docFreq=658, maxDocs=43556)
                0.078125 = fieldNorm(doc=3050)
          0.03648679 = weight(abstract_txt:retrieval in 3050) [ClassicSimilarity], result of:
            0.03648679 = score(doc=3050,freq=1.0), product of:
              0.13444166 = queryWeight, product of:
                2.6076984 = boost
                3.4738557 = idf(docFreq=3669, maxDocs=43556)
                0.014841054 = queryNorm
              0.27139497 = fieldWeight in 3050, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4738557 = idf(docFreq=3669, maxDocs=43556)
                0.078125 = fieldNorm(doc=3050)
          0.08202565 = weight(abstract_txt:terms in 3050) [ClassicSimilarity], result of:
            0.08202565 = score(doc=3050,freq=2.0), product of:
              0.18311891 = queryWeight, product of:
                3.043386 = boost
                4.0542583 = idf(docFreq=2053, maxDocs=43556)
                0.014841054 = queryNorm
              0.4479365 = fieldWeight in 3050, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.0542583 = idf(docFreq=2053, maxDocs=43556)
                0.078125 = fieldNorm(doc=3050)
        0.4 = coord(10/25)
    
  2. Yang, C.C.; Li, K.W.: Automatic construction of English/Chinese parallel corpora (2003) 0.32
    0.3162054 = sum of:
      0.3162054 = product of:
        0.7905135 = sum of:
          0.020246953 = weight(abstract_txt:performance in 2681) [ClassicSimilarity], result of:
            0.020246953 = score(doc=2681,freq=1.0), product of:
              0.07984433 = queryWeight, product of:
                1.1602508 = boost
                4.6368976 = idf(docFreq=1146, maxDocs=43556)
                0.014841054 = queryNorm
              0.25358033 = fieldWeight in 2681, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.6368976 = idf(docFreq=1146, maxDocs=43556)
                0.0546875 = fieldNorm(doc=2681)
          0.07184223 = weight(abstract_txt:cross in 2681) [ClassicSimilarity], result of:
            0.07184223 = score(doc=2681,freq=4.0), product of:
              0.11701392 = queryWeight, product of:
                1.4045868 = boost
                5.613377 = idf(docFreq=431, maxDocs=43556)
                0.014841054 = queryNorm
              0.6139631 = fieldWeight in 2681, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.613377 = idf(docFreq=431, maxDocs=43556)
                0.0546875 = fieldNorm(doc=2681)
          0.012544279 = weight(abstract_txt:information in 2681) [ClassicSimilarity], result of:
            0.012544279 = score(doc=2681,freq=3.0), product of:
              0.054606088 = queryWeight, product of:
                1.5171214 = boost
                2.425247 = idf(docFreq=10472, maxDocs=43556)
                0.014841054 = queryNorm
              0.22972308 = fieldWeight in 2681, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.425247 = idf(docFreq=10472, maxDocs=43556)
                0.0546875 = fieldNorm(doc=2681)
          0.08304586 = weight(abstract_txt:translation in 2681) [ClassicSimilarity], result of:
            0.08304586 = score(doc=2681,freq=3.0), product of:
              0.14185432 = queryWeight, product of:
                1.5465041 = boost
                6.1805444 = idf(docFreq=244, maxDocs=43556)
                0.014841054 = queryNorm
              0.5854306 = fieldWeight in 2681, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.1805444 = idf(docFreq=244, maxDocs=43556)
                0.0546875 = fieldNorm(doc=2681)
          0.15253556 = weight(abstract_txt:lingual in 2681) [ClassicSimilarity], result of:
            0.15253556 = score(doc=2681,freq=2.0), product of:
              0.24354266 = queryWeight, product of:
                2.0263634 = boost
                8.098284 = idf(docFreq=35, maxDocs=43556)
                0.014841054 = queryNorm
              0.62631965 = fieldWeight in 2681, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.098284 = idf(docFreq=35, maxDocs=43556)
                0.0546875 = fieldNorm(doc=2681)
          0.1300594 = weight(abstract_txt:english in 2681) [ClassicSimilarity], result of:
            0.1300594 = score(doc=2681,freq=6.0), product of:
              0.17381163 = queryWeight, product of:
                2.0965965 = boost
                5.585978 = idf(docFreq=443, maxDocs=43556)
                0.014841054 = queryNorm
              0.7482779 = fieldWeight in 2681, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                5.585978 = idf(docFreq=443, maxDocs=43556)
                0.0546875 = fieldNorm(doc=2681)
          0.059762415 = weight(abstract_txt:language in 2681) [ClassicSimilarity], result of:
            0.059762415 = score(doc=2681,freq=4.0), product of:
              0.13040064 = queryWeight, product of:
                2.0969336 = boost
                4.1901574 = idf(docFreq=1792, maxDocs=43556)
                0.014841054 = queryNorm
              0.45829847 = fieldWeight in 2681, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.1901574 = idf(docFreq=1792, maxDocs=43556)
                0.0546875 = fieldNorm(doc=2681)
          0.08035171 = weight(abstract_txt:languages in 2681) [ClassicSimilarity], result of:
            0.08035171 = score(doc=2681,freq=2.0), product of:
              0.20014024 = queryWeight, product of:
                2.5978377 = boost
                5.191079 = idf(docFreq=658, maxDocs=43556)
                0.014841054 = queryNorm
              0.40147704 = fieldWeight in 2681, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.191079 = idf(docFreq=658, maxDocs=43556)
                0.0546875 = fieldNorm(doc=2681)
          0.03612008 = weight(abstract_txt:retrieval in 2681) [ClassicSimilarity], result of:
            0.03612008 = score(doc=2681,freq=2.0), product of:
              0.13444166 = queryWeight, product of:
                2.6076984 = boost
                3.4738557 = idf(docFreq=3669, maxDocs=43556)
                0.014841054 = queryNorm
              0.2686673 = fieldWeight in 2681, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4738557 = idf(docFreq=3669, maxDocs=43556)
                0.0546875 = fieldNorm(doc=2681)
          0.14400509 = weight(abstract_txt:asian in 2681) [ClassicSimilarity], result of:
            0.14400509 = score(doc=2681,freq=1.0), product of:
              0.3380285 = queryWeight, product of:
                2.9238298 = boost
                7.7899823 = idf(docFreq=48, maxDocs=43556)
                0.014841054 = queryNorm
              0.42601466 = fieldWeight in 2681, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.7899823 = idf(docFreq=48, maxDocs=43556)
                0.0546875 = fieldNorm(doc=2681)
        0.4 = coord(10/25)
    
  3. Xu, J.; Weischedel, R.: Empirical studies on the impact of lexical resources on CLIR performance (2005) 0.30
    0.30234057 = sum of:
      0.30234057 = product of:
        0.83983487 = sum of:
          0.06467652 = weight(abstract_txt:performance in 3018) [ClassicSimilarity], result of:
            0.06467652 = score(doc=3018,freq=5.0), product of:
              0.07984433 = queryWeight, product of:
                1.1602508 = boost
                4.6368976 = idf(docFreq=1146, maxDocs=43556)
                0.014841054 = queryNorm
              0.8100327 = fieldWeight in 3018, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.6368976 = idf(docFreq=1146, maxDocs=43556)
                0.078125 = fieldNorm(doc=3018)
          0.051315878 = weight(abstract_txt:cross in 3018) [ClassicSimilarity], result of:
            0.051315878 = score(doc=3018,freq=1.0), product of:
              0.11701392 = queryWeight, product of:
                1.4045868 = boost
                5.613377 = idf(docFreq=431, maxDocs=43556)
                0.014841054 = queryNorm
              0.43854508 = fieldWeight in 3018, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.613377 = idf(docFreq=431, maxDocs=43556)
                0.078125 = fieldNorm(doc=3018)
          0.0103463475 = weight(abstract_txt:information in 3018) [ClassicSimilarity], result of:
            0.0103463475 = score(doc=3018,freq=1.0), product of:
              0.054606088 = queryWeight, product of:
                1.5171214 = boost
                2.425247 = idf(docFreq=10472, maxDocs=43556)
                0.014841054 = queryNorm
              0.18947242 = fieldWeight in 3018, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.425247 = idf(docFreq=10472, maxDocs=43556)
                0.078125 = fieldNorm(doc=3018)
          0.06849507 = weight(abstract_txt:translation in 3018) [ClassicSimilarity], result of:
            0.06849507 = score(doc=3018,freq=1.0), product of:
              0.14185432 = queryWeight, product of:
                1.5465041 = boost
                6.1805444 = idf(docFreq=244, maxDocs=43556)
                0.014841054 = queryNorm
              0.48285502 = fieldWeight in 3018, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.1805444 = idf(docFreq=244, maxDocs=43556)
                0.078125 = fieldNorm(doc=3018)
          0.15408418 = weight(abstract_txt:lingual in 3018) [ClassicSimilarity], result of:
            0.15408418 = score(doc=3018,freq=1.0), product of:
              0.24354266 = queryWeight, product of:
                2.0263634 = boost
                8.098284 = idf(docFreq=35, maxDocs=43556)
                0.014841054 = queryNorm
              0.6326784 = fieldWeight in 3018, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.098284 = idf(docFreq=35, maxDocs=43556)
                0.078125 = fieldNorm(doc=3018)
          0.31820875 = weight(abstract_txt:clir in 3018) [ClassicSimilarity], result of:
            0.31820875 = score(doc=3018,freq=4.0), product of:
              0.24880423 = queryWeight, product of:
                2.0481355 = boost
                8.185295 = idf(docFreq=32, maxDocs=43556)
                0.014841054 = queryNorm
              1.2789524 = fieldWeight in 3018, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                8.185295 = idf(docFreq=32, maxDocs=43556)
                0.078125 = fieldNorm(doc=3018)
          0.075852185 = weight(abstract_txt:english in 3018) [ClassicSimilarity], result of:
            0.075852185 = score(doc=3018,freq=1.0), product of:
              0.17381163 = queryWeight, product of:
                2.0965965 = boost
                5.585978 = idf(docFreq=443, maxDocs=43556)
                0.014841054 = queryNorm
              0.43640453 = fieldWeight in 3018, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.585978 = idf(docFreq=443, maxDocs=43556)
                0.078125 = fieldNorm(doc=3018)
          0.060369156 = weight(abstract_txt:language in 3018) [ClassicSimilarity], result of:
            0.060369156 = score(doc=3018,freq=2.0), product of:
              0.13040064 = queryWeight, product of:
                2.0969336 = boost
                4.1901574 = idf(docFreq=1792, maxDocs=43556)
                0.014841054 = queryNorm
              0.46295136 = fieldWeight in 3018, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.1901574 = idf(docFreq=1792, maxDocs=43556)
                0.078125 = fieldNorm(doc=3018)
          0.03648679 = weight(abstract_txt:retrieval in 3018) [ClassicSimilarity], result of:
            0.03648679 = score(doc=3018,freq=1.0), product of:
              0.13444166 = queryWeight, product of:
                2.6076984 = boost
                3.4738557 = idf(docFreq=3669, maxDocs=43556)
                0.014841054 = queryNorm
              0.27139497 = fieldWeight in 3018, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4738557 = idf(docFreq=3669, maxDocs=43556)
                0.078125 = fieldNorm(doc=3018)
        0.36 = coord(9/25)
    
  4. Bellaachia, A.; Amor-Tijani, G.: Proper nouns in English-Arabic cross language information retrieval (2008) 0.29
    0.29378986 = sum of:
      0.29378986 = product of:
        0.6677042 = sum of:
          0.08380522 = weight(abstract_txt:nouns in 4370) [ClassicSimilarity], result of:
            0.08380522 = score(doc=4370,freq=2.0), product of:
              0.118623406 = queryWeight, product of:
                7.9929233 = idf(docFreq=39, maxDocs=43556)
                0.014841054 = queryNorm
              0.7064813 = fieldWeight in 4370, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.9929233 = idf(docFreq=39, maxDocs=43556)
                0.0625 = fieldNorm(doc=4370)
          0.032724015 = weight(abstract_txt:performance in 4370) [ClassicSimilarity], result of:
            0.032724015 = score(doc=4370,freq=2.0), product of:
              0.07984433 = queryWeight, product of:
                1.1602508 = boost
                4.6368976 = idf(docFreq=1146, maxDocs=43556)
                0.014841054 = queryNorm
              0.4098477 = fieldWeight in 4370, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.6368976 = idf(docFreq=1146, maxDocs=43556)
                0.0625 = fieldNorm(doc=4370)
          0.02480299 = weight(abstract_txt:index in 4370) [ClassicSimilarity], result of:
            0.02480299 = score(doc=4370,freq=1.0), product of:
              0.083626844 = queryWeight, product of:
                1.1874154 = boost
                4.74546 = idf(docFreq=1028, maxDocs=43556)
                0.014841054 = queryNorm
              0.29659125 = fieldWeight in 4370, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.74546 = idf(docFreq=1028, maxDocs=43556)
                0.0625 = fieldNorm(doc=4370)
          0.041052703 = weight(abstract_txt:cross in 4370) [ClassicSimilarity], result of:
            0.041052703 = score(doc=4370,freq=1.0), product of:
              0.11701392 = queryWeight, product of:
                1.4045868 = boost
                5.613377 = idf(docFreq=431, maxDocs=43556)
                0.014841054 = queryNorm
              0.35083607 = fieldWeight in 4370, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.613377 = idf(docFreq=431, maxDocs=43556)
                0.0625 = fieldNorm(doc=4370)
          0.008277078 = weight(abstract_txt:information in 4370) [ClassicSimilarity], result of:
            0.008277078 = score(doc=4370,freq=1.0), product of:
              0.054606088 = queryWeight, product of:
                1.5171214 = boost
                2.425247 = idf(docFreq=10472, maxDocs=43556)
                0.014841054 = queryNorm
              0.15157793 = fieldWeight in 4370, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.425247 = idf(docFreq=10472, maxDocs=43556)
                0.0625 = fieldNorm(doc=4370)
          0.18000606 = weight(abstract_txt:clir in 4370) [ClassicSimilarity], result of:
            0.18000606 = score(doc=4370,freq=2.0), product of:
              0.24880423 = queryWeight, product of:
                2.0481355 = boost
                8.185295 = idf(docFreq=32, maxDocs=43556)
                0.014841054 = queryNorm
              0.7234847 = fieldWeight in 4370, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.185295 = idf(docFreq=32, maxDocs=43556)
                0.0625 = fieldNorm(doc=4370)
          0.060681745 = weight(abstract_txt:english in 4370) [ClassicSimilarity], result of:
            0.060681745 = score(doc=4370,freq=1.0), product of:
              0.17381163 = queryWeight, product of:
                2.0965965 = boost
                5.585978 = idf(docFreq=443, maxDocs=43556)
                0.014841054 = queryNorm
              0.34912363 = fieldWeight in 4370, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.585978 = idf(docFreq=443, maxDocs=43556)
                0.0625 = fieldNorm(doc=4370)
          0.048295323 = weight(abstract_txt:language in 4370) [ClassicSimilarity], result of:
            0.048295323 = score(doc=4370,freq=2.0), product of:
              0.13040064 = queryWeight, product of:
                2.0969336 = boost
                4.1901574 = idf(docFreq=1792, maxDocs=43556)
                0.014841054 = queryNorm
              0.3703611 = fieldWeight in 4370, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.1901574 = idf(docFreq=1792, maxDocs=43556)
                0.0625 = fieldNorm(doc=4370)
          0.112468965 = weight(abstract_txt:languages in 4370) [ClassicSimilarity], result of:
            0.112468965 = score(doc=4370,freq=3.0), product of:
              0.20014024 = queryWeight, product of:
                2.5978377 = boost
                5.191079 = idf(docFreq=658, maxDocs=43556)
                0.014841054 = queryNorm
              0.5619508 = fieldWeight in 4370, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.191079 = idf(docFreq=658, maxDocs=43556)
                0.0625 = fieldNorm(doc=4370)
          0.029189432 = weight(abstract_txt:retrieval in 4370) [ClassicSimilarity], result of:
            0.029189432 = score(doc=4370,freq=1.0), product of:
              0.13444166 = queryWeight, product of:
                2.6076984 = boost
                3.4738557 = idf(docFreq=3669, maxDocs=43556)
                0.014841054 = queryNorm
              0.21711598 = fieldWeight in 4370, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4738557 = idf(docFreq=3669, maxDocs=43556)
                0.0625 = fieldNorm(doc=4370)
          0.04640071 = weight(abstract_txt:terms in 4370) [ClassicSimilarity], result of:
            0.04640071 = score(doc=4370,freq=1.0), product of:
              0.18311891 = queryWeight, product of:
                3.043386 = boost
                4.0542583 = idf(docFreq=2053, maxDocs=43556)
                0.014841054 = queryNorm
              0.25339115 = fieldWeight in 4370, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.0542583 = idf(docFreq=2053, maxDocs=43556)
                0.0625 = fieldNorm(doc=4370)
        0.44 = coord(11/25)
    
  5. Pirkola, A.: Morphological typology of languages for IR (2001) 0.29
    0.2856747 = sum of:
      0.2856747 = product of:
        0.8927334 = sum of:
          0.043845907 = weight(abstract_txt:index in 474) [ClassicSimilarity], result of:
            0.043845907 = score(doc=474,freq=2.0), product of:
              0.083626844 = queryWeight, product of:
                1.1874154 = boost
                4.74546 = idf(docFreq=1028, maxDocs=43556)
                0.014841054 = queryNorm
              0.5243042 = fieldWeight in 474, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.74546 = idf(docFreq=1028, maxDocs=43556)
                0.078125 = fieldNorm(doc=474)
          0.07257161 = weight(abstract_txt:cross in 474) [ClassicSimilarity], result of:
            0.07257161 = score(doc=474,freq=2.0), product of:
              0.11701392 = queryWeight, product of:
                1.4045868 = boost
                5.613377 = idf(docFreq=431, maxDocs=43556)
                0.014841054 = queryNorm
              0.6201964 = fieldWeight in 474, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.613377 = idf(docFreq=431, maxDocs=43556)
                0.078125 = fieldNorm(doc=474)
          0.15408418 = weight(abstract_txt:lingual in 474) [ClassicSimilarity], result of:
            0.15408418 = score(doc=474,freq=1.0), product of:
              0.24354266 = queryWeight, product of:
                2.0263634 = boost
                8.098284 = idf(docFreq=35, maxDocs=43556)
                0.014841054 = queryNorm
              0.6326784 = fieldWeight in 474, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.098284 = idf(docFreq=35, maxDocs=43556)
                0.078125 = fieldNorm(doc=474)
          0.15910438 = weight(abstract_txt:clir in 474) [ClassicSimilarity], result of:
            0.15910438 = score(doc=474,freq=1.0), product of:
              0.24880423 = queryWeight, product of:
                2.0481355 = boost
                8.185295 = idf(docFreq=32, maxDocs=43556)
                0.014841054 = queryNorm
              0.6394762 = fieldWeight in 474, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.185295 = idf(docFreq=32, maxDocs=43556)
                0.078125 = fieldNorm(doc=474)
          0.060369156 = weight(abstract_txt:language in 474) [ClassicSimilarity], result of:
            0.060369156 = score(doc=474,freq=2.0), product of:
              0.13040064 = queryWeight, product of:
                2.0969336 = boost
                4.1901574 = idf(docFreq=1792, maxDocs=43556)
                0.014841054 = queryNorm
              0.46295136 = fieldWeight in 474, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.1901574 = idf(docFreq=1792, maxDocs=43556)
                0.078125 = fieldNorm(doc=474)
          0.22568516 = weight(abstract_txt:mono in 474) [ClassicSimilarity], result of:
            0.22568516 = score(doc=474,freq=1.0), product of:
              0.3141027 = queryWeight, product of:
                2.3012598 = boost
                9.196897 = idf(docFreq=11, maxDocs=43556)
                0.014841054 = queryNorm
              0.7185075 = fieldWeight in 474, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.196897 = idf(docFreq=11, maxDocs=43556)
                0.078125 = fieldNorm(doc=474)
          0.1405862 = weight(abstract_txt:languages in 474) [ClassicSimilarity], result of:
            0.1405862 = score(doc=474,freq=3.0), product of:
              0.20014024 = queryWeight, product of:
                2.5978377 = boost
                5.191079 = idf(docFreq=658, maxDocs=43556)
                0.014841054 = queryNorm
              0.7024385 = fieldWeight in 474, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.191079 = idf(docFreq=658, maxDocs=43556)
                0.078125 = fieldNorm(doc=474)
          0.03648679 = weight(abstract_txt:retrieval in 474) [ClassicSimilarity], result of:
            0.03648679 = score(doc=474,freq=1.0), product of:
              0.13444166 = queryWeight, product of:
                2.6076984 = boost
                3.4738557 = idf(docFreq=3669, maxDocs=43556)
                0.014841054 = queryNorm
              0.27139497 = fieldWeight in 474, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4738557 = idf(docFreq=3669, maxDocs=43556)
                0.078125 = fieldNorm(doc=474)
        0.32 = coord(8/25)