Document (#36217)

Author
Li, Q.
Chen, Y.P.
Myaeng, S.-H.
Jin, Y.
Kang, B.-Y.
Title
Concept unification of terms in different languages via web mining for Information Retrieval
Source
Information processing and management. 45(2009) no.2, S.246-262
Year
2009
Abstract
For historical and cultural reasons, English phrases, especially proper nouns and new words, frequently appear in Web pages written primarily in East Asian languages such as Chinese, Korean, and Japanese. Although such English terms and their equivalences in these East Asian languages refer to the same concept, they are often erroneously treated as independent index units in traditional Information Retrieval (IR). This paper describes the degree to which the problem arises in IR and proposes a novel technique to solve it. Our method first extracts English terms from native Web documents in an East Asian language, and then unifies the extracted terms and their equivalences in the native language as one index unit. For Cross-Language Information Retrieval (CLIR), one of the major hindrances to achieving retrieval performance at the level of Mono-Lingual Information Retrieval (MLIR) is the translation of terms in search queries which can not be found in a bilingual dictionary. The Web mining approach proposed in this paper for concept unification of terms in different languages can also be applied to solve this well-known challenge in CLIR. Experimental results based on NTCIR and KT-Set test collections show that the high translation precision of our approach greatly improves performance of both Mono-Lingual and Cross-Language Information Retrieval.
Theme
Computerlinguistik
Multilinguale Probleme

Similar documents (author)

  1. Khoo, C.; Myaeng, S.H.: Identifying semantic relations in text for information retrieval and information extraction (2002) 1.25
    1.2466464 = sum of:
      1.2466464 = product of:
        3.7399392 = sum of:
          3.7399392 = weight(author_txt:myaeng in 3198) [ClassicSimilarity], result of:
            3.7399392 = score(doc=3198,freq=1.0), product of:
              0.75683635 = queryWeight, product of:
                1.6040282 = boost
                9.883085 = idf(docFreq=5, maxDocs=43254)
                0.04774165 = queryNorm
              4.9415426 = fieldWeight in 3198, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.883085 = idf(docFreq=5, maxDocs=43254)
                0.5 = fieldNorm(doc=3198)
        0.33333334 = coord(1/3)
    
  2. Kang, M.: Dual paths to continuous online knowledge sharing : a repetitive behavior perspective (2020) 1.06
    1.0553623 = sum of:
      1.0553623 = product of:
        3.166087 = sum of:
          3.166087 = weight(author_txt:kang in 986) [ClassicSimilarity], result of:
            3.166087 = score(doc=986,freq=1.0), product of:
              0.5836702 = queryWeight, product of:
                1.408623 = boost
                8.679112 = idf(docFreq=19, maxDocs=43254)
                0.04774165 = queryNorm
              5.424445 = fieldWeight in 986, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.679112 = idf(docFreq=19, maxDocs=43254)
                0.625 = fieldNorm(doc=986)
        0.33333334 = coord(1/3)
    
  3. Jeong, K.S.; Myaeng, S.-H.; Lee, J.S.; Choi, K.: Automatic identification and back-transliteration of foreign words for information retrieval (1999) 0.78
    0.779154 = sum of:
      0.779154 = product of:
        2.337462 = sum of:
          2.337462 = weight(author_txt:myaeng in 1504) [ClassicSimilarity], result of:
            2.337462 = score(doc=1504,freq=1.0), product of:
              0.75683635 = queryWeight, product of:
                1.6040282 = boost
                9.883085 = idf(docFreq=5, maxDocs=43254)
                0.04774165 = queryNorm
              3.0884643 = fieldWeight in 1504, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.883085 = idf(docFreq=5, maxDocs=43254)
                0.3125 = fieldNorm(doc=1504)
        0.33333334 = coord(1/3)
    
  4. Kang, I.-H.; Kim, G.C.: Integration of multiple evidences based on a query type for web search (2004) 0.74
    0.7387537 = sum of:
      0.7387537 = product of:
        2.216261 = sum of:
          2.216261 = weight(author_txt:kang in 4569) [ClassicSimilarity], result of:
            2.216261 = score(doc=4569,freq=1.0), product of:
              0.5836702 = queryWeight, product of:
                1.408623 = boost
                8.679112 = idf(docFreq=19, maxDocs=43254)
                0.04774165 = queryNorm
              3.7971117 = fieldWeight in 4569, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.679112 = idf(docFreq=19, maxDocs=43254)
                0.4375 = fieldNorm(doc=4569)
        0.33333334 = coord(1/3)
    
  5. Kang, M.; Kim, Y.-G.: ¬A multilevel view on interpersonal knowledge transfer (2010) 0.74
    0.7387537 = sum of:
      0.7387537 = product of:
        2.216261 = sum of:
          2.216261 = weight(author_txt:kang in 417) [ClassicSimilarity], result of:
            2.216261 = score(doc=417,freq=1.0), product of:
              0.5836702 = queryWeight, product of:
                1.408623 = boost
                8.679112 = idf(docFreq=19, maxDocs=43254)
                0.04774165 = queryNorm
              3.7971117 = fieldWeight in 417, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.679112 = idf(docFreq=19, maxDocs=43254)
                0.4375 = fieldNorm(doc=417)
        0.33333334 = coord(1/3)
    

Similar documents (content)

  1. Toivonen, J.; Pirkola, A.; Keskustalo, H.; Visala, K.; Järvelin, K.: Translating cross-lingual spelling variants using transformation rules (2005) 0.35
    0.35493836 = sum of:
      0.35493836 = product of:
        0.88734585 = sum of:
          0.07267324 = weight(abstract_txt:cross in 3053) [ClassicSimilarity], result of:
            0.07267324 = score(doc=3053,freq=2.0), product of:
              0.117128864 = queryWeight, product of:
                1.4063977 = boost
                5.6157217 = idf(docFreq=427, maxDocs=43254)
                0.014830309 = queryNorm
              0.62045544 = fieldWeight in 3053, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.6157217 = idf(docFreq=427, maxDocs=43254)
                0.078125 = fieldNorm(doc=3053)
          0.010369292 = weight(abstract_txt:information in 3053) [ClassicSimilarity], result of:
            0.010369292 = score(doc=3053,freq=1.0), product of:
              0.054689463 = queryWeight, product of:
                1.5194906 = boost
                2.42692 = idf(docFreq=10382, maxDocs=43254)
                0.014830309 = queryNorm
              0.18960312 = fieldWeight in 3053, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.42692 = idf(docFreq=10382, maxDocs=43254)
                0.078125 = fieldNorm(doc=3053)
          0.09655404 = weight(abstract_txt:translation in 3053) [ClassicSimilarity], result of:
            0.09655404 = score(doc=3053,freq=2.0), product of:
              0.14155586 = queryWeight, product of:
                1.5461091 = boost
                6.173587 = idf(docFreq=244, maxDocs=43254)
                0.014830309 = queryNorm
              0.6820914 = fieldWeight in 3053, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.173587 = idf(docFreq=244, maxDocs=43254)
                0.078125 = fieldNorm(doc=3053)
          0.15370989 = weight(abstract_txt:lingual in 3053) [ClassicSimilarity], result of:
            0.15370989 = score(doc=3053,freq=1.0), product of:
              0.24315998 = queryWeight, product of:
                2.0263863 = boost
                8.091326 = idf(docFreq=35, maxDocs=43254)
                0.014830309 = queryNorm
              0.6321348 = fieldWeight in 3053, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.091326 = idf(docFreq=35, maxDocs=43254)
                0.078125 = fieldNorm(doc=3053)
          0.15872224 = weight(abstract_txt:clir in 3053) [ClassicSimilarity], result of:
            0.15872224 = score(doc=3053,freq=1.0), product of:
              0.24841782 = queryWeight, product of:
                2.0481772 = boost
                8.178337 = idf(docFreq=32, maxDocs=43254)
                0.014830309 = queryNorm
              0.6389326 = fieldWeight in 3053, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.178337 = idf(docFreq=32, maxDocs=43254)
                0.078125 = fieldNorm(doc=3053)
          0.08551006 = weight(abstract_txt:language in 3053) [ClassicSimilarity], result of:
            0.08551006 = score(doc=3053,freq=4.0), product of:
              0.13054463 = queryWeight, product of:
                2.099765 = boost
                4.192163 = idf(docFreq=1776, maxDocs=43254)
                0.014830309 = queryNorm
              0.6550255 = fieldWeight in 3053, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.192163 = idf(docFreq=1776, maxDocs=43254)
                0.078125 = fieldNorm(doc=3053)
          0.07632155 = weight(abstract_txt:english in 3053) [ClassicSimilarity], result of:
            0.07632155 = score(doc=3053,freq=1.0), product of:
              0.17453644 = queryWeight, product of:
                2.1026397 = boost
                5.597203 = idf(docFreq=435, maxDocs=43254)
                0.014830309 = queryNorm
              0.43728146 = fieldWeight in 3053, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.597203 = idf(docFreq=435, maxDocs=43254)
                0.078125 = fieldNorm(doc=3053)
          0.11484868 = weight(abstract_txt:languages in 3053) [ClassicSimilarity], result of:
            0.11484868 = score(doc=3053,freq=2.0), product of:
              0.20022036 = queryWeight, product of:
                2.6004307 = boost
                5.1917377 = idf(docFreq=653, maxDocs=43254)
                0.014830309 = queryNorm
              0.5736114 = fieldWeight in 3053, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.1917377 = idf(docFreq=653, maxDocs=43254)
                0.078125 = fieldNorm(doc=3053)
          0.036367614 = weight(abstract_txt:retrieval in 3053) [ClassicSimilarity], result of:
            0.036367614 = score(doc=3053,freq=1.0), product of:
              0.1341553 = queryWeight, product of:
                2.6069982 = boost
                3.4699 = idf(docFreq=3658, maxDocs=43254)
                0.014830309 = queryNorm
              0.27108592 = fieldWeight in 3053, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4699 = idf(docFreq=3658, maxDocs=43254)
                0.078125 = fieldNorm(doc=3053)
          0.08226923 = weight(abstract_txt:terms in 3053) [ClassicSimilarity], result of:
            0.08226923 = score(doc=3053,freq=2.0), product of:
              0.18349023 = queryWeight, product of:
                3.0489006 = boost
                4.058069 = idf(docFreq=2031, maxDocs=43254)
                0.014830309 = queryNorm
              0.44835752 = fieldWeight in 3053, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.058069 = idf(docFreq=2031, maxDocs=43254)
                0.078125 = fieldNorm(doc=3053)
        0.4 = coord(10/25)
    
  2. Yang, C.C.; Li, K.W.: Automatic construction of English/Chinese parallel corpora (2003) 0.32
    0.31664902 = sum of:
      0.31664902 = product of:
        0.7916225 = sum of:
          0.020261975 = weight(abstract_txt:performance in 3684) [ClassicSimilarity], result of:
            0.020261975 = score(doc=3684,freq=1.0), product of:
              0.079887725 = queryWeight, product of:
                1.161492 = boost
                4.6378174 = idf(docFreq=1137, maxDocs=43254)
                0.014830309 = queryNorm
              0.25363064 = fieldWeight in 3684, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.6378174 = idf(docFreq=1137, maxDocs=43254)
                0.0546875 = fieldNorm(doc=3684)
          0.071942836 = weight(abstract_txt:cross in 3684) [ClassicSimilarity], result of:
            0.071942836 = score(doc=3684,freq=4.0), product of:
              0.117128864 = queryWeight, product of:
                1.4063977 = boost
                5.6157217 = idf(docFreq=427, maxDocs=43254)
                0.014830309 = queryNorm
              0.61421955 = fieldWeight in 3684, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.6157217 = idf(docFreq=427, maxDocs=43254)
                0.0546875 = fieldNorm(doc=3684)
          0.0125720985 = weight(abstract_txt:information in 3684) [ClassicSimilarity], result of:
            0.0125720985 = score(doc=3684,freq=3.0), product of:
              0.054689463 = queryWeight, product of:
                1.5194906 = boost
                2.42692 = idf(docFreq=10382, maxDocs=43254)
                0.014830309 = queryNorm
              0.22988155 = fieldWeight in 3684, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.42692 = idf(docFreq=10382, maxDocs=43254)
                0.0546875 = fieldNorm(doc=3684)
          0.08277784 = weight(abstract_txt:translation in 3684) [ClassicSimilarity], result of:
            0.08277784 = score(doc=3684,freq=3.0), product of:
              0.14155586 = queryWeight, product of:
                1.5461091 = boost
                6.173587 = idf(docFreq=244, maxDocs=43254)
                0.014830309 = queryNorm
              0.5847716 = fieldWeight in 3684, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.173587 = idf(docFreq=244, maxDocs=43254)
                0.0546875 = fieldNorm(doc=3684)
          0.15216503 = weight(abstract_txt:lingual in 3684) [ClassicSimilarity], result of:
            0.15216503 = score(doc=3684,freq=2.0), product of:
              0.24315998 = queryWeight, product of:
                2.0263863 = boost
                8.091326 = idf(docFreq=35, maxDocs=43254)
                0.014830309 = queryNorm
              0.62578154 = fieldWeight in 3684, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.091326 = idf(docFreq=35, maxDocs=43254)
                0.0546875 = fieldNorm(doc=3684)
          0.05985704 = weight(abstract_txt:language in 3684) [ClassicSimilarity], result of:
            0.05985704 = score(doc=3684,freq=4.0), product of:
              0.13054463 = queryWeight, product of:
                2.099765 = boost
                4.192163 = idf(docFreq=1776, maxDocs=43254)
                0.014830309 = queryNorm
              0.45851782 = fieldWeight in 3684, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.192163 = idf(docFreq=1776, maxDocs=43254)
                0.0546875 = fieldNorm(doc=3684)
          0.1308642 = weight(abstract_txt:english in 3684) [ClassicSimilarity], result of:
            0.1308642 = score(doc=3684,freq=6.0), product of:
              0.17453644 = queryWeight, product of:
                2.1026397 = boost
                5.597203 = idf(docFreq=435, maxDocs=43254)
                0.014830309 = queryNorm
              0.74978155 = fieldWeight in 3684, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                5.597203 = idf(docFreq=435, maxDocs=43254)
                0.0546875 = fieldNorm(doc=3684)
          0.080394074 = weight(abstract_txt:languages in 3684) [ClassicSimilarity], result of:
            0.080394074 = score(doc=3684,freq=2.0), product of:
              0.20022036 = queryWeight, product of:
                2.6004307 = boost
                5.1917377 = idf(docFreq=653, maxDocs=43254)
                0.014830309 = queryNorm
              0.40152797 = fieldWeight in 3684, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.1917377 = idf(docFreq=653, maxDocs=43254)
                0.0546875 = fieldNorm(doc=3684)
          0.0360021 = weight(abstract_txt:retrieval in 3684) [ClassicSimilarity], result of:
            0.0360021 = score(doc=3684,freq=2.0), product of:
              0.1341553 = queryWeight, product of:
                2.6069982 = boost
                3.4699 = idf(docFreq=3658, maxDocs=43254)
                0.014830309 = queryNorm
              0.26836136 = fieldWeight in 3684, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4699 = idf(docFreq=3658, maxDocs=43254)
                0.0546875 = fieldNorm(doc=3684)
          0.1447853 = weight(abstract_txt:asian in 3684) [ClassicSimilarity], result of:
            0.1447853 = score(doc=3684,freq=1.0), product of:
              0.3392649 = queryWeight, product of:
                2.9315093 = boost
                7.803644 = idf(docFreq=47, maxDocs=43254)
                0.014830309 = queryNorm
              0.4267618 = fieldWeight in 3684, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.803644 = idf(docFreq=47, maxDocs=43254)
                0.0546875 = fieldNorm(doc=3684)
        0.4 = coord(10/25)
    
  3. Xu, J.; Weischedel, R.: Empirical studies on the impact of lexical resources on CLIR performance (2005) 0.30
    0.30206296 = sum of:
      0.30206296 = product of:
        0.83906376 = sum of:
          0.064724505 = weight(abstract_txt:performance in 3021) [ClassicSimilarity], result of:
            0.064724505 = score(doc=3021,freq=5.0), product of:
              0.079887725 = queryWeight, product of:
                1.161492 = boost
                4.6378174 = idf(docFreq=1137, maxDocs=43254)
                0.014830309 = queryNorm
              0.81019336 = fieldWeight in 3021, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.6378174 = idf(docFreq=1137, maxDocs=43254)
                0.078125 = fieldNorm(doc=3021)
          0.051387746 = weight(abstract_txt:cross in 3021) [ClassicSimilarity], result of:
            0.051387746 = score(doc=3021,freq=1.0), product of:
              0.117128864 = queryWeight, product of:
                1.4063977 = boost
                5.6157217 = idf(docFreq=427, maxDocs=43254)
                0.014830309 = queryNorm
              0.43872827 = fieldWeight in 3021, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.6157217 = idf(docFreq=427, maxDocs=43254)
                0.078125 = fieldNorm(doc=3021)
          0.010369292 = weight(abstract_txt:information in 3021) [ClassicSimilarity], result of:
            0.010369292 = score(doc=3021,freq=1.0), product of:
              0.054689463 = queryWeight, product of:
                1.5194906 = boost
                2.42692 = idf(docFreq=10382, maxDocs=43254)
                0.014830309 = queryNorm
              0.18960312 = fieldWeight in 3021, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.42692 = idf(docFreq=10382, maxDocs=43254)
                0.078125 = fieldNorm(doc=3021)
          0.06827402 = weight(abstract_txt:translation in 3021) [ClassicSimilarity], result of:
            0.06827402 = score(doc=3021,freq=1.0), product of:
              0.14155586 = queryWeight, product of:
                1.5461091 = boost
                6.173587 = idf(docFreq=244, maxDocs=43254)
                0.014830309 = queryNorm
              0.4823115 = fieldWeight in 3021, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.173587 = idf(docFreq=244, maxDocs=43254)
                0.078125 = fieldNorm(doc=3021)
          0.15370989 = weight(abstract_txt:lingual in 3021) [ClassicSimilarity], result of:
            0.15370989 = score(doc=3021,freq=1.0), product of:
              0.24315998 = queryWeight, product of:
                2.0263863 = boost
                8.091326 = idf(docFreq=35, maxDocs=43254)
                0.014830309 = queryNorm
              0.6321348 = fieldWeight in 3021, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.091326 = idf(docFreq=35, maxDocs=43254)
                0.078125 = fieldNorm(doc=3021)
          0.31744447 = weight(abstract_txt:clir in 3021) [ClassicSimilarity], result of:
            0.31744447 = score(doc=3021,freq=4.0), product of:
              0.24841782 = queryWeight, product of:
                2.0481772 = boost
                8.178337 = idf(docFreq=32, maxDocs=43254)
                0.014830309 = queryNorm
              1.2778652 = fieldWeight in 3021, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                8.178337 = idf(docFreq=32, maxDocs=43254)
                0.078125 = fieldNorm(doc=3021)
          0.06046474 = weight(abstract_txt:language in 3021) [ClassicSimilarity], result of:
            0.06046474 = score(doc=3021,freq=2.0), product of:
              0.13054463 = queryWeight, product of:
                2.099765 = boost
                4.192163 = idf(docFreq=1776, maxDocs=43254)
                0.014830309 = queryNorm
              0.46317294 = fieldWeight in 3021, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.192163 = idf(docFreq=1776, maxDocs=43254)
                0.078125 = fieldNorm(doc=3021)
          0.07632155 = weight(abstract_txt:english in 3021) [ClassicSimilarity], result of:
            0.07632155 = score(doc=3021,freq=1.0), product of:
              0.17453644 = queryWeight, product of:
                2.1026397 = boost
                5.597203 = idf(docFreq=435, maxDocs=43254)
                0.014830309 = queryNorm
              0.43728146 = fieldWeight in 3021, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.597203 = idf(docFreq=435, maxDocs=43254)
                0.078125 = fieldNorm(doc=3021)
          0.036367614 = weight(abstract_txt:retrieval in 3021) [ClassicSimilarity], result of:
            0.036367614 = score(doc=3021,freq=1.0), product of:
              0.1341553 = queryWeight, product of:
                2.6069982 = boost
                3.4699 = idf(docFreq=3658, maxDocs=43254)
                0.014830309 = queryNorm
              0.27108592 = fieldWeight in 3021, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4699 = idf(docFreq=3658, maxDocs=43254)
                0.078125 = fieldNorm(doc=3021)
        0.36 = coord(9/25)
    
  4. Bellaachia, A.; Amor-Tijani, G.: Proper nouns in English-Arabic cross language information retrieval (2008) 0.29
    0.293777 = sum of:
      0.293777 = product of:
        0.66767496 = sum of:
          0.08359881 = weight(abstract_txt:nouns in 4373) [ClassicSimilarity], result of:
            0.08359881 = score(doc=4373,freq=2.0), product of:
              0.11843434 = queryWeight, product of:
                7.9859657 = idf(docFreq=39, maxDocs=43254)
                0.014830309 = queryNorm
              0.7058663 = fieldWeight in 4373, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.9859657 = idf(docFreq=39, maxDocs=43254)
                0.0625 = fieldNorm(doc=4373)
          0.032748297 = weight(abstract_txt:performance in 4373) [ClassicSimilarity], result of:
            0.032748297 = score(doc=4373,freq=2.0), product of:
              0.079887725 = queryWeight, product of:
                1.161492 = boost
                4.6378174 = idf(docFreq=1137, maxDocs=43254)
                0.014830309 = queryNorm
              0.409929 = fieldWeight in 4373, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.6378174 = idf(docFreq=1137, maxDocs=43254)
                0.0625 = fieldNorm(doc=4373)
          0.024758628 = weight(abstract_txt:index in 4373) [ClassicSimilarity], result of:
            0.024758628 = score(doc=4373,freq=1.0), product of:
              0.08353118 = queryWeight, product of:
                1.1876829 = boost
                4.7423973 = idf(docFreq=1024, maxDocs=43254)
                0.014830309 = queryNorm
              0.29639983 = fieldWeight in 4373, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.7423973 = idf(docFreq=1024, maxDocs=43254)
                0.0625 = fieldNorm(doc=4373)
          0.041110195 = weight(abstract_txt:cross in 4373) [ClassicSimilarity], result of:
            0.041110195 = score(doc=4373,freq=1.0), product of:
              0.117128864 = queryWeight, product of:
                1.4063977 = boost
                5.6157217 = idf(docFreq=427, maxDocs=43254)
                0.014830309 = queryNorm
              0.3509826 = fieldWeight in 4373, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.6157217 = idf(docFreq=427, maxDocs=43254)
                0.0625 = fieldNorm(doc=4373)
          0.0082954345 = weight(abstract_txt:information in 4373) [ClassicSimilarity], result of:
            0.0082954345 = score(doc=4373,freq=1.0), product of:
              0.054689463 = queryWeight, product of:
                1.5194906 = boost
                2.42692 = idf(docFreq=10382, maxDocs=43254)
                0.014830309 = queryNorm
              0.1516825 = fieldWeight in 4373, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.42692 = idf(docFreq=10382, maxDocs=43254)
                0.0625 = fieldNorm(doc=4373)
          0.17957371 = weight(abstract_txt:clir in 4373) [ClassicSimilarity], result of:
            0.17957371 = score(doc=4373,freq=2.0), product of:
              0.24841782 = queryWeight, product of:
                2.0481772 = boost
                8.178337 = idf(docFreq=32, maxDocs=43254)
                0.014830309 = queryNorm
              0.7228697 = fieldWeight in 4373, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.178337 = idf(docFreq=32, maxDocs=43254)
                0.0625 = fieldNorm(doc=4373)
          0.04837179 = weight(abstract_txt:language in 4373) [ClassicSimilarity], result of:
            0.04837179 = score(doc=4373,freq=2.0), product of:
              0.13054463 = queryWeight, product of:
                2.099765 = boost
                4.192163 = idf(docFreq=1776, maxDocs=43254)
                0.014830309 = queryNorm
              0.37053835 = fieldWeight in 4373, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.192163 = idf(docFreq=1776, maxDocs=43254)
                0.0625 = fieldNorm(doc=4373)
          0.06105724 = weight(abstract_txt:english in 4373) [ClassicSimilarity], result of:
            0.06105724 = score(doc=4373,freq=1.0), product of:
              0.17453644 = queryWeight, product of:
                2.1026397 = boost
                5.597203 = idf(docFreq=435, maxDocs=43254)
                0.014830309 = queryNorm
              0.34982517 = fieldWeight in 4373, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.597203 = idf(docFreq=435, maxDocs=43254)
                0.0625 = fieldNorm(doc=4373)
          0.112528265 = weight(abstract_txt:languages in 4373) [ClassicSimilarity], result of:
            0.112528265 = score(doc=4373,freq=3.0), product of:
              0.20022036 = queryWeight, product of:
                2.6004307 = boost
                5.1917377 = idf(docFreq=653, maxDocs=43254)
                0.014830309 = queryNorm
              0.5620221 = fieldWeight in 4373, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.1917377 = idf(docFreq=653, maxDocs=43254)
                0.0625 = fieldNorm(doc=4373)
          0.029094093 = weight(abstract_txt:retrieval in 4373) [ClassicSimilarity], result of:
            0.029094093 = score(doc=4373,freq=1.0), product of:
              0.1341553 = queryWeight, product of:
                2.6069982 = boost
                3.4699 = idf(docFreq=3658, maxDocs=43254)
                0.014830309 = queryNorm
              0.21686874 = fieldWeight in 4373, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4699 = idf(docFreq=3658, maxDocs=43254)
                0.0625 = fieldNorm(doc=4373)
          0.046538506 = weight(abstract_txt:terms in 4373) [ClassicSimilarity], result of:
            0.046538506 = score(doc=4373,freq=1.0), product of:
              0.18349023 = queryWeight, product of:
                3.0489006 = boost
                4.058069 = idf(docFreq=2031, maxDocs=43254)
                0.014830309 = queryNorm
              0.25362933 = fieldWeight in 4373, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.058069 = idf(docFreq=2031, maxDocs=43254)
                0.0625 = fieldNorm(doc=4373)
        0.44 = coord(11/25)
    
  5. Pirkola, A.: Morphological typology of languages for IR (2001) 0.29
    0.285303 = sum of:
      0.285303 = product of:
        0.8915719 = sum of:
          0.043767482 = weight(abstract_txt:index in 477) [ClassicSimilarity], result of:
            0.043767482 = score(doc=477,freq=2.0), product of:
              0.08353118 = queryWeight, product of:
                1.1876829 = boost
                4.7423973 = idf(docFreq=1024, maxDocs=43254)
                0.014830309 = queryNorm
              0.52396584 = fieldWeight in 477, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.7423973 = idf(docFreq=1024, maxDocs=43254)
                0.078125 = fieldNorm(doc=477)
          0.07267324 = weight(abstract_txt:cross in 477) [ClassicSimilarity], result of:
            0.07267324 = score(doc=477,freq=2.0), product of:
              0.117128864 = queryWeight, product of:
                1.4063977 = boost
                5.6157217 = idf(docFreq=427, maxDocs=43254)
                0.014830309 = queryNorm
              0.62045544 = fieldWeight in 477, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.6157217 = idf(docFreq=427, maxDocs=43254)
                0.078125 = fieldNorm(doc=477)
          0.15370989 = weight(abstract_txt:lingual in 477) [ClassicSimilarity], result of:
            0.15370989 = score(doc=477,freq=1.0), product of:
              0.24315998 = queryWeight, product of:
                2.0263863 = boost
                8.091326 = idf(docFreq=35, maxDocs=43254)
                0.014830309 = queryNorm
              0.6321348 = fieldWeight in 477, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.091326 = idf(docFreq=35, maxDocs=43254)
                0.078125 = fieldNorm(doc=477)
          0.15872224 = weight(abstract_txt:clir in 477) [ClassicSimilarity], result of:
            0.15872224 = score(doc=477,freq=1.0), product of:
              0.24841782 = queryWeight, product of:
                2.0481772 = boost
                8.178337 = idf(docFreq=32, maxDocs=43254)
                0.014830309 = queryNorm
              0.6389326 = fieldWeight in 477, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.178337 = idf(docFreq=32, maxDocs=43254)
                0.078125 = fieldNorm(doc=477)
          0.06046474 = weight(abstract_txt:language in 477) [ClassicSimilarity], result of:
            0.06046474 = score(doc=477,freq=2.0), product of:
              0.13054463 = queryWeight, product of:
                2.099765 = boost
                4.192163 = idf(docFreq=1776, maxDocs=43254)
                0.014830309 = queryNorm
              0.46317294 = fieldWeight in 477, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.192163 = idf(docFreq=1776, maxDocs=43254)
                0.078125 = fieldNorm(doc=477)
          0.22520635 = weight(abstract_txt:mono in 477) [ClassicSimilarity], result of:
            0.22520635 = score(doc=477,freq=1.0), product of:
              0.31367362 = queryWeight, product of:
                2.3015223 = boost
                9.189939 = idf(docFreq=11, maxDocs=43254)
                0.014830309 = queryNorm
              0.71796393 = fieldWeight in 477, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.189939 = idf(docFreq=11, maxDocs=43254)
                0.078125 = fieldNorm(doc=477)
          0.14066035 = weight(abstract_txt:languages in 477) [ClassicSimilarity], result of:
            0.14066035 = score(doc=477,freq=3.0), product of:
              0.20022036 = queryWeight, product of:
                2.6004307 = boost
                5.1917377 = idf(docFreq=653, maxDocs=43254)
                0.014830309 = queryNorm
              0.70252764 = fieldWeight in 477, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.1917377 = idf(docFreq=653, maxDocs=43254)
                0.078125 = fieldNorm(doc=477)
          0.036367614 = weight(abstract_txt:retrieval in 477) [ClassicSimilarity], result of:
            0.036367614 = score(doc=477,freq=1.0), product of:
              0.1341553 = queryWeight, product of:
                2.6069982 = boost
                3.4699 = idf(docFreq=3658, maxDocs=43254)
                0.014830309 = queryNorm
              0.27108592 = fieldWeight in 477, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4699 = idf(docFreq=3658, maxDocs=43254)
                0.078125 = fieldNorm(doc=477)
        0.32 = coord(8/25)