Document (#36216)

Author
Li, Q.
Chen, Y.P.
Myaeng, S.-H.
Jin, Y.
Kang, B.-Y.
Title
Concept unification of terms in different languages via web mining for Information Retrieval
Source
Information processing and management. 45(2009) no.2, S.246-262
Year
2009
Abstract
For historical and cultural reasons, English phrases, especially proper nouns and new words, frequently appear in Web pages written primarily in East Asian languages such as Chinese, Korean, and Japanese. Although such English terms and their equivalences in these East Asian languages refer to the same concept, they are often erroneously treated as independent index units in traditional Information Retrieval (IR). This paper describes the degree to which the problem arises in IR and proposes a novel technique to solve it. Our method first extracts English terms from native Web documents in an East Asian language, and then unifies the extracted terms and their equivalences in the native language as one index unit. For Cross-Language Information Retrieval (CLIR), one of the major hindrances to achieving retrieval performance at the level of Mono-Lingual Information Retrieval (MLIR) is the translation of terms in search queries which can not be found in a bilingual dictionary. The Web mining approach proposed in this paper for concept unification of terms in different languages can also be applied to solve this well-known challenge in CLIR. Experimental results based on NTCIR and KT-Set test collections show that the high translation precision of our approach greatly improves performance of both Mono-Lingual and Cross-Language Information Retrieval.
Theme
Computerlinguistik
Multilinguale Probleme

Similar documents (author)

  1. Khoo, C.; Myaeng, S.H.: Identifying semantic relations in text for information retrieval and information extraction (2002) 1.26
    1.2635815 = sum of:
      1.2635815 = product of:
        3.7907443 = sum of:
          3.7907443 = weight(author_txt:myaeng in 1197) [ClassicSimilarity], result of:
            3.7907443 = score(doc=1197,freq=1.0), product of:
              0.7654105 = queryWeight, product of:
                1.6101422 = boost
                9.905128 = idf(docFreq=5, maxDocs=44218)
                0.04799214 = queryNorm
              4.952564 = fieldWeight in 1197, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.905128 = idf(docFreq=5, maxDocs=44218)
                0.5 = fieldNorm(doc=1197)
        0.33333334 = coord(1/3)
    
  2. Kang, M.: Dual paths to continuous online knowledge sharing : a repetitive behavior perspective (2020) 1.02
    1.0199206 = sum of:
      1.0199206 = product of:
        3.0597618 = sum of:
          3.0597618 = weight(author_txt:kang in 5985) [ClassicSimilarity], result of:
            3.0597618 = score(doc=5985,freq=1.0), product of:
              0.571825 = queryWeight, product of:
                1.3917096 = boost
                8.561393 = idf(docFreq=22, maxDocs=44218)
                0.04799214 = queryNorm
              5.3508706 = fieldWeight in 5985, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.561393 = idf(docFreq=22, maxDocs=44218)
                0.625 = fieldNorm(doc=5985)
        0.33333334 = coord(1/3)
    
  3. Kang, M.: Motivational affordances and survival of new askers on social Q&A sites : the case of Stack Exchange network (2022) 1.02
    1.0199206 = sum of:
      1.0199206 = product of:
        3.0597618 = sum of:
          3.0597618 = weight(author_txt:kang in 447) [ClassicSimilarity], result of:
            3.0597618 = score(doc=447,freq=1.0), product of:
              0.571825 = queryWeight, product of:
                1.3917096 = boost
                8.561393 = idf(docFreq=22, maxDocs=44218)
                0.04799214 = queryNorm
              5.3508706 = fieldWeight in 447, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.561393 = idf(docFreq=22, maxDocs=44218)
                0.625 = fieldNorm(doc=447)
        0.33333334 = coord(1/3)
    
  4. Jeong, K.S.; Myaeng, S.-H.; Lee, J.S.; Choi, K.: Automatic identification and back-transliteration of foreign words for information retrieval (1999) 0.79
    0.7897384 = sum of:
      0.7897384 = product of:
        2.3692153 = sum of:
          2.3692153 = weight(author_txt:myaeng in 503) [ClassicSimilarity], result of:
            2.3692153 = score(doc=503,freq=1.0), product of:
              0.7654105 = queryWeight, product of:
                1.6101422 = boost
                9.905128 = idf(docFreq=5, maxDocs=44218)
                0.04799214 = queryNorm
              3.0953524 = fieldWeight in 503, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.905128 = idf(docFreq=5, maxDocs=44218)
                0.3125 = fieldNorm(doc=503)
        0.33333334 = coord(1/3)
    
  5. Kang, I.-H.; Kim, G.C.: Integration of multiple evidences based on a query type for web search (2004) 0.71
    0.7139444 = sum of:
      0.7139444 = product of:
        2.141833 = sum of:
          2.141833 = weight(author_txt:kang in 2568) [ClassicSimilarity], result of:
            2.141833 = score(doc=2568,freq=1.0), product of:
              0.571825 = queryWeight, product of:
                1.3917096 = boost
                8.561393 = idf(docFreq=22, maxDocs=44218)
                0.04799214 = queryNorm
              3.7456093 = fieldWeight in 2568, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.561393 = idf(docFreq=22, maxDocs=44218)
                0.4375 = fieldNorm(doc=2568)
        0.33333334 = coord(1/3)
    

Similar documents (content)

  1. Toivonen, J.; Pirkola, A.; Keskustalo, H.; Visala, K.; Järvelin, K.: Translating cross-lingual spelling variants using transformation rules (2005) 0.35
    0.3548971 = sum of:
      0.3548971 = product of:
        0.88724273 = sum of:
          0.07213718 = weight(abstract_txt:cross in 1052) [ClassicSimilarity], result of:
            0.07213718 = score(doc=1052,freq=2.0), product of:
              0.11656917 = queryWeight, product of:
                1.3988655 = boost
                5.601063 = idf(docFreq=443, maxDocs=44218)
                0.014877752 = queryNorm
              0.6188358 = fieldWeight in 1052, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.601063 = idf(docFreq=443, maxDocs=44218)
                0.078125 = fieldNorm(doc=1052)
          0.010297418 = weight(abstract_txt:information in 1052) [ClassicSimilarity], result of:
            0.010297418 = score(doc=1052,freq=1.0), product of:
              0.05444439 = queryWeight, product of:
                1.5115783 = boost
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.014877752 = queryNorm
              0.18913643 = fieldWeight in 1052, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.078125 = fieldNorm(doc=1052)
          0.09668269 = weight(abstract_txt:translation in 1052) [ClassicSimilarity], result of:
            0.09668269 = score(doc=1052,freq=2.0), product of:
              0.14170225 = queryWeight, product of:
                1.5423127 = boost
                6.1754265 = idf(docFreq=249, maxDocs=44218)
                0.014877752 = queryNorm
              0.68229467 = fieldWeight in 1052, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.1754265 = idf(docFreq=249, maxDocs=44218)
                0.078125 = fieldNorm(doc=1052)
          0.15503737 = weight(abstract_txt:lingual in 1052) [ClassicSimilarity], result of:
            0.15503737 = score(doc=1052,freq=1.0), product of:
              0.24459367 = queryWeight, product of:
                2.0263138 = boost
                8.113368 = idf(docFreq=35, maxDocs=44218)
                0.014877752 = queryNorm
              0.6338569 = fieldWeight in 1052, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.113368 = idf(docFreq=35, maxDocs=44218)
                0.078125 = fieldNorm(doc=1052)
          0.1600791 = weight(abstract_txt:clir in 1052) [ClassicSimilarity], result of:
            0.1600791 = score(doc=1052,freq=1.0), product of:
              0.24986802 = queryWeight, product of:
                2.0480447 = boost
                8.200379 = idf(docFreq=32, maxDocs=44218)
                0.014877752 = queryNorm
              0.6406546 = fieldWeight in 1052, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.200379 = idf(docFreq=32, maxDocs=44218)
                0.078125 = fieldNorm(doc=1052)
          0.07542531 = weight(abstract_txt:english in 1052) [ClassicSimilarity], result of:
            0.07542531 = score(doc=1052,freq=1.0), product of:
              0.17319264 = queryWeight, product of:
                2.0883074 = boost
                5.574394 = idf(docFreq=455, maxDocs=44218)
                0.014877752 = queryNorm
              0.43549955 = fieldWeight in 1052, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.574394 = idf(docFreq=455, maxDocs=44218)
                0.078125 = fieldNorm(doc=1052)
          0.08493216 = weight(abstract_txt:language in 1052) [ClassicSimilarity], result of:
            0.08493216 = score(doc=1052,freq=4.0), product of:
              0.12997477 = queryWeight, product of:
                2.0889528 = boost
                4.1820874 = idf(docFreq=1834, maxDocs=44218)
                0.014877752 = queryNorm
              0.65345114 = fieldWeight in 1052, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.1820874 = idf(docFreq=1834, maxDocs=44218)
                0.078125 = fieldNorm(doc=1052)
          0.11465879 = weight(abstract_txt:languages in 1052) [ClassicSimilarity], result of:
            0.11465879 = score(doc=1052,freq=2.0), product of:
              0.20002879 = queryWeight, product of:
                2.591465 = boost
                5.188118 = idf(docFreq=670, maxDocs=44218)
                0.014877752 = queryNorm
              0.57321143 = fieldWeight in 1052, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.188118 = idf(docFreq=670, maxDocs=44218)
                0.078125 = fieldNorm(doc=1052)
          0.0365486 = weight(abstract_txt:retrieval in 1052) [ClassicSimilarity], result of:
            0.0365486 = score(doc=1052,freq=1.0), product of:
              0.13461967 = queryWeight, product of:
                2.603748 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.014877752 = queryNorm
              0.27149525 = fieldWeight in 1052, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.078125 = fieldNorm(doc=1052)
          0.081444085 = weight(abstract_txt:terms in 1052) [ClassicSimilarity], result of:
            0.081444085 = score(doc=1052,freq=2.0), product of:
              0.18228784 = queryWeight, product of:
                3.029867 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.014877752 = queryNorm
              0.44678837 = fieldWeight in 1052, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.078125 = fieldNorm(doc=1052)
        0.4 = coord(10/25)
    
  2. Yang, C.C.; Li, K.W.: Automatic construction of English/Chinese parallel corpora (2003) 0.32
    0.31578654 = sum of:
      0.31578654 = product of:
        0.7894663 = sum of:
          0.020174004 = weight(abstract_txt:performance in 1683) [ClassicSimilarity], result of:
            0.020174004 = score(doc=1683,freq=1.0), product of:
              0.07966795 = queryWeight, product of:
                1.1564474 = boost
                4.63042 = idf(docFreq=1171, maxDocs=44218)
                0.014877752 = queryNorm
              0.2532261 = fieldWeight in 1683, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.63042 = idf(docFreq=1171, maxDocs=44218)
                0.0546875 = fieldNorm(doc=1683)
          0.07141217 = weight(abstract_txt:cross in 1683) [ClassicSimilarity], result of:
            0.07141217 = score(doc=1683,freq=4.0), product of:
              0.11656917 = queryWeight, product of:
                1.3988655 = boost
                5.601063 = idf(docFreq=443, maxDocs=44218)
                0.014877752 = queryNorm
              0.61261624 = fieldWeight in 1683, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.601063 = idf(docFreq=443, maxDocs=44218)
                0.0546875 = fieldNorm(doc=1683)
          0.012484956 = weight(abstract_txt:information in 1683) [ClassicSimilarity], result of:
            0.012484956 = score(doc=1683,freq=3.0), product of:
              0.05444439 = queryWeight, product of:
                1.5115783 = boost
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.014877752 = queryNorm
              0.22931573 = fieldWeight in 1683, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.0546875 = fieldNorm(doc=1683)
          0.08288814 = weight(abstract_txt:translation in 1683) [ClassicSimilarity], result of:
            0.08288814 = score(doc=1683,freq=3.0), product of:
              0.14170225 = queryWeight, product of:
                1.5423127 = boost
                6.1754265 = idf(docFreq=249, maxDocs=44218)
                0.014877752 = queryNorm
              0.58494586 = fieldWeight in 1683, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.1754265 = idf(docFreq=249, maxDocs=44218)
                0.0546875 = fieldNorm(doc=1683)
          0.15347917 = weight(abstract_txt:lingual in 1683) [ClassicSimilarity], result of:
            0.15347917 = score(doc=1683,freq=2.0), product of:
              0.24459367 = queryWeight, product of:
                2.0263138 = boost
                8.113368 = idf(docFreq=35, maxDocs=44218)
                0.014877752 = queryNorm
              0.6274863 = fieldWeight in 1683, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.113368 = idf(docFreq=35, maxDocs=44218)
                0.0546875 = fieldNorm(doc=1683)
          0.12932748 = weight(abstract_txt:english in 1683) [ClassicSimilarity], result of:
            0.12932748 = score(doc=1683,freq=6.0), product of:
              0.17319264 = queryWeight, product of:
                2.0883074 = boost
                5.574394 = idf(docFreq=455, maxDocs=44218)
                0.014877752 = queryNorm
              0.7467262 = fieldWeight in 1683, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                5.574394 = idf(docFreq=455, maxDocs=44218)
                0.0546875 = fieldNorm(doc=1683)
          0.059452515 = weight(abstract_txt:language in 1683) [ClassicSimilarity], result of:
            0.059452515 = score(doc=1683,freq=4.0), product of:
              0.12997477 = queryWeight, product of:
                2.0889528 = boost
                4.1820874 = idf(docFreq=1834, maxDocs=44218)
                0.014877752 = queryNorm
              0.45741582 = fieldWeight in 1683, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.1820874 = idf(docFreq=1834, maxDocs=44218)
                0.0546875 = fieldNorm(doc=1683)
          0.08026116 = weight(abstract_txt:languages in 1683) [ClassicSimilarity], result of:
            0.08026116 = score(doc=1683,freq=2.0), product of:
              0.20002879 = queryWeight, product of:
                2.591465 = boost
                5.188118 = idf(docFreq=670, maxDocs=44218)
                0.014877752 = queryNorm
              0.40124804 = fieldWeight in 1683, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.188118 = idf(docFreq=670, maxDocs=44218)
                0.0546875 = fieldNorm(doc=1683)
          0.036181267 = weight(abstract_txt:retrieval in 1683) [ClassicSimilarity], result of:
            0.036181267 = score(doc=1683,freq=2.0), product of:
              0.13461967 = queryWeight, product of:
                2.603748 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.014877752 = queryNorm
              0.26876658 = fieldWeight in 1683, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.0546875 = fieldNorm(doc=1683)
          0.14380543 = weight(abstract_txt:asian in 1683) [ClassicSimilarity], result of:
            0.14380543 = score(doc=1683,freq=1.0), product of:
              0.33778176 = queryWeight, product of:
                2.9164047 = boost
                7.7848644 = idf(docFreq=49, maxDocs=44218)
                0.014877752 = queryNorm
              0.42573476 = fieldWeight in 1683, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.7848644 = idf(docFreq=49, maxDocs=44218)
                0.0546875 = fieldNorm(doc=1683)
        0.4 = coord(10/25)
    
  3. Xu, J.; Weischedel, R.: Empirical studies on the impact of lexical resources on CLIR performance (2005) 0.30
    0.3028825 = sum of:
      0.3028825 = product of:
        0.84134024 = sum of:
          0.06444349 = weight(abstract_txt:performance in 1020) [ClassicSimilarity], result of:
            0.06444349 = score(doc=1020,freq=5.0), product of:
              0.07966795 = queryWeight, product of:
                1.1564474 = boost
                4.63042 = idf(docFreq=1171, maxDocs=44218)
                0.014877752 = queryNorm
              0.80890113 = fieldWeight in 1020, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.63042 = idf(docFreq=1171, maxDocs=44218)
                0.078125 = fieldNorm(doc=1020)
          0.05100869 = weight(abstract_txt:cross in 1020) [ClassicSimilarity], result of:
            0.05100869 = score(doc=1020,freq=1.0), product of:
              0.11656917 = queryWeight, product of:
                1.3988655 = boost
                5.601063 = idf(docFreq=443, maxDocs=44218)
                0.014877752 = queryNorm
              0.43758303 = fieldWeight in 1020, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.601063 = idf(docFreq=443, maxDocs=44218)
                0.078125 = fieldNorm(doc=1020)
          0.010297418 = weight(abstract_txt:information in 1020) [ClassicSimilarity], result of:
            0.010297418 = score(doc=1020,freq=1.0), product of:
              0.05444439 = queryWeight, product of:
                1.5115783 = boost
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.014877752 = queryNorm
              0.18913643 = fieldWeight in 1020, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.078125 = fieldNorm(doc=1020)
          0.068364985 = weight(abstract_txt:translation in 1020) [ClassicSimilarity], result of:
            0.068364985 = score(doc=1020,freq=1.0), product of:
              0.14170225 = queryWeight, product of:
                1.5423127 = boost
                6.1754265 = idf(docFreq=249, maxDocs=44218)
                0.014877752 = queryNorm
              0.4824552 = fieldWeight in 1020, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.1754265 = idf(docFreq=249, maxDocs=44218)
                0.078125 = fieldNorm(doc=1020)
          0.15503737 = weight(abstract_txt:lingual in 1020) [ClassicSimilarity], result of:
            0.15503737 = score(doc=1020,freq=1.0), product of:
              0.24459367 = queryWeight, product of:
                2.0263138 = boost
                8.113368 = idf(docFreq=35, maxDocs=44218)
                0.014877752 = queryNorm
              0.6338569 = fieldWeight in 1020, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.113368 = idf(docFreq=35, maxDocs=44218)
                0.078125 = fieldNorm(doc=1020)
          0.3201582 = weight(abstract_txt:clir in 1020) [ClassicSimilarity], result of:
            0.3201582 = score(doc=1020,freq=4.0), product of:
              0.24986802 = queryWeight, product of:
                2.0480447 = boost
                8.200379 = idf(docFreq=32, maxDocs=44218)
                0.014877752 = queryNorm
              1.2813092 = fieldWeight in 1020, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                8.200379 = idf(docFreq=32, maxDocs=44218)
                0.078125 = fieldNorm(doc=1020)
          0.07542531 = weight(abstract_txt:english in 1020) [ClassicSimilarity], result of:
            0.07542531 = score(doc=1020,freq=1.0), product of:
              0.17319264 = queryWeight, product of:
                2.0883074 = boost
                5.574394 = idf(docFreq=455, maxDocs=44218)
                0.014877752 = queryNorm
              0.43549955 = fieldWeight in 1020, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.574394 = idf(docFreq=455, maxDocs=44218)
                0.078125 = fieldNorm(doc=1020)
          0.060056105 = weight(abstract_txt:language in 1020) [ClassicSimilarity], result of:
            0.060056105 = score(doc=1020,freq=2.0), product of:
              0.12997477 = queryWeight, product of:
                2.0889528 = boost
                4.1820874 = idf(docFreq=1834, maxDocs=44218)
                0.014877752 = queryNorm
              0.46205974 = fieldWeight in 1020, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.1820874 = idf(docFreq=1834, maxDocs=44218)
                0.078125 = fieldNorm(doc=1020)
          0.0365486 = weight(abstract_txt:retrieval in 1020) [ClassicSimilarity], result of:
            0.0365486 = score(doc=1020,freq=1.0), product of:
              0.13461967 = queryWeight, product of:
                2.603748 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.014877752 = queryNorm
              0.27149525 = fieldWeight in 1020, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.078125 = fieldNorm(doc=1020)
        0.36 = coord(9/25)
    
  4. Bellaachia, A.; Amor-Tijani, G.: Proper nouns in English-Arabic cross language information retrieval (2008) 0.29
    0.29391995 = sum of:
      0.29391995 = product of:
        0.6679999 = sum of:
          0.08432985 = weight(abstract_txt:nouns in 2372) [ClassicSimilarity], result of:
            0.08432985 = score(doc=2372,freq=2.0), product of:
              0.11914116 = queryWeight, product of:
                8.008008 = idf(docFreq=39, maxDocs=44218)
                0.014877752 = queryNorm
              0.7078146 = fieldWeight in 2372, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.008008 = idf(docFreq=39, maxDocs=44218)
                0.0625 = fieldNorm(doc=2372)
          0.032606114 = weight(abstract_txt:performance in 2372) [ClassicSimilarity], result of:
            0.032606114 = score(doc=2372,freq=2.0), product of:
              0.07966795 = queryWeight, product of:
                1.1564474 = boost
                4.63042 = idf(docFreq=1171, maxDocs=44218)
                0.014877752 = queryNorm
              0.40927517 = fieldWeight in 2372, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.63042 = idf(docFreq=1171, maxDocs=44218)
                0.0625 = fieldNorm(doc=2372)
          0.02487228 = weight(abstract_txt:index in 2372) [ClassicSimilarity], result of:
            0.02487228 = score(doc=2372,freq=1.0), product of:
              0.08379884 = queryWeight, product of:
                1.1860503 = boost
                4.74895 = idf(docFreq=1040, maxDocs=44218)
                0.014877752 = queryNorm
              0.29680938 = fieldWeight in 2372, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.74895 = idf(docFreq=1040, maxDocs=44218)
                0.0625 = fieldNorm(doc=2372)
          0.040806953 = weight(abstract_txt:cross in 2372) [ClassicSimilarity], result of:
            0.040806953 = score(doc=2372,freq=1.0), product of:
              0.11656917 = queryWeight, product of:
                1.3988655 = boost
                5.601063 = idf(docFreq=443, maxDocs=44218)
                0.014877752 = queryNorm
              0.35006642 = fieldWeight in 2372, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.601063 = idf(docFreq=443, maxDocs=44218)
                0.0625 = fieldNorm(doc=2372)
          0.008237935 = weight(abstract_txt:information in 2372) [ClassicSimilarity], result of:
            0.008237935 = score(doc=2372,freq=1.0), product of:
              0.05444439 = queryWeight, product of:
                1.5115783 = boost
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.014877752 = queryNorm
              0.15130915 = fieldWeight in 2372, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.0625 = fieldNorm(doc=2372)
          0.18110883 = weight(abstract_txt:clir in 2372) [ClassicSimilarity], result of:
            0.18110883 = score(doc=2372,freq=2.0), product of:
              0.24986802 = queryWeight, product of:
                2.0480447 = boost
                8.200379 = idf(docFreq=32, maxDocs=44218)
                0.014877752 = queryNorm
              0.724818 = fieldWeight in 2372, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.200379 = idf(docFreq=32, maxDocs=44218)
                0.0625 = fieldNorm(doc=2372)
          0.06034025 = weight(abstract_txt:english in 2372) [ClassicSimilarity], result of:
            0.06034025 = score(doc=2372,freq=1.0), product of:
              0.17319264 = queryWeight, product of:
                2.0883074 = boost
                5.574394 = idf(docFreq=455, maxDocs=44218)
                0.014877752 = queryNorm
              0.34839964 = fieldWeight in 2372, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.574394 = idf(docFreq=455, maxDocs=44218)
                0.0625 = fieldNorm(doc=2372)
          0.048044886 = weight(abstract_txt:language in 2372) [ClassicSimilarity], result of:
            0.048044886 = score(doc=2372,freq=2.0), product of:
              0.12997477 = queryWeight, product of:
                2.0889528 = boost
                4.1820874 = idf(docFreq=1834, maxDocs=44218)
                0.014877752 = queryNorm
              0.3696478 = fieldWeight in 2372, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.1820874 = idf(docFreq=1834, maxDocs=44218)
                0.0625 = fieldNorm(doc=2372)
          0.11234222 = weight(abstract_txt:languages in 2372) [ClassicSimilarity], result of:
            0.11234222 = score(doc=2372,freq=3.0), product of:
              0.20002879 = queryWeight, product of:
                2.591465 = boost
                5.188118 = idf(docFreq=670, maxDocs=44218)
                0.014877752 = queryNorm
              0.56163025 = fieldWeight in 2372, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.188118 = idf(docFreq=670, maxDocs=44218)
                0.0625 = fieldNorm(doc=2372)
          0.029238882 = weight(abstract_txt:retrieval in 2372) [ClassicSimilarity], result of:
            0.029238882 = score(doc=2372,freq=1.0), product of:
              0.13461967 = queryWeight, product of:
                2.603748 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.014877752 = queryNorm
              0.21719621 = fieldWeight in 2372, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.0625 = fieldNorm(doc=2372)
          0.046071734 = weight(abstract_txt:terms in 2372) [ClassicSimilarity], result of:
            0.046071734 = score(doc=2372,freq=1.0), product of:
              0.18228784 = queryWeight, product of:
                3.029867 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.014877752 = queryNorm
              0.25274166 = fieldWeight in 2372, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.0625 = fieldNorm(doc=2372)
        0.44 = coord(11/25)
    
  5. Pirkola, A.: Morphological typology of languages for IR (2001) 0.29
    0.28645906 = sum of:
      0.28645906 = product of:
        0.8951846 = sum of:
          0.043968398 = weight(abstract_txt:index in 4476) [ClassicSimilarity], result of:
            0.043968398 = score(doc=4476,freq=2.0), product of:
              0.08379884 = queryWeight, product of:
                1.1860503 = boost
                4.74895 = idf(docFreq=1040, maxDocs=44218)
                0.014877752 = queryNorm
              0.5246898 = fieldWeight in 4476, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.74895 = idf(docFreq=1040, maxDocs=44218)
                0.078125 = fieldNorm(doc=4476)
          0.07213718 = weight(abstract_txt:cross in 4476) [ClassicSimilarity], result of:
            0.07213718 = score(doc=4476,freq=2.0), product of:
              0.11656917 = queryWeight, product of:
                1.3988655 = boost
                5.601063 = idf(docFreq=443, maxDocs=44218)
                0.014877752 = queryNorm
              0.6188358 = fieldWeight in 4476, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.601063 = idf(docFreq=443, maxDocs=44218)
                0.078125 = fieldNorm(doc=4476)
          0.15503737 = weight(abstract_txt:lingual in 4476) [ClassicSimilarity], result of:
            0.15503737 = score(doc=4476,freq=1.0), product of:
              0.24459367 = queryWeight, product of:
                2.0263138 = boost
                8.113368 = idf(docFreq=35, maxDocs=44218)
                0.014877752 = queryNorm
              0.6338569 = fieldWeight in 4476, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.113368 = idf(docFreq=35, maxDocs=44218)
                0.078125 = fieldNorm(doc=4476)
          0.1600791 = weight(abstract_txt:clir in 4476) [ClassicSimilarity], result of:
            0.1600791 = score(doc=4476,freq=1.0), product of:
              0.24986802 = queryWeight, product of:
                2.0480447 = boost
                8.200379 = idf(docFreq=32, maxDocs=44218)
                0.014877752 = queryNorm
              0.6406546 = fieldWeight in 4476, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.200379 = idf(docFreq=32, maxDocs=44218)
                0.078125 = fieldNorm(doc=4476)
          0.060056105 = weight(abstract_txt:language in 4476) [ClassicSimilarity], result of:
            0.060056105 = score(doc=4476,freq=2.0), product of:
              0.12997477 = queryWeight, product of:
                2.0889528 = boost
                4.1820874 = idf(docFreq=1834, maxDocs=44218)
                0.014877752 = queryNorm
              0.46205974 = fieldWeight in 4476, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.1820874 = idf(docFreq=1834, maxDocs=44218)
                0.078125 = fieldNorm(doc=4476)
          0.22693004 = weight(abstract_txt:mono in 4476) [ClassicSimilarity], result of:
            0.22693004 = score(doc=4476,freq=1.0), product of:
              0.3153181 = queryWeight, product of:
                2.3006923 = boost
                9.211981 = idf(docFreq=11, maxDocs=44218)
                0.014877752 = queryNorm
              0.71968603 = fieldWeight in 4476, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.211981 = idf(docFreq=11, maxDocs=44218)
                0.078125 = fieldNorm(doc=4476)
          0.14042777 = weight(abstract_txt:languages in 4476) [ClassicSimilarity], result of:
            0.14042777 = score(doc=4476,freq=3.0), product of:
              0.20002879 = queryWeight, product of:
                2.591465 = boost
                5.188118 = idf(docFreq=670, maxDocs=44218)
                0.014877752 = queryNorm
              0.7020378 = fieldWeight in 4476, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.188118 = idf(docFreq=670, maxDocs=44218)
                0.078125 = fieldNorm(doc=4476)
          0.0365486 = weight(abstract_txt:retrieval in 4476) [ClassicSimilarity], result of:
            0.0365486 = score(doc=4476,freq=1.0), product of:
              0.13461967 = queryWeight, product of:
                2.603748 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.014877752 = queryNorm
              0.27149525 = fieldWeight in 4476, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.078125 = fieldNorm(doc=4476)
        0.32 = coord(8/25)