Document (#33020)

Author
Gey, F.C.
Kando, N.
Peters, C.
Title
Cross-Language Information Retrieval : the way ahead
Source
Information processing and management. 41(2005) no.3, S.415-432
Year
2005
Abstract
This introductory paper covers not only the research content of the articles in this special issue of IP&M but attempts to characterize the state-of-the-art in the Cross-Language Information Retrieval (CLIR) domain. We present our view of some major directions for CLIR research in the future. In particular, we find that insufficient attention has been given to the Web as a resource for multilingual research, and to languages which are spoken by hundreds of millions of people in the world but have been mainly neglected by the CLIR research community. In addition, we find that most CLIR evaluation has focussed narrowly on the news genre to the exclusion of other important genres such as scientific and technical literature. The paper concludes by describing an ambitious 5-year research plan proposed by James Mayfield and Paul McNamee.
Theme
Multilinguale Probleme

Similar documents (author)

  1. Kando, N.: Information concepts reexamined (1994) 2.60
    2.5968575 = sum of:
      2.5968575 = product of:
        5.193715 = sum of:
          5.193715 = weight(author_txt:kando in 2195) [ClassicSimilarity], result of:
            5.193715 = score(doc=2195,freq=1.0), product of:
              0.8524839 = queryWeight, product of:
                1.277011 = boost
                9.747919 = idf(docFreq=6, maxDocs=44083)
                0.0684825 = queryNorm
              6.092449 = fieldWeight in 2195, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.747919 = idf(docFreq=6, maxDocs=44083)
                0.625 = fieldNorm(doc=2195)
        0.5 = coord(1/2)
    
  2. Hu, X.; Kando, N.: Task complexity and difficulty in music information retrieval (2017) 2.08
    2.077486 = sum of:
      2.077486 = product of:
        4.154972 = sum of:
          4.154972 = weight(author_txt:kando in 4691) [ClassicSimilarity], result of:
            4.154972 = score(doc=4691,freq=1.0), product of:
              0.8524839 = queryWeight, product of:
                1.277011 = boost
                9.747919 = idf(docFreq=6, maxDocs=44083)
                0.0684825 = queryNorm
              4.8739595 = fieldWeight in 4691, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.747919 = idf(docFreq=6, maxDocs=44083)
                0.5 = fieldNorm(doc=4691)
        0.5 = coord(1/2)
    
  3. Fujii, A.; Iwayama, M.; Kando, N.: Introduction to the special issue on patent processing (2007) 1.56
    1.5581145 = sum of:
      1.5581145 = product of:
        3.116229 = sum of:
          3.116229 = weight(author_txt:kando in 1930) [ClassicSimilarity], result of:
            3.116229 = score(doc=1930,freq=1.0), product of:
              0.8524839 = queryWeight, product of:
                1.277011 = boost
                9.747919 = idf(docFreq=6, maxDocs=44083)
                0.0684825 = queryNorm
              3.6554697 = fieldWeight in 1930, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.747919 = idf(docFreq=6, maxDocs=44083)
                0.375 = fieldNorm(doc=1930)
        0.5 = coord(1/2)
    
  4. Kuriyama, K.; Kando, N.; Nozue, T.; Eguchi, K.: Pooling for a large-scale test collection : an analysis of the search results from the First NTCIR Workshop (2002) 1.30
    1.2984288 = sum of:
      1.2984288 = product of:
        2.5968575 = sum of:
          2.5968575 = weight(author_txt:kando in 4831) [ClassicSimilarity], result of:
            2.5968575 = score(doc=4831,freq=1.0), product of:
              0.8524839 = queryWeight, product of:
                1.277011 = boost
                9.747919 = idf(docFreq=6, maxDocs=44083)
                0.0684825 = queryNorm
              3.0462246 = fieldWeight in 4831, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.747919 = idf(docFreq=6, maxDocs=44083)
                0.3125 = fieldNorm(doc=4831)
        0.5 = coord(1/2)
    
  5. Rodrigo, A.; Peñas, A.; Miyao, Y.; Kando, N.: Do systems pass university entrance exams? (2018) 1.30
    1.2984288 = sum of:
      1.2984288 = product of:
        2.5968575 = sum of:
          2.5968575 = weight(author_txt:kando in 55) [ClassicSimilarity], result of:
            2.5968575 = score(doc=55,freq=1.0), product of:
              0.8524839 = queryWeight, product of:
                1.277011 = boost
                9.747919 = idf(docFreq=6, maxDocs=44083)
                0.0684825 = queryNorm
              3.0462246 = fieldWeight in 55, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.747919 = idf(docFreq=6, maxDocs=44083)
                0.3125 = fieldNorm(doc=55)
        0.5 = coord(1/2)
    

Similar documents (content)

  1. Kishida, K.: Technical issues of cross-language information retrieval : a review (2005) 0.28
    0.2780772 = sum of:
      0.2780772 = product of:
        1.1586549 = sum of:
          0.023750277 = weight(abstract_txt:paper in 2020) [ClassicSimilarity], result of:
            0.023750277 = score(doc=2020,freq=1.0), product of:
              0.06252103 = queryWeight, product of:
                1.0648394 = boost
                3.4731574 = idf(docFreq=3716, maxDocs=44083)
                0.016905094 = queryNorm
              0.37987658 = fieldWeight in 2020, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4731574 = idf(docFreq=3716, maxDocs=44083)
                0.109375 = fieldNorm(doc=2020)
          0.023783432 = weight(abstract_txt:retrieval in 2020) [ClassicSimilarity], result of:
            0.023783432 = score(doc=2020,freq=1.0), product of:
              0.06257921 = queryWeight, product of:
                1.0653347 = boost
                3.474773 = idf(docFreq=3710, maxDocs=44083)
                0.016905094 = queryNorm
              0.38005328 = fieldWeight in 2020, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.474773 = idf(docFreq=3710, maxDocs=44083)
                0.109375 = fieldNorm(doc=2020)
          0.08300668 = weight(abstract_txt:language in 2020) [ClassicSimilarity], result of:
            0.08300668 = score(doc=2020,freq=4.0), product of:
              0.09070594 = queryWeight, product of:
                1.282593 = boost
                4.1833987 = idf(docFreq=1826, maxDocs=44083)
                0.016905094 = queryNorm
              0.91511846 = fieldWeight in 2020, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.1833987 = idf(docFreq=1826, maxDocs=44083)
                0.109375 = fieldNorm(doc=2020)
          0.09956801 = weight(abstract_txt:cross in 2020) [ClassicSimilarity], result of:
            0.09956801 = score(doc=2020,freq=1.0), product of:
              0.16255246 = queryWeight, product of:
                1.7169901 = boost
                5.60026 = idf(docFreq=442, maxDocs=44083)
                0.016905094 = queryNorm
              0.61252844 = fieldWeight in 2020, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.60026 = idf(docFreq=442, maxDocs=44083)
                0.109375 = fieldNorm(doc=2020)
          0.04535237 = weight(abstract_txt:research in 2020) [ClassicSimilarity], result of:
            0.04535237 = score(doc=2020,freq=1.0), product of:
              0.13060516 = queryWeight, product of:
                2.433443 = boost
                3.1748378 = idf(docFreq=5008, maxDocs=44083)
                0.016905094 = queryNorm
              0.3472479 = fieldWeight in 2020, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.1748378 = idf(docFreq=5008, maxDocs=44083)
                0.109375 = fieldNorm(doc=2020)
          0.8831942 = weight(abstract_txt:clir in 2020) [ClassicSimilarity], result of:
            0.8831942 = score(doc=2020,freq=2.0), product of:
              0.69654816 = queryWeight, product of:
                5.0264525 = boost
                8.197322 = idf(docFreq=32, maxDocs=44083)
                0.016905094 = queryNorm
              1.2679585 = fieldWeight in 2020, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.197322 = idf(docFreq=32, maxDocs=44083)
                0.109375 = fieldNorm(doc=2020)
        0.24 = coord(6/25)
    
  2. Oard, D.W.: Multilingual information access (2009) 0.26
    0.26202992 = sum of:
      0.26202992 = product of:
        1.0917914 = sum of:
          0.023783432 = weight(abstract_txt:retrieval in 4851) [ClassicSimilarity], result of:
            0.023783432 = score(doc=4851,freq=1.0), product of:
              0.06257921 = queryWeight, product of:
                1.0653347 = boost
                3.474773 = idf(docFreq=3710, maxDocs=44083)
                0.016905094 = queryNorm
              0.38005328 = fieldWeight in 4851, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.474773 = idf(docFreq=3710, maxDocs=44083)
                0.109375 = fieldNorm(doc=4851)
          0.0718859 = weight(abstract_txt:language in 4851) [ClassicSimilarity], result of:
            0.0718859 = score(doc=4851,freq=3.0), product of:
              0.09070594 = queryWeight, product of:
                1.282593 = boost
                4.1833987 = idf(docFreq=1826, maxDocs=44083)
                0.016905094 = queryNorm
              0.7925159 = fieldWeight in 4851, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.1833987 = idf(docFreq=1826, maxDocs=44083)
                0.109375 = fieldNorm(doc=4851)
          0.20565183 = weight(abstract_txt:narrowly in 4851) [ClassicSimilarity], result of:
            0.20565183 = score(doc=4851,freq=1.0), product of:
              0.20924675 = queryWeight, product of:
                1.3774803 = boost
                8.98578 = idf(docFreq=14, maxDocs=44083)
                0.016905094 = queryNorm
              0.9828197 = fieldWeight in 4851, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.98578 = idf(docFreq=14, maxDocs=44083)
                0.109375 = fieldNorm(doc=4851)
          0.06638963 = weight(abstract_txt:find in 4851) [ClassicSimilarity], result of:
            0.06638963 = score(doc=4851,freq=1.0), product of:
              0.1240644 = queryWeight, product of:
                1.5000116 = boost
                4.8925467 = idf(docFreq=898, maxDocs=44083)
                0.016905094 = queryNorm
              0.5351223 = fieldWeight in 4851, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.8925467 = idf(docFreq=898, maxDocs=44083)
                0.109375 = fieldNorm(doc=4851)
          0.09956801 = weight(abstract_txt:cross in 4851) [ClassicSimilarity], result of:
            0.09956801 = score(doc=4851,freq=1.0), product of:
              0.16255246 = queryWeight, product of:
                1.7169901 = boost
                5.60026 = idf(docFreq=442, maxDocs=44083)
                0.016905094 = queryNorm
              0.61252844 = fieldWeight in 4851, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.60026 = idf(docFreq=442, maxDocs=44083)
                0.109375 = fieldNorm(doc=4851)
          0.6245126 = weight(abstract_txt:clir in 4851) [ClassicSimilarity], result of:
            0.6245126 = score(doc=4851,freq=1.0), product of:
              0.69654816 = queryWeight, product of:
                5.0264525 = boost
                8.197322 = idf(docFreq=32, maxDocs=44083)
                0.016905094 = queryNorm
              0.89658207 = fieldWeight in 4851, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.197322 = idf(docFreq=32, maxDocs=44083)
                0.109375 = fieldNorm(doc=4851)
        0.24 = coord(6/25)
    
  3. Oard, D.W.; He, D.; Wang, J.: User-assisted query translation for interactive cross-language information retrieval (2008) 0.25
    0.25497106 = sum of:
      0.25497106 = product of:
        0.9106109 = sum of:
          0.0239914 = weight(abstract_txt:paper in 3031) [ClassicSimilarity], result of:
            0.0239914 = score(doc=3031,freq=2.0), product of:
              0.06252103 = queryWeight, product of:
                1.0648394 = boost
                3.4731574 = idf(docFreq=3716, maxDocs=44083)
                0.016905094 = queryNorm
              0.38373327 = fieldWeight in 3031, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4731574 = idf(docFreq=3716, maxDocs=44083)
                0.078125 = fieldNorm(doc=3031)
          0.024024894 = weight(abstract_txt:retrieval in 3031) [ClassicSimilarity], result of:
            0.024024894 = score(doc=3031,freq=2.0), product of:
              0.06257921 = queryWeight, product of:
                1.0653347 = boost
                3.474773 = idf(docFreq=3710, maxDocs=44083)
                0.016905094 = queryNorm
              0.3839118 = fieldWeight in 3031, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.474773 = idf(docFreq=3710, maxDocs=44083)
                0.078125 = fieldNorm(doc=3031)
          0.051347066 = weight(abstract_txt:language in 3031) [ClassicSimilarity], result of:
            0.051347066 = score(doc=3031,freq=3.0), product of:
              0.09070594 = queryWeight, product of:
                1.282593 = boost
                4.1833987 = idf(docFreq=1826, maxDocs=44083)
                0.016905094 = queryNorm
              0.5660827 = fieldWeight in 3031, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.1833987 = idf(docFreq=1826, maxDocs=44083)
                0.078125 = fieldNorm(doc=3031)
          0.047421165 = weight(abstract_txt:find in 3031) [ClassicSimilarity], result of:
            0.047421165 = score(doc=3031,freq=1.0), product of:
              0.1240644 = queryWeight, product of:
                1.5000116 = boost
                4.8925467 = idf(docFreq=898, maxDocs=44083)
                0.016905094 = queryNorm
              0.38223022 = fieldWeight in 3031, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.8925467 = idf(docFreq=898, maxDocs=44083)
                0.078125 = fieldNorm(doc=3031)
          0.10057887 = weight(abstract_txt:cross in 3031) [ClassicSimilarity], result of:
            0.10057887 = score(doc=3031,freq=2.0), product of:
              0.16255246 = queryWeight, product of:
                1.7169901 = boost
                5.60026 = idf(docFreq=442, maxDocs=44083)
                0.016905094 = queryNorm
              0.6187471 = fieldWeight in 3031, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.60026 = idf(docFreq=442, maxDocs=44083)
                0.078125 = fieldNorm(doc=3031)
          0.032394547 = weight(abstract_txt:research in 3031) [ClassicSimilarity], result of:
            0.032394547 = score(doc=3031,freq=1.0), product of:
              0.13060516 = queryWeight, product of:
                2.433443 = boost
                3.1748378 = idf(docFreq=5008, maxDocs=44083)
                0.016905094 = queryNorm
              0.24803421 = fieldWeight in 3031, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.1748378 = idf(docFreq=5008, maxDocs=44083)
                0.078125 = fieldNorm(doc=3031)
          0.630853 = weight(abstract_txt:clir in 3031) [ClassicSimilarity], result of:
            0.630853 = score(doc=3031,freq=2.0), product of:
              0.69654816 = queryWeight, product of:
                5.0264525 = boost
                8.197322 = idf(docFreq=32, maxDocs=44083)
                0.016905094 = queryNorm
              0.9056847 = fieldWeight in 3031, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.197322 = idf(docFreq=32, maxDocs=44083)
                0.078125 = fieldNorm(doc=3031)
        0.28 = coord(7/25)
    
  4. Xu, J.; Weischedel, R.: Empirical studies on the impact of lexical resources on CLIR performance (2005) 0.21
    0.20783165 = sum of:
      0.20783165 = product of:
        1.0391582 = sum of:
          0.016964484 = weight(abstract_txt:paper in 2021) [ClassicSimilarity], result of:
            0.016964484 = score(doc=2021,freq=1.0), product of:
              0.06252103 = queryWeight, product of:
                1.0648394 = boost
                3.4731574 = idf(docFreq=3716, maxDocs=44083)
                0.016905094 = queryNorm
              0.27134043 = fieldWeight in 2021, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4731574 = idf(docFreq=3716, maxDocs=44083)
                0.078125 = fieldNorm(doc=2021)
          0.016988168 = weight(abstract_txt:retrieval in 2021) [ClassicSimilarity], result of:
            0.016988168 = score(doc=2021,freq=1.0), product of:
              0.06257921 = queryWeight, product of:
                1.0653347 = boost
                3.474773 = idf(docFreq=3710, maxDocs=44083)
                0.016905094 = queryNorm
              0.27146664 = fieldWeight in 2021, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.474773 = idf(docFreq=3710, maxDocs=44083)
                0.078125 = fieldNorm(doc=2021)
          0.041924704 = weight(abstract_txt:language in 2021) [ClassicSimilarity], result of:
            0.041924704 = score(doc=2021,freq=2.0), product of:
              0.09070594 = queryWeight, product of:
                1.282593 = boost
                4.1833987 = idf(docFreq=1826, maxDocs=44083)
                0.016905094 = queryNorm
              0.46220464 = fieldWeight in 2021, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.1833987 = idf(docFreq=1826, maxDocs=44083)
                0.078125 = fieldNorm(doc=2021)
          0.07112 = weight(abstract_txt:cross in 2021) [ClassicSimilarity], result of:
            0.07112 = score(doc=2021,freq=1.0), product of:
              0.16255246 = queryWeight, product of:
                1.7169901 = boost
                5.60026 = idf(docFreq=442, maxDocs=44083)
                0.016905094 = queryNorm
              0.4375203 = fieldWeight in 2021, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.60026 = idf(docFreq=442, maxDocs=44083)
                0.078125 = fieldNorm(doc=2021)
          0.8921609 = weight(abstract_txt:clir in 2021) [ClassicSimilarity], result of:
            0.8921609 = score(doc=2021,freq=4.0), product of:
              0.69654816 = queryWeight, product of:
                5.0264525 = boost
                8.197322 = idf(docFreq=32, maxDocs=44083)
                0.016905094 = queryNorm
              1.2808316 = fieldWeight in 2021, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                8.197322 = idf(docFreq=32, maxDocs=44083)
                0.078125 = fieldNorm(doc=2021)
        0.2 = coord(5/25)
    
  5. Levow, G.-A.; Oard, D.W.; Resnik, P.: Dictionary-based techniques for cross-language information retrieval (2005) 0.20
    0.1986137 = sum of:
      0.1986137 = product of:
        0.8275571 = sum of:
          0.016988168 = weight(abstract_txt:retrieval in 2026) [ClassicSimilarity], result of:
            0.016988168 = score(doc=2026,freq=1.0), product of:
              0.06257921 = queryWeight, product of:
                1.0653347 = boost
                3.474773 = idf(docFreq=3710, maxDocs=44083)
                0.016905094 = queryNorm
              0.27146664 = fieldWeight in 2026, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.474773 = idf(docFreq=3710, maxDocs=44083)
                0.078125 = fieldNorm(doc=2026)
          0.019250073 = weight(abstract_txt:been in 2026) [ClassicSimilarity], result of:
            0.019250073 = score(doc=2026,freq=1.0), product of:
              0.06801749 = queryWeight, product of:
                1.1106604 = boost
                3.622611 = idf(docFreq=3200, maxDocs=44083)
                0.016905094 = queryNorm
              0.2830165 = fieldWeight in 2026, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.622611 = idf(docFreq=3200, maxDocs=44083)
                0.078125 = fieldNorm(doc=2026)
          0.041924704 = weight(abstract_txt:language in 2026) [ClassicSimilarity], result of:
            0.041924704 = score(doc=2026,freq=2.0), product of:
              0.09070594 = queryWeight, product of:
                1.282593 = boost
                4.1833987 = idf(docFreq=1826, maxDocs=44083)
                0.016905094 = queryNorm
              0.46220464 = fieldWeight in 2026, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.1833987 = idf(docFreq=1826, maxDocs=44083)
                0.078125 = fieldNorm(doc=2026)
          0.047421165 = weight(abstract_txt:find in 2026) [ClassicSimilarity], result of:
            0.047421165 = score(doc=2026,freq=1.0), product of:
              0.1240644 = queryWeight, product of:
                1.5000116 = boost
                4.8925467 = idf(docFreq=898, maxDocs=44083)
                0.016905094 = queryNorm
              0.38223022 = fieldWeight in 2026, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.8925467 = idf(docFreq=898, maxDocs=44083)
                0.078125 = fieldNorm(doc=2026)
          0.07112 = weight(abstract_txt:cross in 2026) [ClassicSimilarity], result of:
            0.07112 = score(doc=2026,freq=1.0), product of:
              0.16255246 = queryWeight, product of:
                1.7169901 = boost
                5.60026 = idf(docFreq=442, maxDocs=44083)
                0.016905094 = queryNorm
              0.4375203 = fieldWeight in 2026, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.60026 = idf(docFreq=442, maxDocs=44083)
                0.078125 = fieldNorm(doc=2026)
          0.630853 = weight(abstract_txt:clir in 2026) [ClassicSimilarity], result of:
            0.630853 = score(doc=2026,freq=2.0), product of:
              0.69654816 = queryWeight, product of:
                5.0264525 = boost
                8.197322 = idf(docFreq=32, maxDocs=44083)
                0.016905094 = queryNorm
              0.9056847 = fieldWeight in 2026, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.197322 = idf(docFreq=32, maxDocs=44083)
                0.078125 = fieldNorm(doc=2026)
        0.24 = coord(6/25)