Document (#33019)

Author
Gey, F.C.
Kando, N.
Peters, C.
Title
Cross-Language Information Retrieval : the way ahead
Source
Information processing and management. 41(2005) no.3, S.415-432
Year
2005
Abstract
This introductory paper covers not only the research content of the articles in this special issue of IP&M but attempts to characterize the state-of-the-art in the Cross-Language Information Retrieval (CLIR) domain. We present our view of some major directions for CLIR research in the future. In particular, we find that insufficient attention has been given to the Web as a resource for multilingual research, and to languages which are spoken by hundreds of millions of people in the world but have been mainly neglected by the CLIR research community. In addition, we find that most CLIR evaluation has focussed narrowly on the news genre to the exclusion of other important genres such as scientific and technical literature. The paper concludes by describing an ambitious 5-year research plan proposed by James Mayfield and Paul McNamee.
Theme
Multilinguale Probleme

Similar documents (author)

  1. Kando, N.: Information concepts reexamined (1994) 2.60
    2.600723 = sum of:
      2.600723 = product of:
        5.201446 = sum of:
          5.201446 = weight(author_txt:kando in 2126) [ClassicSimilarity], result of:
            5.201446 = score(doc=2126,freq=1.0), product of:
              0.8534851 = queryWeight, product of:
                1.279765 = boost
                9.7509775 = idf(docFreq=6, maxDocs=44218)
                0.06839393 = queryNorm
              6.094361 = fieldWeight in 2126, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.7509775 = idf(docFreq=6, maxDocs=44218)
                0.625 = fieldNorm(doc=2126)
        0.5 = coord(1/2)
    
  2. Hu, X.; Kando, N.: Task complexity and difficulty in music information retrieval (2017) 2.08
    2.0805786 = sum of:
      2.0805786 = product of:
        4.161157 = sum of:
          4.161157 = weight(author_txt:kando in 3690) [ClassicSimilarity], result of:
            4.161157 = score(doc=3690,freq=1.0), product of:
              0.8534851 = queryWeight, product of:
                1.279765 = boost
                9.7509775 = idf(docFreq=6, maxDocs=44218)
                0.06839393 = queryNorm
              4.8754888 = fieldWeight in 3690, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.7509775 = idf(docFreq=6, maxDocs=44218)
                0.5 = fieldNorm(doc=3690)
        0.5 = coord(1/2)
    
  3. Fujii, A.; Iwayama, M.; Kando, N.: Introduction to the special issue on patent processing (2007) 1.56
    1.560434 = sum of:
      1.560434 = product of:
        3.120868 = sum of:
          3.120868 = weight(author_txt:kando in 929) [ClassicSimilarity], result of:
            3.120868 = score(doc=929,freq=1.0), product of:
              0.8534851 = queryWeight, product of:
                1.279765 = boost
                9.7509775 = idf(docFreq=6, maxDocs=44218)
                0.06839393 = queryNorm
              3.6566167 = fieldWeight in 929, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.7509775 = idf(docFreq=6, maxDocs=44218)
                0.375 = fieldNorm(doc=929)
        0.5 = coord(1/2)
    
  4. Kuriyama, K.; Kando, N.; Nozue, T.; Eguchi, K.: Pooling for a large-scale test collection : an analysis of the search results from the First NTCIR Workshop (2002) 1.30
    1.3003615 = sum of:
      1.3003615 = product of:
        2.600723 = sum of:
          2.600723 = weight(author_txt:kando in 3830) [ClassicSimilarity], result of:
            2.600723 = score(doc=3830,freq=1.0), product of:
              0.8534851 = queryWeight, product of:
                1.279765 = boost
                9.7509775 = idf(docFreq=6, maxDocs=44218)
                0.06839393 = queryNorm
              3.0471804 = fieldWeight in 3830, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.7509775 = idf(docFreq=6, maxDocs=44218)
                0.3125 = fieldNorm(doc=3830)
        0.5 = coord(1/2)
    
  5. Rodrigo, A.; Peñas, A.; Miyao, Y.; Kando, N.: Do systems pass university entrance exams? (2018) 1.30
    1.3003615 = sum of:
      1.3003615 = product of:
        2.600723 = sum of:
          2.600723 = weight(author_txt:kando in 5054) [ClassicSimilarity], result of:
            2.600723 = score(doc=5054,freq=1.0), product of:
              0.8534851 = queryWeight, product of:
                1.279765 = boost
                9.7509775 = idf(docFreq=6, maxDocs=44218)
                0.06839393 = queryNorm
              3.0471804 = fieldWeight in 5054, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.7509775 = idf(docFreq=6, maxDocs=44218)
                0.3125 = fieldNorm(doc=5054)
        0.5 = coord(1/2)
    

Similar documents (content)

  1. Kishida, K.: Technical issues of cross-language information retrieval : a review (2005) 0.28
    0.27832684 = sum of:
      0.27832684 = product of:
        1.1596951 = sum of:
          0.023639824 = weight(abstract_txt:paper in 1019) [ClassicSimilarity], result of:
            0.023639824 = score(doc=1019,freq=1.0), product of:
              0.06233404 = queryWeight, product of:
                1.0625687 = boost
                3.467376 = idf(docFreq=3749, maxDocs=44218)
                0.016918713 = queryNorm
              0.37924424 = fieldWeight in 1019, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.467376 = idf(docFreq=3749, maxDocs=44218)
                0.109375 = fieldNorm(doc=1019)
          0.023798969 = weight(abstract_txt:retrieval in 1019) [ClassicSimilarity], result of:
            0.023798969 = score(doc=1019,freq=1.0), product of:
              0.06261348 = queryWeight, product of:
                1.0649477 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.016918713 = queryNorm
              0.38009337 = fieldWeight in 1019, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.109375 = fieldNorm(doc=1019)
          0.08295657 = weight(abstract_txt:language in 1019) [ClassicSimilarity], result of:
            0.08295657 = score(doc=1019,freq=4.0), product of:
              0.0906796 = queryWeight, product of:
                1.2815902 = boost
                4.1820874 = idf(docFreq=1834, maxDocs=44218)
                0.016918713 = queryNorm
              0.91483164 = fieldWeight in 1019, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.1820874 = idf(docFreq=1834, maxDocs=44218)
                0.109375 = fieldNorm(doc=1019)
          0.09964437 = weight(abstract_txt:cross in 1019) [ClassicSimilarity], result of:
            0.09964437 = score(doc=1019,freq=1.0), product of:
              0.16265382 = queryWeight, product of:
                1.7164316 = boost
                5.601063 = idf(docFreq=443, maxDocs=44218)
                0.016918713 = queryNorm
              0.61261624 = fieldWeight in 1019, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.601063 = idf(docFreq=443, maxDocs=44218)
                0.109375 = fieldNorm(doc=1019)
          0.045174994 = weight(abstract_txt:research in 1019) [ClassicSimilarity], result of:
            0.045174994 = score(doc=1019,freq=1.0), product of:
              0.13027902 = queryWeight, product of:
                2.4288552 = boost
                3.170338 = idf(docFreq=5046, maxDocs=44218)
                0.016918713 = queryNorm
              0.3467557 = fieldWeight in 1019, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.170338 = idf(docFreq=5046, maxDocs=44218)
                0.109375 = fieldNorm(doc=1019)
          0.8844805 = weight(abstract_txt:clir in 1019) [ClassicSimilarity], result of:
            0.8844805 = score(doc=1019,freq=2.0), product of:
              0.6973026 = queryWeight, product of:
                5.0259714 = boost
                8.200379 = idf(docFreq=32, maxDocs=44218)
                0.016918713 = queryNorm
              1.2684314 = fieldWeight in 1019, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.200379 = idf(docFreq=32, maxDocs=44218)
                0.109375 = fieldNorm(doc=1019)
        0.24 = coord(6/25)
    
  2. Oard, D.W.: Multilingual information access (2009) 0.26
    0.2622864 = sum of:
      0.2622864 = product of:
        1.09286 = sum of:
          0.023798969 = weight(abstract_txt:retrieval in 3850) [ClassicSimilarity], result of:
            0.023798969 = score(doc=3850,freq=1.0), product of:
              0.06261348 = queryWeight, product of:
                1.0649477 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.016918713 = queryNorm
              0.38009337 = fieldWeight in 3850, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.109375 = fieldNorm(doc=3850)
          0.0718425 = weight(abstract_txt:language in 3850) [ClassicSimilarity], result of:
            0.0718425 = score(doc=3850,freq=3.0), product of:
              0.0906796 = queryWeight, product of:
                1.2815902 = boost
                4.1820874 = idf(docFreq=1834, maxDocs=44218)
                0.016918713 = queryNorm
              0.79226744 = fieldWeight in 3850, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.1820874 = idf(docFreq=1834, maxDocs=44218)
                0.109375 = fieldNorm(doc=3850)
          0.20593114 = weight(abstract_txt:narrowly in 3850) [ClassicSimilarity], result of:
            0.20593114 = score(doc=3850,freq=1.0), product of:
              0.20945969 = queryWeight, product of:
                1.3773034 = boost
                8.988837 = idf(docFreq=14, maxDocs=44218)
                0.016918713 = queryNorm
              0.98315406 = fieldWeight in 3850, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.988837 = idf(docFreq=14, maxDocs=44218)
                0.109375 = fieldNorm(doc=3850)
          0.06622082 = weight(abstract_txt:find in 3850) [ClassicSimilarity], result of:
            0.06622082 = score(doc=3850,freq=1.0), product of:
              0.12386791 = queryWeight, product of:
                1.4978688 = boost
                4.887848 = idf(docFreq=905, maxDocs=44218)
                0.016918713 = queryNorm
              0.53460836 = fieldWeight in 3850, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.887848 = idf(docFreq=905, maxDocs=44218)
                0.109375 = fieldNorm(doc=3850)
          0.09964437 = weight(abstract_txt:cross in 3850) [ClassicSimilarity], result of:
            0.09964437 = score(doc=3850,freq=1.0), product of:
              0.16265382 = queryWeight, product of:
                1.7164316 = boost
                5.601063 = idf(docFreq=443, maxDocs=44218)
                0.016918713 = queryNorm
              0.61261624 = fieldWeight in 3850, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.601063 = idf(docFreq=443, maxDocs=44218)
                0.109375 = fieldNorm(doc=3850)
          0.6254222 = weight(abstract_txt:clir in 3850) [ClassicSimilarity], result of:
            0.6254222 = score(doc=3850,freq=1.0), product of:
              0.6973026 = queryWeight, product of:
                5.0259714 = boost
                8.200379 = idf(docFreq=32, maxDocs=44218)
                0.016918713 = queryNorm
              0.8969165 = fieldWeight in 3850, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.200379 = idf(docFreq=32, maxDocs=44218)
                0.109375 = fieldNorm(doc=3850)
        0.24 = coord(6/25)
    
  3. Oard, D.W.; He, D.; Wang, J.: User-assisted query translation for interactive cross-language information retrieval (2008) 0.26
    0.25514516 = sum of:
      0.25514516 = product of:
        0.9112327 = sum of:
          0.02387983 = weight(abstract_txt:paper in 2030) [ClassicSimilarity], result of:
            0.02387983 = score(doc=2030,freq=2.0), product of:
              0.06233404 = queryWeight, product of:
                1.0625687 = boost
                3.467376 = idf(docFreq=3749, maxDocs=44218)
                0.016918713 = queryNorm
              0.38309455 = fieldWeight in 2030, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.467376 = idf(docFreq=3749, maxDocs=44218)
                0.078125 = fieldNorm(doc=2030)
          0.024040587 = weight(abstract_txt:retrieval in 2030) [ClassicSimilarity], result of:
            0.024040587 = score(doc=2030,freq=2.0), product of:
              0.06261348 = queryWeight, product of:
                1.0649477 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.016918713 = queryNorm
              0.38395226 = fieldWeight in 2030, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.078125 = fieldNorm(doc=2030)
          0.05131607 = weight(abstract_txt:language in 2030) [ClassicSimilarity], result of:
            0.05131607 = score(doc=2030,freq=3.0), product of:
              0.0906796 = queryWeight, product of:
                1.2815902 = boost
                4.1820874 = idf(docFreq=1834, maxDocs=44218)
                0.016918713 = queryNorm
              0.56590533 = fieldWeight in 2030, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.1820874 = idf(docFreq=1834, maxDocs=44218)
                0.078125 = fieldNorm(doc=2030)
          0.047300585 = weight(abstract_txt:find in 2030) [ClassicSimilarity], result of:
            0.047300585 = score(doc=2030,freq=1.0), product of:
              0.12386791 = queryWeight, product of:
                1.4978688 = boost
                4.887848 = idf(docFreq=905, maxDocs=44218)
                0.016918713 = queryNorm
              0.38186312 = fieldWeight in 2030, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.887848 = idf(docFreq=905, maxDocs=44218)
                0.078125 = fieldNorm(doc=2030)
          0.10065601 = weight(abstract_txt:cross in 2030) [ClassicSimilarity], result of:
            0.10065601 = score(doc=2030,freq=2.0), product of:
              0.16265382 = queryWeight, product of:
                1.7164316 = boost
                5.601063 = idf(docFreq=443, maxDocs=44218)
                0.016918713 = queryNorm
              0.6188358 = fieldWeight in 2030, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.601063 = idf(docFreq=443, maxDocs=44218)
                0.078125 = fieldNorm(doc=2030)
          0.032267854 = weight(abstract_txt:research in 2030) [ClassicSimilarity], result of:
            0.032267854 = score(doc=2030,freq=1.0), product of:
              0.13027902 = queryWeight, product of:
                2.4288552 = boost
                3.170338 = idf(docFreq=5046, maxDocs=44218)
                0.016918713 = queryNorm
              0.24768265 = fieldWeight in 2030, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.170338 = idf(docFreq=5046, maxDocs=44218)
                0.078125 = fieldNorm(doc=2030)
          0.6317718 = weight(abstract_txt:clir in 2030) [ClassicSimilarity], result of:
            0.6317718 = score(doc=2030,freq=2.0), product of:
              0.6973026 = queryWeight, product of:
                5.0259714 = boost
                8.200379 = idf(docFreq=32, maxDocs=44218)
                0.016918713 = queryNorm
              0.9060225 = fieldWeight in 2030, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.200379 = idf(docFreq=32, maxDocs=44218)
                0.078125 = fieldNorm(doc=2030)
        0.28 = coord(7/25)
    
  4. Xu, J.; Weischedel, R.: Empirical studies on the impact of lexical resources on CLIR performance (2005) 0.21
    0.2080838 = sum of:
      0.2080838 = product of:
        1.040419 = sum of:
          0.01688559 = weight(abstract_txt:paper in 1020) [ClassicSimilarity], result of:
            0.01688559 = score(doc=1020,freq=1.0), product of:
              0.06233404 = queryWeight, product of:
                1.0625687 = boost
                3.467376 = idf(docFreq=3749, maxDocs=44218)
                0.016918713 = queryNorm
              0.27088875 = fieldWeight in 1020, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.467376 = idf(docFreq=3749, maxDocs=44218)
                0.078125 = fieldNorm(doc=1020)
          0.016999263 = weight(abstract_txt:retrieval in 1020) [ClassicSimilarity], result of:
            0.016999263 = score(doc=1020,freq=1.0), product of:
              0.06261348 = queryWeight, product of:
                1.0649477 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.016918713 = queryNorm
              0.27149525 = fieldWeight in 1020, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.078125 = fieldNorm(doc=1020)
          0.041899394 = weight(abstract_txt:language in 1020) [ClassicSimilarity], result of:
            0.041899394 = score(doc=1020,freq=2.0), product of:
              0.0906796 = queryWeight, product of:
                1.2815902 = boost
                4.1820874 = idf(docFreq=1834, maxDocs=44218)
                0.016918713 = queryNorm
              0.46205974 = fieldWeight in 1020, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.1820874 = idf(docFreq=1834, maxDocs=44218)
                0.078125 = fieldNorm(doc=1020)
          0.07117455 = weight(abstract_txt:cross in 1020) [ClassicSimilarity], result of:
            0.07117455 = score(doc=1020,freq=1.0), product of:
              0.16265382 = queryWeight, product of:
                1.7164316 = boost
                5.601063 = idf(docFreq=443, maxDocs=44218)
                0.016918713 = queryNorm
              0.43758303 = fieldWeight in 1020, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.601063 = idf(docFreq=443, maxDocs=44218)
                0.078125 = fieldNorm(doc=1020)
          0.8934602 = weight(abstract_txt:clir in 1020) [ClassicSimilarity], result of:
            0.8934602 = score(doc=1020,freq=4.0), product of:
              0.6973026 = queryWeight, product of:
                5.0259714 = boost
                8.200379 = idf(docFreq=32, maxDocs=44218)
                0.016918713 = queryNorm
              1.2813092 = fieldWeight in 1020, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                8.200379 = idf(docFreq=32, maxDocs=44218)
                0.078125 = fieldNorm(doc=1020)
        0.2 = coord(5/25)
    
  5. Levow, G.-A.; Oard, D.W.; Resnik, P.: Dictionary-based techniques for cross-language information retrieval (2005) 0.20
    0.19879727 = sum of:
      0.19879727 = product of:
        0.828322 = sum of:
          0.016999263 = weight(abstract_txt:retrieval in 1025) [ClassicSimilarity], result of:
            0.016999263 = score(doc=1025,freq=1.0), product of:
              0.06261348 = queryWeight, product of:
                1.0649477 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.016918713 = queryNorm
              0.27149525 = fieldWeight in 1025, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.078125 = fieldNorm(doc=1025)
          0.019176418 = weight(abstract_txt:been in 1025) [ClassicSimilarity], result of:
            0.019176418 = score(doc=1025,freq=1.0), product of:
              0.0678515 = queryWeight, product of:
                1.108598 = boost
                3.617579 = idf(docFreq=3226, maxDocs=44218)
                0.016918713 = queryNorm
              0.28262335 = fieldWeight in 1025, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.617579 = idf(docFreq=3226, maxDocs=44218)
                0.078125 = fieldNorm(doc=1025)
          0.041899394 = weight(abstract_txt:language in 1025) [ClassicSimilarity], result of:
            0.041899394 = score(doc=1025,freq=2.0), product of:
              0.0906796 = queryWeight, product of:
                1.2815902 = boost
                4.1820874 = idf(docFreq=1834, maxDocs=44218)
                0.016918713 = queryNorm
              0.46205974 = fieldWeight in 1025, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.1820874 = idf(docFreq=1834, maxDocs=44218)
                0.078125 = fieldNorm(doc=1025)
          0.047300585 = weight(abstract_txt:find in 1025) [ClassicSimilarity], result of:
            0.047300585 = score(doc=1025,freq=1.0), product of:
              0.12386791 = queryWeight, product of:
                1.4978688 = boost
                4.887848 = idf(docFreq=905, maxDocs=44218)
                0.016918713 = queryNorm
              0.38186312 = fieldWeight in 1025, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.887848 = idf(docFreq=905, maxDocs=44218)
                0.078125 = fieldNorm(doc=1025)
          0.07117455 = weight(abstract_txt:cross in 1025) [ClassicSimilarity], result of:
            0.07117455 = score(doc=1025,freq=1.0), product of:
              0.16265382 = queryWeight, product of:
                1.7164316 = boost
                5.601063 = idf(docFreq=443, maxDocs=44218)
                0.016918713 = queryNorm
              0.43758303 = fieldWeight in 1025, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.601063 = idf(docFreq=443, maxDocs=44218)
                0.078125 = fieldNorm(doc=1025)
          0.6317718 = weight(abstract_txt:clir in 1025) [ClassicSimilarity], result of:
            0.6317718 = score(doc=1025,freq=2.0), product of:
              0.6973026 = queryWeight, product of:
                5.0259714 = boost
                8.200379 = idf(docFreq=32, maxDocs=44218)
                0.016918713 = queryNorm
              0.9060225 = fieldWeight in 1025, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.200379 = idf(docFreq=32, maxDocs=44218)
                0.078125 = fieldNorm(doc=1025)
        0.24 = coord(6/25)