Search (1 results, page 1 of 1)

Chen, H.-H.; Lin, W.-C.; Yang, C.; Lin, W.-H.: Translating-transliterating named entities for multilingual information access (2006) 0.03

0.029877722 = product of:
  0.059755445 = sum of:
    0.059755445 = sum of:
      0.009447212 = weight(_text_:a in 1080) [ClassicSimilarity], result of:
        0.009447212 = score(doc=1080,freq=6.0), product of:
          0.06116359 = queryWeight, product of:
            1.153047 = idf(docFreq=37942, maxDocs=44218)
            0.053045183 = queryNorm
          0.1544581 = fieldWeight in 1080, product of:
            2.4494898 = tf(freq=6.0), with freq of:
              6.0 = termFreq=6.0
            1.153047 = idf(docFreq=37942, maxDocs=44218)
            0.0546875 = fieldNorm(doc=1080)
      0.05030823 = weight(_text_:22 in 1080) [ClassicSimilarity], result of:
        0.05030823 = score(doc=1080,freq=2.0), product of:
          0.1857552 = queryWeight, product of:
            3.5018296 = idf(docFreq=3622, maxDocs=44218)
            0.053045183 = queryNorm
          0.2708308 = fieldWeight in 1080, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            3.5018296 = idf(docFreq=3622, maxDocs=44218)
            0.0546875 = fieldNorm(doc=1080)
  0.5 = coord(1/2)

Abstract: Named entities are major constituents of a document but are usually unknown words. This work proposes a systematic way of dealing with formulation, transformation, translation, and transliteration of multilingual-named entities. The rules and similarity matrices for translation and transliteration are learned automatically from parallel-named-entity corpora. The results are applied in cross-language access to collections of images with captions. Experimental results demonstrate that the similarity-based transliteration of named entities is effective, and runs in which transliteration is considered outperform the runs in which it is neglected.
Date: 4. 6.2006 19:52:22
Type: a