Chen, H.-H.; Lin, W.-C.; Yang, C.; Lin, W.-H.: Translating-transliterating named entities for multilingual information access (2006)
0.00
0.0017546645 = product of:
0.017546644 = sum of:
0.0053462577 = weight(_text_:in in 1080) [ClassicSimilarity], result of:
0.0053462577 = score(doc=1080,freq=6.0), product of:
0.029340398 = queryWeight, product of:
1.3602545 = idf(docFreq=30841, maxDocs=44218)
0.021569785 = queryNorm
0.1822149 = fieldWeight in 1080, product of:
2.4494898 = tf(freq=6.0), with freq of:
6.0 = termFreq=6.0
1.3602545 = idf(docFreq=30841, maxDocs=44218)
0.0546875 = fieldNorm(doc=1080)
0.0019719584 = weight(_text_:s in 1080) [ClassicSimilarity], result of:
0.0019719584 = score(doc=1080,freq=2.0), product of:
0.023451481 = queryWeight, product of:
1.0872376 = idf(docFreq=40523, maxDocs=44218)
0.021569785 = queryNorm
0.08408674 = fieldWeight in 1080, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
1.0872376 = idf(docFreq=40523, maxDocs=44218)
0.0546875 = fieldNorm(doc=1080)
0.010228428 = product of:
0.020456856 = sum of:
0.020456856 = weight(_text_:22 in 1080) [ClassicSimilarity], result of:
0.020456856 = score(doc=1080,freq=2.0), product of:
0.07553371 = queryWeight, product of:
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.021569785 = queryNorm
0.2708308 = fieldWeight in 1080, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.0546875 = fieldNorm(doc=1080)
0.5 = coord(1/2)
0.1 = coord(3/30)
- Abstract
- Named entities are major constituents of a document but are usually unknown words. This work proposes a systematic way of dealing with formulation, transformation, translation, and transliteration of multilingual-named entities. The rules and similarity matrices for translation and transliteration are learned automatically from parallel-named-entity corpora. The results are applied in cross-language access to collections of images with captions. Experimental results demonstrate that the similarity-based transliteration of named entities is effective, and runs in which transliteration is considered outperform the runs in which it is neglected.
- Date
- 4. 6.2006 19:52:22
- Source
- Journal of the American Society for Information Science and Technology. 57(2006) no.5, S.645-659
Lin, W.-C.; Chang, Y.-C.; Chen, H.-H.: Integrating textual and visual information for cross-language image retrieval : a trans-media dictionary approach (2007)
0.00
3.6212345E-4 = product of:
0.0054318514 = sum of:
0.003741601 = weight(_text_:in in 904) [ClassicSimilarity], result of:
0.003741601 = score(doc=904,freq=4.0), product of:
0.029340398 = queryWeight, product of:
1.3602545 = idf(docFreq=30841, maxDocs=44218)
0.021569785 = queryNorm
0.12752387 = fieldWeight in 904, product of:
2.0 = tf(freq=4.0), with freq of:
4.0 = termFreq=4.0
1.3602545 = idf(docFreq=30841, maxDocs=44218)
0.046875 = fieldNorm(doc=904)
0.0016902501 = weight(_text_:s in 904) [ClassicSimilarity], result of:
0.0016902501 = score(doc=904,freq=2.0), product of:
0.023451481 = queryWeight, product of:
1.0872376 = idf(docFreq=40523, maxDocs=44218)
0.021569785 = queryNorm
0.072074346 = fieldWeight in 904, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
1.0872376 = idf(docFreq=40523, maxDocs=44218)
0.046875 = fieldNorm(doc=904)
0.06666667 = coord(2/30)
- Footnote
- Beitrag in: Special issue on AIRS2005: Information Retrieval Research in Asia
- Source
- Information processing and management. 43(2007) no.2, S.488-502