-
Chen, H.-H.; Lin, W.-C.; Yang, C.; Lin, W.-H.: Translating-transliterating named entities for multilingual information access (2006)
0.03
0.034623183 = product of:
0.13849273 = sum of:
0.031475607 = product of:
0.062951215 = sum of:
0.062951215 = weight(_text_:rules in 1080) [ClassicSimilarity], result of:
0.062951215 = score(doc=1080,freq=2.0), product of:
0.16161752 = queryWeight, product of:
5.036312 = idf(docFreq=780, maxDocs=44218)
0.032090448 = queryNorm
0.38950738 = fieldWeight in 1080, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
5.036312 = idf(docFreq=780, maxDocs=44218)
0.0546875 = fieldNorm(doc=1080)
0.5 = coord(1/2)
0.028848568 = weight(_text_:american in 1080) [ClassicSimilarity], result of:
0.028848568 = score(doc=1080,freq=2.0), product of:
0.10940785 = queryWeight, product of:
3.4093587 = idf(docFreq=3973, maxDocs=44218)
0.032090448 = queryNorm
0.26367915 = fieldWeight in 1080, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.4093587 = idf(docFreq=3973, maxDocs=44218)
0.0546875 = fieldNorm(doc=1080)
0.062951215 = weight(_text_:rules in 1080) [ClassicSimilarity], result of:
0.062951215 = score(doc=1080,freq=2.0), product of:
0.16161752 = queryWeight, product of:
5.036312 = idf(docFreq=780, maxDocs=44218)
0.032090448 = queryNorm
0.38950738 = fieldWeight in 1080, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
5.036312 = idf(docFreq=780, maxDocs=44218)
0.0546875 = fieldNorm(doc=1080)
0.015217344 = product of:
0.030434689 = sum of:
0.030434689 = weight(_text_:22 in 1080) [ClassicSimilarity], result of:
0.030434689 = score(doc=1080,freq=2.0), product of:
0.11237528 = queryWeight, product of:
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.032090448 = queryNorm
0.2708308 = fieldWeight in 1080, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.0546875 = fieldNorm(doc=1080)
0.5 = coord(1/2)
0.25 = coord(4/16)
- Abstract
- Named entities are major constituents of a document but are usually unknown words. This work proposes a systematic way of dealing with formulation, transformation, translation, and transliteration of multilingual-named entities. The rules and similarity matrices for translation and transliteration are learned automatically from parallel-named-entity corpora. The results are applied in cross-language access to collections of images with captions. Experimental results demonstrate that the similarity-based transliteration of named entities is effective, and runs in which transliteration is considered outperform the runs in which it is neglected.
- Date
- 4. 6.2006 19:52:22
- Source
- Journal of the American Society for Information Science and Technology. 57(2006) no.5, S.645-659
-
Chen, H.-H.; Kuo, J.-J.; Huang, S.-J.; Lin, C.-J.; Wung, H.-C.: ¬A summarization system for Chinese news from multiple sources (2003)
0.01
0.006407313 = product of:
0.051258504 = sum of:
0.026531162 = weight(_text_:26 in 2115) [ClassicSimilarity], result of:
0.026531162 = score(doc=2115,freq=2.0), product of:
0.113328174 = queryWeight, product of:
3.5315237 = idf(docFreq=3516, maxDocs=44218)
0.032090448 = queryNorm
0.23410915 = fieldWeight in 2115, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.5315237 = idf(docFreq=3516, maxDocs=44218)
0.046875 = fieldNorm(doc=2115)
0.024727343 = weight(_text_:american in 2115) [ClassicSimilarity], result of:
0.024727343 = score(doc=2115,freq=2.0), product of:
0.10940785 = queryWeight, product of:
3.4093587 = idf(docFreq=3973, maxDocs=44218)
0.032090448 = queryNorm
0.22601068 = fieldWeight in 2115, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.4093587 = idf(docFreq=3973, maxDocs=44218)
0.046875 = fieldNorm(doc=2115)
0.125 = coord(2/16)
- Date
- 24. 1.2004 18:26:52
- Source
- Journal of the American Society for Information Science and technology. 54(2003) no.13, S.1224-1236
-
Bian, G.-W.; Chen, H.-H.: Cross-language information access to multilingual collections on the Internet (2000)
0.00
0.0047213477 = product of:
0.03777078 = sum of:
0.024727343 = weight(_text_:american in 4436) [ClassicSimilarity], result of:
0.024727343 = score(doc=4436,freq=2.0), product of:
0.10940785 = queryWeight, product of:
3.4093587 = idf(docFreq=3973, maxDocs=44218)
0.032090448 = queryNorm
0.22601068 = fieldWeight in 4436, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.4093587 = idf(docFreq=3973, maxDocs=44218)
0.046875 = fieldNorm(doc=4436)
0.013043438 = product of:
0.026086876 = sum of:
0.026086876 = weight(_text_:22 in 4436) [ClassicSimilarity], result of:
0.026086876 = score(doc=4436,freq=2.0), product of:
0.11237528 = queryWeight, product of:
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.032090448 = queryNorm
0.23214069 = fieldWeight in 4436, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.046875 = fieldNorm(doc=4436)
0.5 = coord(1/2)
0.125 = coord(2/16)
- Date
- 16. 2.2000 14:22:39
- Source
- Journal of the American Society for Information Science. 51(2000) no.3, S.281-296