Wang, J.; Oard, D.W.: Matching meaning for cross-language information retrieval (2012)
0.01
0.008009522 = product of:
0.05606665 = sum of:
0.014125523 = weight(_text_:information in 7430) [ClassicSimilarity], result of:
0.014125523 = score(doc=7430,freq=8.0), product of:
0.052020688 = queryWeight, product of:
1.7554779 = idf(docFreq=20772, maxDocs=44218)
0.029633347 = queryNorm
0.27153665 = fieldWeight in 7430, product of:
2.828427 = tf(freq=8.0), with freq of:
8.0 = termFreq=8.0
1.7554779 = idf(docFreq=20772, maxDocs=44218)
0.0546875 = fieldNorm(doc=7430)
0.04194113 = weight(_text_:retrieval in 7430) [ClassicSimilarity], result of:
0.04194113 = score(doc=7430,freq=8.0), product of:
0.08963835 = queryWeight, product of:
3.024915 = idf(docFreq=5836, maxDocs=44218)
0.029633347 = queryNorm
0.46789268 = fieldWeight in 7430, product of:
2.828427 = tf(freq=8.0), with freq of:
8.0 = termFreq=8.0
3.024915 = idf(docFreq=5836, maxDocs=44218)
0.0546875 = fieldNorm(doc=7430)
0.14285715 = coord(2/14)
- Abstract
- This article describes a framework for cross-language information retrieval that efficiently leverages statistical estimation of translation probabilities. The framework provides a unified perspective into which some earlier work on techniques for cross-language information retrieval based on translation probabilities can be cast. Modeling synonymy and filtering translation probabilities using bidirectional evidence are shown to yield a balance between retrieval effectiveness and query-time (or indexing-time) efficiency that seems well suited large-scale applications. Evaluations with six test collections show consistent improvements over strong baselines.
- Source
- Information processing and management. 48(2012) no.4, S.631-653