Pirkola, A.; Puolamäki, D.; Järvelin, K.: Applying query structuring in cross-language retrieval (2003)
0.01
0.0067993784 = product of:
0.016998446 = sum of:
0.010184056 = weight(_text_:a in 1074) [ClassicSimilarity], result of:
0.010184056 = score(doc=1074,freq=12.0), product of:
0.054392863 = queryWeight, product of:
1.153047 = idf(docFreq=37942, maxDocs=44218)
0.047173154 = queryNorm
0.18723148 = fieldWeight in 1074, product of:
3.4641016 = tf(freq=12.0), with freq of:
12.0 = termFreq=12.0
1.153047 = idf(docFreq=37942, maxDocs=44218)
0.046875 = fieldNorm(doc=1074)
0.006814389 = product of:
0.013628778 = sum of:
0.013628778 = weight(_text_:information in 1074) [ClassicSimilarity], result of:
0.013628778 = score(doc=1074,freq=4.0), product of:
0.08281143 = queryWeight, product of:
1.7554779 = idf(docFreq=20772, maxDocs=44218)
0.047173154 = queryNorm
0.16457605 = fieldWeight in 1074, product of:
2.0 = tf(freq=4.0), with freq of:
4.0 = termFreq=4.0
1.7554779 = idf(docFreq=20772, maxDocs=44218)
0.046875 = fieldNorm(doc=1074)
0.5 = coord(1/2)
0.4 = coord(2/5)
- Abstract
- We will explore various ways to apply query structuring in cross-language information retrieval. In the first test, English queries were translated into Finnish using an electronic dictionary, and were run in a Finnish newspaper database of 55,000 articles. Queries were structured by combining the Finnish translation equivalents of the same English query key using the syn-operator of the InQuery retrieval system. Structured queries performed markedly better than unstructured queries. Second, the effects of compound-based structuring using a proximity operator for the translation equivalents of query language compound components were tested. The method was not useful in syn-based queries but resulted in decrease in retrieval effectiveness. Proper names are often non-identical spelling variants in different languages. This allows n-gram based translation of names not included in a dictionary. In the third test, a query structuring method where the Boolean and-operator was used to assign more weight to keys translated through n-gram matching gave good results.
- Source
- Information processing and management. 39(2003) no.3, S.391-402
- Type
- a