-
Pirkola, A.; Puolamäki, D.; Järvelin, K.: Applying query structuring in cross-language retrieval (2003)
0.02
0.019709876 = product of:
0.03941975 = sum of:
0.03941975 = product of:
0.0788395 = sum of:
0.0788395 = weight(_text_:n in 1074) [ClassicSimilarity], result of:
0.0788395 = score(doc=1074,freq=4.0), product of:
0.19504215 = queryWeight, product of:
4.3116565 = idf(docFreq=1611, maxDocs=44218)
0.045236014 = queryNorm
0.40421778 = fieldWeight in 1074, product of:
2.0 = tf(freq=4.0), with freq of:
4.0 = termFreq=4.0
4.3116565 = idf(docFreq=1611, maxDocs=44218)
0.046875 = fieldNorm(doc=1074)
0.5 = coord(1/2)
0.5 = coord(1/2)
- Abstract
- We will explore various ways to apply query structuring in cross-language information retrieval. In the first test, English queries were translated into Finnish using an electronic dictionary, and were run in a Finnish newspaper database of 55,000 articles. Queries were structured by combining the Finnish translation equivalents of the same English query key using the syn-operator of the InQuery retrieval system. Structured queries performed markedly better than unstructured queries. Second, the effects of compound-based structuring using a proximity operator for the translation equivalents of query language compound components were tested. The method was not useful in syn-based queries but resulted in decrease in retrieval effectiveness. Proper names are often non-identical spelling variants in different languages. This allows n-gram based translation of names not included in a dictionary. In the third test, a query structuring method where the Boolean and-operator was used to assign more weight to keys translated through n-gram matching gave good results.
-
Pharo, N.; Järvelin, K.: "Irrational" searchers and IR-rational researchers (2006)
0.01
0.013936987 = product of:
0.027873974 = sum of:
0.027873974 = product of:
0.05574795 = sum of:
0.05574795 = weight(_text_:n in 4922) [ClassicSimilarity], result of:
0.05574795 = score(doc=4922,freq=2.0), product of:
0.19504215 = queryWeight, product of:
4.3116565 = idf(docFreq=1611, maxDocs=44218)
0.045236014 = queryNorm
0.28582513 = fieldWeight in 4922, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
4.3116565 = idf(docFreq=1611, maxDocs=44218)
0.046875 = fieldNorm(doc=4922)
0.5 = coord(1/2)
0.5 = coord(1/2)
-
Pharo, N.; Järvelin, K.: ¬The SST method : a tool for analysing Web information search processes (2004)
0.01
0.011614156 = product of:
0.023228312 = sum of:
0.023228312 = product of:
0.046456624 = sum of:
0.046456624 = weight(_text_:n in 2533) [ClassicSimilarity], result of:
0.046456624 = score(doc=2533,freq=2.0), product of:
0.19504215 = queryWeight, product of:
4.3116565 = idf(docFreq=1611, maxDocs=44218)
0.045236014 = queryNorm
0.23818761 = fieldWeight in 2533, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
4.3116565 = idf(docFreq=1611, maxDocs=44218)
0.0390625 = fieldNorm(doc=2533)
0.5 = coord(1/2)
0.5 = coord(1/2)
-
Näppilä, T.; Järvelin, K.; Niemi, T.: ¬A tool for data cube construction from structurally heterogeneous XML documents (2008)
0.01
0.0076610697 = product of:
0.0153221395 = sum of:
0.0153221395 = product of:
0.030644279 = sum of:
0.030644279 = weight(_text_:22 in 1369) [ClassicSimilarity], result of:
0.030644279 = score(doc=1369,freq=2.0), product of:
0.15840882 = queryWeight, product of:
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.045236014 = queryNorm
0.19345059 = fieldWeight in 1369, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.0390625 = fieldNorm(doc=1369)
0.5 = coord(1/2)
0.5 = coord(1/2)
- Date
- 9. 2.2008 17:22:42