Robertson, A.M.; Willett, P.: Applications of n-grams in textual information systems (1998)
0.01
0.008150326 = product of:
0.020375814 = sum of:
0.009437811 = weight(_text_:a in 4715) [ClassicSimilarity], result of:
0.009437811 = score(doc=4715,freq=6.0), product of:
0.053464882 = queryWeight, product of:
1.153047 = idf(docFreq=37942, maxDocs=44218)
0.046368346 = queryNorm
0.17652355 = fieldWeight in 4715, product of:
2.4494898 = tf(freq=6.0), with freq of:
6.0 = termFreq=6.0
1.153047 = idf(docFreq=37942, maxDocs=44218)
0.0625 = fieldNorm(doc=4715)
0.010938003 = product of:
0.021876005 = sum of:
0.021876005 = weight(_text_:information in 4715) [ClassicSimilarity], result of:
0.021876005 = score(doc=4715,freq=6.0), product of:
0.08139861 = queryWeight, product of:
1.7554779 = idf(docFreq=20772, maxDocs=44218)
0.046368346 = queryNorm
0.2687516 = fieldWeight in 4715, product of:
2.4494898 = tf(freq=6.0), with freq of:
6.0 = termFreq=6.0
1.7554779 = idf(docFreq=20772, maxDocs=44218)
0.0625 = fieldNorm(doc=4715)
0.5 = coord(1/2)
0.4 = coord(2/5)
- Abstract
- Provides an introduction to the use of n-grams in textual information systems, where an n-gram is a string of n, usually adjacent, characters, extracted from a section of continuous text. Applications that can be implemented efficiently and effectively using sets of n-grams include spelling errors detection and correction, query expansion, information retrieval with serial, inverted and signature files, dictionary look up, text compression, and language identification
- Type
- a
Ekmekcioglu, F.C.; Robertson, A.M.; Willett, P.: Effectiveness of query expansion in ranked-output document retrieval systems (1992)
0.01
0.0063011474 = product of:
0.015752869 = sum of:
0.009437811 = weight(_text_:a in 5689) [ClassicSimilarity], result of:
0.009437811 = score(doc=5689,freq=6.0), product of:
0.053464882 = queryWeight, product of:
1.153047 = idf(docFreq=37942, maxDocs=44218)
0.046368346 = queryNorm
0.17652355 = fieldWeight in 5689, product of:
2.4494898 = tf(freq=6.0), with freq of:
6.0 = termFreq=6.0
1.153047 = idf(docFreq=37942, maxDocs=44218)
0.0625 = fieldNorm(doc=5689)
0.006315058 = product of:
0.012630116 = sum of:
0.012630116 = weight(_text_:information in 5689) [ClassicSimilarity], result of:
0.012630116 = score(doc=5689,freq=2.0), product of:
0.08139861 = queryWeight, product of:
1.7554779 = idf(docFreq=20772, maxDocs=44218)
0.046368346 = queryNorm
0.1551638 = fieldWeight in 5689, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
1.7554779 = idf(docFreq=20772, maxDocs=44218)
0.0625 = fieldNorm(doc=5689)
0.5 = coord(1/2)
0.4 = coord(2/5)
- Abstract
- Reports an evaluation of 3 methods for the expansion of natural language queries in ranked output retrieval systems. The methods are based on term co-occurrence data, on Soundex codes, and on a string similarity measure. Searches for 110 queries in a data base of 26.280 titles and abstracts suggest that there is no significant difference in retrieval effectiveness between any of these methods and unexpanded searches
- Source
- Journal of information science. 18(1992) no.2, S.139-147
- Type
- a