Robertson, A.M.; Willett, P.: Applications of n-grams in textual information systems (1998)
0.06
0.056120045 = product of:
0.16836013 = sum of:
0.098698735 = weight(_text_:applications in 4715) [ClassicSimilarity], result of:
0.098698735 = score(doc=4715,freq=4.0), product of:
0.17934994 = queryWeight, product of:
4.4025097 = idf(docFreq=1471, maxDocs=44218)
0.040738113 = queryNorm
0.5503137 = fieldWeight in 4715, product of:
2.0 = tf(freq=4.0), with freq of:
4.0 = termFreq=4.0
4.4025097 = idf(docFreq=1471, maxDocs=44218)
0.0625 = fieldNorm(doc=4715)
0.021568017 = weight(_text_:of in 4715) [ClassicSimilarity], result of:
0.021568017 = score(doc=4715,freq=12.0), product of:
0.06370452 = queryWeight, product of:
1.5637573 = idf(docFreq=25162, maxDocs=44218)
0.040738113 = queryNorm
0.33856338 = fieldWeight in 4715, product of:
3.4641016 = tf(freq=12.0), with freq of:
12.0 = termFreq=12.0
1.5637573 = idf(docFreq=25162, maxDocs=44218)
0.0625 = fieldNorm(doc=4715)
0.048093382 = weight(_text_:systems in 4715) [ClassicSimilarity], result of:
0.048093382 = score(doc=4715,freq=4.0), product of:
0.12519532 = queryWeight, product of:
3.0731742 = idf(docFreq=5561, maxDocs=44218)
0.040738113 = queryNorm
0.38414678 = fieldWeight in 4715, product of:
2.0 = tf(freq=4.0), with freq of:
4.0 = termFreq=4.0
3.0731742 = idf(docFreq=5561, maxDocs=44218)
0.0625 = fieldNorm(doc=4715)
0.33333334 = coord(3/9)
- Abstract
- Provides an introduction to the use of n-grams in textual information systems, where an n-gram is a string of n, usually adjacent, characters, extracted from a section of continuous text. Applications that can be implemented efficiently and effectively using sets of n-grams include spelling errors detection and correction, query expansion, information retrieval with serial, inverted and signature files, dictionary look up, text compression, and language identification
- Source
- Journal of documentation. 54(1998) no.1, S.48-69
Ekmekcioglu, F.C.; Robertson, A.M.; Willett, P.: Effectiveness of query expansion in ranked-output document retrieval systems (1992)
0.02
0.015480312 = product of:
0.0696614 = sum of:
0.021568017 = weight(_text_:of in 5689) [ClassicSimilarity], result of:
0.021568017 = score(doc=5689,freq=12.0), product of:
0.06370452 = queryWeight, product of:
1.5637573 = idf(docFreq=25162, maxDocs=44218)
0.040738113 = queryNorm
0.33856338 = fieldWeight in 5689, product of:
3.4641016 = tf(freq=12.0), with freq of:
12.0 = termFreq=12.0
1.5637573 = idf(docFreq=25162, maxDocs=44218)
0.0625 = fieldNorm(doc=5689)
0.048093382 = weight(_text_:systems in 5689) [ClassicSimilarity], result of:
0.048093382 = score(doc=5689,freq=4.0), product of:
0.12519532 = queryWeight, product of:
3.0731742 = idf(docFreq=5561, maxDocs=44218)
0.040738113 = queryNorm
0.38414678 = fieldWeight in 5689, product of:
2.0 = tf(freq=4.0), with freq of:
4.0 = termFreq=4.0
3.0731742 = idf(docFreq=5561, maxDocs=44218)
0.0625 = fieldNorm(doc=5689)
0.22222222 = coord(2/9)
- Abstract
- Reports an evaluation of 3 methods for the expansion of natural language queries in ranked output retrieval systems. The methods are based on term co-occurrence data, on Soundex codes, and on a string similarity measure. Searches for 110 queries in a data base of 26.280 titles and abstracts suggest that there is no significant difference in retrieval effectiveness between any of these methods and unexpanded searches
- Source
- Journal of information science. 18(1992) no.2, S.139-147