Robertson, A.M.; Willett, P.: Applications of n-grams in textual information systems (1998)
0.01
0.008151309 = product of:
0.024453925 = sum of:
0.024453925 = weight(_text_:information in 4715) [ClassicSimilarity], result of:
0.024453925 = score(doc=4715,freq=6.0), product of:
0.09099081 = queryWeight, product of:
1.7554779 = idf(docFreq=20772, maxDocs=44218)
0.0518325 = queryNorm
0.2687516 = fieldWeight in 4715, product of:
2.4494898 = tf(freq=6.0), with freq of:
6.0 = termFreq=6.0
1.7554779 = idf(docFreq=20772, maxDocs=44218)
0.0625 = fieldNorm(doc=4715)
0.33333334 = coord(1/3)
- Abstract
- Provides an introduction to the use of n-grams in textual information systems, where an n-gram is a string of n, usually adjacent, characters, extracted from a section of continuous text. Applications that can be implemented efficiently and effectively using sets of n-grams include spelling errors detection and correction, query expansion, information retrieval with serial, inverted and signature files, dictionary look up, text compression, and language identification