Frakes, W.B.: Stemming algorithms (1992)
0.02
0.017567426 = product of:
0.052702274 = sum of:
0.052702274 = product of:
0.10540455 = sum of:
0.10540455 = weight(_text_:indexing in 3503) [ClassicSimilarity], result of:
0.10540455 = score(doc=3503,freq=4.0), product of:
0.2202888 = queryWeight, product of:
3.8278677 = idf(docFreq=2614, maxDocs=44218)
0.057548698 = queryNorm
0.47848347 = fieldWeight in 3503, product of:
2.0 = tf(freq=4.0), with freq of:
4.0 = termFreq=4.0
3.8278677 = idf(docFreq=2614, maxDocs=44218)
0.0625 = fieldNorm(doc=3503)
0.5 = coord(1/2)
0.33333334 = coord(1/3)
- Abstract
- Desribes stemming algorithms - programs that relate morphologically similar indexing and search terms. Stemming is used to improve retrieval effectiveness and to reduce the size of indexing files. Several approaches to stemming are describes - table lookup, affix removal, successor variety, and n-gram. empirical studies of stemming are summarized. The Porter stemmer is described in detail, and a full implementation in C is presented