Frakes, W.B.: Stemming algorithms (1992)
0.01
0.011975727 = product of:
0.023951454 = sum of:
0.023951454 = product of:
0.04790291 = sum of:
0.04790291 = weight(_text_:c in 3503) [ClassicSimilarity], result of:
0.04790291 = score(doc=3503,freq=2.0), product of:
0.15711682 = queryWeight, product of:
3.4494052 = idf(docFreq=3817, maxDocs=44218)
0.045548957 = queryNorm
0.3048872 = fieldWeight in 3503, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.4494052 = idf(docFreq=3817, maxDocs=44218)
0.0625 = fieldNorm(doc=3503)
0.5 = coord(1/2)
0.5 = coord(1/2)
- Abstract
- Desribes stemming algorithms - programs that relate morphologically similar indexing and search terms. Stemming is used to improve retrieval effectiveness and to reduce the size of indexing files. Several approaches to stemming are describes - table lookup, affix removal, successor variety, and n-gram. empirical studies of stemming are summarized. The Porter stemmer is described in detail, and a full implementation in C is presented