Frakes, W.B.: Stemming algorithms (1992)
0.01
0.01204295 = product of:
0.0240859 = sum of:
0.0240859 = product of:
0.0481718 = sum of:
0.0481718 = weight(_text_:c in 4503) [ClassicSimilarity], result of:
0.0481718 = score(doc=4503,freq=2.0), product of:
0.158076 = queryWeight, product of:
3.4477193 = idf(docFreq=3841, maxDocs=44421)
0.045849442 = queryNorm
0.30473822 = fieldWeight in 4503, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.4477193 = idf(docFreq=3841, maxDocs=44421)
0.0625 = fieldNorm(doc=4503)
0.5 = coord(1/2)
0.5 = coord(1/2)
- Abstract
- Desribes stemming algorithms - programs that relate morphologically similar indexing and search terms. Stemming is used to improve retrieval effectiveness and to reduce the size of indexing files. Several approaches to stemming are describes - table lookup, affix removal, successor variety, and n-gram. empirical studies of stemming are summarized. The Porter stemmer is described in detail, and a full implementation in C is presented