Search (1 results, page 1 of 1)

Did you mean:
themes%3a%22Klassifikationssysteme im online-retrieval%22 1

Hafer, M.A.; Weiss, S.F.: Word segmentation by letter successor varieties (1974) 0.01
```
0.006948821 = product of:
  0.020846462 = sum of:
    0.020846462 = product of:
      0.062539384 = sum of:
        0.062539384 = weight(_text_:retrieval in 4997) [ClassicSimilarity], result of:
          0.062539384 = score(doc=4997,freq=6.0), product of:
            0.15433937 = queryWeight, product of:
              3.024915 = idf(docFreq=5836, maxDocs=44218)
              0.051022716 = queryNorm
            0.40520695 = fieldWeight in 4997, product of:
              2.4494898 = tf(freq=6.0), with freq of:
                6.0 = termFreq=6.0
              3.024915 = idf(docFreq=5836, maxDocs=44218)
              0.0546875 = fieldNorm(doc=4997)
      0.33333334 = coord(1/3)
  0.33333334 = coord(1/3)
```
Abstract

This paper describes a method for automatically segmenting words into their stems and affixes. The process uses certain statistical properties of corpus (successor and predecessor letter variety counts) to indicate where words should be divided. Consequently, this process is less reliant on human intervention than are other methods for automated stemming. The segmentation system is used to construct stem dictionariesfor documnet classification. Information retrieval experiments are then performed using documents and queries so classified. Results show not only that this method is capable of high quality word segmentation, but also that its use in information retrieval produce results that are at least as good as thosse obtained using the more traditional stemming process.

Source

Information storage and retrieval. 10(1974) H.11/12, S.371-385