Search (1 results, page 1 of 1)

Did you mean:
rvk_ss%3a%2200 76400 allgemeines %2f buch- und bibliothekswesen%2c informationswissenschaft %2f bibliothekswesen %2f bibliotheksbenutzung %2f auskunft%2c information%22 1
rvk_ss%3a%2200 73400 allgemeines %2f buch- und bibliothekswesen%2c informationswissenschaft %2f bibliothekswesen %2f bibliotheksbenutzung %2f auskunft%2c information%22 1
rvk_ss%3a%2200 76400 allgemeinen %2f buch- und bibliothekswesen%2c informationswissenschaft %2f bibliothekswesen %2f bibliotheksbenutzung %2f auskunft%2c information%22 1
rvk_ss%3a%2200 76400 allgemeines %2f buch- und bibliothekswesen%2c informationswissenschaft %2f bibliothekswesen %2f bibliotheksbenutzung %2f auskunfts%2c information%22 1
rvk_ss%3a%2223 76400 allgemeines %2f buch- und bibliothekswesen%2c informationswissenschaft %2f bibliothekswesen %2f bibliotheksbenutzung %2f auskunft%2c information%22 1

Fautsch, C.; Savoy, J.: Algorithmic stemmers or morphological analysis? : an evaluation (2009) 0.00
```
3.4178712E-4 = product of:
  0.0051268064 = sum of:
    0.0051268064 = product of:
      0.010253613 = sum of:
        0.010253613 = weight(_text_:information in 2950) [ClassicSimilarity], result of:
          0.010253613 = score(doc=2950,freq=6.0), product of:
            0.050870337 = queryWeight, product of:
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.028978055 = queryNorm
            0.20156369 = fieldWeight in 2950, product of:
              2.4494898 = tf(freq=6.0), with freq of:
                6.0 = termFreq=6.0
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.046875 = fieldNorm(doc=2950)
      0.5 = coord(1/2)
  0.06666667 = coord(1/15)
```
Abstract

It is important in information retrieval (IR), information extraction, or classification tasks that morphologically related forms are conflated under the same stem (using stemmer) or lemma (using morphological analyzer). To achieve this for the English language, algorithmic stemming or various morphological analysis approaches have been suggested. Based on Cross-Language Evaluation Forum test collections containing 284 queries and various IR models, this article evaluates these word-normalization proposals. Stemming improves the mean average precision significantly by around 7% while performance differences are not significant when comparing various algorithmic stemmers or algorithmic stemmers and morphological analysis. Accounting for thesaurus class numbers during indexing does not modify overall retrieval performances. Finally, we demonstrate that including a stop word list, even one containing only around 10 terms, might significantly improve retrieval performance, depending on the IR model.

Source

Journal of the American Society for Information Science and Technology. 60(2009) no.8, S.1616-1624