Eckert, K.; Pfeffer, M.; Stuckenschmidt, H.: Assessing thesaurus-based annotations for semantic search applications (2008)
0.05
0.05267809 = sum of:
0.036523234 = product of:
0.1095697 = sum of:
0.1095697 = weight(_text_:basis in 1528) [ClassicSimilarity], result of:
0.1095697 = score(doc=1528,freq=4.0), product of:
0.22523694 = queryWeight, product of:
4.4476724 = idf(docFreq=1406, maxDocs=44218)
0.05064153 = queryNorm
0.48646417 = fieldWeight in 1528, product of:
2.0 = tf(freq=4.0), with freq of:
4.0 = termFreq=4.0
4.4476724 = idf(docFreq=1406, maxDocs=44218)
0.0546875 = fieldNorm(doc=1528)
0.33333334 = coord(1/3)
0.016154855 = product of:
0.048464566 = sum of:
0.048464566 = weight(_text_:29 in 1528) [ClassicSimilarity], result of:
0.048464566 = score(doc=1528,freq=2.0), product of:
0.17814107 = queryWeight, product of:
3.5176873 = idf(docFreq=3565, maxDocs=44218)
0.05064153 = queryNorm
0.27205724 = fieldWeight in 1528, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.5176873 = idf(docFreq=3565, maxDocs=44218)
0.0546875 = fieldNorm(doc=1528)
0.33333334 = coord(1/3)
- Abstract
- Statistical methods for automated document indexing are becoming an alternative to the manual assignment of keywords. We argue that the quality of the thesaurus used as a basis for indexing in regard to its ability to adequately cover the contents to be indexed and as a basis for the specific indexing method used is of crucial importance in automatic indexing. We present an interactive tool for thesaurus evaluation that is based on a combination of statistical measures and appropriate visualisation techniques that supports the detection of potential problems in a thesaurus. We describe the methods used and show that the tool supports the detection and correction of errors, leading to a better indexing result.
- Date
- 25. 2.2012 13:51:29