Froissart, C.; Lallich-Boidin, G.: Towards structuring of indexing vocabulary for large technical documents (1998)
0.01
0.0076240813 = product of:
0.045744486 = sum of:
0.045744486 = weight(_text_:world in 50) [ClassicSimilarity], result of:
0.045744486 = score(doc=50,freq=2.0), product of:
0.1538826 = queryWeight, product of:
3.8436708 = idf(docFreq=2573, maxDocs=44218)
0.04003532 = queryNorm
0.29726875 = fieldWeight in 50, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.8436708 = idf(docFreq=2573, maxDocs=44218)
0.0546875 = fieldNorm(doc=50)
0.16666667 = coord(1/6)
- Abstract
- This paper deals with indexing of large textual and structured documents. We limit our area to technical documents like maintenance and users manuals. This firstly implies, that the document describes a closed world, and then that they are used by experts in this area. We suggest a methodology to extract the indexing vocabulary from the text with linguistic and numeric tools and then to structure the vocabulary, as a thesaurus might. We aim at assisting the user in order that he retrieves quickly the only text passages he needs