Jones, S.: ¬A thesaurus data model for an intelligent retrieval system (1993)
0.01
0.007908144 = product of:
0.03954072 = sum of:
0.03954072 = weight(_text_:system in 5279) [ClassicSimilarity], result of:
0.03954072 = score(doc=5279,freq=4.0), product of:
0.13391352 = queryWeight, product of:
3.1495528 = idf(docFreq=5152, maxDocs=44218)
0.04251826 = queryNorm
0.29527056 = fieldWeight in 5279, product of:
2.0 = tf(freq=4.0), with freq of:
4.0 = termFreq=4.0
3.1495528 = idf(docFreq=5152, maxDocs=44218)
0.046875 = fieldNorm(doc=5279)
0.2 = coord(1/5)
- Abstract
- This paper demonstrates the application of conventional database design techniques to thesaurus representation. The thesaurus is considered as a printed document, as a semantic net, and as a relational database to be used in conjunction with an intelligent information retrieval system. Some issues raised by analysis of two standard thesauri include: the prevalence of compound terms and the representation of term structure; thesaurus redundancy and the extent to which it can be eliminated in machine-readable versions; the difficulty of exploiting thesaurus knowledge originally designed for human rather than automatic interpretation; deriving 'strength of association' measures between terms in a thesaurus considered as a semantic net; facet representation and the need for variations in the data model to cater for structural differences between thesauri. A complete schema of database tables is presented, with an outline suggestion for using the stored information when matching one or more thesaurus terms with a user's query