Jones, S.: ¬A thesaurus data model for an intelligent retrieval system (1993)
0.00
0.003358763 = product of:
0.030228866 = sum of:
0.030228866 = weight(_text_:retrieval in 5279) [ClassicSimilarity], result of:
0.030228866 = score(doc=5279,freq=4.0), product of:
0.106595136 = queryWeight, product of:
3.024915 = idf(docFreq=5836, maxDocs=44218)
0.035239052 = queryNorm
0.2835858 = fieldWeight in 5279, product of:
2.0 = tf(freq=4.0), with freq of:
4.0 = termFreq=4.0
3.024915 = idf(docFreq=5836, maxDocs=44218)
0.046875 = fieldNorm(doc=5279)
0.11111111 = coord(1/9)
- Abstract
- This paper demonstrates the application of conventional database design techniques to thesaurus representation. The thesaurus is considered as a printed document, as a semantic net, and as a relational database to be used in conjunction with an intelligent information retrieval system. Some issues raised by analysis of two standard thesauri include: the prevalence of compound terms and the representation of term structure; thesaurus redundancy and the extent to which it can be eliminated in machine-readable versions; the difficulty of exploiting thesaurus knowledge originally designed for human rather than automatic interpretation; deriving 'strength of association' measures between terms in a thesaurus considered as a semantic net; facet representation and the need for variations in the data model to cater for structural differences between thesauri. A complete schema of database tables is presented, with an outline suggestion for using the stored information when matching one or more thesaurus terms with a user's query