Schwarz, C.: THESYS: Thesaurus Syntax System : a fully automatic thesaurus building aid (1988)
0.02
0.023882624 = product of:
0.047765248 = sum of:
0.047765248 = sum of:
0.008511645 = weight(_text_:a in 1361) [ClassicSimilarity], result of:
0.008511645 = score(doc=1361,freq=8.0), product of:
0.04772363 = queryWeight, product of:
1.153047 = idf(docFreq=37942, maxDocs=44218)
0.041389145 = queryNorm
0.17835285 = fieldWeight in 1361, product of:
2.828427 = tf(freq=8.0), with freq of:
8.0 = termFreq=8.0
1.153047 = idf(docFreq=37942, maxDocs=44218)
0.0546875 = fieldNorm(doc=1361)
0.039253604 = weight(_text_:22 in 1361) [ClassicSimilarity], result of:
0.039253604 = score(doc=1361,freq=2.0), product of:
0.14493774 = queryWeight, product of:
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.041389145 = queryNorm
0.2708308 = fieldWeight in 1361, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.0546875 = fieldNorm(doc=1361)
0.5 = coord(1/2)
- Abstract
- THESYS is based on the natural language processing of free-text databases. It yields statistically evaluated correlations between words of the database. These correlations correspond to traditional thesaurus relations. The person who has to build a thesaurus is thus assisted by the proposals made by THESYS. THESYS is being tested on commercial databases under real world conditions. It is part of a text processing project at Siemens, called TINA (Text-Inhalts-Analyse). Software from TINA is actually being applied and evaluated by the US Department of Commerce for patent search and indexing (REALIST: REtrieval Aids by Linguistics and STatistics)
- Date
- 6. 1.1999 10:22:07
- Type
- a