-
Holzner, S.: FIZ Wirtschaft - das Portal der Wirtschaftswissenschaften : Standard Thesaurus Wirtschaft als Basis für Wortschatzsynchronisierung (2001)
0.00
9.3004614E-4 = product of:
0.007440369 = sum of:
0.007440369 = product of:
0.022321107 = sum of:
0.022321107 = weight(_text_:29 in 5886) [ClassicSimilarity], result of:
0.022321107 = score(doc=5886,freq=2.0), product of:
0.11486387 = queryWeight, product of:
3.5176873 = idf(docFreq=3565, maxDocs=44218)
0.032653235 = queryNorm
0.19432661 = fieldWeight in 5886, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.5176873 = idf(docFreq=3565, maxDocs=44218)
0.0390625 = fieldNorm(doc=5886)
0.33333334 = coord(1/3)
0.125 = coord(1/8)
- Date
- 17. 5.2001 20:29:55
-
Schmitz-Esser, W.: EXPO-INFO 2000 : Visuelles Besucherinformationssystem für Weltausstellungen (2000)
0.00
9.3004614E-4 = product of:
0.007440369 = sum of:
0.007440369 = product of:
0.022321107 = sum of:
0.022321107 = weight(_text_:29 in 1404) [ClassicSimilarity], result of:
0.022321107 = score(doc=1404,freq=2.0), product of:
0.11486387 = queryWeight, product of:
3.5176873 = idf(docFreq=3565, maxDocs=44218)
0.032653235 = queryNorm
0.19432661 = fieldWeight in 1404, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.5176873 = idf(docFreq=3565, maxDocs=44218)
0.0390625 = fieldNorm(doc=1404)
0.33333334 = coord(1/3)
0.125 = coord(1/8)
- Footnote
- Rez.in: KO 29(2002) no.2, S.103-104 (G.J.A. Riesthuis)
-
Tseng, Y.-H.: Automatic thesaurus generation for Chinese documents (2002)
0.00
9.3004614E-4 = product of:
0.007440369 = sum of:
0.007440369 = product of:
0.022321107 = sum of:
0.022321107 = weight(_text_:29 in 5226) [ClassicSimilarity], result of:
0.022321107 = score(doc=5226,freq=2.0), product of:
0.11486387 = queryWeight, product of:
3.5176873 = idf(docFreq=3565, maxDocs=44218)
0.032653235 = queryNorm
0.19432661 = fieldWeight in 5226, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.5176873 = idf(docFreq=3565, maxDocs=44218)
0.0390625 = fieldNorm(doc=5226)
0.33333334 = coord(1/3)
0.125 = coord(1/8)
- Abstract
- Tseng constructs a word co-occurrence based thesaurus by means of the automatic analysis of Chinese text. Words are identified by a longest dictionary match supplemented by a key word extraction algorithm that merges back nearby tokens and accepts shorter strings of characters if they occur more often than the longest string. Single character auxiliary words are a major source of error but this can be greatly reduced with the use of a 70-character 2680 word stop list. Extracted terms with their associate document weights are sorted by decreasing frequency and the top of this list is associated using a Dice coefficient modified to account for longer documents on the weights of term pairs. Co-occurrence is not in the document as a whole but in paragraph or sentence size sections in order to reduce computation time. A window of 29 characters or 11 words was found to be sufficient. A thesaurus was produced from 25,230 Chinese news articles and judges asked to review the top 50 terms associated with each of 30 single word query terms. They determined 69% to be relevant.
-
Müller, T.: Wissensrepräsentation mit semantischen Netzen im Bereich Luftfahrt (2006)
0.00
9.216798E-4 = product of:
0.007373438 = sum of:
0.007373438 = product of:
0.022120314 = sum of:
0.022120314 = weight(_text_:22 in 1670) [ClassicSimilarity], result of:
0.022120314 = score(doc=1670,freq=2.0), product of:
0.114346065 = queryWeight, product of:
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.032653235 = queryNorm
0.19345059 = fieldWeight in 1670, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.0390625 = fieldNorm(doc=1670)
0.33333334 = coord(1/3)
0.125 = coord(1/8)
- Date
- 26. 9.2006 21:00:22
-
Burkart, M.: Thesaurus (2004)
0.00
7.373438E-4 = product of:
0.0058987504 = sum of:
0.0058987504 = product of:
0.01769625 = sum of:
0.01769625 = weight(_text_:22 in 2913) [ClassicSimilarity], result of:
0.01769625 = score(doc=2913,freq=2.0), product of:
0.114346065 = queryWeight, product of:
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.032653235 = queryNorm
0.15476047 = fieldWeight in 2913, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.03125 = fieldNorm(doc=2913)
0.33333334 = coord(1/3)
0.125 = coord(1/8)
- Date
- 5. 4.2013 10:18:22
-
Brühl, B.: Thesauri und Klassifikationen : Naturwissenschaften - Technik - Wirtschaft (2005)
0.00
7.373438E-4 = product of:
0.0058987504 = sum of:
0.0058987504 = product of:
0.01769625 = sum of:
0.01769625 = weight(_text_:22 in 3487) [ClassicSimilarity], result of:
0.01769625 = score(doc=3487,freq=2.0), product of:
0.114346065 = queryWeight, product of:
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.032653235 = queryNorm
0.15476047 = fieldWeight in 3487, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.03125 = fieldNorm(doc=3487)
0.33333334 = coord(1/3)
0.125 = coord(1/8)
- Series
- Materialien zur Information und Dokumentation; Bd.22