-
Eastman, C.M.: Overlaps in postings to thesaurus terms : a preliminary study (1988)
0.00
0.0012378087 = product of:
0.011140279 = sum of:
0.011140279 = product of:
0.033420835 = sum of:
0.033420835 = weight(_text_:22 in 3555) [ClassicSimilarity], result of:
0.033420835 = score(doc=3555,freq=2.0), product of:
0.12340116 = queryWeight, product of:
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.035239052 = queryNorm
0.2708308 = fieldWeight in 3555, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.0546875 = fieldNorm(doc=3555)
0.33333334 = coord(1/3)
0.11111111 = coord(1/9)
- Date
- 25.12.1995 22:52:34
-
Busch, J.A.: Building and accessing vocabulary resources for networked resource discovery and navigation (1998)
0.00
0.0012378087 = product of:
0.011140279 = sum of:
0.011140279 = product of:
0.033420835 = sum of:
0.033420835 = weight(_text_:22 in 2346) [ClassicSimilarity], result of:
0.033420835 = score(doc=2346,freq=2.0), product of:
0.12340116 = queryWeight, product of:
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.035239052 = queryNorm
0.2708308 = fieldWeight in 2346, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.0546875 = fieldNorm(doc=2346)
0.33333334 = coord(1/3)
0.11111111 = coord(1/9)
- Date
- 22. 9.1997 19:16:05
-
Nielsen, M.L.: Thesaurus construction : key issues and selected readings (2004)
0.00
0.0012378087 = product of:
0.011140279 = sum of:
0.011140279 = product of:
0.033420835 = sum of:
0.033420835 = weight(_text_:22 in 5006) [ClassicSimilarity], result of:
0.033420835 = score(doc=5006,freq=2.0), product of:
0.12340116 = queryWeight, product of:
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.035239052 = queryNorm
0.2708308 = fieldWeight in 5006, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.0546875 = fieldNorm(doc=5006)
0.33333334 = coord(1/3)
0.11111111 = coord(1/9)
- Date
- 18. 5.2006 20:06:22
-
Schneider, J.W.; Borlund, P.: ¬A bibliometric-based semiautomatic approach to identification of candidate thesaurus terms : parsing and filtering of noun phrases from citation contexts (2005)
0.00
0.0012378087 = product of:
0.011140279 = sum of:
0.011140279 = product of:
0.033420835 = sum of:
0.033420835 = weight(_text_:22 in 156) [ClassicSimilarity], result of:
0.033420835 = score(doc=156,freq=2.0), product of:
0.12340116 = queryWeight, product of:
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.035239052 = queryNorm
0.2708308 = fieldWeight in 156, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.0546875 = fieldNorm(doc=156)
0.33333334 = coord(1/3)
0.11111111 = coord(1/9)
- Date
- 8. 3.2007 19:55:22
-
Huckstorf, A.; Petras, V.: Mind the lexical gap : EuroVoc Building Block of the Semantic Web (2011)
0.00
0.0010706098 = product of:
0.009635488 = sum of:
0.009635488 = product of:
0.028906463 = sum of:
0.028906463 = weight(_text_:29 in 2782) [ClassicSimilarity], result of:
0.028906463 = score(doc=2782,freq=2.0), product of:
0.123959966 = queryWeight, product of:
3.5176873 = idf(docFreq=3565, maxDocs=44218)
0.035239052 = queryNorm
0.23319192 = fieldWeight in 2782, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.5176873 = idf(docFreq=3565, maxDocs=44218)
0.046875 = fieldNorm(doc=2782)
0.33333334 = coord(1/3)
0.11111111 = coord(1/9)
- Date
- 29. 3.2013 17:46:08
-
Assem, M. van; Gangemi, A.; Schreiber, G.: Conversion of WordNet to a standard RDF/OWL representation (2006)
0.00
0.0010706098 = product of:
0.009635488 = sum of:
0.009635488 = product of:
0.028906463 = sum of:
0.028906463 = weight(_text_:29 in 4641) [ClassicSimilarity], result of:
0.028906463 = score(doc=4641,freq=2.0), product of:
0.123959966 = queryWeight, product of:
3.5176873 = idf(docFreq=3565, maxDocs=44218)
0.035239052 = queryNorm
0.23319192 = fieldWeight in 4641, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.5176873 = idf(docFreq=3565, maxDocs=44218)
0.046875 = fieldNorm(doc=4641)
0.33333334 = coord(1/3)
0.11111111 = coord(1/9)
- Date
- 29. 7.2011 14:44:56
-
Aitchison, J.; Dextre Clarke, S.G.: ¬The Thesaurus : a historical viewpoint, with a look to the future (2004)
0.00
0.001060979 = product of:
0.00954881 = sum of:
0.00954881 = product of:
0.02864643 = sum of:
0.02864643 = weight(_text_:22 in 5005) [ClassicSimilarity], result of:
0.02864643 = score(doc=5005,freq=2.0), product of:
0.12340116 = queryWeight, product of:
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.035239052 = queryNorm
0.23214069 = fieldWeight in 5005, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.046875 = fieldNorm(doc=5005)
0.33333334 = coord(1/3)
0.11111111 = coord(1/9)
- Date
- 22. 9.2007 15:46:13
-
Bagheri, M.: Development of thesauri in Iran (2006)
0.00
0.001060979 = product of:
0.00954881 = sum of:
0.00954881 = product of:
0.02864643 = sum of:
0.02864643 = weight(_text_:22 in 260) [ClassicSimilarity], result of:
0.02864643 = score(doc=260,freq=2.0), product of:
0.12340116 = queryWeight, product of:
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.035239052 = queryNorm
0.23214069 = fieldWeight in 260, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.046875 = fieldNorm(doc=260)
0.33333334 = coord(1/3)
0.11111111 = coord(1/9)
- Source
- Indexer. 25(2006) no.1, S.19-22
-
Keyser, P. de: Indexing : from thesauri to the Semantic Web (2012)
0.00
0.001060979 = product of:
0.00954881 = sum of:
0.00954881 = product of:
0.02864643 = sum of:
0.02864643 = weight(_text_:22 in 3197) [ClassicSimilarity], result of:
0.02864643 = score(doc=3197,freq=2.0), product of:
0.12340116 = queryWeight, product of:
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.035239052 = queryNorm
0.23214069 = fieldWeight in 3197, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.046875 = fieldNorm(doc=3197)
0.33333334 = coord(1/3)
0.11111111 = coord(1/9)
- Date
- 24. 8.2016 14:03:22
-
Cheti, A.; Viti, E.: Functionality and merits of a faceted thesaurus : the case of the Nuovo soggettario (2023)
0.00
0.001060979 = product of:
0.00954881 = sum of:
0.00954881 = product of:
0.02864643 = sum of:
0.02864643 = weight(_text_:22 in 1181) [ClassicSimilarity], result of:
0.02864643 = score(doc=1181,freq=2.0), product of:
0.12340116 = queryWeight, product of:
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.035239052 = queryNorm
0.23214069 = fieldWeight in 1181, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.046875 = fieldNorm(doc=1181)
0.33333334 = coord(1/3)
0.11111111 = coord(1/9)
- Date
- 26.11.2023 18:59:22
-
Tseng, Y.-H.: Automatic thesaurus generation for Chinese documents (2002)
0.00
8.9217484E-4 = product of:
0.008029574 = sum of:
0.008029574 = product of:
0.02408872 = sum of:
0.02408872 = weight(_text_:29 in 5226) [ClassicSimilarity], result of:
0.02408872 = score(doc=5226,freq=2.0), product of:
0.123959966 = queryWeight, product of:
3.5176873 = idf(docFreq=3565, maxDocs=44218)
0.035239052 = queryNorm
0.19432661 = fieldWeight in 5226, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.5176873 = idf(docFreq=3565, maxDocs=44218)
0.0390625 = fieldNorm(doc=5226)
0.33333334 = coord(1/3)
0.11111111 = coord(1/9)
- Abstract
- Tseng constructs a word co-occurrence based thesaurus by means of the automatic analysis of Chinese text. Words are identified by a longest dictionary match supplemented by a key word extraction algorithm that merges back nearby tokens and accepts shorter strings of characters if they occur more often than the longest string. Single character auxiliary words are a major source of error but this can be greatly reduced with the use of a 70-character 2680 word stop list. Extracted terms with their associate document weights are sorted by decreasing frequency and the top of this list is associated using a Dice coefficient modified to account for longer documents on the weights of term pairs. Co-occurrence is not in the document as a whole but in paragraph or sentence size sections in order to reduce computation time. A window of 29 characters or 11 words was found to be sufficient. A thesaurus was produced from 25,230 Chinese news articles and judges asked to review the top 50 terms associated with each of 30 single word query terms. They determined 69% to be relevant.
-
Assem, M. van: Converting and integrating vocabularies for the Semantic Web (2010)
0.00
7.1373984E-4 = product of:
0.0064236587 = sum of:
0.0064236587 = product of:
0.019270975 = sum of:
0.019270975 = weight(_text_:29 in 4639) [ClassicSimilarity], result of:
0.019270975 = score(doc=4639,freq=2.0), product of:
0.123959966 = queryWeight, product of:
3.5176873 = idf(docFreq=3565, maxDocs=44218)
0.035239052 = queryNorm
0.15546128 = fieldWeight in 4639, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.5176873 = idf(docFreq=3565, maxDocs=44218)
0.03125 = fieldNorm(doc=4639)
0.33333334 = coord(1/3)
0.11111111 = coord(1/9)
- Date
- 29. 7.2011 14:44:56
-
Burkart, M.: Thesaurus (2004)
0.00
7.073193E-4 = product of:
0.006365874 = sum of:
0.006365874 = product of:
0.01909762 = sum of:
0.01909762 = weight(_text_:22 in 2913) [ClassicSimilarity], result of:
0.01909762 = score(doc=2913,freq=2.0), product of:
0.12340116 = queryWeight, product of:
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.035239052 = queryNorm
0.15476047 = fieldWeight in 2913, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.03125 = fieldNorm(doc=2913)
0.33333334 = coord(1/3)
0.11111111 = coord(1/9)
- Date
- 5. 4.2013 10:18:22
-
Brühl, B.: Thesauri und Klassifikationen : Naturwissenschaften - Technik - Wirtschaft (2005)
0.00
7.073193E-4 = product of:
0.006365874 = sum of:
0.006365874 = product of:
0.01909762 = sum of:
0.01909762 = weight(_text_:22 in 3487) [ClassicSimilarity], result of:
0.01909762 = score(doc=3487,freq=2.0), product of:
0.12340116 = queryWeight, product of:
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.035239052 = queryNorm
0.15476047 = fieldWeight in 3487, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.03125 = fieldNorm(doc=3487)
0.33333334 = coord(1/3)
0.11111111 = coord(1/9)
- Series
- Materialien zur Information und Dokumentation; Bd.22