-
Amirhosseini, M.: Theoretical base of quantitative evaluation of unity in a thesaurus term network based on Kant's epistemology (2010)
0.00
8.1174844E-4 = product of:
0.0048704906 = sum of:
0.0048704906 = product of:
0.024352452 = sum of:
0.024352452 = weight(_text_:28 in 5854) [ClassicSimilarity], result of:
0.024352452 = score(doc=5854,freq=2.0), product of:
0.12305808 = queryWeight, product of:
3.5822632 = idf(docFreq=3342, maxDocs=44218)
0.03435205 = queryNorm
0.19789396 = fieldWeight in 5854, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.5822632 = idf(docFreq=3342, maxDocs=44218)
0.0390625 = fieldNorm(doc=5854)
0.2 = coord(1/5)
0.16666667 = coord(1/6)
- Date
- 6. 1.1997 18:30:28
-
Tseng, Y.-H.: Automatic thesaurus generation for Chinese documents (2002)
0.00
7.827461E-4 = product of:
0.0046964767 = sum of:
0.0046964767 = product of:
0.023482382 = sum of:
0.023482382 = weight(_text_:29 in 5226) [ClassicSimilarity], result of:
0.023482382 = score(doc=5226,freq=2.0), product of:
0.12083977 = queryWeight, product of:
3.5176873 = idf(docFreq=3565, maxDocs=44218)
0.03435205 = queryNorm
0.19432661 = fieldWeight in 5226, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.5176873 = idf(docFreq=3565, maxDocs=44218)
0.0390625 = fieldNorm(doc=5226)
0.2 = coord(1/5)
0.16666667 = coord(1/6)
- Abstract
- Tseng constructs a word co-occurrence based thesaurus by means of the automatic analysis of Chinese text. Words are identified by a longest dictionary match supplemented by a key word extraction algorithm that merges back nearby tokens and accepts shorter strings of characters if they occur more often than the longest string. Single character auxiliary words are a major source of error but this can be greatly reduced with the use of a 70-character 2680 word stop list. Extracted terms with their associate document weights are sorted by decreasing frequency and the top of this list is associated using a Dice coefficient modified to account for longer documents on the weights of term pairs. Co-occurrence is not in the document as a whole but in paragraph or sentence size sections in order to reduce computation time. A window of 29 characters or 11 words was found to be sufficient. A thesaurus was produced from 25,230 Chinese news articles and judges asked to review the top 50 terms associated with each of 30 single word query terms. They determined 69% to be relevant.
-
Mu, X.; Lu, K.; Ryu, H.: Explicitly integrating MeSH thesaurus help into health information retrieval systems : an empirical user study (2014)
0.00
7.827461E-4 = product of:
0.0046964767 = sum of:
0.0046964767 = product of:
0.023482382 = sum of:
0.023482382 = weight(_text_:29 in 2703) [ClassicSimilarity], result of:
0.023482382 = score(doc=2703,freq=2.0), product of:
0.12083977 = queryWeight, product of:
3.5176873 = idf(docFreq=3565, maxDocs=44218)
0.03435205 = queryNorm
0.19432661 = fieldWeight in 2703, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.5176873 = idf(docFreq=3565, maxDocs=44218)
0.0390625 = fieldNorm(doc=2703)
0.2 = coord(1/5)
0.16666667 = coord(1/6)
- Date
- 25. 1.2016 18:43:29
-
Dextre Clarke, S.G.; Gilchrist, A.; Will, L.: Revision and extension of thesaurus standards (2004)
0.00
6.493987E-4 = product of:
0.0038963922 = sum of:
0.0038963922 = product of:
0.01948196 = sum of:
0.01948196 = weight(_text_:28 in 2615) [ClassicSimilarity], result of:
0.01948196 = score(doc=2615,freq=2.0), product of:
0.12305808 = queryWeight, product of:
3.5822632 = idf(docFreq=3342, maxDocs=44218)
0.03435205 = queryNorm
0.15831517 = fieldWeight in 2615, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.5822632 = idf(docFreq=3342, maxDocs=44218)
0.03125 = fieldNorm(doc=2615)
0.2 = coord(1/5)
0.16666667 = coord(1/6)
- Date
- 6. 1.1997 18:30:28
-
Shiri, A.A.; Revie, C.; Chowdhurry, G.: Assessing the impact of user interaction with thesaural knowledge structures : a quantitative analysis framework (2003)
0.00
6.493987E-4 = product of:
0.0038963922 = sum of:
0.0038963922 = product of:
0.01948196 = sum of:
0.01948196 = weight(_text_:28 in 2766) [ClassicSimilarity], result of:
0.01948196 = score(doc=2766,freq=2.0), product of:
0.12305808 = queryWeight, product of:
3.5822632 = idf(docFreq=3342, maxDocs=44218)
0.03435205 = queryNorm
0.15831517 = fieldWeight in 2766, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.5822632 = idf(docFreq=3342, maxDocs=44218)
0.03125 = fieldNorm(doc=2766)
0.2 = coord(1/5)
0.16666667 = coord(1/6)
- Date
- 6. 1.1997 18:30:28
-
Willis, C.; Losee, R.M.: ¬A random walk on an ontology : using thesaurus structure for automatic subject indexing (2013)
0.00
6.493987E-4 = product of:
0.0038963922 = sum of:
0.0038963922 = product of:
0.01948196 = sum of:
0.01948196 = weight(_text_:28 in 1016) [ClassicSimilarity], result of:
0.01948196 = score(doc=1016,freq=2.0), product of:
0.12305808 = queryWeight, product of:
3.5822632 = idf(docFreq=3342, maxDocs=44218)
0.03435205 = queryNorm
0.15831517 = fieldWeight in 1016, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.5822632 = idf(docFreq=3342, maxDocs=44218)
0.03125 = fieldNorm(doc=1016)
0.2 = coord(1/5)
0.16666667 = coord(1/6)
- Date
- 28. 7.2013 14:20:39
-
Assem, M. van: Converting and integrating vocabularies for the Semantic Web (2010)
0.00
6.2619685E-4 = product of:
0.003757181 = sum of:
0.003757181 = product of:
0.018785905 = sum of:
0.018785905 = weight(_text_:29 in 4639) [ClassicSimilarity], result of:
0.018785905 = score(doc=4639,freq=2.0), product of:
0.12083977 = queryWeight, product of:
3.5176873 = idf(docFreq=3565, maxDocs=44218)
0.03435205 = queryNorm
0.15546128 = fieldWeight in 4639, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.5176873 = idf(docFreq=3565, maxDocs=44218)
0.03125 = fieldNorm(doc=4639)
0.2 = coord(1/5)
0.16666667 = coord(1/6)
- Date
- 29. 7.2011 14:44:56
-
Mooers, C.N.: ¬The indexing language of an information retrieval system (1985)
0.00
5.4299337E-4 = product of:
0.00325796 = sum of:
0.00325796 = product of:
0.0162898 = sum of:
0.0162898 = weight(_text_:22 in 3644) [ClassicSimilarity], result of:
0.0162898 = score(doc=3644,freq=2.0), product of:
0.120295025 = queryWeight, product of:
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.03435205 = queryNorm
0.1354154 = fieldWeight in 3644, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.02734375 = fieldNorm(doc=3644)
0.2 = coord(1/5)
0.16666667 = coord(1/6)
- Footnote
- Original in: Information retrieval today: papers presented at an Institute conducted by the Library School and the Center for Continuation Study, University of Minnesota, Sept. 19-22, 1962. Ed. by Wesley Simonton. Minneapolis, Minn.: The Center, 1963. S.21-36.
-
Moreira, A.; Alvarenga, L.; Paiva Oliveira, A. de: "Thesaurus" and "Ontology" : a study of the definitions found in the computer and information science literature (2004)
0.00
4.87049E-4 = product of:
0.002922294 = sum of:
0.002922294 = product of:
0.01461147 = sum of:
0.01461147 = weight(_text_:28 in 3726) [ClassicSimilarity], result of:
0.01461147 = score(doc=3726,freq=2.0), product of:
0.12305808 = queryWeight, product of:
3.5822632 = idf(docFreq=3342, maxDocs=44218)
0.03435205 = queryNorm
0.11873637 = fieldWeight in 3726, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.5822632 = idf(docFreq=3342, maxDocs=44218)
0.0234375 = fieldNorm(doc=3726)
0.2 = coord(1/5)
0.16666667 = coord(1/6)
- Date
- 6. 1.1997 18:30:28