Search (2 results, page 1 of 1)
- Did you mean:
- rvk_ss%3a%2200 15860 allgemeines %2f medien- und kommunikationswissenschaften%2c kommunikationsdesign %2f formen der kommunikation und des kommunikationsdesigns %2f kommunikationsdesign in elektronischen medien%22 2
- rvk_ss%3a%2200 15860 allgemeinen %2f medien- und kommunikationswissenschaften%2c kommunikationsdesign %2f formen der kommunikation und des kommunikationsdesigns %2f kommunikationsdesign in elektronischen medien%22 2
- rvk_ss%3a%2200 15860 allgemeines %2f medien- und kommunikationswissenschaften%2c kommunikationsdesign %2f firmen der kommunikation und des kommunikationsdesigns %2f kommunikationsdesign in elektronischen medien%22 2
- rvk_ss%3a%2200 15860 allgemeines %2f medien- und kommunikationswissenschaften%2c kommunikationsdesign %2f formen der kommunikation und des kommunikationsdesign %2f kommunikationsdesign in elektronischen medien%22 2
- rvk_ss%3a%2200 15860 allgemeines %2f medien- und kommunikationswissenschaften%2c kommunikationsdesigns %2f formen der kommunikation und des kommunikationsdesigns %2f kommunikationsdesign in elektronischen medien%22 2
-
Chen, H.; Martinez, J.; Kirchhoff, A.; Ng, T.D.; Schatz, B.R.: Alleviating search uncertainty through concept associations : automatic indexing, co-occurence analysis, and parallel computing (1998)
0.01
0.0085469205 = product of: 0.0427346 = sum of: 0.009376213 = weight(_text_:und in 5202) [ClassicSimilarity], result of: 0.009376213 = score(doc=5202,freq=2.0), product of: 0.06381599 = queryWeight, product of: 2.216367 = idf(docFreq=13101, maxDocs=44218) 0.02879306 = queryNorm 0.14692576 = fieldWeight in 5202, product of: 1.4142135 = tf(freq=2.0), with freq of: 2.0 = termFreq=2.0 2.216367 = idf(docFreq=13101, maxDocs=44218) 0.046875 = fieldNorm(doc=5202) 0.009376213 = weight(_text_:und in 5202) [ClassicSimilarity], result of: 0.009376213 = score(doc=5202,freq=2.0), product of: 0.06381599 = queryWeight, product of: 2.216367 = idf(docFreq=13101, maxDocs=44218) 0.02879306 = queryNorm 0.14692576 = fieldWeight in 5202, product of: 1.4142135 = tf(freq=2.0), with freq of: 2.0 = termFreq=2.0 2.216367 = idf(docFreq=13101, maxDocs=44218) 0.046875 = fieldNorm(doc=5202) 0.014638159 = weight(_text_:des in 5202) [ClassicSimilarity], result of: 0.014638159 = score(doc=5202,freq=2.0), product of: 0.079736836 = queryWeight, product of: 2.7693076 = idf(docFreq=7536, maxDocs=44218) 0.02879306 = queryNorm 0.18358089 = fieldWeight in 5202, product of: 1.4142135 = tf(freq=2.0), with freq of: 2.0 = termFreq=2.0 2.7693076 = idf(docFreq=7536, maxDocs=44218) 0.046875 = fieldNorm(doc=5202) 0.009344013 = weight(_text_:in in 5202) [ClassicSimilarity], result of: 0.009344013 = score(doc=5202,freq=14.0), product of: 0.039165888 = queryWeight, product of: 1.3602545 = idf(docFreq=30841, maxDocs=44218) 0.02879306 = queryNorm 0.23857531 = fieldWeight in 5202, product of: 3.7416575 = tf(freq=14.0), with freq of: 14.0 = termFreq=14.0 1.3602545 = idf(docFreq=30841, maxDocs=44218) 0.046875 = fieldNorm(doc=5202) 0.2 = coord(4/20)
- Abstract
- In this article, we report research on an algorithmic approach to alleviating search uncertainty in a large information space. Grounded on object filtering, automatic indexing, and co-occurence analysis, we performed a large-scale experiment using a parallel supercomputer (SGI Power Challenge) to analyze 400.000+ abstracts in an INSPEC computer engineering collection. Two system-generated thesauri, one based on a combined object filtering and automatic indexing method, and the other based on automatic indexing only, were compaed with the human-generated INSPEC subject thesaurus. Our user evaluation revealed that the system-generated thesauri were better than the INSPEC thesaurus in 'concept recall', but in 'concept precision' the 3 thesauri were comparable. Our analysis also revealed that the terms suggested by the 3 thesauri were complementary and could be used to significantly increase 'variety' in search terms the thereby reduce search uncertainty
- Theme
- Konzeption und Anwendung des Prinzips Thesaurus
Semantisches Umfeld in Indexierung u. Retrieval
-
Chen, H.; Ng, T.D.; Martinez, J.; Schatz, B.R.: ¬A concept space approach to addressing the vocabulary problem in scientific information retrieval : an experiment on the Worm Community System (1997)
0.00
3.8933393E-4 = product of: 0.0077866786 = sum of: 0.0077866786 = weight(_text_:in in 6492) [ClassicSimilarity], result of: 0.0077866786 = score(doc=6492,freq=14.0), product of: 0.039165888 = queryWeight, product of: 1.3602545 = idf(docFreq=30841, maxDocs=44218) 0.02879306 = queryNorm 0.19881277 = fieldWeight in 6492, product of: 3.7416575 = tf(freq=14.0), with freq of: 14.0 = termFreq=14.0 1.3602545 = idf(docFreq=30841, maxDocs=44218) 0.0390625 = fieldNorm(doc=6492) 0.05 = coord(1/20)
- Abstract
- This research presents an algorithmic approach to addressing the vocabulary problem in scientific information retrieval and information sharing, using the molecular biology domain as an example. We first present a literature review of cognitive studies related to the vocabulary problem and vocabulary-based search aids (thesauri) and then discuss techniques for building robust and domain-specific thesauri to assist in cross-domain scientific information retrieval. Using a variation of the automatic thesaurus generation techniques, which we refer to as the concept space approach, we recently conducted an experiment in the molecular biology domain in which we created a C. elegans worm thesaurus of 7.657 worm-specific terms and a Drosophila fly thesaurus of 15.626 terms. About 30% of these terms overlapped, which created vocabulary paths from one subject domain to the other. Based on a cognitve study of term association involving 4 biologists, we found that a large percentage (59,6-85,6%) of the terms suggested by the subjects were identified in the cojoined fly-worm thesaurus. However, we found only a small percentage (8,4-18,1%) of the associations suggested by the subjects in the thesaurus