Chen, H.; Martinez, J.; Kirchhoff, A.; Ng, T.D.; Schatz, B.R.: Alleviating search uncertainty through concept associations : automatic indexing, co-occurence analysis, and parallel computing (1998)
0.23
0.22779256 = product of:
0.27335107 = sum of:
0.015386774 = weight(_text_:und in 5202) [ClassicSimilarity], result of:
0.015386774 = score(doc=5202,freq=2.0), product of:
0.104724824 = queryWeight, product of:
2.216367 = idf(docFreq=13101, maxDocs=44218)
0.04725067 = queryNorm
0.14692576 = fieldWeight in 5202, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
2.216367 = idf(docFreq=13101, maxDocs=44218)
0.046875 = fieldNorm(doc=5202)
0.07342099 = weight(_text_:anwendung in 5202) [ClassicSimilarity], result of:
0.07342099 = score(doc=5202,freq=2.0), product of:
0.22876309 = queryWeight, product of:
4.8414783 = idf(docFreq=948, maxDocs=44218)
0.04725067 = queryNorm
0.3209477 = fieldWeight in 5202, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
4.8414783 = idf(docFreq=948, maxDocs=44218)
0.046875 = fieldNorm(doc=5202)
0.02402186 = weight(_text_:des in 5202) [ClassicSimilarity], result of:
0.02402186 = score(doc=5202,freq=2.0), product of:
0.13085164 = queryWeight, product of:
2.7693076 = idf(docFreq=7536, maxDocs=44218)
0.04725067 = queryNorm
0.18358089 = fieldWeight in 5202, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
2.7693076 = idf(docFreq=7536, maxDocs=44218)
0.046875 = fieldNorm(doc=5202)
0.10259437 = weight(_text_:prinzips in 5202) [ClassicSimilarity], result of:
0.10259437 = score(doc=5202,freq=2.0), product of:
0.27041927 = queryWeight, product of:
5.723078 = idf(docFreq=392, maxDocs=44218)
0.04725067 = queryNorm
0.37939 = fieldWeight in 5202, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
5.723078 = idf(docFreq=392, maxDocs=44218)
0.046875 = fieldNorm(doc=5202)
0.057927076 = product of:
0.11585415 = sum of:
0.11585415 = weight(_text_:thesaurus in 5202) [ClassicSimilarity], result of:
0.11585415 = score(doc=5202,freq=6.0), product of:
0.21834905 = queryWeight, product of:
4.6210785 = idf(docFreq=1182, maxDocs=44218)
0.04725067 = queryNorm
0.5305915 = fieldWeight in 5202, product of:
2.4494898 = tf(freq=6.0), with freq of:
6.0 = termFreq=6.0
4.6210785 = idf(docFreq=1182, maxDocs=44218)
0.046875 = fieldNorm(doc=5202)
0.5 = coord(1/2)
0.8333333 = coord(5/6)
- Abstract
- In this article, we report research on an algorithmic approach to alleviating search uncertainty in a large information space. Grounded on object filtering, automatic indexing, and co-occurence analysis, we performed a large-scale experiment using a parallel supercomputer (SGI Power Challenge) to analyze 400.000+ abstracts in an INSPEC computer engineering collection. Two system-generated thesauri, one based on a combined object filtering and automatic indexing method, and the other based on automatic indexing only, were compaed with the human-generated INSPEC subject thesaurus. Our user evaluation revealed that the system-generated thesauri were better than the INSPEC thesaurus in 'concept recall', but in 'concept precision' the 3 thesauri were comparable. Our analysis also revealed that the terms suggested by the 3 thesauri were complementary and could be used to significantly increase 'variety' in search terms the thereby reduce search uncertainty
- Theme
- Konzeption und Anwendung des Prinzips Thesaurus