-
Chen, H.; Ng, T.D.; Martinez, J.; Schatz, B.R.: ¬A concept space approach to addressing the vocabulary problem in scientific information retrieval : an experiment on the Worm Community System (1997)
0.02
0.015060814 = product of:
0.030121628 = sum of:
0.030121628 = product of:
0.04518244 = sum of:
0.029749434 = weight(_text_:c in 6492) [ClassicSimilarity], result of:
0.029749434 = score(doc=6492,freq=2.0), product of:
0.15612034 = queryWeight, product of:
3.4494052 = idf(docFreq=3817, maxDocs=44218)
0.045260075 = queryNorm
0.1905545 = fieldWeight in 6492, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.4494052 = idf(docFreq=3817, maxDocs=44218)
0.0390625 = fieldNorm(doc=6492)
0.015433006 = weight(_text_:h in 6492) [ClassicSimilarity], result of:
0.015433006 = score(doc=6492,freq=2.0), product of:
0.11244635 = queryWeight, product of:
2.4844491 = idf(docFreq=10020, maxDocs=44218)
0.045260075 = queryNorm
0.13724773 = fieldWeight in 6492, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
2.4844491 = idf(docFreq=10020, maxDocs=44218)
0.0390625 = fieldNorm(doc=6492)
0.6666667 = coord(2/3)
0.5 = coord(1/2)
- Abstract
- This research presents an algorithmic approach to addressing the vocabulary problem in scientific information retrieval and information sharing, using the molecular biology domain as an example. We first present a literature review of cognitive studies related to the vocabulary problem and vocabulary-based search aids (thesauri) and then discuss techniques for building robust and domain-specific thesauri to assist in cross-domain scientific information retrieval. Using a variation of the automatic thesaurus generation techniques, which we refer to as the concept space approach, we recently conducted an experiment in the molecular biology domain in which we created a C. elegans worm thesaurus of 7.657 worm-specific terms and a Drosophila fly thesaurus of 15.626 terms. About 30% of these terms overlapped, which created vocabulary paths from one subject domain to the other. Based on a cognitve study of term association involving 4 biologists, we found that a large percentage (59,6-85,6%) of the terms suggested by the subjects were identified in the cojoined fly-worm thesaurus. However, we found only a small percentage (8,4-18,1%) of the associations suggested by the subjects in the thesaurus
-
Schatz, B.R.; Johnson, E.H.; Cochrane, P.A.; Chen, H.: Interactive term suggestion for users of digital libraries : using thesauri and co-occurrence lists for information retrieval (1996)
0.01
0.0051443353 = product of:
0.010288671 = sum of:
0.010288671 = product of:
0.030866012 = sum of:
0.030866012 = weight(_text_:h in 6417) [ClassicSimilarity], result of:
0.030866012 = score(doc=6417,freq=2.0), product of:
0.11244635 = queryWeight, product of:
2.4844491 = idf(docFreq=10020, maxDocs=44218)
0.045260075 = queryNorm
0.27449545 = fieldWeight in 6417, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
2.4844491 = idf(docFreq=10020, maxDocs=44218)
0.078125 = fieldNorm(doc=6417)
0.33333334 = coord(1/3)
0.5 = coord(1/2)
-
Ramsey, M.C.; Chen, H.; Zhu, B.; Schatz, B.R.: ¬A collection of visual thesauri for browsing large collections of geographic images (1999)
0.00
0.0036010346 = product of:
0.0072020693 = sum of:
0.0072020693 = product of:
0.021606207 = sum of:
0.021606207 = weight(_text_:h in 3922) [ClassicSimilarity], result of:
0.021606207 = score(doc=3922,freq=2.0), product of:
0.11244635 = queryWeight, product of:
2.4844491 = idf(docFreq=10020, maxDocs=44218)
0.045260075 = queryNorm
0.19214681 = fieldWeight in 3922, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
2.4844491 = idf(docFreq=10020, maxDocs=44218)
0.0546875 = fieldNorm(doc=3922)
0.33333334 = coord(1/3)
0.5 = coord(1/2)
-
Chen, H.; Martinez, J.; Kirchhoff, A.; Ng, T.D.; Schatz, B.R.: Alleviating search uncertainty through concept associations : automatic indexing, co-occurence analysis, and parallel computing (1998)
0.00
0.0030866012 = product of:
0.0061732023 = sum of:
0.0061732023 = product of:
0.018519606 = sum of:
0.018519606 = weight(_text_:h in 5202) [ClassicSimilarity], result of:
0.018519606 = score(doc=5202,freq=2.0), product of:
0.11244635 = queryWeight, product of:
2.4844491 = idf(docFreq=10020, maxDocs=44218)
0.045260075 = queryNorm
0.16469726 = fieldWeight in 5202, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
2.4844491 = idf(docFreq=10020, maxDocs=44218)
0.046875 = fieldNorm(doc=5202)
0.33333334 = coord(1/3)
0.5 = coord(1/2)
-
Chen, H.; Houston, A.L.; Sewell, R.R.; Schatz, B.R.: Internet browsing and searching : user evaluations of category map and concept space techniques (1998)
0.00
0.0020577342 = product of:
0.0041154684 = sum of:
0.0041154684 = product of:
0.012346405 = sum of:
0.012346405 = weight(_text_:h in 869) [ClassicSimilarity], result of:
0.012346405 = score(doc=869,freq=2.0), product of:
0.11244635 = queryWeight, product of:
2.4844491 = idf(docFreq=10020, maxDocs=44218)
0.045260075 = queryNorm
0.10979818 = fieldWeight in 869, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
2.4844491 = idf(docFreq=10020, maxDocs=44218)
0.03125 = fieldNorm(doc=869)
0.33333334 = coord(1/3)
0.5 = coord(1/2)