-
Chen, H.; Ng, T.D.; Martinez, J.; Schatz, B.R.: ¬A concept space approach to addressing the vocabulary problem in scientific information retrieval : an experiment on the Worm Community System (1997)
0.00
0.0022137975 = product of:
0.01771038 = sum of:
0.01771038 = product of:
0.05313114 = sum of:
0.05313114 = weight(_text_:problem in 6492) [ClassicSimilarity], result of:
0.05313114 = score(doc=6492,freq=6.0), product of:
0.13082431 = queryWeight, product of:
4.244485 = idf(docFreq=1723, maxDocs=44218)
0.030822188 = queryNorm
0.4061259 = fieldWeight in 6492, product of:
2.4494898 = tf(freq=6.0), with freq of:
6.0 = termFreq=6.0
4.244485 = idf(docFreq=1723, maxDocs=44218)
0.0390625 = fieldNorm(doc=6492)
0.33333334 = coord(1/3)
0.125 = coord(1/8)
- Abstract
- This research presents an algorithmic approach to addressing the vocabulary problem in scientific information retrieval and information sharing, using the molecular biology domain as an example. We first present a literature review of cognitive studies related to the vocabulary problem and vocabulary-based search aids (thesauri) and then discuss techniques for building robust and domain-specific thesauri to assist in cross-domain scientific information retrieval. Using a variation of the automatic thesaurus generation techniques, which we refer to as the concept space approach, we recently conducted an experiment in the molecular biology domain in which we created a C. elegans worm thesaurus of 7.657 worm-specific terms and a Drosophila fly thesaurus of 15.626 terms. About 30% of these terms overlapped, which created vocabulary paths from one subject domain to the other. Based on a cognitve study of term association involving 4 biologists, we found that a large percentage (59,6-85,6%) of the terms suggested by the subjects were identified in the cojoined fly-worm thesaurus. However, we found only a small percentage (8,4-18,1%) of the associations suggested by the subjects in the thesaurus
-
Schatz, B.R.: Information analysis in the net : the interspace of the twenty-first century (1998)
0.00
0.0013919937 = product of:
0.01113595 = sum of:
0.01113595 = product of:
0.03340785 = sum of:
0.03340785 = weight(_text_:22 in 2344) [ClassicSimilarity], result of:
0.03340785 = score(doc=2344,freq=2.0), product of:
0.10793405 = queryWeight, product of:
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.030822188 = queryNorm
0.30952093 = fieldWeight in 2344, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.0625 = fieldNorm(doc=2344)
0.33333334 = coord(1/3)
0.125 = coord(1/8)
- Date
- 22. 9.1997 19:16:05
-
Ramsey, M.C.; Chen, H.; Zhu, B.; Schatz, B.R.: ¬A collection of visual thesauri for browsing large collections of geographic images (1999)
0.00
0.0012290506 = product of:
0.009832405 = sum of:
0.009832405 = product of:
0.029497212 = sum of:
0.029497212 = weight(_text_:29 in 3922) [ClassicSimilarity], result of:
0.029497212 = score(doc=3922,freq=2.0), product of:
0.108422816 = queryWeight, product of:
3.5176873 = idf(docFreq=3565, maxDocs=44218)
0.030822188 = queryNorm
0.27205724 = fieldWeight in 3922, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.5176873 = idf(docFreq=3565, maxDocs=44218)
0.0546875 = fieldNorm(doc=3922)
0.33333334 = coord(1/3)
0.125 = coord(1/8)
- Date
- 21. 7.1999 13:48:29