Wright, L.W.; Nardini, H.K.G.; Aronson, A.R.; Rindflesch, T.C.: Hierarchical concept indexing of full-text documents in the Unified Medical Language System Information sources Map (1999)
0.04
0.03752559 = product of:
0.09006142 = sum of:
0.01898392 = weight(_text_:information in 2111) [ClassicSimilarity], result of:
0.01898392 = score(doc=2111,freq=14.0), product of:
0.0616574 = queryWeight, product of:
1.7554779 = idf(docFreq=20772, maxDocs=44218)
0.035122856 = queryNorm
0.3078936 = fieldWeight in 2111, product of:
3.7416575 = tf(freq=14.0), with freq of:
14.0 = termFreq=14.0
1.7554779 = idf(docFreq=20772, maxDocs=44218)
0.046875 = fieldNorm(doc=2111)
0.016415559 = weight(_text_:for in 2111) [ClassicSimilarity], result of:
0.016415559 = score(doc=2111,freq=8.0), product of:
0.06594466 = queryWeight, product of:
1.8775425 = idf(docFreq=18385, maxDocs=44218)
0.035122856 = queryNorm
0.24892932 = fieldWeight in 2111, product of:
2.828427 = tf(freq=8.0), with freq of:
8.0 = termFreq=8.0
1.8775425 = idf(docFreq=18385, maxDocs=44218)
0.046875 = fieldNorm(doc=2111)
0.018328644 = weight(_text_:the in 2111) [ClassicSimilarity], result of:
0.018328644 = score(doc=2111,freq=20.0), product of:
0.05541559 = queryWeight, product of:
1.5777643 = idf(docFreq=24812, maxDocs=44218)
0.035122856 = queryNorm
0.3307489 = fieldWeight in 2111, product of:
4.472136 = tf(freq=20.0), with freq of:
20.0 = termFreq=20.0
1.5777643 = idf(docFreq=24812, maxDocs=44218)
0.046875 = fieldNorm(doc=2111)
0.018004656 = weight(_text_:of in 2111) [ClassicSimilarity], result of:
0.018004656 = score(doc=2111,freq=20.0), product of:
0.054923624 = queryWeight, product of:
1.5637573 = idf(docFreq=25162, maxDocs=44218)
0.035122856 = queryNorm
0.32781258 = fieldWeight in 2111, product of:
4.472136 = tf(freq=20.0), with freq of:
20.0 = termFreq=20.0
1.5637573 = idf(docFreq=25162, maxDocs=44218)
0.046875 = fieldNorm(doc=2111)
0.018328644 = weight(_text_:the in 2111) [ClassicSimilarity], result of:
0.018328644 = score(doc=2111,freq=20.0), product of:
0.05541559 = queryWeight, product of:
1.5777643 = idf(docFreq=24812, maxDocs=44218)
0.035122856 = queryNorm
0.3307489 = fieldWeight in 2111, product of:
4.472136 = tf(freq=20.0), with freq of:
20.0 = termFreq=20.0
1.5777643 = idf(docFreq=24812, maxDocs=44218)
0.046875 = fieldNorm(doc=2111)
0.41666666 = coord(5/12)
- Abstract
- Full-text documents are a vital and rapidly growing part of online biomedical information. A single large document can contain as much information as a small database, but normally lacks the tight structure and consistent indexing of a database. Retrieval systems will often miss highly relevant parts of a document if the document as a whole appears irrelevant. Access to full-text information is further complicated by the need to search separately many disparate information resources. This research explores how these problems can be addressed by the combined use of 2 techniques: 1) natural language processing for automatic concept-based indexing of full text, and 2) methods for exploiting the structure and hierarchy of full-text documents. We describe methods for applying these techniques to a large collection of full-text documents drawn from the Health Services / Technology Assessment Text (HSTAT) database at the NLM and examine how this hierarchical concept indexing can assist both document- and source-level retrieval in the context of NLM's Information Source Map project
- Source
- Journal of the American Society for Information Science. 50(1999) no.6, S.514-523