Damerau, F.J.: ¬An experiment in automatic indexing (1965)
0.00
0.002146068 = product of:
0.011803374 = sum of:
0.0062481174 = weight(_text_:a in 5464) [ClassicSimilarity], result of:
0.0062481174 = score(doc=5464,freq=2.0), product of:
0.030653298 = queryWeight, product of:
1.153047 = idf(docFreq=37942, maxDocs=44218)
0.026584605 = queryNorm
0.20383182 = fieldWeight in 5464, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
1.153047 = idf(docFreq=37942, maxDocs=44218)
0.125 = fieldNorm(doc=5464)
0.0055552567 = weight(_text_:s in 5464) [ClassicSimilarity], result of:
0.0055552567 = score(doc=5464,freq=2.0), product of:
0.028903782 = queryWeight, product of:
1.0872376 = idf(docFreq=40523, maxDocs=44218)
0.026584605 = queryNorm
0.19219826 = fieldWeight in 5464, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
1.0872376 = idf(docFreq=40523, maxDocs=44218)
0.125 = fieldNorm(doc=5464)
0.18181819 = coord(2/11)
- Source
- American documentation. 16(1965), S.283-289
- Type
- a
Damerau, F.J.: Generating an evaluating domain-oriented multi-word terms from texts (1993)
0.00
0.0016410447 = product of:
0.009025746 = sum of:
0.0062481174 = weight(_text_:a in 5814) [ClassicSimilarity], result of:
0.0062481174 = score(doc=5814,freq=8.0), product of:
0.030653298 = queryWeight, product of:
1.153047 = idf(docFreq=37942, maxDocs=44218)
0.026584605 = queryNorm
0.20383182 = fieldWeight in 5814, product of:
2.828427 = tf(freq=8.0), with freq of:
8.0 = termFreq=8.0
1.153047 = idf(docFreq=37942, maxDocs=44218)
0.0625 = fieldNorm(doc=5814)
0.0027776284 = weight(_text_:s in 5814) [ClassicSimilarity], result of:
0.0027776284 = score(doc=5814,freq=2.0), product of:
0.028903782 = queryWeight, product of:
1.0872376 = idf(docFreq=40523, maxDocs=44218)
0.026584605 = queryNorm
0.09609913 = fieldWeight in 5814, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
1.0872376 = idf(docFreq=40523, maxDocs=44218)
0.0625 = fieldNorm(doc=5814)
0.18181819 = coord(2/11)
- Abstract
- Examines techniques for automatically generating domain vocabularies from large text collections. Focuses on the problem of generating multi-word vocabulary terms (specifically pairs). Discusses statistical issues associated with word co-occurrences likely to be of use in a natural language interface. Provides a more objective evaluation of the selection procedures. As substantial experimentation with subjects using a working query system is absent, all evaluation is necessarily subjective. Uses surrogate for experimentation by relying on pre-existing dictionaries as indicators of domain relevance
- Source
- Information processing and management. 29(1993) no.4, S.433-447
- Type
- a