Damerau, F.J.: Generating an evaluating domain-oriented multi-word terms from texts (1993)
0.01
0.006899295 = product of:
0.02759718 = sum of:
0.016360147 = product of:
0.04908044 = sum of:
0.04908044 = weight(_text_:problem in 5814) [ClassicSimilarity], result of:
0.04908044 = score(doc=5814,freq=2.0), product of:
0.13082431 = queryWeight, product of:
4.244485 = idf(docFreq=1723, maxDocs=44218)
0.030822188 = queryNorm
0.375163 = fieldWeight in 5814, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
4.244485 = idf(docFreq=1723, maxDocs=44218)
0.0625 = fieldNorm(doc=5814)
0.33333334 = coord(1/3)
0.011237033 = product of:
0.033711098 = sum of:
0.033711098 = weight(_text_:29 in 5814) [ClassicSimilarity], result of:
0.033711098 = score(doc=5814,freq=2.0), product of:
0.108422816 = queryWeight, product of:
3.5176873 = idf(docFreq=3565, maxDocs=44218)
0.030822188 = queryNorm
0.31092256 = fieldWeight in 5814, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.5176873 = idf(docFreq=3565, maxDocs=44218)
0.0625 = fieldNorm(doc=5814)
0.33333334 = coord(1/3)
0.25 = coord(2/8)
- Abstract
- Examines techniques for automatically generating domain vocabularies from large text collections. Focuses on the problem of generating multi-word vocabulary terms (specifically pairs). Discusses statistical issues associated with word co-occurrences likely to be of use in a natural language interface. Provides a more objective evaluation of the selection procedures. As substantial experimentation with subjects using a working query system is absent, all evaluation is necessarily subjective. Uses surrogate for experimentation by relying on pre-existing dictionaries as indicators of domain relevance
- Source
- Information processing and management. 29(1993) no.4, S.433-447