Jacquemin, C.: What is the tree that we see through the window : a linguistic approach to windowing and term variation (1996)
0.00
9.346905E-4 = product of:
0.0065428335 = sum of:
0.0065428335 = weight(_text_:in in 5578) [ClassicSimilarity], result of:
0.0065428335 = score(doc=5578,freq=2.0), product of:
0.062193166 = queryWeight, product of:
1.3602545 = idf(docFreq=30841, maxDocs=44218)
0.045721713 = queryNorm
0.10520181 = fieldWeight in 5578, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
1.3602545 = idf(docFreq=30841, maxDocs=44218)
0.0546875 = fieldNorm(doc=5578)
0.14285715 = coord(1/7)
- Abstract
- Provides a linguistic approach to text windowing through an extraction of term variants with the help of a partial parser. The syntactic grounding of the method ensures ehat words observed within restricted spans are lexically related and that spurious word cooccurrences are rules out with a good level of confidence. The system is computationally tractable on large corpora and large lists of terms. Gives illustrative examples of term variation from a large medical corpus. An experimental evaluation of the method shows that only a small proportion of co-occuring words are lexically related and motivates the call for natural language parsing techniques in text windowing