Jacquemin, C.: What is the tree that we see through the window : a linguistic approach to windowing and term variation (1996)
0.00
0.0031324127 = product of:
0.0062648254 = sum of:
0.0062648254 = product of:
0.012529651 = sum of:
0.012529651 = weight(_text_:a in 5578) [ClassicSimilarity], result of:
0.012529651 = score(doc=5578,freq=14.0), product of:
0.053105544 = queryWeight, product of:
1.153047 = idf(docFreq=37942, maxDocs=44218)
0.046056706 = queryNorm
0.23593865 = fieldWeight in 5578, product of:
3.7416575 = tf(freq=14.0), with freq of:
14.0 = termFreq=14.0
1.153047 = idf(docFreq=37942, maxDocs=44218)
0.0546875 = fieldNorm(doc=5578)
0.5 = coord(1/2)
0.5 = coord(1/2)
- Abstract
- Provides a linguistic approach to text windowing through an extraction of term variants with the help of a partial parser. The syntactic grounding of the method ensures ehat words observed within restricted spans are lexically related and that spurious word cooccurrences are rules out with a good level of confidence. The system is computationally tractable on large corpora and large lists of terms. Gives illustrative examples of term variation from a large medical corpus. An experimental evaluation of the method shows that only a small proportion of co-occuring words are lexically related and motivates the call for natural language parsing techniques in text windowing
- Type
- a