Jacquemin, C.: What is the tree that we see through the window : a linguistic approach to windowing and term variation (1996)
0.03
0.031244382 = product of:
0.07498652 = sum of:
0.008371122 = weight(_text_:information in 5578) [ClassicSimilarity], result of:
0.008371122 = score(doc=5578,freq=2.0), product of:
0.0616574 = queryWeight, product of:
1.7554779 = idf(docFreq=20772, maxDocs=44218)
0.035122856 = queryNorm
0.13576832 = fieldWeight in 5578, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
1.7554779 = idf(docFreq=20772, maxDocs=44218)
0.0546875 = fieldNorm(doc=5578)
0.009575742 = weight(_text_:for in 5578) [ClassicSimilarity], result of:
0.009575742 = score(doc=5578,freq=2.0), product of:
0.06594466 = queryWeight, product of:
1.8775425 = idf(docFreq=18385, maxDocs=44218)
0.035122856 = queryNorm
0.14520876 = fieldWeight in 5578, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
1.8775425 = idf(docFreq=18385, maxDocs=44218)
0.0546875 = fieldNorm(doc=5578)
0.01912591 = weight(_text_:the in 5578) [ClassicSimilarity], result of:
0.01912591 = score(doc=5578,freq=16.0), product of:
0.05541559 = queryWeight, product of:
1.5777643 = idf(docFreq=24812, maxDocs=44218)
0.035122856 = queryNorm
0.34513593 = fieldWeight in 5578, product of:
4.0 = tf(freq=16.0), with freq of:
16.0 = termFreq=16.0
1.5777643 = idf(docFreq=24812, maxDocs=44218)
0.0546875 = fieldNorm(doc=5578)
0.01878783 = weight(_text_:of in 5578) [ClassicSimilarity], result of:
0.01878783 = score(doc=5578,freq=16.0), product of:
0.054923624 = queryWeight, product of:
1.5637573 = idf(docFreq=25162, maxDocs=44218)
0.035122856 = queryNorm
0.34207192 = fieldWeight in 5578, product of:
4.0 = tf(freq=16.0), with freq of:
16.0 = termFreq=16.0
1.5637573 = idf(docFreq=25162, maxDocs=44218)
0.0546875 = fieldNorm(doc=5578)
0.01912591 = weight(_text_:the in 5578) [ClassicSimilarity], result of:
0.01912591 = score(doc=5578,freq=16.0), product of:
0.05541559 = queryWeight, product of:
1.5777643 = idf(docFreq=24812, maxDocs=44218)
0.035122856 = queryNorm
0.34513593 = fieldWeight in 5578, product of:
4.0 = tf(freq=16.0), with freq of:
16.0 = termFreq=16.0
1.5777643 = idf(docFreq=24812, maxDocs=44218)
0.0546875 = fieldNorm(doc=5578)
0.41666666 = coord(5/12)
- Abstract
- Provides a linguistic approach to text windowing through an extraction of term variants with the help of a partial parser. The syntactic grounding of the method ensures ehat words observed within restricted spans are lexically related and that spurious word cooccurrences are rules out with a good level of confidence. The system is computationally tractable on large corpora and large lists of terms. Gives illustrative examples of term variation from a large medical corpus. An experimental evaluation of the method shows that only a small proportion of co-occuring words are lexically related and motivates the call for natural language parsing techniques in text windowing
- Source
- Information processing and management. 32(1996) no.4, S.445-458