-
Zajic, D.; Dorr, B.J.; Lin, J.; Schwartz, R.: Multi-candidate reduction : sentence compression as a tool for document summarization tasks (2007)
0.09
0.08835813 = product of:
0.17671625 = sum of:
0.17671625 = product of:
0.3534325 = sum of:
0.3534325 = weight(_text_:compression in 944) [ClassicSimilarity], result of:
0.3534325 = score(doc=944,freq=6.0), product of:
0.36069217 = queryWeight, product of:
7.314861 = idf(docFreq=79, maxDocs=44218)
0.049309507 = queryNorm
0.97987294 = fieldWeight in 944, product of:
2.4494898 = tf(freq=6.0), with freq of:
6.0 = termFreq=6.0
7.314861 = idf(docFreq=79, maxDocs=44218)
0.0546875 = fieldNorm(doc=944)
0.5 = coord(1/2)
0.5 = coord(1/2)
- Abstract
- This article examines the application of two single-document sentence compression techniques to the problem of multi-document summarization-a "parse-and-trim" approach and a statistical noisy-channel approach. We introduce the multi-candidate reduction (MCR) framework for multi-document summarization, in which many compressed candidates are generated for each source sentence. These candidates are then selected for inclusion in the final summary based on a combination of static and dynamic features. Evaluations demonstrate that sentence compression is a valuable component of a larger multi-document summarization framework.
-
Zajic, D.M.; Dorr, B.J.; Lin, J.: Single-document and multi-document summarization techniques for email threads using sentence compression (2008)
0.09
0.08835813 = product of:
0.17671625 = sum of:
0.17671625 = product of:
0.3534325 = sum of:
0.3534325 = weight(_text_:compression in 2105) [ClassicSimilarity], result of:
0.3534325 = score(doc=2105,freq=6.0), product of:
0.36069217 = queryWeight, product of:
7.314861 = idf(docFreq=79, maxDocs=44218)
0.049309507 = queryNorm
0.97987294 = fieldWeight in 2105, product of:
2.4494898 = tf(freq=6.0), with freq of:
6.0 = termFreq=6.0
7.314861 = idf(docFreq=79, maxDocs=44218)
0.0546875 = fieldNorm(doc=2105)
0.5 = coord(1/2)
0.5 = coord(1/2)
- Abstract
- We present two approaches to email thread summarization: collective message summarization (CMS) applies a multi-document summarization approach, while individual message summarization (IMS) treats the problem as a sequence of single-document summarization tasks. Both approaches are implemented in our general framework driven by sentence compression. Instead of a purely extractive approach, we employ linguistic and statistical methods to generate multiple compressions, and then select from those candidates to produce a final summary. We demonstrate these ideas on the Enron email collection - a very challenging corpus because of the highly technical language. Experimental results point to two findings: that CMS represents a better approach to email thread summarization, and that current sentence compression techniques do not improve summarization performance in this genre.
-
Dorr, B.J.; Olsen, M.B.: Multilingual generation : the role of telicity in lexical choice and syntactic realization (1996)
0.02
0.02338265 = product of:
0.0467653 = sum of:
0.0467653 = product of:
0.0935306 = sum of:
0.0935306 = weight(_text_:22 in 536) [ClassicSimilarity], result of:
0.0935306 = score(doc=536,freq=2.0), product of:
0.1726735 = queryWeight, product of:
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.049309507 = queryNorm
0.5416616 = fieldWeight in 536, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.109375 = fieldNorm(doc=536)
0.5 = coord(1/2)
0.5 = coord(1/2)
- Date
- 31. 7.1996 9:22:19
-
Dorr, B.J.: Large-scale dictionary construction for foreign language tutoring and interlingual machine translation (1997)
0.01
0.010021136 = product of:
0.020042272 = sum of:
0.020042272 = product of:
0.040084545 = sum of:
0.040084545 = weight(_text_:22 in 3244) [ClassicSimilarity], result of:
0.040084545 = score(doc=3244,freq=2.0), product of:
0.1726735 = queryWeight, product of:
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.049309507 = queryNorm
0.23214069 = fieldWeight in 3244, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.046875 = fieldNorm(doc=3244)
0.5 = coord(1/2)
0.5 = coord(1/2)
- Date
- 31. 7.1996 9:22:19