Ou, S.; Khoo, C.S.G.; Goh, D.H.: Multi-document summarization of news articles using an event-based framework (2006)
0.09
0.08851941 = product of:
0.11802588 = sum of:
0.033937775 = weight(_text_:web in 657) [ClassicSimilarity], result of:
0.033937775 = score(doc=657,freq=2.0), product of:
0.18824495 = queryWeight, product of:
3.2635105 = idf(docFreq=4597, maxDocs=44218)
0.057681736 = queryNorm
0.18028519 = fieldWeight in 657, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.2635105 = idf(docFreq=4597, maxDocs=44218)
0.0390625 = fieldNorm(doc=657)
0.038493924 = weight(_text_:search in 657) [ClassicSimilarity], result of:
0.038493924 = score(doc=657,freq=2.0), product of:
0.20048308 = queryWeight, product of:
3.475677 = idf(docFreq=3718, maxDocs=44218)
0.057681736 = queryNorm
0.19200584 = fieldWeight in 657, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.475677 = idf(docFreq=3718, maxDocs=44218)
0.0390625 = fieldNorm(doc=657)
0.04559418 = product of:
0.09118836 = sum of:
0.09118836 = weight(_text_:engine in 657) [ClassicSimilarity], result of:
0.09118836 = score(doc=657,freq=2.0), product of:
0.30856833 = queryWeight, product of:
5.349498 = idf(docFreq=570, maxDocs=44218)
0.057681736 = queryNorm
0.29552078 = fieldWeight in 657, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
5.349498 = idf(docFreq=570, maxDocs=44218)
0.0390625 = fieldNorm(doc=657)
0.5 = coord(1/2)
0.75 = coord(3/4)
- Abstract
- Purpose - The purpose of this research is to develop a method for automatic construction of multi-document summaries of sets of news articles that might be retrieved by a web search engine in response to a user query. Design/methodology/approach - Based on the cross-document discourse analysis, an event-based framework is proposed for integrating and organizing information extracted from different news articles. It has a hierarchical structure in which the summarized information is presented at the top level and more detailed information given at the lower levels. A tree-view interface was implemented for displaying a multi-document summary based on the framework. A preliminary user evaluation was performed by comparing the framework-based summaries against the sentence-based summaries. Findings - In a small evaluation, all the human subjects preferred the framework-based summaries to the sentence-based summaries. It indicates that the event-based framework is an effective way to summarize a set of news articles reporting an event or a series of relevant events. Research limitations/implications - Limited to event-based news articles only, not applicable to news critiques and other kinds of news articles. A summarization system based on the event-based framework is being implemented. Practical implications - Multi-document summarization of news articles can adopt the proposed event-based framework. Originality/value - An event-based framework for summarizing sets of news articles was developed and evaluated using a tree-view interface for displaying such summaries.