Ou, S.; Khoo, C.S.G.; Goh, D.H.: Multi-document summarization of news articles using an event-based framework (2006)
0.00
3.8738234E-4 = product of:
0.005810735 = sum of:
0.0038187557 = weight(_text_:in in 657) [ClassicSimilarity], result of:
0.0038187557 = score(doc=657,freq=6.0), product of:
0.029340398 = queryWeight, product of:
1.3602545 = idf(docFreq=30841, maxDocs=44218)
0.021569785 = queryNorm
0.1301535 = fieldWeight in 657, product of:
2.4494898 = tf(freq=6.0), with freq of:
6.0 = termFreq=6.0
1.3602545 = idf(docFreq=30841, maxDocs=44218)
0.0390625 = fieldNorm(doc=657)
0.001991979 = weight(_text_:s in 657) [ClassicSimilarity], result of:
0.001991979 = score(doc=657,freq=4.0), product of:
0.023451481 = queryWeight, product of:
1.0872376 = idf(docFreq=40523, maxDocs=44218)
0.021569785 = queryNorm
0.08494043 = fieldWeight in 657, product of:
2.0 = tf(freq=4.0), with freq of:
4.0 = termFreq=4.0
1.0872376 = idf(docFreq=40523, maxDocs=44218)
0.0390625 = fieldNorm(doc=657)
0.06666667 = coord(2/30)
- Abstract
- Purpose - The purpose of this research is to develop a method for automatic construction of multi-document summaries of sets of news articles that might be retrieved by a web search engine in response to a user query. Design/methodology/approach - Based on the cross-document discourse analysis, an event-based framework is proposed for integrating and organizing information extracted from different news articles. It has a hierarchical structure in which the summarized information is presented at the top level and more detailed information given at the lower levels. A tree-view interface was implemented for displaying a multi-document summary based on the framework. A preliminary user evaluation was performed by comparing the framework-based summaries against the sentence-based summaries. Findings - In a small evaluation, all the human subjects preferred the framework-based summaries to the sentence-based summaries. It indicates that the event-based framework is an effective way to summarize a set of news articles reporting an event or a series of relevant events. Research limitations/implications - Limited to event-based news articles only, not applicable to news critiques and other kinds of news articles. A summarization system based on the event-based framework is being implemented. Practical implications - Multi-document summarization of news articles can adopt the proposed event-based framework. Originality/value - An event-based framework for summarizing sets of news articles was developed and evaluated using a tree-view interface for displaying such summaries.
- Source
- Aslib proceedings. 58(2006) no.3, S.276-291
Ou, S.; Khoo, S.G.; Goh, D.H.: Automatic multidocument summarization of research abstracts : design and user evaluation (2007)
0.00
3.8738234E-4 = product of:
0.005810735 = sum of:
0.0038187557 = weight(_text_:in in 522) [ClassicSimilarity], result of:
0.0038187557 = score(doc=522,freq=6.0), product of:
0.029340398 = queryWeight, product of:
1.3602545 = idf(docFreq=30841, maxDocs=44218)
0.021569785 = queryNorm
0.1301535 = fieldWeight in 522, product of:
2.4494898 = tf(freq=6.0), with freq of:
6.0 = termFreq=6.0
1.3602545 = idf(docFreq=30841, maxDocs=44218)
0.0390625 = fieldNorm(doc=522)
0.001991979 = weight(_text_:s in 522) [ClassicSimilarity], result of:
0.001991979 = score(doc=522,freq=4.0), product of:
0.023451481 = queryWeight, product of:
1.0872376 = idf(docFreq=40523, maxDocs=44218)
0.021569785 = queryNorm
0.08494043 = fieldWeight in 522, product of:
2.0 = tf(freq=4.0), with freq of:
4.0 = termFreq=4.0
1.0872376 = idf(docFreq=40523, maxDocs=44218)
0.0390625 = fieldNorm(doc=522)
0.06666667 = coord(2/30)
- Abstract
- The purpose of this study was to develop a method for automatic construction of multidocument summaries of sets of research abstracts that may be retrieved by a digital library or search engine in response to a user query. Sociology dissertation abstracts were selected as the sample domain in this study. A variable-based framework was proposed for integrating and organizing research concepts and relationships as well as research methods and contextual relations extracted from different dissertation abstracts. Based on the framework, a new summarization method was developed, which parses the discourse structure of abstracts, extracts research concepts and relationships, integrates the information across different abstracts, and organizes and presents them in a Web-based interface. The focus of this article is on the user evaluation that was performed to assess the overall quality and usefulness of the summaries. Two types of variable-based summaries generated using the summarization method-with or without the use of a taxonomy-were compared against a sentence-based summary that lists only the research-objective sentences extracted from each abstract and another sentence-based summary generated using the MEAD system that extracts important sentences. The evaluation results indicate that the majority of sociological researchers (70%) and general users (64%) preferred the variable-based summaries generated with the use of the taxonomy.
- Source
- Journal of the American Society for Information Science and Technology. 58(2007) no.10, S.1419-1435