Search (5 results, page 1 of 1)

Ou, S.; Khoo, C.S.G.; Goh, D.H.: Multi-document summarization of news articles using an event-based framework (2006) 0.06
```
0.057997324 = product of:
  0.11599465 = sum of:
    0.11599465 = product of:
      0.2319893 = sum of:
        0.2319893 = weight(_text_:news in 657) [ClassicSimilarity], result of:
          0.2319893 = score(doc=657,freq=18.0), product of:
            0.26705483 = queryWeight, product of:
              5.2416887 = idf(docFreq=635, maxDocs=44218)
              0.05094824 = queryNorm
            0.8686954 = fieldWeight in 657, product of:
              4.2426405 = tf(freq=18.0), with freq of:
                18.0 = termFreq=18.0
              5.2416887 = idf(docFreq=635, maxDocs=44218)
              0.0390625 = fieldNorm(doc=657)
      0.5 = coord(1/2)
  0.5 = coord(1/2)
```
Abstract

Purpose - The purpose of this research is to develop a method for automatic construction of multi-document summaries of sets of news articles that might be retrieved by a web search engine in response to a user query. Design/methodology/approach - Based on the cross-document discourse analysis, an event-based framework is proposed for integrating and organizing information extracted from different news articles. It has a hierarchical structure in which the summarized information is presented at the top level and more detailed information given at the lower levels. A tree-view interface was implemented for displaying a multi-document summary based on the framework. A preliminary user evaluation was performed by comparing the framework-based summaries against the sentence-based summaries. Findings - In a small evaluation, all the human subjects preferred the framework-based summaries to the sentence-based summaries. It indicates that the event-based framework is an effective way to summarize a set of news articles reporting an event or a series of relevant events. Research limitations/implications - Limited to event-based news articles only, not applicable to news critiques and other kinds of news articles. A summarization system based on the event-based framework is being implemented. Practical implications - Multi-document summarization of news articles can adopt the proposed event-based framework. Originality/value - An event-based framework for summarizing sets of news articles was developed and evaluated using a tree-view interface for displaying such summaries.
Chen, H.-H.; Kuo, J.-J.; Huang, S.-J.; Lin, C.-J.; Wung, H.-C.: ¬A summarization system for Chinese news from multiple sources (2003) 0.05
```
0.046397857 = product of:
  0.092795715 = sum of:
    0.092795715 = product of:
      0.18559143 = sum of:
        0.18559143 = weight(_text_:news in 2115) [ClassicSimilarity], result of:
          0.18559143 = score(doc=2115,freq=8.0), product of:
            0.26705483 = queryWeight, product of:
              5.2416887 = idf(docFreq=635, maxDocs=44218)
              0.05094824 = queryNorm
            0.6949563 = fieldWeight in 2115, product of:
              2.828427 = tf(freq=8.0), with freq of:
                8.0 = termFreq=8.0
              5.2416887 = idf(docFreq=635, maxDocs=44218)
              0.046875 = fieldNorm(doc=2115)
      0.5 = coord(1/2)
  0.5 = coord(1/2)
```
Abstract

This article proposes a summarization system for multiple documents. It employs not only named entities and other signatures to cluster news from different sources, but also employs punctuation marks, linking elements, and topic chains to identify the meaningful units (MUs). Using nouns and verbs to identify the similar MUs, focusing and browsing models are applied to represent the summarization results. To reduce information loss during summarization, informative words in a document are introduced. For the evaluation, a question answering system (QA system) is proposed to substitute the human assessors. In large-scale experiments containing 140 questions to 17,877 documents, the results show that those models using informative words outperform pure heuristic voting-only strategy by news reporters. This model can be easily further applied to summarize multilingual news from multiple sources.
Moens, M.F.; Dumortier, J.: Use of a text grammar for generating highlight abstracts of magazine articles (2000) 0.03
```
0.027065417 = product of:
  0.054130834 = sum of:
    0.054130834 = product of:
      0.10826167 = sum of:
        0.10826167 = weight(_text_:news in 4540) [ClassicSimilarity], result of:
          0.10826167 = score(doc=4540,freq=2.0), product of:
            0.26705483 = queryWeight, product of:
              5.2416887 = idf(docFreq=635, maxDocs=44218)
              0.05094824 = queryNorm
            0.40539116 = fieldWeight in 4540, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              5.2416887 = idf(docFreq=635, maxDocs=44218)
              0.0546875 = fieldNorm(doc=4540)
      0.5 = coord(1/2)
  0.5 = coord(1/2)
```
Abstract

Browsing a database of article abstracts is one way to select and buy relevant magazine articles online. Our research contributes to the design and development of text grammars for abstracting texts in unlimited subject domains. We developed a system that parses texts based on the text grammar of a specific text type and that extracts sentences and statements which are relevant for inclusion in the abstracts. The system employs knowledge of the discourse patterns that are typical of news stories. The results are encouraging and demonstrate the importance of discourse structures in text summarisation.
Vanderwende, L.; Suzuki, H.; Brockett, J.M.; Nenkova, A.: Beyond SumBasic : task-focused summarization with sentence simplification and lexical expansion (2007) 0.01
```
0.010354174 = product of:
  0.020708349 = sum of:
    0.020708349 = product of:
      0.041416697 = sum of:
        0.041416697 = weight(_text_:22 in 948) [ClassicSimilarity], result of:
          0.041416697 = score(doc=948,freq=2.0), product of:
            0.17841205 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.05094824 = queryNorm
            0.23214069 = fieldWeight in 948, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.046875 = fieldNorm(doc=948)
      0.5 = coord(1/2)
  0.5 = coord(1/2)
```
Abstract

In recent years, there has been increased interest in topic-focused multi-document summarization. In this task, automatic summaries are produced in response to a specific information request, or topic, stated by the user. The system we have designed to accomplish this task comprises four main components: a generic extractive summarization system, a topic-focusing component, sentence simplification, and lexical expansion of topic words. This paper details each of these components, together with experiments designed to quantify their individual contributions. We include an analysis of our results on two large datasets commonly used to evaluate task-focused summarization, the DUC2005 and DUC2006 datasets, using automatic metrics. Additionally, we include an analysis of our results on the DUC2006 task according to human evaluation metrics. In the human evaluation of system summaries compared to human summaries, i.e., the Pyramid method, our system ranked first out of 22 systems in terms of overall mean Pyramid score; and in the human evaluation of summary responsiveness to the topic, our system ranked third out of 35 systems.

Wu, Y.-f.B.; Li, Q.; Bot, R.S.; Chen, X.: Finding nuggets in documents : a machine learning approach (2006) 0.01

0.008628479 = product of:
  0.017256958 = sum of:
    0.017256958 = product of:
      0.034513917 = sum of:
        0.034513917 = weight(_text_:22 in 5290) [ClassicSimilarity], result of:
          0.034513917 = score(doc=5290,freq=2.0), product of:
            0.17841205 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.05094824 = queryNorm
            0.19345059 = fieldWeight in 5290, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.0390625 = fieldNorm(doc=5290)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Date: 22. 7.2006 17:25:48

Search (5 results, page 1 of 1)

Authors

Themes