Search (22 results, page 1 of 2)

Over, P.; Dang, H.; Harman, D.: DUC in context (2007) 0.01

0.010471511 = product of:
  0.052357554 = sum of:
    0.052357554 = product of:
      0.15707266 = sum of:
        0.15707266 = weight(_text_:themes in 934) [ClassicSimilarity], result of:
          0.15707266 = score(doc=934,freq=4.0), product of:
            0.19545428 = queryWeight, product of:
              6.429029 = idf(docFreq=193, maxDocs=44218)
              0.030401835 = queryNorm
            0.8036286 = fieldWeight in 934, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              6.429029 = idf(docFreq=193, maxDocs=44218)
              0.0625 = fieldNorm(doc=934)
      0.33333334 = coord(1/3)
  0.2 = coord(1/5)

Abstract: Recent years have seen increased interest in text summarization with emphasis on evaluation of prototype systems. Many factors can affect the design of such evaluations, requiring choices among competing alternatives. This paper examines several major themes running through three evaluations: SUMMAC, NTCIR, and DUC, with a concentration on DUC. The themes are extrinsic and intrinsic evaluation, evaluation procedures and methods, generic versus focused summaries, single- and multi-document summaries, length and compression issues, extracts versus abstracts, and issues with genre.

Cai, X.; Li, W.: Enhancing sentence-level clustering with integrated and interactive frameworks for theme-based summarization (2011) 0.00
```
0.0046277978 = product of:
  0.023138989 = sum of:
    0.023138989 = product of:
      0.06941696 = sum of:
        0.06941696 = weight(_text_:themes in 4770) [ClassicSimilarity], result of:
          0.06941696 = score(doc=4770,freq=2.0), product of:
            0.19545428 = queryWeight, product of:
              6.429029 = idf(docFreq=193, maxDocs=44218)
              0.030401835 = queryNorm
            0.35515702 = fieldWeight in 4770, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              6.429029 = idf(docFreq=193, maxDocs=44218)
              0.0390625 = fieldNorm(doc=4770)
      0.33333334 = coord(1/3)
  0.2 = coord(1/5)
```
Abstract

Sentence clustering plays a pivotal role in theme-based summarization, which discovers topic themes defined as the clusters of highly related sentences to avoid redundancy and cover more diverse information. As the length of sentences is short and the content it contains is limited, the bag-of-words cosine similarity traditionally used for document clustering is no longer suitable. Special treatment for measuring sentence similarity is necessary. In this article, we study the sentence-level clustering problem. After exploiting concept- and context-enriched sentence vector representations, we develop two co-clustering frameworks to enhance sentence-level clustering for theme-based summarization-integrated clustering and interactive clustering-both allowing word and document to play an explicit role in sentence clustering as independent text objects rather than using word or concept as features of a sentence in a document set. In each framework, we experiment with two-level co-clustering (i.e., sentence-word co-clustering or sentence-document co-clustering) and three-level co-clustering (i.e., document-sentence-word co-clustering). Compared against concept- and context-oriented sentence-representation reformation, co-clustering shows a clear advantage in both intrinsic clustering quality evaluation and extrinsic summarization evaluation conducted on the Document Understanding Conferences (DUC) datasets.

Johnson, F.: Automatic abstracting research (1995) 0.00

0.002845978 = product of:
  0.014229889 = sum of:
    0.014229889 = product of:
      0.042689666 = sum of:
        0.042689666 = weight(_text_:f in 3847) [ClassicSimilarity], result of:
          0.042689666 = score(doc=3847,freq=2.0), product of:
            0.12117521 = queryWeight, product of:
              3.985786 = idf(docFreq=2232, maxDocs=44218)
              0.030401835 = queryNorm
            0.35229704 = fieldWeight in 3847, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.985786 = idf(docFreq=2232, maxDocs=44218)
              0.0625 = fieldNorm(doc=3847)
      0.33333334 = coord(1/3)
  0.2 = coord(1/5)

Harabagiu, S.; Hickl, A.; Lacatusu, F.: Satisfying information needs with multi-document summaries (2007) 0.00

0.002845978 = product of:
  0.014229889 = sum of:
    0.014229889 = product of:
      0.042689666 = sum of:
        0.042689666 = weight(_text_:f in 939) [ClassicSimilarity], result of:
          0.042689666 = score(doc=939,freq=2.0), product of:
            0.12117521 = queryWeight, product of:
              3.985786 = idf(docFreq=2232, maxDocs=44218)
              0.030401835 = queryNorm
            0.35229704 = fieldWeight in 939, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.985786 = idf(docFreq=2232, maxDocs=44218)
              0.0625 = fieldNorm(doc=939)
      0.33333334 = coord(1/3)
  0.2 = coord(1/5)

Uyttendaele, C.; Moens, M.-F.; Dumortier, J.: SALOMON: automatic abstracting of legal cases for effective access to court decisions (1998) 0.00

0.0024902306 = product of:
  0.012451152 = sum of:
    0.012451152 = product of:
      0.037353456 = sum of:
        0.037353456 = weight(_text_:f in 495) [ClassicSimilarity], result of:
          0.037353456 = score(doc=495,freq=2.0), product of:
            0.12117521 = queryWeight, product of:
              3.985786 = idf(docFreq=2232, maxDocs=44218)
              0.030401835 = queryNorm
            0.3082599 = fieldWeight in 495, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.985786 = idf(docFreq=2232, maxDocs=44218)
              0.0546875 = fieldNorm(doc=495)
      0.33333334 = coord(1/3)
  0.2 = coord(1/5)

Moens, M.-F.: Summarizing court decisions (2007) 0.00

0.0024902306 = product of:
  0.012451152 = sum of:
    0.012451152 = product of:
      0.037353456 = sum of:
        0.037353456 = weight(_text_:f in 954) [ClassicSimilarity], result of:
          0.037353456 = score(doc=954,freq=2.0), product of:
            0.12117521 = queryWeight, product of:
              3.985786 = idf(docFreq=2232, maxDocs=44218)
              0.030401835 = queryNorm
            0.3082599 = fieldWeight in 954, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.985786 = idf(docFreq=2232, maxDocs=44218)
              0.0546875 = fieldNorm(doc=954)
      0.33333334 = coord(1/3)
  0.2 = coord(1/5)

Moens, M.-F.; Uyttendaele, C.: Automatic text structuring and categorization as a first step in summarizing legal cases (1997) 0.00

0.0021344833 = product of:
  0.010672417 = sum of:
    0.010672417 = product of:
      0.03201725 = sum of:
        0.03201725 = weight(_text_:f in 2256) [ClassicSimilarity], result of:
          0.03201725 = score(doc=2256,freq=2.0), product of:
            0.12117521 = queryWeight, product of:
              3.985786 = idf(docFreq=2232, maxDocs=44218)
              0.030401835 = queryNorm
            0.26422277 = fieldWeight in 2256, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.985786 = idf(docFreq=2232, maxDocs=44218)
              0.046875 = fieldNorm(doc=2256)
      0.33333334 = coord(1/3)
  0.2 = coord(1/5)

Moens, M.-F.; Uyttendaele, C.; Dumotier, J.: Abstracting of legal cases : the potential of clustering based on the selection of representative objects (1999) 0.00

0.0021344833 = product of:
  0.010672417 = sum of:
    0.010672417 = product of:
      0.03201725 = sum of:
        0.03201725 = weight(_text_:f in 2944) [ClassicSimilarity], result of:
          0.03201725 = score(doc=2944,freq=2.0), product of:
            0.12117521 = queryWeight, product of:
              3.985786 = idf(docFreq=2232, maxDocs=44218)
              0.030401835 = queryNorm
            0.26422277 = fieldWeight in 2944, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.985786 = idf(docFreq=2232, maxDocs=44218)
              0.046875 = fieldNorm(doc=2944)
      0.33333334 = coord(1/3)
  0.2 = coord(1/5)

Liang, S.-F.; Devlin, S.; Tait, J.: Investigating sentence weighting components for automatic summarisation (2007) 0.00

0.0021344833 = product of:
  0.010672417 = sum of:
    0.010672417 = product of:
      0.03201725 = sum of:
        0.03201725 = weight(_text_:f in 899) [ClassicSimilarity], result of:
          0.03201725 = score(doc=899,freq=2.0), product of:
            0.12117521 = queryWeight, product of:
              3.985786 = idf(docFreq=2232, maxDocs=44218)
              0.030401835 = queryNorm
            0.26422277 = fieldWeight in 899, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.985786 = idf(docFreq=2232, maxDocs=44218)
              0.046875 = fieldNorm(doc=899)
      0.33333334 = coord(1/3)
  0.2 = coord(1/5)

Martinez-Romo, J.; Araujo, L.; Fernandez, A.D.: SemGraph : extracting keyphrases following a novel semantic graph-based approach (2016) 0.00
```
0.0021344833 = product of:
  0.010672417 = sum of:
    0.010672417 = product of:
      0.03201725 = sum of:
        0.03201725 = weight(_text_:f in 2832) [ClassicSimilarity], result of:
          0.03201725 = score(doc=2832,freq=2.0), product of:
            0.12117521 = queryWeight, product of:
              3.985786 = idf(docFreq=2232, maxDocs=44218)
              0.030401835 = queryNorm
            0.26422277 = fieldWeight in 2832, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.985786 = idf(docFreq=2232, maxDocs=44218)
              0.046875 = fieldNorm(doc=2832)
      0.33333334 = coord(1/3)
  0.2 = coord(1/5)
```
Abstract

Keyphrases represent the main topics a text is about. In this article, we introduce SemGraph, an unsupervised algorithm for extracting keyphrases from a collection of texts based on a semantic relationship graph. The main novelty of this algorithm is its ability to identify semantic relationships between words whose presence is statistically significant. Our method constructs a co-occurrence graph in which words appearing in the same document are linked, provided their presence in the collection is statistically significant with respect to a null model. Furthermore, the graph obtained is enriched with information from WordNet. We have used the most recent and standardized benchmark to evaluate the system ability to detect the keyphrases that are part of the text. The result is a method that achieves an improvement of 5.3% and 7.28% in F measure over the two labeled sets of keyphrases used in the evaluation of SemEval-2010.
Yeh, J.-Y.; Ke, H.-R.; Yang, W.-P.; Meng, I.-H.: Text summarization using a trainable summarizer and latent semantic analysis (2005) 0.00
```
0.0017787361 = product of:
  0.008893681 = sum of:
    0.008893681 = product of:
      0.026681041 = sum of:
        0.026681041 = weight(_text_:f in 1003) [ClassicSimilarity], result of:
          0.026681041 = score(doc=1003,freq=2.0), product of:
            0.12117521 = queryWeight, product of:
              3.985786 = idf(docFreq=2232, maxDocs=44218)
              0.030401835 = queryNorm
            0.22018565 = fieldWeight in 1003, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.985786 = idf(docFreq=2232, maxDocs=44218)
              0.0390625 = fieldNorm(doc=1003)
      0.33333334 = coord(1/3)
  0.2 = coord(1/5)
```
Abstract

This paper proposes two approaches to address text summarization: modified corpus-based approach (MCBA) and LSA-based T.R.M. approach (LSA + T.R.M.). The first is a trainable summarizer, which takes into account several features, including position, positive keyword, negative keyword, centrality, and the resemblance to the title, to generate summaries. Two new ideas are exploited: (1) sentence positions are ranked to emphasize the significances of different sentence positions, and (2) the score function is trained by the genetic algorithm (GA) to obtain a suitable combination of feature weights. The second uses latent semantic analysis (LSA) to derive the semantic matrix of a document or a corpus and uses semantic sentence representation to construct a semantic text relationship map. We evaluate LSA + T.R.M. both with single documents and at the corpus level to investigate the competence of LSA in text summarization. The two novel approaches were measured at several compression rates on a data corpus composed of 100 political articles. When the compression rate was 30%, an average f-measure of 49% for MCBA, 52% for MCBA + GA, 44% and 40% for LSA + T.R.M. in single-document and corpus level were achieved respectively.

Sweeney, S.; Crestani, F.; Losada, D.E.: 'Show me more' : incremental length summarisation using novelty detection (2008) 0.00

0.0017787361 = product of:
  0.008893681 = sum of:
    0.008893681 = product of:
      0.026681041 = sum of:
        0.026681041 = weight(_text_:f in 2054) [ClassicSimilarity], result of:
          0.026681041 = score(doc=2054,freq=2.0), product of:
            0.12117521 = queryWeight, product of:
              3.985786 = idf(docFreq=2232, maxDocs=44218)
              0.030401835 = queryNorm
            0.22018565 = fieldWeight in 2054, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.985786 = idf(docFreq=2232, maxDocs=44218)
              0.0390625 = fieldNorm(doc=2054)
      0.33333334 = coord(1/3)
  0.2 = coord(1/5)

Wei, F.; Li, W.; Lu, Q.; He, Y.: Applying two-level reinforcement ranking in query-oriented multidocument summarization (2009) 0.00

0.0017787361 = product of:
  0.008893681 = sum of:
    0.008893681 = product of:
      0.026681041 = sum of:
        0.026681041 = weight(_text_:f in 3120) [ClassicSimilarity], result of:
          0.026681041 = score(doc=3120,freq=2.0), product of:
            0.12117521 = queryWeight, product of:
              3.985786 = idf(docFreq=2232, maxDocs=44218)
              0.030401835 = queryNorm
            0.22018565 = fieldWeight in 3120, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.985786 = idf(docFreq=2232, maxDocs=44218)
              0.0390625 = fieldNorm(doc=3120)
      0.33333334 = coord(1/3)
  0.2 = coord(1/5)

Galgani, F.; Compton, P.; Hoffmann, A.: Summarization based on bi-directional citation analysis (2015) 0.00

0.0017787361 = product of:
  0.008893681 = sum of:
    0.008893681 = product of:
      0.026681041 = sum of:
        0.026681041 = weight(_text_:f in 2685) [ClassicSimilarity], result of:
          0.026681041 = score(doc=2685,freq=2.0), product of:
            0.12117521 = queryWeight, product of:
              3.985786 = idf(docFreq=2232, maxDocs=44218)
              0.030401835 = queryNorm
            0.22018565 = fieldWeight in 2685, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.985786 = idf(docFreq=2232, maxDocs=44218)
              0.0390625 = fieldNorm(doc=2685)
      0.33333334 = coord(1/3)
  0.2 = coord(1/5)

Goh, A.; Hui, S.C.: TES: a text extraction system (1996) 0.00

0.0016476115 = product of:
  0.008238058 = sum of:
    0.008238058 = product of:
      0.03295223 = sum of:
        0.03295223 = weight(_text_:22 in 6599) [ClassicSimilarity], result of:
          0.03295223 = score(doc=6599,freq=2.0), product of:
            0.10646205 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.030401835 = queryNorm
            0.30952093 = fieldWeight in 6599, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.0625 = fieldNorm(doc=6599)
      0.25 = coord(1/4)
  0.2 = coord(1/5)

Date: 26. 2.1997 10:22:43

Robin, J.; McKeown, K.: Empirically designing and evaluating a new revision-based model for summary generation (1996) 0.00

0.0016476115 = product of:
  0.008238058 = sum of:
    0.008238058 = product of:
      0.03295223 = sum of:
        0.03295223 = weight(_text_:22 in 6751) [ClassicSimilarity], result of:
          0.03295223 = score(doc=6751,freq=2.0), product of:
            0.10646205 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.030401835 = queryNorm
            0.30952093 = fieldWeight in 6751, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.0625 = fieldNorm(doc=6751)
      0.25 = coord(1/4)
  0.2 = coord(1/5)

Date: 6. 3.1997 16:22:15

Jones, P.A.; Bradbeer, P.V.G.: Discovery of optimal weights in a concept selection system (1996) 0.00

0.0016476115 = product of:
  0.008238058 = sum of:
    0.008238058 = product of:
      0.03295223 = sum of:
        0.03295223 = weight(_text_:22 in 6974) [ClassicSimilarity], result of:
          0.03295223 = score(doc=6974,freq=2.0), product of:
            0.10646205 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.030401835 = queryNorm
            0.30952093 = fieldWeight in 6974, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.0625 = fieldNorm(doc=6974)
      0.25 = coord(1/4)
  0.2 = coord(1/5)

Source: Information retrieval: new systems and current research. Proceedings of the 16th Research Colloquium of the British Computer Society Information Retrieval Specialist Group, Drymen, Scotland, 22-23 Mar 94. Ed.: R. Leon

Vanderwende, L.; Suzuki, H.; Brockett, J.M.; Nenkova, A.: Beyond SumBasic : task-focused summarization with sentence simplification and lexical expansion (2007) 0.00
```
0.0012357087 = product of:
  0.0061785434 = sum of:
    0.0061785434 = product of:
      0.024714174 = sum of:
        0.024714174 = weight(_text_:22 in 948) [ClassicSimilarity], result of:
          0.024714174 = score(doc=948,freq=2.0), product of:
            0.10646205 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.030401835 = queryNorm
            0.23214069 = fieldWeight in 948, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.046875 = fieldNorm(doc=948)
      0.25 = coord(1/4)
  0.2 = coord(1/5)
```
Abstract

In recent years, there has been increased interest in topic-focused multi-document summarization. In this task, automatic summaries are produced in response to a specific information request, or topic, stated by the user. The system we have designed to accomplish this task comprises four main components: a generic extractive summarization system, a topic-focusing component, sentence simplification, and lexical expansion of topic words. This paper details each of these components, together with experiments designed to quantify their individual contributions. We include an analysis of our results on two large datasets commonly used to evaluate task-focused summarization, the DUC2005 and DUC2006 datasets, using automatic metrics. Additionally, we include an analysis of our results on the DUC2006 task according to human evaluation metrics. In the human evaluation of system summaries compared to human summaries, i.e., the Pyramid method, our system ranked first out of 22 systems in terms of overall mean Pyramid score; and in the human evaluation of summary responsiveness to the topic, our system ranked third out of 35 systems.

Wu, Y.-f.B.; Li, Q.; Bot, R.S.; Chen, X.: Finding nuggets in documents : a machine learning approach (2006) 0.00

0.0010297572 = product of:
  0.005148786 = sum of:
    0.005148786 = product of:
      0.020595144 = sum of:
        0.020595144 = weight(_text_:22 in 5290) [ClassicSimilarity], result of:
          0.020595144 = score(doc=5290,freq=2.0), product of:
            0.10646205 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.030401835 = queryNorm
            0.19345059 = fieldWeight in 5290, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.0390625 = fieldNorm(doc=5290)
      0.25 = coord(1/4)
  0.2 = coord(1/5)

Date: 22. 7.2006 17:25:48

Kim, H.H.; Kim, Y.H.: Generic speech summarization of transcribed lecture videos : using tags and their semantic relations (2016) 0.00

0.0010297572 = product of:
  0.005148786 = sum of:
    0.005148786 = product of:
      0.020595144 = sum of:
        0.020595144 = weight(_text_:22 in 2640) [ClassicSimilarity], result of:
          0.020595144 = score(doc=2640,freq=2.0), product of:
            0.10646205 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.030401835 = queryNorm
            0.19345059 = fieldWeight in 2640, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.0390625 = fieldNorm(doc=2640)
      0.25 = coord(1/4)
  0.2 = coord(1/5)

Date: 22. 1.2016 12:29:41

Search (22 results, page 1 of 2)

Authors

Years

Themes