Search (36 results, page 1 of 2)

Jones, P.A.; Bradbeer, P.V.G.: Discovery of optimal weights in a concept selection system (1996) 0.02

0.01634812 = product of:
  0.09808872 = sum of:
    0.09808872 = sum of:
      0.05577461 = weight(_text_:methods in 6974) [ClassicSimilarity], result of:
        0.05577461 = score(doc=6974,freq=2.0), product of:
          0.15695344 = queryWeight, product of:
            4.0204134 = idf(docFreq=2156, maxDocs=44218)
            0.03903913 = queryNorm
          0.35535768 = fieldWeight in 6974, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            4.0204134 = idf(docFreq=2156, maxDocs=44218)
            0.0625 = fieldNorm(doc=6974)
      0.04231411 = weight(_text_:22 in 6974) [ClassicSimilarity], result of:
        0.04231411 = score(doc=6974,freq=2.0), product of:
          0.1367084 = queryWeight, product of:
            3.5018296 = idf(docFreq=3622, maxDocs=44218)
            0.03903913 = queryNorm
          0.30952093 = fieldWeight in 6974, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            3.5018296 = idf(docFreq=3622, maxDocs=44218)
            0.0625 = fieldNorm(doc=6974)
  0.16666667 = coord(1/6)

Abstract: Describes the application of weighting strategies to model uncertainties and probabilities in automatic abstracting systems, particularly in the concept selection phase. The weights were originally assigned in an ad hoc manner and were then refined by manual analysis of the results. The new method attempts to derive a more systematic methods and performs this using a genetic algorithm
Source: Information retrieval: new systems and current research. Proceedings of the 16th Research Colloquium of the British Computer Society Information Retrieval Specialist Group, Drymen, Scotland, 22-23 Mar 94. Ed.: R. Leon

Oh, H.; Nam, S.; Zhu, Y.: Structured abstract summarization of scientific articles : summarization using full-text section information (2023) 0.01
```
0.010217575 = product of:
  0.06130545 = sum of:
    0.06130545 = sum of:
      0.034859132 = weight(_text_:methods in 889) [ClassicSimilarity], result of:
        0.034859132 = score(doc=889,freq=2.0), product of:
          0.15695344 = queryWeight, product of:
            4.0204134 = idf(docFreq=2156, maxDocs=44218)
            0.03903913 = queryNorm
          0.22209854 = fieldWeight in 889, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            4.0204134 = idf(docFreq=2156, maxDocs=44218)
            0.0390625 = fieldNorm(doc=889)
      0.026446318 = weight(_text_:22 in 889) [ClassicSimilarity], result of:
        0.026446318 = score(doc=889,freq=2.0), product of:
          0.1367084 = queryWeight, product of:
            3.5018296 = idf(docFreq=3622, maxDocs=44218)
            0.03903913 = queryNorm
          0.19345059 = fieldWeight in 889, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            3.5018296 = idf(docFreq=3622, maxDocs=44218)
            0.0390625 = fieldNorm(doc=889)
  0.16666667 = coord(1/6)
```
Abstract

The automatic summarization of scientific articles differs from other text genres because of the structured format and longer text length. Previous approaches have focused on tackling the lengthy nature of scientific articles, aiming to improve the computational efficiency of summarizing long text using a flat, unstructured abstract. However, the structured format of scientific articles and characteristics of each section have not been fully explored, despite their importance. The lack of a sufficient investigation and discussion of various characteristics for each section and their influence on summarization results has hindered the practical use of automatic summarization for scientific articles. To provide a balanced abstract proportionally emphasizing each section of a scientific article, the community introduced the structured abstract, an abstract with distinct, labeled sections. Using this information, in this study, we aim to understand tasks ranging from data preparation to model evaluation from diverse viewpoints. Specifically, we provide a preprocessed large-scale dataset and propose a summarization method applying the introduction, methods, results, and discussion (IMRaD) format reflecting the characteristics of each section. We also discuss the objective benchmarks and perspectives of state-of-the-art algorithms and present the challenges and research directions in this area.

Date

22. 1.2023 18:57:12

Kim, H.H.; Kim, Y.H.: Generic speech summarization of transcribed lecture videos : using tags and their semantic relations (2016) 0.01

0.00885545 = product of:
  0.026566349 = sum of:
    0.01334319 = product of:
      0.02668638 = sum of:
        0.02668638 = weight(_text_:29 in 2640) [ClassicSimilarity], result of:
          0.02668638 = score(doc=2640,freq=2.0), product of:
            0.13732746 = queryWeight, product of:
              3.5176873 = idf(docFreq=3565, maxDocs=44218)
              0.03903913 = queryNorm
            0.19432661 = fieldWeight in 2640, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5176873 = idf(docFreq=3565, maxDocs=44218)
              0.0390625 = fieldNorm(doc=2640)
      0.5 = coord(1/2)
    0.013223159 = product of:
      0.026446318 = sum of:
        0.026446318 = weight(_text_:22 in 2640) [ClassicSimilarity], result of:
          0.026446318 = score(doc=2640,freq=2.0), product of:
            0.1367084 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.03903913 = queryNorm
            0.19345059 = fieldWeight in 2640, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.0390625 = fieldNorm(doc=2640)
      0.5 = coord(1/2)
  0.33333334 = coord(2/6)

Date: 22. 1.2016 12:29:41

Pinto, M.: Engineering the production of meta-information : the abstracting concern (2003) 0.01

0.008806056 = product of:
  0.052836336 = sum of:
    0.052836336 = product of:
      0.10567267 = sum of:
        0.10567267 = weight(_text_:29 in 4667) [ClassicSimilarity], result of:
          0.10567267 = score(doc=4667,freq=4.0), product of:
            0.13732746 = queryWeight, product of:
              3.5176873 = idf(docFreq=3565, maxDocs=44218)
              0.03903913 = queryNorm
            0.7694941 = fieldWeight in 4667, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              3.5176873 = idf(docFreq=3565, maxDocs=44218)
              0.109375 = fieldNorm(doc=4667)
      0.5 = coord(1/2)
  0.16666667 = coord(1/6)

Date: 27.11.2005 18:29:55
Source: Journal of information science. 29(2003) no.5, S.405-418

Ling, X.; Jiang, J.; He, X.; Mei, Q.; Zhai, C.; Schatz, B.: Generating gene summaries from biomedical literature : a study of semi-structured summarization (2007) 0.01
```
0.0076857167 = product of:
  0.0461143 = sum of:
    0.0461143 = product of:
      0.0922286 = sum of:
        0.0922286 = weight(_text_:methods in 946) [ClassicSimilarity], result of:
          0.0922286 = score(doc=946,freq=14.0), product of:
            0.15695344 = queryWeight, product of:
              4.0204134 = idf(docFreq=2156, maxDocs=44218)
              0.03903913 = queryNorm
            0.5876176 = fieldWeight in 946, product of:
              3.7416575 = tf(freq=14.0), with freq of:
                14.0 = termFreq=14.0
              4.0204134 = idf(docFreq=2156, maxDocs=44218)
              0.0390625 = fieldNorm(doc=946)
      0.5 = coord(1/2)
  0.16666667 = coord(1/6)
```
Abstract

Most knowledge accumulated through scientific discoveries in genomics and related biomedical disciplines is buried in the vast amount of biomedical literature. Since understanding gene regulations is fundamental to biomedical research, summarizing all the existing knowledge about a gene based on literature is highly desirable to help biologists digest the literature. In this paper, we present a study of methods for automatically generating gene summaries from biomedical literature. Unlike most existing work on automatic text summarization, in which the generated summary is often a list of extracted sentences, we propose to generate a semi-structured summary which consists of sentences covering specific semantic aspects of a gene. Such a semi-structured summary is more appropriate for describing genes and poses special challenges for automatic text summarization. We propose a two-stage approach to generate such a summary for a given gene - first retrieving articles about a gene and then extracting sentences for each specified semantic aspect. We address the issue of gene name variation in the first stage and propose several different methods for sentence extraction in the second stage. We evaluate the proposed methods using a test set with 20 genes. Experiment results show that the proposed methods can generate useful semi-structured gene summaries automatically from biomedical literature, and our proposed methods outperform general purpose summarization methods. Among all the proposed methods for sentence extraction, a probabilistic language modeling approach that models gene context performs the best.

Xianghao, G.; Yixin, Z.; Li, Y.: ¬A new method of news test understanding and abstracting based on speech acts theory (1998) 0.01

0.0070320163 = product of:
  0.042192098 = sum of:
    0.042192098 = product of:
      0.084384196 = sum of:
        0.084384196 = weight(_text_:theory in 3532) [ClassicSimilarity], result of:
          0.084384196 = score(doc=3532,freq=4.0), product of:
            0.16234003 = queryWeight, product of:
              4.1583924 = idf(docFreq=1878, maxDocs=44218)
              0.03903913 = queryNorm
            0.51979905 = fieldWeight in 3532, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              4.1583924 = idf(docFreq=1878, maxDocs=44218)
              0.0625 = fieldNorm(doc=3532)
      0.5 = coord(1/2)
  0.16666667 = coord(1/6)

Abstract: Presents a method for the automated analysis and comprehension of foreign affairs news produced by a Chinese news agency. Notes that the development of the method was prededed by a study of the structuring rules of the news. Describes how an abstract of the news story is produced automatically from the analysis. Stresses the main aim of the work which is to use specch act theory to analyse and classify sentences

Galgani, F.; Compton, P.; Hoffmann, A.: Summarization based on bi-directional citation analysis (2015) 0.01
```
0.0064956164 = product of:
  0.038973697 = sum of:
    0.038973697 = product of:
      0.07794739 = sum of:
        0.07794739 = weight(_text_:methods in 2685) [ClassicSimilarity], result of:
          0.07794739 = score(doc=2685,freq=10.0), product of:
            0.15695344 = queryWeight, product of:
              4.0204134 = idf(docFreq=2156, maxDocs=44218)
              0.03903913 = queryNorm
            0.4966275 = fieldWeight in 2685, product of:
              3.1622777 = tf(freq=10.0), with freq of:
                10.0 = termFreq=10.0
              4.0204134 = idf(docFreq=2156, maxDocs=44218)
              0.0390625 = fieldNorm(doc=2685)
      0.5 = coord(1/2)
  0.16666667 = coord(1/6)
```
Abstract

Automatic document summarization using citations is based on summarizing what others explicitly say about the document, by extracting a summary from text around the citations (citances). While this technique works quite well for summarizing the impact of scientific articles, other genres of documents as well as other types of summaries require different approaches. In this paper, we introduce a new family of methods that we developed for legal documents summarization to generate catchphrases for legal cases (where catchphrases are a form of legal summary). Our methods use both incoming and outgoing citations, and we show how citances can be combined with other elements of cited and citing documents, including the full text of the target document, and catchphrases of cited and citing cases. On a legal summarization corpus, our methods outperform competitive baselines. The combination of full text sentences and catchphrases from cited and citing cases is particularly successful. We also apply and evaluate the methods on scientific paper summarization, where they perform at the level of state-of-the-art techniques. Our family of citation-based summarization methods is powerful and flexible enough to target successfully a range of different domains and summarization tasks.
Hirao, T.; Okumura, M.; Yasuda, N.; Isozaki, H.: Supervised automatic evaluation for summarization with voted regression model (2007) 0.01
```
0.006037779 = product of:
  0.036226675 = sum of:
    0.036226675 = product of:
      0.07245335 = sum of:
        0.07245335 = weight(_text_:methods in 942) [ClassicSimilarity], result of:
          0.07245335 = score(doc=942,freq=6.0), product of:
            0.15695344 = queryWeight, product of:
              4.0204134 = idf(docFreq=2156, maxDocs=44218)
              0.03903913 = queryNorm
            0.4616232 = fieldWeight in 942, product of:
              2.4494898 = tf(freq=6.0), with freq of:
                6.0 = termFreq=6.0
              4.0204134 = idf(docFreq=2156, maxDocs=44218)
              0.046875 = fieldNorm(doc=942)
      0.5 = coord(1/2)
  0.16666667 = coord(1/6)
```
Abstract

The high quality evaluation of generated summaries is needed if we are to improve automatic summarization systems. Although human evaluation provides better results than automatic evaluation methods, its cost is huge and it is difficult to reproduce the results. Therefore, we need an automatic method that simulates human evaluation if we are to improve our summarization system efficiently. Although automatic evaluation methods have been proposed, they are unreliable when used for individual summaries. To solve this problem, we propose a supervised automatic evaluation method based on a new regression model called the voted regression model (VRM). VRM has two characteristics: (1) model selection based on 'corrected AIC' to avoid multicollinearity, (2) voting by the selected models to alleviate the problem of overfitting. Evaluation results obtained for TSC3 and DUC2004 show that our method achieved error reductions of about 17-51% compared with conventional automatic evaluation methods. Moreover, our method obtained the highest correlation coefficients in several different experiments.

Su, H.: Automatic abstracting (1996) 0.01

0.0058098556 = product of:
  0.034859132 = sum of:
    0.034859132 = product of:
      0.069718264 = sum of:
        0.069718264 = weight(_text_:methods in 150) [ClassicSimilarity], result of:
          0.069718264 = score(doc=150,freq=2.0), product of:
            0.15695344 = queryWeight, product of:
              4.0204134 = idf(docFreq=2156, maxDocs=44218)
              0.03903913 = queryNorm
            0.4441971 = fieldWeight in 150, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              4.0204134 = idf(docFreq=2156, maxDocs=44218)
              0.078125 = fieldNorm(doc=150)
      0.5 = coord(1/2)
  0.16666667 = coord(1/6)

Abstract: Presents an introductory overview of research into the automatic construction of abstracts from the texts of documents. Discusses the origin and definition of automatic abstracting; reasons for using automatic abstracting; methods of automatic abstracting; and evaluation problems

Reeve, L.H.; Han, H.; Brooks, A.D.: ¬The use of domain-specific concepts in biomedical text summarization (2007) 0.01
```
0.0058098556 = product of:
  0.034859132 = sum of:
    0.034859132 = product of:
      0.069718264 = sum of:
        0.069718264 = weight(_text_:methods in 955) [ClassicSimilarity], result of:
          0.069718264 = score(doc=955,freq=8.0), product of:
            0.15695344 = queryWeight, product of:
              4.0204134 = idf(docFreq=2156, maxDocs=44218)
              0.03903913 = queryNorm
            0.4441971 = fieldWeight in 955, product of:
              2.828427 = tf(freq=8.0), with freq of:
                8.0 = termFreq=8.0
              4.0204134 = idf(docFreq=2156, maxDocs=44218)
              0.0390625 = fieldNorm(doc=955)
      0.5 = coord(1/2)
  0.16666667 = coord(1/6)
```
Abstract

Text summarization is a method for data reduction. The use of text summarization enables users to reduce the amount of text that must be read while still assimilating the core information. The data reduction offered by text summarization is particularly useful in the biomedical domain, where physicians must continuously find clinical trial study information to incorporate into their patient treatment efforts. Such efforts are often hampered by the high-volume of publications. This paper presents two independent methods (BioChain and FreqDist) for identifying salient sentences in biomedical texts using concepts derived from domain-specific resources. Our semantic-based method (BioChain) is effective at identifying thematic sentences, while our frequency-distribution method (FreqDist) removes information redundancy. The two methods are then combined to form a hybrid method (ChainFreq). An evaluation of each method is performed using the ROUGE system to compare system-generated summaries against a set of manually-generated summaries. The BioChain and FreqDist methods outperform some common summarization systems, while the ChainFreq method improves upon the base approaches. Our work shows that the best performance is achieved when the two methods are combined. The paper also presents a brief physician's evaluation of three randomly-selected papers from an evaluation corpus to show that the author's abstract does not always reflect the entire contents of the full-text.
Salton, G.: Automatic text structuring and summarization (1997) 0.01
```
0.005751464 = product of:
  0.034508783 = sum of:
    0.034508783 = product of:
      0.06901757 = sum of:
        0.06901757 = weight(_text_:methods in 145) [ClassicSimilarity], result of:
          0.06901757 = score(doc=145,freq=4.0), product of:
            0.15695344 = queryWeight, product of:
              4.0204134 = idf(docFreq=2156, maxDocs=44218)
              0.03903913 = queryNorm
            0.43973273 = fieldWeight in 145, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              4.0204134 = idf(docFreq=2156, maxDocs=44218)
              0.0546875 = fieldNorm(doc=145)
      0.5 = coord(1/2)
  0.16666667 = coord(1/6)
```
Abstract

Applies the ideas from the automatic link generation research to automatic text summarisation. Using techniques for inter-document link generation, generates intra-document links between passages of a document. Based on the intra-document linkage pattern of a text, characterises the structure of the text. Applies the knowledge of text structure to do automatic text summarisation by passage extraction. Evaluates a set of 50 summaries generated using these techniques by comparing the to paragraph extracts constructed by humans. The automatic summarisation methods perform well, especially in view of the fact that the summaries generates by 2 humans for the same article are surprisingly dissimilar

Footnote

Contribution to a special issue on methods and tools for the automatic construction of hypertext
Yang, C.C.; Wang, F.L.: Hierarchical summarization of large documents (2008) 0.01
```
0.0053827665 = product of:
  0.032296598 = sum of:
    0.032296598 = product of:
      0.064593196 = sum of:
        0.064593196 = weight(_text_:theory in 1719) [ClassicSimilarity], result of:
          0.064593196 = score(doc=1719,freq=6.0), product of:
            0.16234003 = queryWeight, product of:
              4.1583924 = idf(docFreq=1878, maxDocs=44218)
              0.03903913 = queryNorm
            0.39788827 = fieldWeight in 1719, product of:
              2.4494898 = tf(freq=6.0), with freq of:
                6.0 = termFreq=6.0
              4.1583924 = idf(docFreq=1878, maxDocs=44218)
              0.0390625 = fieldNorm(doc=1719)
      0.5 = coord(1/2)
  0.16666667 = coord(1/6)
```
Abstract

Many automatic text summarization models have been developed in the last decades. Related research in information science has shown that human abstractors extract sentences for summaries based on the hierarchical structure of documents; however, the existing automatic summarization models do not take into account the human abstractor's behavior of sentence extraction and only consider the document as a sequence of sentences during the process of extraction of sentences as a summary. In general, a document exhibits a well-defined hierarchical structure that can be described as fractals - mathematical objects with a high degree of redundancy. In this article, we introduce the fractal summarization model based on the fractal theory. The important information is captured from the source document by exploring the hierarchical structure and salient features of the document. A condensed version of the document that is informatively close to the source document is produced iteratively using the contractive transformation in the fractal theory. The fractal summarization model is the first attempt to apply fractal theory to document summarization. It significantly improves the divergence of information coverage of summary and the precision of summary. User evaluations have been conducted. Results have indicated that fractal summarization is promising and outperforms current summarization techniques that do not consider the hierarchical structure of documents.
Dorr, B.J.; Gaasterland, T.: Exploiting aspectual features and connecting words for summarization-inspired temporal-relation extraction (2007) 0.01
```
0.005274012 = product of:
  0.031644072 = sum of:
    0.031644072 = product of:
      0.063288145 = sum of:
        0.063288145 = weight(_text_:theory in 950) [ClassicSimilarity], result of:
          0.063288145 = score(doc=950,freq=4.0), product of:
            0.16234003 = queryWeight, product of:
              4.1583924 = idf(docFreq=1878, maxDocs=44218)
              0.03903913 = queryNorm
            0.3898493 = fieldWeight in 950, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              4.1583924 = idf(docFreq=1878, maxDocs=44218)
              0.046875 = fieldNorm(doc=950)
      0.5 = coord(1/2)
  0.16666667 = coord(1/6)
```
Abstract

This paper presents a model that incorporates contemporary theories of tense and aspect and develops a new framework for extracting temporal relations between two sentence-internal events, given their tense, aspect, and a temporal connecting word relating the two events. A linguistic constraint on event combination has been implemented to detect incorrect parser analyses and potentially apply syntactic reanalysis or semantic reinterpretation - in preparation for subsequent processing for multi-document summarization. An important contribution of this work is the extension of two different existing theoretical frameworks - Hornstein's 1990 theory of tense analysis and Allen's 1984 theory on event ordering - and the combination of both into a unified system for representing and constraining combinations of different event types (points, closed intervals, and open-ended intervals). We show that our theoretical results have been verified in a large-scale corpus analysis. The framework is designed to inform a temporally motivated sentence-ordering module in an implemented multi-document summarization system.

Craven, T.C.: ¬A computer-aided abstracting tool kit (1993) 0.00

0.0046478845 = product of:
  0.027887305 = sum of:
    0.027887305 = product of:
      0.05577461 = sum of:
        0.05577461 = weight(_text_:methods in 6506) [ClassicSimilarity], result of:
          0.05577461 = score(doc=6506,freq=2.0), product of:
            0.15695344 = queryWeight, product of:
              4.0204134 = idf(docFreq=2156, maxDocs=44218)
              0.03903913 = queryNorm
            0.35535768 = fieldWeight in 6506, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              4.0204134 = idf(docFreq=2156, maxDocs=44218)
              0.0625 = fieldNorm(doc=6506)
      0.5 = coord(1/2)
  0.16666667 = coord(1/6)

Abstract: Describes the abstracting assistance features being prototyped in the TEXNET text network management system. Sentence weighting methods include: weithing negatively or positively on the stems in a selected passage; weighting on general lists of cue words, adjusting weights of selected segments; and weighting of occurrence of frequent stems. The user may adjust a number of parameters: the minimum strength of extracts; the threshold for frequent word/stems and the amount sentence weight is to be adjusted for each weighting type

Over, P.; Dang, H.; Harman, D.: DUC in context (2007) 0.00

0.0046478845 = product of:
  0.027887305 = sum of:
    0.027887305 = product of:
      0.05577461 = sum of:
        0.05577461 = weight(_text_:methods in 934) [ClassicSimilarity], result of:
          0.05577461 = score(doc=934,freq=2.0), product of:
            0.15695344 = queryWeight, product of:
              4.0204134 = idf(docFreq=2156, maxDocs=44218)
              0.03903913 = queryNorm
            0.35535768 = fieldWeight in 934, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              4.0204134 = idf(docFreq=2156, maxDocs=44218)
              0.0625 = fieldNorm(doc=934)
      0.5 = coord(1/2)
  0.16666667 = coord(1/6)

Abstract: Recent years have seen increased interest in text summarization with emphasis on evaluation of prototype systems. Many factors can affect the design of such evaluations, requiring choices among competing alternatives. This paper examines several major themes running through three evaluations: SUMMAC, NTCIR, and DUC, with a concentration on DUC. The themes are extrinsic and intrinsic evaluation, evaluation procedures and methods, generic versus focused summaries, single- and multi-document summaries, length and compression issues, extracts versus abstracts, and issues with genre.

Salton, G.; Allan, J.; Buckley, C.; Singhal, A.: Automatic analysis, theme generation, and summarization of machine readable texts (1994) 0.00

0.0044477303 = product of:
  0.02668638 = sum of:
    0.02668638 = product of:
      0.05337276 = sum of:
        0.05337276 = weight(_text_:29 in 1949) [ClassicSimilarity], result of:
          0.05337276 = score(doc=1949,freq=2.0), product of:
            0.13732746 = queryWeight, product of:
              3.5176873 = idf(docFreq=3565, maxDocs=44218)
              0.03903913 = queryNorm
            0.38865322 = fieldWeight in 1949, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5176873 = idf(docFreq=3565, maxDocs=44218)
              0.078125 = fieldNorm(doc=1949)
      0.5 = coord(1/2)
  0.16666667 = coord(1/6)

Date: 16. 8.1998 12:30:29

Lee, J.-H.; Park, S.; Ahn, C.-M.; Kim, D.: Automatic generic document summarization based on non-negative matrix factorization (2009) 0.00
```
0.004066899 = product of:
  0.024401393 = sum of:
    0.024401393 = product of:
      0.048802786 = sum of:
        0.048802786 = weight(_text_:methods in 2448) [ClassicSimilarity], result of:
          0.048802786 = score(doc=2448,freq=2.0), product of:
            0.15695344 = queryWeight, product of:
              4.0204134 = idf(docFreq=2156, maxDocs=44218)
              0.03903913 = queryNorm
            0.31093797 = fieldWeight in 2448, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              4.0204134 = idf(docFreq=2156, maxDocs=44218)
              0.0546875 = fieldNorm(doc=2448)
      0.5 = coord(1/2)
  0.16666667 = coord(1/6)
```
Abstract

In existing unsupervised methods, Latent Semantic Analysis (LSA) is used for sentence selection. However, the obtained results are less meaningful, because singular vectors are used as the bases for sentence selection from given documents, and singular vector components can have negative values. We propose a new unsupervised method using Non-negative Matrix Factorization (NMF) to select sentences for automatic generic document summarization. The proposed method uses non-negative constraints, which are more similar to the human cognition process. As a result, the method selects more meaningful sentences for generic document summarization than those selected using LSA.

Craven, T.C.: ¬A phrase flipper for the assistance of writers of abstracts and other text (1995) 0.00

0.003558184 = product of:
  0.021349104 = sum of:
    0.021349104 = product of:
      0.04269821 = sum of:
        0.04269821 = weight(_text_:29 in 4897) [ClassicSimilarity], result of:
          0.04269821 = score(doc=4897,freq=2.0), product of:
            0.13732746 = queryWeight, product of:
              3.5176873 = idf(docFreq=3565, maxDocs=44218)
              0.03903913 = queryNorm
            0.31092256 = fieldWeight in 4897, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5176873 = idf(docFreq=3565, maxDocs=44218)
              0.0625 = fieldNorm(doc=4897)
      0.5 = coord(1/2)
  0.16666667 = coord(1/6)

Date: 17. 8.1996 10:29:59

Goh, A.; Hui, S.C.: TES: a text extraction system (1996) 0.00

0.0035261759 = product of:
  0.021157054 = sum of:
    0.021157054 = product of:
      0.04231411 = sum of:
        0.04231411 = weight(_text_:22 in 6599) [ClassicSimilarity], result of:
          0.04231411 = score(doc=6599,freq=2.0), product of:
            0.1367084 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.03903913 = queryNorm
            0.30952093 = fieldWeight in 6599, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.0625 = fieldNorm(doc=6599)
      0.5 = coord(1/2)
  0.16666667 = coord(1/6)

Date: 26. 2.1997 10:22:43

Robin, J.; McKeown, K.: Empirically designing and evaluating a new revision-based model for summary generation (1996) 0.00

0.0035261759 = product of:
  0.021157054 = sum of:
    0.021157054 = product of:
      0.04231411 = sum of:
        0.04231411 = weight(_text_:22 in 6751) [ClassicSimilarity], result of:
          0.04231411 = score(doc=6751,freq=2.0), product of:
            0.1367084 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.03903913 = queryNorm
            0.30952093 = fieldWeight in 6751, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.0625 = fieldNorm(doc=6751)
      0.5 = coord(1/2)
  0.16666667 = coord(1/6)

Date: 6. 3.1997 16:22:15

Search (36 results, page 1 of 2)

Authors

Years

Languages

Types

Themes