Search (23 results, page 1 of 2)

  • × theme_ss:"Automatisches Abstracting"
  1. Craven, T.C.: Abstracts produced using computer assistance (2000) 0.02
    0.01884501 = product of:
      0.03769002 = sum of:
        0.03769002 = product of:
          0.15076008 = sum of:
            0.15076008 = weight(_text_:author's in 4809) [ClassicSimilarity], result of:
              0.15076008 = score(doc=4809,freq=2.0), product of:
                0.338416 = queryWeight, product of:
                  6.7201533 = idf(docFreq=144, maxDocs=44218)
                  0.050358377 = queryNorm
                0.44548744 = fieldWeight in 4809, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  6.7201533 = idf(docFreq=144, maxDocs=44218)
                  0.046875 = fieldNorm(doc=4809)
          0.25 = coord(1/4)
      0.5 = coord(1/2)
    
    Abstract
    Experimental subjects wrote abstracts using a simplified version of the TEXNET abstracting assistance software. In addition to the full text, subjects were presented with either keywords or phrases extracted automatically. The resulting abstracts, and the times taken, were recorded automatically; some additional information was gathered by oral questionnaire. Selected abstracts produced were evaluated on various criteria by independent raters. Results showed considerable variation among subjects, but 37% found the keywords or phrases 'quite' or 'very' useful in writing their abstracts. Statistical analysis failed to support several hypothesized relations: phrases were not viewed as significantly more helpful than keywords; and abstracting experience did not correlate with originality of wording, approximation of the author abstract, or greater conciseness. Requiring further study are some unanticipated strong correlations including the following: Windows experience and writing an abstract like the author's; experience reading abstracts and thinking one had written a good abstract; gender and abstract length; gender and use of words and phrases from the original text. Results have also suggested possible modifications to the TEXNET software
  2. Wang, W.; Hwang, D.: Abstraction Assistant : an automatic text abstraction system (2010) 0.02
    0.01884501 = product of:
      0.03769002 = sum of:
        0.03769002 = product of:
          0.15076008 = sum of:
            0.15076008 = weight(_text_:author's in 3981) [ClassicSimilarity], result of:
              0.15076008 = score(doc=3981,freq=2.0), product of:
                0.338416 = queryWeight, product of:
                  6.7201533 = idf(docFreq=144, maxDocs=44218)
                  0.050358377 = queryNorm
                0.44548744 = fieldWeight in 3981, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  6.7201533 = idf(docFreq=144, maxDocs=44218)
                  0.046875 = fieldNorm(doc=3981)
          0.25 = coord(1/4)
      0.5 = coord(1/2)
    
    Abstract
    In the interest of standardization and quality assurance, it is desirable for authors and staff of access services to follow the American National Standards Institute (ANSI) guidelines in preparing abstracts. Using the statistical approach an extraction system (the Abstraction Assistant) was developed to generate informative abstracts to meet the ANSI guidelines for structural content elements. The system performance is evaluated by comparing the system-generated abstracts with the author's original abstracts and the manually enhanced system abstracts on three criteria: balance (satisfaction of the ANSI standards), fluency (text coherence), and understandability (clarity). The results suggest that it is possible to use the system output directly without manual modification, but there are issues that need to be addressed in further studies to make the system a better tool.
  3. Johnson, F.: Automatic abstracting research (1995) 0.02
    0.017678065 = product of:
      0.03535613 = sum of:
        0.03535613 = product of:
          0.07071226 = sum of:
            0.07071226 = weight(_text_:f in 3847) [ClassicSimilarity], result of:
              0.07071226 = score(doc=3847,freq=2.0), product of:
                0.20071772 = queryWeight, product of:
                  3.985786 = idf(docFreq=2232, maxDocs=44218)
                  0.050358377 = queryNorm
                0.35229704 = fieldWeight in 3847, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.985786 = idf(docFreq=2232, maxDocs=44218)
                  0.0625 = fieldNorm(doc=3847)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
  4. Harabagiu, S.; Hickl, A.; Lacatusu, F.: Satisfying information needs with multi-document summaries (2007) 0.02
    0.017678065 = product of:
      0.03535613 = sum of:
        0.03535613 = product of:
          0.07071226 = sum of:
            0.07071226 = weight(_text_:f in 939) [ClassicSimilarity], result of:
              0.07071226 = score(doc=939,freq=2.0), product of:
                0.20071772 = queryWeight, product of:
                  3.985786 = idf(docFreq=2232, maxDocs=44218)
                  0.050358377 = queryNorm
                0.35229704 = fieldWeight in 939, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.985786 = idf(docFreq=2232, maxDocs=44218)
                  0.0625 = fieldNorm(doc=939)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
  5. Reeve, L.H.; Han, H.; Brooks, A.D.: ¬The use of domain-specific concepts in biomedical text summarization (2007) 0.02
    0.015704174 = product of:
      0.031408347 = sum of:
        0.031408347 = product of:
          0.12563339 = sum of:
            0.12563339 = weight(_text_:author's in 955) [ClassicSimilarity], result of:
              0.12563339 = score(doc=955,freq=2.0), product of:
                0.338416 = queryWeight, product of:
                  6.7201533 = idf(docFreq=144, maxDocs=44218)
                  0.050358377 = queryNorm
                0.3712395 = fieldWeight in 955, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  6.7201533 = idf(docFreq=144, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=955)
          0.25 = coord(1/4)
      0.5 = coord(1/2)
    
    Abstract
    Text summarization is a method for data reduction. The use of text summarization enables users to reduce the amount of text that must be read while still assimilating the core information. The data reduction offered by text summarization is particularly useful in the biomedical domain, where physicians must continuously find clinical trial study information to incorporate into their patient treatment efforts. Such efforts are often hampered by the high-volume of publications. This paper presents two independent methods (BioChain and FreqDist) for identifying salient sentences in biomedical texts using concepts derived from domain-specific resources. Our semantic-based method (BioChain) is effective at identifying thematic sentences, while our frequency-distribution method (FreqDist) removes information redundancy. The two methods are then combined to form a hybrid method (ChainFreq). An evaluation of each method is performed using the ROUGE system to compare system-generated summaries against a set of manually-generated summaries. The BioChain and FreqDist methods outperform some common summarization systems, while the ChainFreq method improves upon the base approaches. Our work shows that the best performance is achieved when the two methods are combined. The paper also presents a brief physician's evaluation of three randomly-selected papers from an evaluation corpus to show that the author's abstract does not always reflect the entire contents of the full-text.
  6. Uyttendaele, C.; Moens, M.-F.; Dumortier, J.: SALOMON: automatic abstracting of legal cases for effective access to court decisions (1998) 0.02
    0.015468306 = product of:
      0.030936612 = sum of:
        0.030936612 = product of:
          0.061873224 = sum of:
            0.061873224 = weight(_text_:f in 495) [ClassicSimilarity], result of:
              0.061873224 = score(doc=495,freq=2.0), product of:
                0.20071772 = queryWeight, product of:
                  3.985786 = idf(docFreq=2232, maxDocs=44218)
                  0.050358377 = queryNorm
                0.3082599 = fieldWeight in 495, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.985786 = idf(docFreq=2232, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=495)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
  7. Moens, M.-F.: Summarizing court decisions (2007) 0.02
    0.015468306 = product of:
      0.030936612 = sum of:
        0.030936612 = product of:
          0.061873224 = sum of:
            0.061873224 = weight(_text_:f in 954) [ClassicSimilarity], result of:
              0.061873224 = score(doc=954,freq=2.0), product of:
                0.20071772 = queryWeight, product of:
                  3.985786 = idf(docFreq=2232, maxDocs=44218)
                  0.050358377 = queryNorm
                0.3082599 = fieldWeight in 954, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.985786 = idf(docFreq=2232, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=954)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
  8. Goh, A.; Hui, S.C.: TES: a text extraction system (1996) 0.01
    0.013645729 = product of:
      0.027291458 = sum of:
        0.027291458 = product of:
          0.054582916 = sum of:
            0.054582916 = weight(_text_:22 in 6599) [ClassicSimilarity], result of:
              0.054582916 = score(doc=6599,freq=2.0), product of:
                0.17634645 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.050358377 = queryNorm
                0.30952093 = fieldWeight in 6599, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0625 = fieldNorm(doc=6599)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Date
    26. 2.1997 10:22:43
  9. Robin, J.; McKeown, K.: Empirically designing and evaluating a new revision-based model for summary generation (1996) 0.01
    0.013645729 = product of:
      0.027291458 = sum of:
        0.027291458 = product of:
          0.054582916 = sum of:
            0.054582916 = weight(_text_:22 in 6751) [ClassicSimilarity], result of:
              0.054582916 = score(doc=6751,freq=2.0), product of:
                0.17634645 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.050358377 = queryNorm
                0.30952093 = fieldWeight in 6751, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0625 = fieldNorm(doc=6751)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Date
    6. 3.1997 16:22:15
  10. Jones, P.A.; Bradbeer, P.V.G.: Discovery of optimal weights in a concept selection system (1996) 0.01
    0.013645729 = product of:
      0.027291458 = sum of:
        0.027291458 = product of:
          0.054582916 = sum of:
            0.054582916 = weight(_text_:22 in 6974) [ClassicSimilarity], result of:
              0.054582916 = score(doc=6974,freq=2.0), product of:
                0.17634645 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.050358377 = queryNorm
                0.30952093 = fieldWeight in 6974, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0625 = fieldNorm(doc=6974)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Source
    Information retrieval: new systems and current research. Proceedings of the 16th Research Colloquium of the British Computer Society Information Retrieval Specialist Group, Drymen, Scotland, 22-23 Mar 94. Ed.: R. Leon
  11. Moens, M.-F.; Uyttendaele, C.: Automatic text structuring and categorization as a first step in summarizing legal cases (1997) 0.01
    0.0132585475 = product of:
      0.026517095 = sum of:
        0.026517095 = product of:
          0.05303419 = sum of:
            0.05303419 = weight(_text_:f in 2256) [ClassicSimilarity], result of:
              0.05303419 = score(doc=2256,freq=2.0), product of:
                0.20071772 = queryWeight, product of:
                  3.985786 = idf(docFreq=2232, maxDocs=44218)
                  0.050358377 = queryNorm
                0.26422277 = fieldWeight in 2256, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.985786 = idf(docFreq=2232, maxDocs=44218)
                  0.046875 = fieldNorm(doc=2256)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
  12. Moens, M.-F.; Uyttendaele, C.; Dumotier, J.: Abstracting of legal cases : the potential of clustering based on the selection of representative objects (1999) 0.01
    0.0132585475 = product of:
      0.026517095 = sum of:
        0.026517095 = product of:
          0.05303419 = sum of:
            0.05303419 = weight(_text_:f in 2944) [ClassicSimilarity], result of:
              0.05303419 = score(doc=2944,freq=2.0), product of:
                0.20071772 = queryWeight, product of:
                  3.985786 = idf(docFreq=2232, maxDocs=44218)
                  0.050358377 = queryNorm
                0.26422277 = fieldWeight in 2944, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.985786 = idf(docFreq=2232, maxDocs=44218)
                  0.046875 = fieldNorm(doc=2944)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
  13. Liang, S.-F.; Devlin, S.; Tait, J.: Investigating sentence weighting components for automatic summarisation (2007) 0.01
    0.0132585475 = product of:
      0.026517095 = sum of:
        0.026517095 = product of:
          0.05303419 = sum of:
            0.05303419 = weight(_text_:f in 899) [ClassicSimilarity], result of:
              0.05303419 = score(doc=899,freq=2.0), product of:
                0.20071772 = queryWeight, product of:
                  3.985786 = idf(docFreq=2232, maxDocs=44218)
                  0.050358377 = queryNorm
                0.26422277 = fieldWeight in 899, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.985786 = idf(docFreq=2232, maxDocs=44218)
                  0.046875 = fieldNorm(doc=899)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
  14. Martinez-Romo, J.; Araujo, L.; Fernandez, A.D.: SemGraph : extracting keyphrases following a novel semantic graph-based approach (2016) 0.01
    0.0132585475 = product of:
      0.026517095 = sum of:
        0.026517095 = product of:
          0.05303419 = sum of:
            0.05303419 = weight(_text_:f in 2832) [ClassicSimilarity], result of:
              0.05303419 = score(doc=2832,freq=2.0), product of:
                0.20071772 = queryWeight, product of:
                  3.985786 = idf(docFreq=2232, maxDocs=44218)
                  0.050358377 = queryNorm
                0.26422277 = fieldWeight in 2832, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.985786 = idf(docFreq=2232, maxDocs=44218)
                  0.046875 = fieldNorm(doc=2832)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Abstract
    Keyphrases represent the main topics a text is about. In this article, we introduce SemGraph, an unsupervised algorithm for extracting keyphrases from a collection of texts based on a semantic relationship graph. The main novelty of this algorithm is its ability to identify semantic relationships between words whose presence is statistically significant. Our method constructs a co-occurrence graph in which words appearing in the same document are linked, provided their presence in the collection is statistically significant with respect to a null model. Furthermore, the graph obtained is enriched with information from WordNet. We have used the most recent and standardized benchmark to evaluate the system ability to detect the keyphrases that are part of the text. The result is a method that achieves an improvement of 5.3% and 7.28% in F measure over the two labeled sets of keyphrases used in the evaluation of SemEval-2010.
  15. Yeh, J.-Y.; Ke, H.-R.; Yang, W.-P.; Meng, I.-H.: Text summarization using a trainable summarizer and latent semantic analysis (2005) 0.01
    0.01104879 = product of:
      0.02209758 = sum of:
        0.02209758 = product of:
          0.04419516 = sum of:
            0.04419516 = weight(_text_:f in 1003) [ClassicSimilarity], result of:
              0.04419516 = score(doc=1003,freq=2.0), product of:
                0.20071772 = queryWeight, product of:
                  3.985786 = idf(docFreq=2232, maxDocs=44218)
                  0.050358377 = queryNorm
                0.22018565 = fieldWeight in 1003, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.985786 = idf(docFreq=2232, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=1003)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Abstract
    This paper proposes two approaches to address text summarization: modified corpus-based approach (MCBA) and LSA-based T.R.M. approach (LSA + T.R.M.). The first is a trainable summarizer, which takes into account several features, including position, positive keyword, negative keyword, centrality, and the resemblance to the title, to generate summaries. Two new ideas are exploited: (1) sentence positions are ranked to emphasize the significances of different sentence positions, and (2) the score function is trained by the genetic algorithm (GA) to obtain a suitable combination of feature weights. The second uses latent semantic analysis (LSA) to derive the semantic matrix of a document or a corpus and uses semantic sentence representation to construct a semantic text relationship map. We evaluate LSA + T.R.M. both with single documents and at the corpus level to investigate the competence of LSA in text summarization. The two novel approaches were measured at several compression rates on a data corpus composed of 100 political articles. When the compression rate was 30%, an average f-measure of 49% for MCBA, 52% for MCBA + GA, 44% and 40% for LSA + T.R.M. in single-document and corpus level were achieved respectively.
  16. Sweeney, S.; Crestani, F.; Losada, D.E.: 'Show me more' : incremental length summarisation using novelty detection (2008) 0.01
    0.01104879 = product of:
      0.02209758 = sum of:
        0.02209758 = product of:
          0.04419516 = sum of:
            0.04419516 = weight(_text_:f in 2054) [ClassicSimilarity], result of:
              0.04419516 = score(doc=2054,freq=2.0), product of:
                0.20071772 = queryWeight, product of:
                  3.985786 = idf(docFreq=2232, maxDocs=44218)
                  0.050358377 = queryNorm
                0.22018565 = fieldWeight in 2054, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.985786 = idf(docFreq=2232, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=2054)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
  17. Wei, F.; Li, W.; Lu, Q.; He, Y.: Applying two-level reinforcement ranking in query-oriented multidocument summarization (2009) 0.01
    0.01104879 = product of:
      0.02209758 = sum of:
        0.02209758 = product of:
          0.04419516 = sum of:
            0.04419516 = weight(_text_:f in 3120) [ClassicSimilarity], result of:
              0.04419516 = score(doc=3120,freq=2.0), product of:
                0.20071772 = queryWeight, product of:
                  3.985786 = idf(docFreq=2232, maxDocs=44218)
                  0.050358377 = queryNorm
                0.22018565 = fieldWeight in 3120, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.985786 = idf(docFreq=2232, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=3120)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
  18. Galgani, F.; Compton, P.; Hoffmann, A.: Summarization based on bi-directional citation analysis (2015) 0.01
    0.01104879 = product of:
      0.02209758 = sum of:
        0.02209758 = product of:
          0.04419516 = sum of:
            0.04419516 = weight(_text_:f in 2685) [ClassicSimilarity], result of:
              0.04419516 = score(doc=2685,freq=2.0), product of:
                0.20071772 = queryWeight, product of:
                  3.985786 = idf(docFreq=2232, maxDocs=44218)
                  0.050358377 = queryNorm
                0.22018565 = fieldWeight in 2685, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.985786 = idf(docFreq=2232, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=2685)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
  19. Vanderwende, L.; Suzuki, H.; Brockett, J.M.; Nenkova, A.: Beyond SumBasic : task-focused summarization with sentence simplification and lexical expansion (2007) 0.01
    0.010234296 = product of:
      0.020468593 = sum of:
        0.020468593 = product of:
          0.040937185 = sum of:
            0.040937185 = weight(_text_:22 in 948) [ClassicSimilarity], result of:
              0.040937185 = score(doc=948,freq=2.0), product of:
                0.17634645 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.050358377 = queryNorm
                0.23214069 = fieldWeight in 948, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.046875 = fieldNorm(doc=948)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Abstract
    In recent years, there has been increased interest in topic-focused multi-document summarization. In this task, automatic summaries are produced in response to a specific information request, or topic, stated by the user. The system we have designed to accomplish this task comprises four main components: a generic extractive summarization system, a topic-focusing component, sentence simplification, and lexical expansion of topic words. This paper details each of these components, together with experiments designed to quantify their individual contributions. We include an analysis of our results on two large datasets commonly used to evaluate task-focused summarization, the DUC2005 and DUC2006 datasets, using automatic metrics. Additionally, we include an analysis of our results on the DUC2006 task according to human evaluation metrics. In the human evaluation of system summaries compared to human summaries, i.e., the Pyramid method, our system ranked first out of 22 systems in terms of overall mean Pyramid score; and in the human evaluation of summary responsiveness to the topic, our system ranked third out of 35 systems.
  20. Wu, Y.-f.B.; Li, Q.; Bot, R.S.; Chen, X.: Finding nuggets in documents : a machine learning approach (2006) 0.01
    0.008528581 = product of:
      0.017057162 = sum of:
        0.017057162 = product of:
          0.034114324 = sum of:
            0.034114324 = weight(_text_:22 in 5290) [ClassicSimilarity], result of:
              0.034114324 = score(doc=5290,freq=2.0), product of:
                0.17634645 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.050358377 = queryNorm
                0.19345059 = fieldWeight in 5290, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=5290)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Date
    22. 7.2006 17:25:48