Search (98 results, page 2 of 5)

Bateman, J.; Teich, E.: Selective information presentation in an integrated publication system : an application of genre-driven text generation (1995) 0.00

6.7448284E-4 = product of:
  0.010117242 = sum of:
    0.0061733257 = weight(_text_:in in 2928) [ClassicSimilarity], result of:
      0.0061733257 = score(doc=2928,freq=2.0), product of:
        0.029340398 = queryWeight, product of:
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.021569785 = queryNorm
        0.21040362 = fieldWeight in 2928, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.109375 = fieldNorm(doc=2928)
    0.003943917 = weight(_text_:s in 2928) [ClassicSimilarity], result of:
      0.003943917 = score(doc=2928,freq=2.0), product of:
        0.023451481 = queryWeight, product of:
          1.0872376 = idf(docFreq=40523, maxDocs=44218)
          0.021569785 = queryNorm
        0.16817348 = fieldWeight in 2928, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          1.0872376 = idf(docFreq=40523, maxDocs=44218)
          0.109375 = fieldNorm(doc=2928)
  0.06666667 = coord(2/30)

Source: Information processing and management. 31(1995) no.5, S.753-767

Edmundson, H.P.; Wyllis, R.E.: Problems in automatic abstracting (1964) 0.00

6.7448284E-4 = product of:
  0.010117242 = sum of:
    0.0061733257 = weight(_text_:in in 3670) [ClassicSimilarity], result of:
      0.0061733257 = score(doc=3670,freq=2.0), product of:
        0.029340398 = queryWeight, product of:
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.021569785 = queryNorm
        0.21040362 = fieldWeight in 3670, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.109375 = fieldNorm(doc=3670)
    0.003943917 = weight(_text_:s in 3670) [ClassicSimilarity], result of:
      0.003943917 = score(doc=3670,freq=2.0), product of:
        0.023451481 = queryWeight, product of:
          1.0872376 = idf(docFreq=40523, maxDocs=44218)
          0.021569785 = queryNorm
        0.16817348 = fieldWeight in 3670, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          1.0872376 = idf(docFreq=40523, maxDocs=44218)
          0.109375 = fieldNorm(doc=3670)
  0.06666667 = coord(2/30)

Source: Communications of the Association for Computing Machinery. 7(1964) no.1, S.259-263

Johnson, F.: Automatic abstracting research (1995) 0.00

6.2059314E-4 = product of:
  0.009308897 = sum of:
    0.00705523 = weight(_text_:in in 3847) [ClassicSimilarity], result of:
      0.00705523 = score(doc=3847,freq=8.0), product of:
        0.029340398 = queryWeight, product of:
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.021569785 = queryNorm
        0.24046129 = fieldWeight in 3847, product of:
          2.828427 = tf(freq=8.0), with freq of:
            8.0 = termFreq=8.0
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.0625 = fieldNorm(doc=3847)
    0.002253667 = weight(_text_:s in 3847) [ClassicSimilarity], result of:
      0.002253667 = score(doc=3847,freq=2.0), product of:
        0.023451481 = queryWeight, product of:
          1.0872376 = idf(docFreq=40523, maxDocs=44218)
          0.021569785 = queryNorm
        0.09609913 = fieldWeight in 3847, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          1.0872376 = idf(docFreq=40523, maxDocs=44218)
          0.0625 = fieldNorm(doc=3847)
  0.06666667 = coord(2/30)

Abstract: Discusses the attraction for researchers of the prospect of automatically generating abstracts but notes that the promise of superseding the human effort has yet to be realized. Notes ways in which progress in automatic abstracting research may come about and suggests a shift in the aim from reproducing the conventional benefits of abstracts to accentuating the advantages to users of the computerized representation of information in large textual databases
Source: Library review. 44(1995) no.8, S.28-36

Xiong, S.; Ji, D.: Query-focused multi-document summarization using hypergraph-based ranking (2016) 0.00

5.974731E-4 = product of:
  0.008962097 = sum of:
    0.0061733257 = weight(_text_:in in 2972) [ClassicSimilarity], result of:
      0.0061733257 = score(doc=2972,freq=8.0), product of:
        0.029340398 = queryWeight, product of:
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.021569785 = queryNorm
        0.21040362 = fieldWeight in 2972, product of:
          2.828427 = tf(freq=8.0), with freq of:
            8.0 = termFreq=8.0
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.0546875 = fieldNorm(doc=2972)
    0.0027887707 = weight(_text_:s in 2972) [ClassicSimilarity], result of:
      0.0027887707 = score(doc=2972,freq=4.0), product of:
        0.023451481 = queryWeight, product of:
          1.0872376 = idf(docFreq=40523, maxDocs=44218)
          0.021569785 = queryNorm
        0.118916616 = fieldWeight in 2972, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          1.0872376 = idf(docFreq=40523, maxDocs=44218)
          0.0546875 = fieldNorm(doc=2972)
  0.06666667 = coord(2/30)

Abstract: General graph random walk has been successfully applied in multi-document summarization, but it has some limitations to process documents by this way. In this paper, we propose a novel hypergraph based vertex-reinforced random walk framework for multi-document summarization. The framework first exploits the Hierarchical Dirichlet Process (HDP) topic model to learn a word-topic probability distribution in sentences. Then the hypergraph is used to capture both cluster relationship based on the word-topic probability distribution and pairwise similarity among sentences. Finally, a time-variant random walk algorithm for hypergraphs is developed to rank sentences which ensures sentence diversity by vertex-reinforcement in summaries. Experimental results on the public available dataset demonstrate the effectiveness of our framework.
Source: Information processing and management. 52(2016) no.4, S.670-681

Soricut, R.; Marcu, D.: Abstractive headline generation using WIDL-expressions (2007) 0.00
```
5.9159653E-4 = product of:
  0.008873948 = sum of:
    0.006901989 = weight(_text_:in in 943) [ClassicSimilarity], result of:
      0.006901989 = score(doc=943,freq=10.0), product of:
        0.029340398 = queryWeight, product of:
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.021569785 = queryNorm
        0.23523843 = fieldWeight in 943, product of:
          3.1622777 = tf(freq=10.0), with freq of:
            10.0 = termFreq=10.0
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.0546875 = fieldNorm(doc=943)
    0.0019719584 = weight(_text_:s in 943) [ClassicSimilarity], result of:
      0.0019719584 = score(doc=943,freq=2.0), product of:
        0.023451481 = queryWeight, product of:
          1.0872376 = idf(docFreq=40523, maxDocs=44218)
          0.021569785 = queryNorm
        0.08408674 = fieldWeight in 943, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          1.0872376 = idf(docFreq=40523, maxDocs=44218)
          0.0546875 = fieldNorm(doc=943)
  0.06666667 = coord(2/30)
```
Abstract

We present a new paradigm for the automatic creation of document headlines that is based on direct transformation of relevant textual information into well-formed textual output. Starting from an input document, we automatically create compact representations of weighted finite sets of strings, called WIDL-expressions, which encode the most important topics in the document. A generic natural language generation engine performs the headline generation task, driven by both statistical knowledge encapsulated in WIDL-expressions (representing topic biases induced by the input document) and statistical knowledge encapsulated in language models (representing biases induced by the target language). Our evaluation shows similar performance in quality with a state-of-the-art, extractive approach to headline generation, and significant improvements in quality over previously proposed solutions to abstractive headline generation.

Source

Information processing and management. 43(2007) no.6, S.1536-1548
Atanassova, I.; Bertin, M.; Larivière, V.: On the composition of scientific abstracts (2016) 0.00
```
5.7375047E-4 = product of:
  0.008606257 = sum of:
    0.006614278 = weight(_text_:in in 3028) [ClassicSimilarity], result of:
      0.006614278 = score(doc=3028,freq=18.0), product of:
        0.029340398 = queryWeight, product of:
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.021569785 = queryNorm
        0.22543246 = fieldWeight in 3028, product of:
          4.2426405 = tf(freq=18.0), with freq of:
            18.0 = termFreq=18.0
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.0390625 = fieldNorm(doc=3028)
    0.001991979 = weight(_text_:s in 3028) [ClassicSimilarity], result of:
      0.001991979 = score(doc=3028,freq=4.0), product of:
        0.023451481 = queryWeight, product of:
          1.0872376 = idf(docFreq=40523, maxDocs=44218)
          0.021569785 = queryNorm
        0.08494043 = fieldWeight in 3028, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          1.0872376 = idf(docFreq=40523, maxDocs=44218)
          0.0390625 = fieldNorm(doc=3028)
  0.06666667 = coord(2/30)
```
Abstract

Purpose - Scientific abstracts reproduce only part of the information and the complexity of argumentation in a scientific article. The purpose of this paper provides a first analysis of the similarity between the text of scientific abstracts and the body of articles, using sentences as the basic textual unit. It contributes to the understanding of the structure of abstracts. Design/methodology/approach - Using sentence-based similarity metrics, the authors quantify the phenomenon of text re-use in abstracts and examine the positions of the sentences that are similar to sentences in abstracts in the introduction, methods, results and discussion structure, using a corpus of over 85,000 research articles published in the seven Public Library of Science journals. Findings - The authors provide evidence that 84 percent of abstract have at least one sentence in common with the body of the paper. Studying the distributions of sentences in the body of the articles that are re-used in abstracts, the authors show that there exists a strong relation between the rhetorical structure of articles and the zones that authors re-use when writing abstracts, with sentences mainly coming from the beginning of the introduction and the end of the conclusion. Originality/value - Scientific abstracts contain what is considered by the author(s) as information that best describe documents' content. This is a first study that examines the relation between the contents of abstracts and the rhetorical structure of scientific articles. The work might provide new insight for improving automatic abstracting tools as well as information retrieval approaches, in which text organization and structure are important features.

Source

Journal of documentation. 72(2016) no.4, S.636-647

Ahmad, K.: Text summarisation : the role of lexical cohesion analysis (1995) 0.00

5.575784E-4 = product of:
  0.008363675 = sum of:
    0.006110009 = weight(_text_:in in 5795) [ClassicSimilarity], result of:
      0.006110009 = score(doc=5795,freq=6.0), product of:
        0.029340398 = queryWeight, product of:
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.021569785 = queryNorm
        0.2082456 = fieldWeight in 5795, product of:
          2.4494898 = tf(freq=6.0), with freq of:
            6.0 = termFreq=6.0
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.0625 = fieldNorm(doc=5795)
    0.002253667 = weight(_text_:s in 5795) [ClassicSimilarity], result of:
      0.002253667 = score(doc=5795,freq=2.0), product of:
        0.023451481 = queryWeight, product of:
          1.0872376 = idf(docFreq=40523, maxDocs=44218)
          0.021569785 = queryNorm
        0.09609913 = fieldWeight in 5795, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          1.0872376 = idf(docFreq=40523, maxDocs=44218)
          0.0625 = fieldNorm(doc=5795)
  0.06666667 = coord(2/30)

Abstract: The work in automatic text summary focuses mainly on computational models of texts. The artificial intelligence related work in text summary deals mainly with narrative texts such as newspaper reports and stories. Presents a study on the summary of non-narrative texts such as those in scientific and technical communication. Discusses syntactic cohesion; lexical cohesion; complex lexical repetition; simple and complex paraphrase; bonds and links; and Tele-pattan; an architecture for cohesion based text analysis and summarisation system working on SGML
Source: New review of document and text management. 1995, no.1, S.321-335

Ouyang, Y.; Li, W.; Li, S.; Lu, Q.: Intertopic information mining for query-based summarization (2010) 0.00
```
5.48532E-4 = product of:
  0.00822798 = sum of:
    0.006236001 = weight(_text_:in in 3459) [ClassicSimilarity], result of:
      0.006236001 = score(doc=3459,freq=16.0), product of:
        0.029340398 = queryWeight, product of:
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.021569785 = queryNorm
        0.21253976 = fieldWeight in 3459, product of:
          4.0 = tf(freq=16.0), with freq of:
            16.0 = termFreq=16.0
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.0390625 = fieldNorm(doc=3459)
    0.001991979 = weight(_text_:s in 3459) [ClassicSimilarity], result of:
      0.001991979 = score(doc=3459,freq=4.0), product of:
        0.023451481 = queryWeight, product of:
          1.0872376 = idf(docFreq=40523, maxDocs=44218)
          0.021569785 = queryNorm
        0.08494043 = fieldWeight in 3459, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          1.0872376 = idf(docFreq=40523, maxDocs=44218)
          0.0390625 = fieldNorm(doc=3459)
  0.06666667 = coord(2/30)
```
Abstract

In this article, the authors address the problem of sentence ranking in summarization. Although most existing summarization approaches are concerned with the information embodied in a particular topic (including a set of documents and an associated query) for sentence ranking, they propose a novel ranking approach that incorporates intertopic information mining. Intertopic information, in contrast to intratopic information, is able to reveal pairwise topic relationships and thus can be considered as the bridge across different topics. In this article, the intertopic information is used for transferring word importance learned from known topics to unknown topics under a learning-based summarization framework. To mine this information, the authors model the topic relationship by clustering all the words in both known and unknown topics according to various kinds of word conceptual labels, which indicate the roles of the words in the topic. Based on the mined relationships, we develop a probabilistic model using manually generated summaries provided for known topics to predict ranking scores for sentences in unknown topics. A series of experiments have been conducted on the Document Understanding Conference (DUC) 2006 data set. The evaluation results show that intertopic information is indeed effective for sentence ranking and the resultant summarization system performs comparably well to the best-performing DUC participating systems on the same data set.

Source

Journal of the American Society for Information Science and Technology. 61(2010) no.5, S.1062-1072

Moens, M.-F.: Summarizing court decisions (2007) 0.00

5.43019E-4 = product of:
  0.008145284 = sum of:
    0.0061733257 = weight(_text_:in in 954) [ClassicSimilarity], result of:
      0.0061733257 = score(doc=954,freq=8.0), product of:
        0.029340398 = queryWeight, product of:
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.021569785 = queryNorm
        0.21040362 = fieldWeight in 954, product of:
          2.828427 = tf(freq=8.0), with freq of:
            8.0 = termFreq=8.0
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.0546875 = fieldNorm(doc=954)
    0.0019719584 = weight(_text_:s in 954) [ClassicSimilarity], result of:
      0.0019719584 = score(doc=954,freq=2.0), product of:
        0.023451481 = queryWeight, product of:
          1.0872376 = idf(docFreq=40523, maxDocs=44218)
          0.021569785 = queryNorm
        0.08408674 = fieldWeight in 954, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          1.0872376 = idf(docFreq=40523, maxDocs=44218)
          0.0546875 = fieldNorm(doc=954)
  0.06666667 = coord(2/30)

Abstract: In the field of law there is an absolute need for summarizing the texts of court decisions in order to make the content of the cases easily accessible for legal professionals. During the SALOMON and MOSAIC projects we investigated the summarization and retrieval of legal cases. This article presents some of the main findings while integrating the research results of experiments on legal document summarization by other research groups. In addition, we propose novel avenues of research for automatic text summarization, which we currently exploit when summarizing court decisions in the ACILA project. Techniques for automated concept learning and argument recognition are here the most challenging.
Source: Information processing and management. 43(2007) no.6, S.1748-1764

Moens, M.-F.; Uyttendaele, C.; Dumotier, J.: Abstracting of legal cases : the potential of clustering based on the selection of representative objects (1999) 0.00
```
5.0708273E-4 = product of:
  0.007606241 = sum of:
    0.005915991 = weight(_text_:in in 2944) [ClassicSimilarity], result of:
      0.005915991 = score(doc=2944,freq=10.0), product of:
        0.029340398 = queryWeight, product of:
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.021569785 = queryNorm
        0.20163295 = fieldWeight in 2944, product of:
          3.1622777 = tf(freq=10.0), with freq of:
            10.0 = termFreq=10.0
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.046875 = fieldNorm(doc=2944)
    0.0016902501 = weight(_text_:s in 2944) [ClassicSimilarity], result of:
      0.0016902501 = score(doc=2944,freq=2.0), product of:
        0.023451481 = queryWeight, product of:
          1.0872376 = idf(docFreq=40523, maxDocs=44218)
          0.021569785 = queryNorm
        0.072074346 = fieldWeight in 2944, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          1.0872376 = idf(docFreq=40523, maxDocs=44218)
          0.046875 = fieldNorm(doc=2944)
  0.06666667 = coord(2/30)
```
Abstract

The SALOMON project automatically summarizes Belgian criminal cases in order to improve access to the large number of existing and future court decisions. SALOMON extracts text units from the case text to form a case summary. Such a case summary facilitates the rapid determination of the relevance of the case or may be employed in text search. an important part of the research concerns the development of techniques for automatic recognition of representative text paragraphs (or sentences) in texts of unrestricted domains. these techniques are employed to eliminate redundant material in the case texts, and to identify informative text paragraphs which are relevant to include in the case summary. An evaluation of a test set of 700 criminal cases demonstrates that the algorithms have an application potential for automatic indexing, abstracting, and text linkage

Source

Journal of the American Society for Information Science. 50(1999) no.2, S.151-161
Sweeney, S.; Crestani, F.; Losada, D.E.: 'Show me more' : incremental length summarisation using novelty detection (2008) 0.00
```
4.928343E-4 = product of:
  0.0073925145 = sum of:
    0.0054005357 = weight(_text_:in in 2054) [ClassicSimilarity], result of:
      0.0054005357 = score(doc=2054,freq=12.0), product of:
        0.029340398 = queryWeight, product of:
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.021569785 = queryNorm
        0.18406484 = fieldWeight in 2054, product of:
          3.4641016 = tf(freq=12.0), with freq of:
            12.0 = termFreq=12.0
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.0390625 = fieldNorm(doc=2054)
    0.001991979 = weight(_text_:s in 2054) [ClassicSimilarity], result of:
      0.001991979 = score(doc=2054,freq=4.0), product of:
        0.023451481 = queryWeight, product of:
          1.0872376 = idf(docFreq=40523, maxDocs=44218)
          0.021569785 = queryNorm
        0.08494043 = fieldWeight in 2054, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          1.0872376 = idf(docFreq=40523, maxDocs=44218)
          0.0390625 = fieldNorm(doc=2054)
  0.06666667 = coord(2/30)
```
Abstract

The paper presents a study investigating the effects of incorporating novelty detection in automatic text summarisation. Condensing a textual document, automatic text summarisation can reduce the need to refer to the source document. It also offers a means to deliver device-friendly content when accessing information in non-traditional environments. An effective method of summarisation could be to produce a summary that includes only novel information. However, a consequence of focusing exclusively on novel parts may result in a loss of context, which may have an impact on the correct interpretation of the summary, with respect to the source document. In this study we compare two strategies to produce summaries that incorporate novelty in different ways: a constant length summary, which contains only novel sentences, and an incremental summary, containing additional sentences that provide context. The aim is to establish whether a summary that contains only novel sentences provides sufficient basis to determine relevance of a document, or if indeed we need to include additional sentences to provide context. Findings from the study seem to suggest that there is only a minimal difference in performance for the tasks we set our users and that the presence of contextual information is not so important. However, for the case of mobile information access, a summary that contains only novel information does offer benefits, given bandwidth constraints.

Source

Information processing and management. 44(2008) no.2, S.663-686

Saggion, H.; Lapalme, G.: Selective analysis for the automatic generation of summaries (2000) 0.00

4.8788113E-4 = product of:
  0.0073182164 = sum of:
    0.0053462577 = weight(_text_:in in 132) [ClassicSimilarity], result of:
      0.0053462577 = score(doc=132,freq=6.0), product of:
        0.029340398 = queryWeight, product of:
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.021569785 = queryNorm
        0.1822149 = fieldWeight in 132, product of:
          2.4494898 = tf(freq=6.0), with freq of:
            6.0 = termFreq=6.0
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.0546875 = fieldNorm(doc=132)
    0.0019719584 = weight(_text_:s in 132) [ClassicSimilarity], result of:
      0.0019719584 = score(doc=132,freq=2.0), product of:
        0.023451481 = queryWeight, product of:
          1.0872376 = idf(docFreq=40523, maxDocs=44218)
          0.021569785 = queryNorm
        0.08408674 = fieldWeight in 132, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          1.0872376 = idf(docFreq=40523, maxDocs=44218)
          0.0546875 = fieldNorm(doc=132)
  0.06666667 = coord(2/30)

Abstract: Selective Analysis is a new method for text summarization of technical articles whose design is based on the study of a corpus of professional abstracts and technical documents The method emphasizes the selection of particular types of information and its elaboration exploring the issue of dynamical summarization. A computer prototype was developed to demonstrate the viability of the approach and the automatic abstracts were evaluated using human informants. The results so far obtained indicate that the summaries are acceptable in content and text quality
Pages: S.176-181
Series: Advances in knowledge organization; vol.7
Source: Dynamism and stability in knowledge organization: Proceedings of the 6th International ISKO-Conference, 10-13 July 2000, Toronto, Canada. Ed.: C. Beghtol et al

Moens, M.F.; Dumortier, J.: Use of a text grammar for generating highlight abstracts of magazine articles (2000) 0.00

4.8788113E-4 = product of:
  0.0073182164 = sum of:
    0.0053462577 = weight(_text_:in in 4540) [ClassicSimilarity], result of:
      0.0053462577 = score(doc=4540,freq=6.0), product of:
        0.029340398 = queryWeight, product of:
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.021569785 = queryNorm
        0.1822149 = fieldWeight in 4540, product of:
          2.4494898 = tf(freq=6.0), with freq of:
            6.0 = termFreq=6.0
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.0546875 = fieldNorm(doc=4540)
    0.0019719584 = weight(_text_:s in 4540) [ClassicSimilarity], result of:
      0.0019719584 = score(doc=4540,freq=2.0), product of:
        0.023451481 = queryWeight, product of:
          1.0872376 = idf(docFreq=40523, maxDocs=44218)
          0.021569785 = queryNorm
        0.08408674 = fieldWeight in 4540, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          1.0872376 = idf(docFreq=40523, maxDocs=44218)
          0.0546875 = fieldNorm(doc=4540)
  0.06666667 = coord(2/30)

Abstract: Browsing a database of article abstracts is one way to select and buy relevant magazine articles online. Our research contributes to the design and development of text grammars for abstracting texts in unlimited subject domains. We developed a system that parses texts based on the text grammar of a specific text type and that extracts sentences and statements which are relevant for inclusion in the abstracts. The system employs knowledge of the discourse patterns that are typical of news stories. The results are encouraging and demonstrate the importance of discourse structures in text summarisation.
Source: Journal of documentation. 56(2000) no.5, S.520-539

Ercan, G.; Cicekli, I.: Using lexical chains for keyword extraction (2007) 0.00

4.8788113E-4 = product of:
  0.0073182164 = sum of:
    0.0053462577 = weight(_text_:in in 951) [ClassicSimilarity], result of:
      0.0053462577 = score(doc=951,freq=6.0), product of:
        0.029340398 = queryWeight, product of:
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.021569785 = queryNorm
        0.1822149 = fieldWeight in 951, product of:
          2.4494898 = tf(freq=6.0), with freq of:
            6.0 = termFreq=6.0
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.0546875 = fieldNorm(doc=951)
    0.0019719584 = weight(_text_:s in 951) [ClassicSimilarity], result of:
      0.0019719584 = score(doc=951,freq=2.0), product of:
        0.023451481 = queryWeight, product of:
          1.0872376 = idf(docFreq=40523, maxDocs=44218)
          0.021569785 = queryNorm
        0.08408674 = fieldWeight in 951, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          1.0872376 = idf(docFreq=40523, maxDocs=44218)
          0.0546875 = fieldNorm(doc=951)
  0.06666667 = coord(2/30)

Abstract: Keywords can be considered as condensed versions of documents and short forms of their summaries. In this paper, the problem of automatic extraction of keywords from documents is treated as a supervised learning task. A lexical chain holds a set of semantically related words of a text and it can be said that a lexical chain represents the semantic content of a portion of the text. Although lexical chains have been extensively used in text summarization, their usage for keyword extraction problem has not been fully investigated. In this paper, a keyword extraction technique that uses lexical chains is described, and encouraging results are obtained.
Source: Information processing and management. 43(2007) no.6, S.1705-1714

Craven, T.C.: ¬A computer-aided abstracting tool kit (1993) 0.00

4.8283124E-4 = product of:
  0.007242468 = sum of:
    0.004988801 = weight(_text_:in in 6506) [ClassicSimilarity], result of:
      0.004988801 = score(doc=6506,freq=4.0), product of:
        0.029340398 = queryWeight, product of:
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.021569785 = queryNorm
        0.17003182 = fieldWeight in 6506, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.0625 = fieldNorm(doc=6506)
    0.002253667 = weight(_text_:s in 6506) [ClassicSimilarity], result of:
      0.002253667 = score(doc=6506,freq=2.0), product of:
        0.023451481 = queryWeight, product of:
          1.0872376 = idf(docFreq=40523, maxDocs=44218)
          0.021569785 = queryNorm
        0.09609913 = fieldWeight in 6506, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          1.0872376 = idf(docFreq=40523, maxDocs=44218)
          0.0625 = fieldNorm(doc=6506)
  0.06666667 = coord(2/30)

Abstract: Describes the abstracting assistance features being prototyped in the TEXNET text network management system. Sentence weighting methods include: weithing negatively or positively on the stems in a selected passage; weighting on general lists of cue words, adjusting weights of selected segments; and weighting of occurrence of frequent stems. The user may adjust a number of parameters: the minimum strength of extracts; the threshold for frequent word/stems and the amount sentence weight is to be adjusted for each weighting type
Source: Canadian journal of information and library science. 18(1993) no.2, S.20-31

Craven, T.C.: ¬A phrase flipper for the assistance of writers of abstracts and other text (1995) 0.00

4.8283124E-4 = product of:
  0.007242468 = sum of:
    0.004988801 = weight(_text_:in in 4897) [ClassicSimilarity], result of:
      0.004988801 = score(doc=4897,freq=4.0), product of:
        0.029340398 = queryWeight, product of:
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.021569785 = queryNorm
        0.17003182 = fieldWeight in 4897, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.0625 = fieldNorm(doc=4897)
    0.002253667 = weight(_text_:s in 4897) [ClassicSimilarity], result of:
      0.002253667 = score(doc=4897,freq=2.0), product of:
        0.023451481 = queryWeight, product of:
          1.0872376 = idf(docFreq=40523, maxDocs=44218)
          0.021569785 = queryNorm
        0.09609913 = fieldWeight in 4897, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          1.0872376 = idf(docFreq=40523, maxDocs=44218)
          0.0625 = fieldNorm(doc=4897)
  0.06666667 = coord(2/30)

Abstract: Describes computerized tools for computer assisted abstracting. FlipPhr is a Microsoft Windows application program that rearranges (flips) phrases or other expressions in accordance with rules in a grammar. The flipping may be invoked with a single keystroke from within various Windows application programs that allow cutting and pasting of text. The user may modify the grammar to provide for different kinds of flipping
Source: Canadian journal of information and library science. 20(1995) nos.3/4, S.41-49

Over, P.; Dang, H.; Harman, D.: DUC in context (2007) 0.00

4.8283124E-4 = product of:
  0.007242468 = sum of:
    0.004988801 = weight(_text_:in in 934) [ClassicSimilarity], result of:
      0.004988801 = score(doc=934,freq=4.0), product of:
        0.029340398 = queryWeight, product of:
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.021569785 = queryNorm
        0.17003182 = fieldWeight in 934, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.0625 = fieldNorm(doc=934)
    0.002253667 = weight(_text_:s in 934) [ClassicSimilarity], result of:
      0.002253667 = score(doc=934,freq=2.0), product of:
        0.023451481 = queryWeight, product of:
          1.0872376 = idf(docFreq=40523, maxDocs=44218)
          0.021569785 = queryNorm
        0.09609913 = fieldWeight in 934, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          1.0872376 = idf(docFreq=40523, maxDocs=44218)
          0.0625 = fieldNorm(doc=934)
  0.06666667 = coord(2/30)

Abstract: Recent years have seen increased interest in text summarization with emphasis on evaluation of prototype systems. Many factors can affect the design of such evaluations, requiring choices among competing alternatives. This paper examines several major themes running through three evaluations: SUMMAC, NTCIR, and DUC, with a concentration on DUC. The themes are extrinsic and intrinsic evaluation, evaluation procedures and methods, generic versus focused summaries, single- and multi-document summaries, length and compression issues, extracts versus abstracts, and issues with genre.
Source: Information processing and management. 43(2007) no.6, S.1506-1520

Gomez, J.; Allen, K.; Matney, M.; Awopetu, T.; Shafer, S.: Experimenting with a machine generated annotations pipeline (2020) 0.00

4.8283124E-4 = product of:
  0.007242468 = sum of:
    0.004988801 = weight(_text_:in in 657) [ClassicSimilarity], result of:
      0.004988801 = score(doc=657,freq=4.0), product of:
        0.029340398 = queryWeight, product of:
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.021569785 = queryNorm
        0.17003182 = fieldWeight in 657, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.0625 = fieldNorm(doc=657)
    0.002253667 = weight(_text_:s in 657) [ClassicSimilarity], result of:
      0.002253667 = score(doc=657,freq=2.0), product of:
        0.023451481 = queryWeight, product of:
          1.0872376 = idf(docFreq=40523, maxDocs=44218)
          0.021569785 = queryNorm
        0.09609913 = fieldWeight in 657, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          1.0872376 = idf(docFreq=40523, maxDocs=44218)
          0.0625 = fieldNorm(doc=657)
  0.06666667 = coord(2/30)

Abstract: The UCLA Library reorganized its software developers into focused subteams with one, the Labs Team, dedicated to conducting experiments. In this article we describe our first attempt at conducting a software development experiment, in which we attempted to improve our digital library's search results with metadata from cloud-based image tagging services. We explore the findings and discuss the lessons learned from our first attempt at running an experiment.

Wei, F.; Li, W.; Lu, Q.; He, Y.: Applying two-level reinforcement ranking in query-oriented multidocument summarization (2009) 0.00
```
4.8278586E-4 = product of:
  0.0072417874 = sum of:
    0.005833246 = weight(_text_:in in 3120) [ClassicSimilarity], result of:
      0.005833246 = score(doc=3120,freq=14.0), product of:
        0.029340398 = queryWeight, product of:
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.021569785 = queryNorm
        0.19881277 = fieldWeight in 3120, product of:
          3.7416575 = tf(freq=14.0), with freq of:
            14.0 = termFreq=14.0
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.0390625 = fieldNorm(doc=3120)
    0.0014085418 = weight(_text_:s in 3120) [ClassicSimilarity], result of:
      0.0014085418 = score(doc=3120,freq=2.0), product of:
        0.023451481 = queryWeight, product of:
          1.0872376 = idf(docFreq=40523, maxDocs=44218)
          0.021569785 = queryNorm
        0.060061958 = fieldWeight in 3120, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          1.0872376 = idf(docFreq=40523, maxDocs=44218)
          0.0390625 = fieldNorm(doc=3120)
  0.06666667 = coord(2/30)
```
Abstract

Sentence ranking is the issue of most concern in document summarization today. While traditional feature-based approaches evaluate sentence significance and rank the sentences relying on the features that are particularly designed to characterize the different aspects of the individual sentences, the newly emerging graph-based ranking algorithms (such as the PageRank-like algorithms) recursively compute sentence significance using the global information in a text graph that links sentences together. In general, the existing PageRank-like algorithms can model well the phenomena that a sentence is important if it is linked by many other important sentences. Or they are capable of modeling the mutual reinforcement among the sentences in the text graph. However, when dealing with multidocument summarization these algorithms often assemble a set of documents into one large file. The document dimension is totally ignored. In this article we present a framework to model the two-level mutual reinforcement among sentences as well as documents. Under this framework we design and develop a novel ranking algorithm such that the document reinforcement is taken into account in the process of sentence ranking. The convergence issue is examined. We also explore an interesting and important property of the proposed algorithm. When evaluated on the DUC 2005 and 2006 query-oriented multidocument summarization datasets, significant results are achieved.

Source

Journal of the American Society for Information Science and Technology. 60(2009) no.10, S.2119-2131
Moens, M.-F.; Uyttendaele, C.: Automatic text structuring and categorization as a first step in summarizing legal cases (1997) 0.00
```
4.6544487E-4 = product of:
  0.0069816727 = sum of:
    0.0052914224 = weight(_text_:in in 2256) [ClassicSimilarity], result of:
      0.0052914224 = score(doc=2256,freq=8.0), product of:
        0.029340398 = queryWeight, product of:
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.021569785 = queryNorm
        0.18034597 = fieldWeight in 2256, product of:
          2.828427 = tf(freq=8.0), with freq of:
            8.0 = termFreq=8.0
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.046875 = fieldNorm(doc=2256)
    0.0016902501 = weight(_text_:s in 2256) [ClassicSimilarity], result of:
      0.0016902501 = score(doc=2256,freq=2.0), product of:
        0.023451481 = queryWeight, product of:
          1.0872376 = idf(docFreq=40523, maxDocs=44218)
          0.021569785 = queryNorm
        0.072074346 = fieldWeight in 2256, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          1.0872376 = idf(docFreq=40523, maxDocs=44218)
          0.046875 = fieldNorm(doc=2256)
  0.06666667 = coord(2/30)
```
Abstract

The SALOMON system automatically summarizes Belgian criminal cases in order to improve access to the large number of existing and future court decisions. SALOMON extracts relevant text units from the case text to form a case summary. Such a case profile facilitates the rapid determination of the relevance of the case or may be employed in text search. In a first important abstracting step SALOMON performs an initial categorization of legal criminal cases and structures the case text into separate legally relevant and irrelevant components. A text grammar represented as a semantic network is used to automatically determine the category of the case and its components. Extracts from the case general data and identifies text portions relevant for further abstracting. Prior knowledge of the text structure and its indicative cues may support automatic abstracting. A text grammar is a promising form for representing the knowledge involved

Source

Information processing and management. 33(1997) no.6, S.727-737

Search (98 results, page 2 of 5)

Authors

Years

Types

Themes

Subjects