Document (#11419)

Author
Maybury, M.T.
Title
Generating summaries from event data
Source
Information processing and management. 31(1995) no.5, S.735-751
Year
1995
Abstract
Summarization entails analysis of source material, selection of key information, condensation of this, and generation of a compct summary form. While there habe been many investigations into the automatic summarization of text, relatively little attention has been given to the summarization of information from structured information sources such as data of knowledge bases, despite this being a desirable capability for a number of application areas including report generation from databases (e.g. weather, financial, medical) and simulation (e.g. military, manufacturing, aconomic). After a brief introduction indicating the main elements of summarization and referring to some illustrative approaches to it, considers pecific issues in the generation of text summaries of event data, describes a system, SumGen, which selects key information from an event database by reasoning about event frequencies, frequencies of relations between events, and domain specific importance measures. Describes how Sum Gen then aggregates similar information and plans a summary presentations tailored to stereotypical users
Theme
Automatisches Abstracting

Similar documents (content)

  1. Ou, S.; Khoo, C.S.G.; Goh, D.H.: Multi-document summarization of news articles using an event-based framework (2006) 0.28
    0.28489038 = sum of:
      0.28489038 = product of:
        1.1870433 = sum of:
          0.012603358 = weight(abstract_txt:from in 1783) [ClassicSimilarity], result of:
            0.012603358 = score(doc=1783,freq=1.0), product of:
              0.07188087 = queryWeight, product of:
                1.5355401 = boost
                2.805388 = idf(docFreq=6898, maxDocs=41962)
                0.01668627 = queryNorm
              0.17533675 = fieldWeight in 1783, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.805388 = idf(docFreq=6898, maxDocs=41962)
                0.0625 = fieldNorm(doc=1783)
          0.01794448 = weight(abstract_txt:information in 1783) [ClassicSimilarity], result of:
            0.01794448 = score(doc=1783,freq=3.0), product of:
              0.06794737 = queryWeight, product of:
                1.6691518 = boost
                2.439594 = idf(docFreq=9945, maxDocs=41962)
                0.01668627 = queryNorm
              0.2640938 = fieldWeight in 1783, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.439594 = idf(docFreq=9945, maxDocs=41962)
                0.0625 = fieldNorm(doc=1783)
          0.07765136 = weight(abstract_txt:summary in 1783) [ClassicSimilarity], result of:
            0.07765136 = score(doc=1783,freq=1.0), product of:
              0.19173962 = queryWeight, product of:
                1.7733539 = boost
                6.479734 = idf(docFreq=174, maxDocs=41962)
                0.01668627 = queryNorm
              0.40498337 = fieldWeight in 1783, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.479734 = idf(docFreq=174, maxDocs=41962)
                0.0625 = fieldNorm(doc=1783)
          0.24080658 = weight(abstract_txt:summaries in 1783) [ClassicSimilarity], result of:
            0.24080658 = score(doc=1783,freq=6.0), product of:
              0.22439213 = queryWeight, product of:
                1.9184183 = boost
                7.009791 = idf(docFreq=102, maxDocs=41962)
                0.01668627 = queryNorm
              1.0731508 = fieldWeight in 1783, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                7.009791 = idf(docFreq=102, maxDocs=41962)
                0.0625 = fieldNorm(doc=1783)
          0.5408245 = weight(abstract_txt:event in 1783) [ClassicSimilarity], result of:
            0.5408245 = score(doc=1783,freq=7.0), product of:
              0.4605683 = queryWeight, product of:
                3.8868835 = boost
                7.101225 = idf(docFreq=93, maxDocs=41962)
                0.01668627 = queryNorm
              1.1742547 = fieldWeight in 1783, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                7.101225 = idf(docFreq=93, maxDocs=41962)
                0.0625 = fieldNorm(doc=1783)
          0.2972131 = weight(abstract_txt:summarization in 1783) [ClassicSimilarity], result of:
            0.2972131 = score(doc=1783,freq=2.0), product of:
              0.46916378 = queryWeight, product of:
                3.9229858 = boost
                7.167183 = idf(docFreq=87, maxDocs=41962)
                0.01668627 = queryNorm
              0.63349545 = fieldWeight in 1783, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.167183 = idf(docFreq=87, maxDocs=41962)
                0.0625 = fieldNorm(doc=1783)
        0.24 = coord(6/25)
    
  2. Kar, M.; Nunes, S.; Ribeiro, C.: Summarization of changes in dynamic text collections using Latent Dirichlet Allocation model (2015) 0.27
    0.26558346 = sum of:
      0.26558346 = product of:
        0.8299483 = sum of:
          0.011690685 = weight(abstract_txt:describes in 4677) [ClassicSimilarity], result of:
            0.011690685 = score(doc=4677,freq=1.0), product of:
              0.06573524 = queryWeight, product of:
                1.0383377 = boost
                3.7940266 = idf(docFreq=2566, maxDocs=41962)
                0.01668627 = queryNorm
              0.177845 = fieldWeight in 4677, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.7940266 = idf(docFreq=2566, maxDocs=41962)
                0.046875 = fieldNorm(doc=4677)
          0.024733959 = weight(abstract_txt:text in 4677) [ClassicSimilarity], result of:
            0.024733959 = score(doc=4677,freq=3.0), product of:
              0.075115055 = queryWeight, product of:
                1.1099489 = boost
                4.05569 = idf(docFreq=1975, maxDocs=41962)
                0.01668627 = queryNorm
              0.32928097 = fieldWeight in 4677, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.05569 = idf(docFreq=1975, maxDocs=41962)
                0.046875 = fieldNorm(doc=4677)
          0.016372243 = weight(abstract_txt:from in 4677) [ClassicSimilarity], result of:
            0.016372243 = score(doc=4677,freq=3.0), product of:
              0.07188087 = queryWeight, product of:
                1.5355401 = boost
                2.805388 = idf(docFreq=6898, maxDocs=41962)
                0.01668627 = queryNorm
              0.22776912 = fieldWeight in 4677, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.805388 = idf(docFreq=6898, maxDocs=41962)
                0.046875 = fieldNorm(doc=4677)
          0.01345836 = weight(abstract_txt:information in 4677) [ClassicSimilarity], result of:
            0.01345836 = score(doc=4677,freq=3.0), product of:
              0.06794737 = queryWeight, product of:
                1.6691518 = boost
                2.439594 = idf(docFreq=9945, maxDocs=41962)
                0.01668627 = queryNorm
              0.19807035 = fieldWeight in 4677, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.439594 = idf(docFreq=9945, maxDocs=41962)
                0.046875 = fieldNorm(doc=4677)
          0.13022529 = weight(abstract_txt:summary in 4677) [ClassicSimilarity], result of:
            0.13022529 = score(doc=4677,freq=5.0), product of:
              0.19173962 = queryWeight, product of:
                1.7733539 = boost
                6.479734 = idf(docFreq=174, maxDocs=41962)
                0.01668627 = queryNorm
              0.67917776 = fieldWeight in 4677, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                6.479734 = idf(docFreq=174, maxDocs=41962)
                0.046875 = fieldNorm(doc=4677)
          0.12770697 = weight(abstract_txt:summaries in 4677) [ClassicSimilarity], result of:
            0.12770697 = score(doc=4677,freq=3.0), product of:
              0.22439213 = queryWeight, product of:
                1.9184183 = boost
                7.009791 = idf(docFreq=102, maxDocs=41962)
                0.01668627 = queryNorm
              0.5691241 = fieldWeight in 4677, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.009791 = idf(docFreq=102, maxDocs=41962)
                0.046875 = fieldNorm(doc=4677)
          0.15330933 = weight(abstract_txt:event in 4677) [ClassicSimilarity], result of:
            0.15330933 = score(doc=4677,freq=1.0), product of:
              0.4605683 = queryWeight, product of:
                3.8868835 = boost
                7.101225 = idf(docFreq=93, maxDocs=41962)
                0.01668627 = queryNorm
              0.33286992 = fieldWeight in 4677, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.101225 = idf(docFreq=93, maxDocs=41962)
                0.046875 = fieldNorm(doc=4677)
          0.3524514 = weight(abstract_txt:summarization in 4677) [ClassicSimilarity], result of:
            0.3524514 = score(doc=4677,freq=5.0), product of:
              0.46916378 = queryWeight, product of:
                3.9229858 = boost
                7.167183 = idf(docFreq=87, maxDocs=41962)
                0.01668627 = queryNorm
              0.7512332 = fieldWeight in 4677, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                7.167183 = idf(docFreq=87, maxDocs=41962)
                0.046875 = fieldNorm(doc=4677)
        0.32 = coord(8/25)
    
  3. Yang, C.C.; Wang, F.L.: Hierarchical summarization of large documents (2008) 0.24
    0.24027339 = sum of:
      0.24027339 = product of:
        0.85811925 = sum of:
          0.019691452 = weight(abstract_txt:been in 3720) [ClassicSimilarity], result of:
            0.019691452 = score(doc=3720,freq=2.0), product of:
              0.060970675 = queryWeight, product of:
                3.6539428 = idf(docFreq=2952, maxDocs=41962)
                0.01668627 = queryNorm
              0.32296595 = fieldWeight in 3720, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.6539428 = idf(docFreq=2952, maxDocs=41962)
                0.0625 = fieldNorm(doc=3720)
          0.01904021 = weight(abstract_txt:text in 3720) [ClassicSimilarity], result of:
            0.01904021 = score(doc=3720,freq=1.0), product of:
              0.075115055 = queryWeight, product of:
                1.1099489 = boost
                4.05569 = idf(docFreq=1975, maxDocs=41962)
                0.01668627 = queryNorm
              0.2534806 = fieldWeight in 3720, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.05569 = idf(docFreq=1975, maxDocs=41962)
                0.0625 = fieldNorm(doc=3720)
          0.012603358 = weight(abstract_txt:from in 3720) [ClassicSimilarity], result of:
            0.012603358 = score(doc=3720,freq=1.0), product of:
              0.07188087 = queryWeight, product of:
                1.5355401 = boost
                2.805388 = idf(docFreq=6898, maxDocs=41962)
                0.01668627 = queryNorm
              0.17533675 = fieldWeight in 3720, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.805388 = idf(docFreq=6898, maxDocs=41962)
                0.0625 = fieldNorm(doc=3720)
          0.01794448 = weight(abstract_txt:information in 3720) [ClassicSimilarity], result of:
            0.01794448 = score(doc=3720,freq=3.0), product of:
              0.06794737 = queryWeight, product of:
                1.6691518 = boost
                2.439594 = idf(docFreq=9945, maxDocs=41962)
                0.01668627 = queryNorm
              0.2640938 = fieldWeight in 3720, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.439594 = idf(docFreq=9945, maxDocs=41962)
                0.0625 = fieldNorm(doc=3720)
          0.1344961 = weight(abstract_txt:summary in 3720) [ClassicSimilarity], result of:
            0.1344961 = score(doc=3720,freq=3.0), product of:
              0.19173962 = queryWeight, product of:
                1.7733539 = boost
                6.479734 = idf(docFreq=174, maxDocs=41962)
                0.01668627 = queryNorm
              0.7014518 = fieldWeight in 3720, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.479734 = idf(docFreq=174, maxDocs=41962)
                0.0625 = fieldNorm(doc=3720)
          0.09830887 = weight(abstract_txt:summaries in 3720) [ClassicSimilarity], result of:
            0.09830887 = score(doc=3720,freq=1.0), product of:
              0.22439213 = queryWeight, product of:
                1.9184183 = boost
                7.009791 = idf(docFreq=102, maxDocs=41962)
                0.01668627 = queryNorm
              0.43811193 = fieldWeight in 3720, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.009791 = idf(docFreq=102, maxDocs=41962)
                0.0625 = fieldNorm(doc=3720)
          0.5560348 = weight(abstract_txt:summarization in 3720) [ClassicSimilarity], result of:
            0.5560348 = score(doc=3720,freq=7.0), product of:
              0.46916378 = queryWeight, product of:
                3.9229858 = boost
                7.167183 = idf(docFreq=87, maxDocs=41962)
                0.01668627 = queryNorm
              1.1851615 = fieldWeight in 3720, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                7.167183 = idf(docFreq=87, maxDocs=41962)
                0.0625 = fieldNorm(doc=3720)
        0.28 = coord(7/25)
    
  4. Kannan, R.; Ghinea, G.; Swaminathan, S.: What do you wish to see? : A summarization system for movies based on user preferences (2015) 0.21
    0.20754386 = sum of:
      0.20754386 = product of:
        0.8647661 = sum of:
          0.012183465 = weight(abstract_txt:been in 4684) [ClassicSimilarity], result of:
            0.012183465 = score(doc=4684,freq=1.0), product of:
              0.060970675 = queryWeight, product of:
                3.6539428 = idf(docFreq=2952, maxDocs=41962)
                0.01668627 = queryNorm
              0.199825 = fieldWeight in 4684, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6539428 = idf(docFreq=2952, maxDocs=41962)
                0.0546875 = fieldNorm(doc=4684)
          0.054103296 = weight(abstract_txt:tailored in 4684) [ClassicSimilarity], result of:
            0.054103296 = score(doc=4684,freq=1.0), product of:
              0.13074134 = queryWeight, product of:
                1.0354544 = boost
                7.5669823 = idf(docFreq=58, maxDocs=41962)
                0.01668627 = queryNorm
              0.41381934 = fieldWeight in 4684, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.5669823 = idf(docFreq=58, maxDocs=41962)
                0.0546875 = fieldNorm(doc=4684)
          0.011027939 = weight(abstract_txt:from in 4684) [ClassicSimilarity], result of:
            0.011027939 = score(doc=4684,freq=1.0), product of:
              0.07188087 = queryWeight, product of:
                1.5355401 = boost
                2.805388 = idf(docFreq=6898, maxDocs=41962)
                0.01668627 = queryNorm
              0.15341966 = fieldWeight in 4684, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.805388 = idf(docFreq=6898, maxDocs=41962)
                0.0546875 = fieldNorm(doc=4684)
          0.1519295 = weight(abstract_txt:summary in 4684) [ClassicSimilarity], result of:
            0.1519295 = score(doc=4684,freq=5.0), product of:
              0.19173962 = queryWeight, product of:
                1.7733539 = boost
                6.479734 = idf(docFreq=174, maxDocs=41962)
                0.01668627 = queryNorm
              0.7923741 = fieldWeight in 4684, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                6.479734 = idf(docFreq=174, maxDocs=41962)
                0.0546875 = fieldNorm(doc=4684)
          0.14899147 = weight(abstract_txt:summaries in 4684) [ClassicSimilarity], result of:
            0.14899147 = score(doc=4684,freq=3.0), product of:
              0.22439213 = queryWeight, product of:
                1.9184183 = boost
                7.009791 = idf(docFreq=102, maxDocs=41962)
                0.01668627 = queryNorm
              0.6639781 = fieldWeight in 4684, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.009791 = idf(docFreq=102, maxDocs=41962)
                0.0546875 = fieldNorm(doc=4684)
          0.48653048 = weight(abstract_txt:summarization in 4684) [ClassicSimilarity], result of:
            0.48653048 = score(doc=4684,freq=7.0), product of:
              0.46916378 = queryWeight, product of:
                3.9229858 = boost
                7.167183 = idf(docFreq=87, maxDocs=41962)
                0.01668627 = queryNorm
              1.0370163 = fieldWeight in 4684, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                7.167183 = idf(docFreq=87, maxDocs=41962)
                0.0546875 = fieldNorm(doc=4684)
        0.24 = coord(6/25)
    
  5. Aker, A.; Gaizauskas, R.: Generating descriptive multi-document summaries of geo-located entities using entity type models (2015) 0.19
    0.18857878 = sum of:
      0.18857878 = product of:
        0.78574497 = sum of:
          0.026926924 = weight(abstract_txt:text in 3727) [ClassicSimilarity], result of:
            0.026926924 = score(doc=3727,freq=2.0), product of:
              0.075115055 = queryWeight, product of:
                1.1099489 = boost
                4.05569 = idf(docFreq=1975, maxDocs=41962)
                0.01668627 = queryNorm
              0.35847571 = fieldWeight in 3727, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.05569 = idf(docFreq=1975, maxDocs=41962)
                0.0625 = fieldNorm(doc=3727)
          0.012603358 = weight(abstract_txt:from in 3727) [ClassicSimilarity], result of:
            0.012603358 = score(doc=3727,freq=1.0), product of:
              0.07188087 = queryWeight, product of:
                1.5355401 = boost
                2.805388 = idf(docFreq=6898, maxDocs=41962)
                0.01668627 = queryNorm
              0.17533675 = fieldWeight in 3727, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.805388 = idf(docFreq=6898, maxDocs=41962)
                0.0625 = fieldNorm(doc=3727)
          0.07765136 = weight(abstract_txt:summary in 3727) [ClassicSimilarity], result of:
            0.07765136 = score(doc=3727,freq=1.0), product of:
              0.19173962 = queryWeight, product of:
                1.7733539 = boost
                6.479734 = idf(docFreq=174, maxDocs=41962)
                0.01668627 = queryNorm
              0.40498337 = fieldWeight in 3727, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.479734 = idf(docFreq=174, maxDocs=41962)
                0.0625 = fieldNorm(doc=3727)
          0.17027596 = weight(abstract_txt:summaries in 3727) [ClassicSimilarity], result of:
            0.17027596 = score(doc=3727,freq=3.0), product of:
              0.22439213 = queryWeight, product of:
                1.9184183 = boost
                7.009791 = idf(docFreq=102, maxDocs=41962)
                0.01668627 = queryNorm
              0.7588321 = fieldWeight in 3727, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.009791 = idf(docFreq=102, maxDocs=41962)
                0.0625 = fieldNorm(doc=3727)
          0.077964544 = weight(abstract_txt:generation in 3727) [ClassicSimilarity], result of:
            0.077964544 = score(doc=3727,freq=1.0), product of:
              0.22007684 = queryWeight, product of:
                2.326871 = boost
                5.668169 = idf(docFreq=393, maxDocs=41962)
                0.01668627 = queryNorm
              0.35426056 = fieldWeight in 3727, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.668169 = idf(docFreq=393, maxDocs=41962)
                0.0625 = fieldNorm(doc=3727)
          0.42032284 = weight(abstract_txt:summarization in 3727) [ClassicSimilarity], result of:
            0.42032284 = score(doc=3727,freq=4.0), product of:
              0.46916378 = queryWeight, product of:
                3.9229858 = boost
                7.167183 = idf(docFreq=87, maxDocs=41962)
                0.01668627 = queryNorm
              0.89589787 = fieldWeight in 3727, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                7.167183 = idf(docFreq=87, maxDocs=41962)
                0.0625 = fieldNorm(doc=3727)
        0.24 = coord(6/25)