Document (#11419)

Author
Maybury, M.T.
Title
Generating summaries from event data
Source
Information processing and management. 31(1995) no.5, S.735-751
Year
1995
Abstract
Summarization entails analysis of source material, selection of key information, condensation of this, and generation of a compct summary form. While there habe been many investigations into the automatic summarization of text, relatively little attention has been given to the summarization of information from structured information sources such as data of knowledge bases, despite this being a desirable capability for a number of application areas including report generation from databases (e.g. weather, financial, medical) and simulation (e.g. military, manufacturing, aconomic). After a brief introduction indicating the main elements of summarization and referring to some illustrative approaches to it, considers pecific issues in the generation of text summaries of event data, describes a system, SumGen, which selects key information from an event database by reasoning about event frequencies, frequencies of relations between events, and domain specific importance measures. Describes how Sum Gen then aggregates similar information and plans a summary presentations tailored to stereotypical users
Theme
Automatisches Abstracting

Similar documents (content)

  1. Ou, S.; Khoo, C.S.G.; Goh, D.H.: Multi-document summarization of news articles using an event-based framework (2006) 0.28
    0.28149572 = sum of:
      0.28149572 = product of:
        1.1728989 = sum of:
          0.0121614365 = weight(abstract_txt:from in 657) [ClassicSimilarity], result of:
            0.0121614365 = score(doc=657,freq=1.0), product of:
              0.07040204 = queryWeight, product of:
                1.528029 = boost
                2.7638826 = idf(docFreq=7577, maxDocs=44218)
                0.01666994 = queryNorm
              0.17274266 = fieldWeight in 657, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.7638826 = idf(docFreq=7577, maxDocs=44218)
                0.0625 = fieldNorm(doc=657)
          0.017695064 = weight(abstract_txt:information in 657) [ClassicSimilarity], result of:
            0.017695064 = score(doc=657,freq=3.0), product of:
              0.067519054 = queryWeight, product of:
                1.6730431 = boost
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.01666994 = queryNorm
              0.26207513 = fieldWeight in 657, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.0625 = fieldNorm(doc=657)
          0.07804381 = weight(abstract_txt:summary in 657) [ClassicSimilarity], result of:
            0.07804381 = score(doc=657,freq=1.0), product of:
              0.1929646 = queryWeight, product of:
                1.7888042 = boost
                6.4711404 = idf(docFreq=185, maxDocs=44218)
                0.01666994 = queryNorm
              0.40444627 = fieldWeight in 657, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.4711404 = idf(docFreq=185, maxDocs=44218)
                0.0625 = fieldNorm(doc=657)
          0.24545763 = weight(abstract_txt:summaries in 657) [ClassicSimilarity], result of:
            0.24545763 = score(doc=657,freq=6.0), product of:
              0.22795683 = queryWeight, product of:
                1.9442418 = boost
                7.033448 = idf(docFreq=105, maxDocs=44218)
                0.01666994 = queryNorm
              1.0767725 = fieldWeight in 657, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                7.033448 = idf(docFreq=105, maxDocs=44218)
                0.0625 = fieldNorm(doc=657)
          0.523962 = weight(abstract_txt:event in 657) [ClassicSimilarity], result of:
            0.523962 = score(doc=657,freq=7.0), product of:
              0.45230272 = queryWeight, product of:
                3.873054 = boost
                7.0055394 = idf(docFreq=108, maxDocs=44218)
                0.01666994 = queryNorm
              1.1584321 = fieldWeight in 657, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                7.0055394 = idf(docFreq=108, maxDocs=44218)
                0.0625 = fieldNorm(doc=657)
          0.29557893 = weight(abstract_txt:summarization in 657) [ClassicSimilarity], result of:
            0.29557893 = score(doc=657,freq=2.0), product of:
              0.4688504 = queryWeight, product of:
                3.9432664 = boost
                7.132539 = idf(docFreq=95, maxDocs=44218)
                0.01666994 = queryNorm
              0.6304333 = fieldWeight in 657, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.132539 = idf(docFreq=95, maxDocs=44218)
                0.0625 = fieldNorm(doc=657)
        0.24 = coord(6/25)
    
  2. Kar, M.; Nunes, S.; Ribeiro, C.: Summarization of changes in dynamic text collections using Latent Dirichlet Allocation model (2015) 0.26
    0.26431635 = sum of:
      0.26431635 = product of:
        0.82598865 = sum of:
          0.012078928 = weight(abstract_txt:describes in 2676) [ClassicSimilarity], result of:
            0.012078928 = score(doc=2676,freq=1.0), product of:
              0.067385025 = queryWeight, product of:
                1.0570747 = boost
                3.8240511 = idf(docFreq=2624, maxDocs=44218)
                0.01666994 = queryNorm
              0.1792524 = fieldWeight in 2676, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.8240511 = idf(docFreq=2624, maxDocs=44218)
                0.046875 = fieldNorm(doc=2676)
          0.024740493 = weight(abstract_txt:text in 2676) [ClassicSimilarity], result of:
            0.024740493 = score(doc=2676,freq=3.0), product of:
              0.075354576 = queryWeight, product of:
                1.1178378 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.01666994 = queryNorm
              0.32832104 = fieldWeight in 2676, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.046875 = fieldNorm(doc=2676)
          0.015798168 = weight(abstract_txt:from in 2676) [ClassicSimilarity], result of:
            0.015798168 = score(doc=2676,freq=3.0), product of:
              0.07040204 = queryWeight, product of:
                1.528029 = boost
                2.7638826 = idf(docFreq=7577, maxDocs=44218)
                0.01666994 = queryNorm
              0.2243993 = fieldWeight in 2676, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.7638826 = idf(docFreq=7577, maxDocs=44218)
                0.046875 = fieldNorm(doc=2676)
          0.013271298 = weight(abstract_txt:information in 2676) [ClassicSimilarity], result of:
            0.013271298 = score(doc=2676,freq=3.0), product of:
              0.067519054 = queryWeight, product of:
                1.6730431 = boost
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.01666994 = queryNorm
              0.19655634 = fieldWeight in 2676, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.046875 = fieldNorm(doc=2676)
          0.13088346 = weight(abstract_txt:summary in 2676) [ClassicSimilarity], result of:
            0.13088346 = score(doc=2676,freq=5.0), product of:
              0.1929646 = queryWeight, product of:
                1.7888042 = boost
                6.4711404 = idf(docFreq=185, maxDocs=44218)
                0.01666994 = queryNorm
              0.678277 = fieldWeight in 2676, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                6.4711404 = idf(docFreq=185, maxDocs=44218)
                0.046875 = fieldNorm(doc=2676)
          0.13017356 = weight(abstract_txt:summaries in 2676) [ClassicSimilarity], result of:
            0.13017356 = score(doc=2676,freq=3.0), product of:
              0.22795683 = queryWeight, product of:
                1.9442418 = boost
                7.033448 = idf(docFreq=105, maxDocs=44218)
                0.01666994 = queryNorm
              0.5710448 = fieldWeight in 2676, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.033448 = idf(docFreq=105, maxDocs=44218)
                0.046875 = fieldNorm(doc=2676)
          0.14852928 = weight(abstract_txt:event in 2676) [ClassicSimilarity], result of:
            0.14852928 = score(doc=2676,freq=1.0), product of:
              0.45230272 = queryWeight, product of:
                3.873054 = boost
                7.0055394 = idf(docFreq=108, maxDocs=44218)
                0.01666994 = queryNorm
              0.32838467 = fieldWeight in 2676, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.0055394 = idf(docFreq=108, maxDocs=44218)
                0.046875 = fieldNorm(doc=2676)
          0.3505135 = weight(abstract_txt:summarization in 2676) [ClassicSimilarity], result of:
            0.3505135 = score(doc=2676,freq=5.0), product of:
              0.4688504 = queryWeight, product of:
                3.9432664 = boost
                7.132539 = idf(docFreq=95, maxDocs=44218)
                0.01666994 = queryNorm
              0.747602 = fieldWeight in 2676, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                7.132539 = idf(docFreq=95, maxDocs=44218)
                0.046875 = fieldNorm(doc=2676)
        0.32 = coord(8/25)
    
  3. Yang, C.C.; Wang, F.L.: Hierarchical summarization of large documents (2008) 0.24
    0.2398327 = sum of:
      0.2398327 = product of:
        0.8565453 = sum of:
          0.019282578 = weight(abstract_txt:been in 1719) [ClassicSimilarity], result of:
            0.019282578 = score(doc=1719,freq=2.0), product of:
              0.060304824 = queryWeight, product of:
                3.617579 = idf(docFreq=3226, maxDocs=44218)
                0.01666994 = queryNorm
              0.31975183 = fieldWeight in 1719, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.617579 = idf(docFreq=3226, maxDocs=44218)
                0.0625 = fieldNorm(doc=1719)
          0.019045241 = weight(abstract_txt:text in 1719) [ClassicSimilarity], result of:
            0.019045241 = score(doc=1719,freq=1.0), product of:
              0.075354576 = queryWeight, product of:
                1.1178378 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.01666994 = queryNorm
              0.25274166 = fieldWeight in 1719, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.0625 = fieldNorm(doc=1719)
          0.0121614365 = weight(abstract_txt:from in 1719) [ClassicSimilarity], result of:
            0.0121614365 = score(doc=1719,freq=1.0), product of:
              0.07040204 = queryWeight, product of:
                1.528029 = boost
                2.7638826 = idf(docFreq=7577, maxDocs=44218)
                0.01666994 = queryNorm
              0.17274266 = fieldWeight in 1719, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.7638826 = idf(docFreq=7577, maxDocs=44218)
                0.0625 = fieldNorm(doc=1719)
          0.017695064 = weight(abstract_txt:information in 1719) [ClassicSimilarity], result of:
            0.017695064 = score(doc=1719,freq=3.0), product of:
              0.067519054 = queryWeight, product of:
                1.6730431 = boost
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.01666994 = queryNorm
              0.26207513 = fieldWeight in 1719, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.0625 = fieldNorm(doc=1719)
          0.13517584 = weight(abstract_txt:summary in 1719) [ClassicSimilarity], result of:
            0.13517584 = score(doc=1719,freq=3.0), product of:
              0.1929646 = queryWeight, product of:
                1.7888042 = boost
                6.4711404 = idf(docFreq=185, maxDocs=44218)
                0.01666994 = queryNorm
              0.70052147 = fieldWeight in 1719, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.4711404 = idf(docFreq=185, maxDocs=44218)
                0.0625 = fieldNorm(doc=1719)
          0.100207664 = weight(abstract_txt:summaries in 1719) [ClassicSimilarity], result of:
            0.100207664 = score(doc=1719,freq=1.0), product of:
              0.22795683 = queryWeight, product of:
                1.9442418 = boost
                7.033448 = idf(docFreq=105, maxDocs=44218)
                0.01666994 = queryNorm
              0.4395905 = fieldWeight in 1719, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.033448 = idf(docFreq=105, maxDocs=44218)
                0.0625 = fieldNorm(doc=1719)
          0.5529775 = weight(abstract_txt:summarization in 1719) [ClassicSimilarity], result of:
            0.5529775 = score(doc=1719,freq=7.0), product of:
              0.4688504 = queryWeight, product of:
                3.9432664 = boost
                7.132539 = idf(docFreq=95, maxDocs=44218)
                0.01666994 = queryNorm
              1.1794327 = fieldWeight in 1719, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                7.132539 = idf(docFreq=95, maxDocs=44218)
                0.0625 = fieldNorm(doc=1719)
        0.28 = coord(7/25)
    
  4. Kannan, R.; Ghinea, G.; Swaminathan, S.: What do you wish to see? : A summarization system for movies based on user preferences (2015) 0.21
    0.20767233 = sum of:
      0.20767233 = product of:
        0.8653014 = sum of:
          0.011930486 = weight(abstract_txt:been in 2683) [ClassicSimilarity], result of:
            0.011930486 = score(doc=2683,freq=1.0), product of:
              0.060304824 = queryWeight, product of:
                3.617579 = idf(docFreq=3226, maxDocs=44218)
                0.01666994 = queryNorm
              0.19783635 = fieldWeight in 2683, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.617579 = idf(docFreq=3226, maxDocs=44218)
                0.0546875 = fieldNorm(doc=2683)
          0.05430782 = weight(abstract_txt:tailored in 2683) [ClassicSimilarity], result of:
            0.05430782 = score(doc=2683,freq=1.0), product of:
              0.13146542 = queryWeight, product of:
                1.0440342 = boost
                7.5537524 = idf(docFreq=62, maxDocs=44218)
                0.01666994 = queryNorm
              0.41309583 = fieldWeight in 2683, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.5537524 = idf(docFreq=62, maxDocs=44218)
                0.0546875 = fieldNorm(doc=2683)
          0.010641257 = weight(abstract_txt:from in 2683) [ClassicSimilarity], result of:
            0.010641257 = score(doc=2683,freq=1.0), product of:
              0.07040204 = queryWeight, product of:
                1.528029 = boost
                2.7638826 = idf(docFreq=7577, maxDocs=44218)
                0.01666994 = queryNorm
              0.15114984 = fieldWeight in 2683, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.7638826 = idf(docFreq=7577, maxDocs=44218)
                0.0546875 = fieldNorm(doc=2683)
          0.15269735 = weight(abstract_txt:summary in 2683) [ClassicSimilarity], result of:
            0.15269735 = score(doc=2683,freq=5.0), product of:
              0.1929646 = queryWeight, product of:
                1.7888042 = boost
                6.4711404 = idf(docFreq=185, maxDocs=44218)
                0.01666994 = queryNorm
              0.7913232 = fieldWeight in 2683, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                6.4711404 = idf(docFreq=185, maxDocs=44218)
                0.0546875 = fieldNorm(doc=2683)
          0.15186916 = weight(abstract_txt:summaries in 2683) [ClassicSimilarity], result of:
            0.15186916 = score(doc=2683,freq=3.0), product of:
              0.22795683 = queryWeight, product of:
                1.9442418 = boost
                7.033448 = idf(docFreq=105, maxDocs=44218)
                0.01666994 = queryNorm
              0.66621894 = fieldWeight in 2683, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.033448 = idf(docFreq=105, maxDocs=44218)
                0.0546875 = fieldNorm(doc=2683)
          0.48385534 = weight(abstract_txt:summarization in 2683) [ClassicSimilarity], result of:
            0.48385534 = score(doc=2683,freq=7.0), product of:
              0.4688504 = queryWeight, product of:
                3.9432664 = boost
                7.132539 = idf(docFreq=95, maxDocs=44218)
                0.01666994 = queryNorm
              1.0320036 = fieldWeight in 2683, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                7.132539 = idf(docFreq=95, maxDocs=44218)
                0.0546875 = fieldNorm(doc=2683)
        0.24 = coord(6/25)
    
  5. Aker, A.; Gaizauskas, R.: Generating descriptive multi-document summaries of geo-located entities using entity type models (2015) 0.19
    0.18855606 = sum of:
      0.18855606 = product of:
        0.78565025 = sum of:
          0.026934039 = weight(abstract_txt:text in 1726) [ClassicSimilarity], result of:
            0.026934039 = score(doc=1726,freq=2.0), product of:
              0.075354576 = queryWeight, product of:
                1.1178378 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.01666994 = queryNorm
              0.3574307 = fieldWeight in 1726, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.0625 = fieldNorm(doc=1726)
          0.0121614365 = weight(abstract_txt:from in 1726) [ClassicSimilarity], result of:
            0.0121614365 = score(doc=1726,freq=1.0), product of:
              0.07040204 = queryWeight, product of:
                1.528029 = boost
                2.7638826 = idf(docFreq=7577, maxDocs=44218)
                0.01666994 = queryNorm
              0.17274266 = fieldWeight in 1726, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.7638826 = idf(docFreq=7577, maxDocs=44218)
                0.0625 = fieldNorm(doc=1726)
          0.07804381 = weight(abstract_txt:summary in 1726) [ClassicSimilarity], result of:
            0.07804381 = score(doc=1726,freq=1.0), product of:
              0.1929646 = queryWeight, product of:
                1.7888042 = boost
                6.4711404 = idf(docFreq=185, maxDocs=44218)
                0.01666994 = queryNorm
              0.40444627 = fieldWeight in 1726, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.4711404 = idf(docFreq=185, maxDocs=44218)
                0.0625 = fieldNorm(doc=1726)
          0.17356475 = weight(abstract_txt:summaries in 1726) [ClassicSimilarity], result of:
            0.17356475 = score(doc=1726,freq=3.0), product of:
              0.22795683 = queryWeight, product of:
                1.9442418 = boost
                7.033448 = idf(docFreq=105, maxDocs=44218)
                0.01666994 = queryNorm
              0.7613931 = fieldWeight in 1726, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.033448 = idf(docFreq=105, maxDocs=44218)
                0.0625 = fieldNorm(doc=1726)
          0.076934494 = weight(abstract_txt:generation in 1726) [ClassicSimilarity], result of:
            0.076934494 = score(doc=1726,freq=1.0), product of:
              0.21879119 = queryWeight, product of:
                2.3328376 = boost
                5.6261497 = idf(docFreq=432, maxDocs=44218)
                0.01666994 = queryNorm
              0.35163435 = fieldWeight in 1726, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.6261497 = idf(docFreq=432, maxDocs=44218)
                0.0625 = fieldNorm(doc=1726)
          0.41801172 = weight(abstract_txt:summarization in 1726) [ClassicSimilarity], result of:
            0.41801172 = score(doc=1726,freq=4.0), product of:
              0.4688504 = queryWeight, product of:
                3.9432664 = boost
                7.132539 = idf(docFreq=95, maxDocs=44218)
                0.01666994 = queryNorm
              0.89156735 = fieldWeight in 1726, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                7.132539 = idf(docFreq=95, maxDocs=44218)
                0.0625 = fieldNorm(doc=1726)
        0.24 = coord(6/25)