Document (#11999)

Author
Brandow, R.
Mitze, K.
Rau, L.F.
Title
Automatic condensation of electronic publications by sentence selection
Source
Information processing and management. 31(1995) no.5, S.675-685
Year
1995
Abstract
Description of a system that performs domain-independent automatic condensation of news from a large commercial news service encompassing 41 different publications. This system was evaluated against a system that condensed the same articles using only the first portions of the texts (the löead), up to the target length of the summaries. 3 lengths of articles were evaluated for 250 documents by both systems, totalling 1.500 suitability judgements in all. The lead-based summaries outperformed the 'intelligent' summaries significantly, achieving acceptability ratings of over 90%, compared to 74,7%
Theme
Automatisches Abstracting

Similar documents (content)

  1. Ou, S.; Khoo, C.S.G.; Goh, D.H.: Multi-document summarization of news articles using an event-based framework (2006) 0.26
    0.25977376 = sum of:
      0.25977376 = product of:
        0.9277634 = sum of:
          0.07740536 = weight(abstract_txt:sentence in 1783) [ClassicSimilarity], result of:
            0.07740536 = score(doc=1783,freq=2.0), product of:
              0.12627874 = queryWeight, product of:
                1.1660488 = boost
                6.9349895 = idf(docFreq=110, maxDocs=41962)
                0.015615927 = queryNorm
              0.61297226 = fieldWeight in 1783, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.9349895 = idf(docFreq=110, maxDocs=41962)
                0.0625 = fieldNorm(doc=1783)
          0.09777704 = weight(abstract_txt:articles in 1783) [ClassicSimilarity], result of:
            0.09777704 = score(doc=1783,freq=7.0), product of:
              0.122450754 = queryWeight, product of:
                1.6238552 = boost
                4.82888 = idf(docFreq=911, maxDocs=41962)
                0.015615927 = queryNorm
              0.7985009 = fieldWeight in 1783, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                4.82888 = idf(docFreq=911, maxDocs=41962)
                0.0625 = fieldNorm(doc=1783)
          0.018725762 = weight(abstract_txt:system in 1783) [ClassicSimilarity], result of:
            0.018725762 = score(doc=1783,freq=1.0), product of:
              0.08908946 = queryWeight, product of:
                1.6963887 = boost
                3.3630488 = idf(docFreq=3949, maxDocs=41962)
                0.015615927 = queryNorm
              0.21019055 = fieldWeight in 1783, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3630488 = idf(docFreq=3949, maxDocs=41962)
                0.0625 = fieldNorm(doc=1783)
          0.04607448 = weight(abstract_txt:automatic in 1783) [ClassicSimilarity], result of:
            0.04607448 = score(doc=1783,freq=1.0), product of:
              0.14184363 = queryWeight, product of:
                1.7477186 = boost
                5.1972136 = idf(docFreq=630, maxDocs=41962)
                0.015615927 = queryNorm
              0.32482585 = fieldWeight in 1783, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.1972136 = idf(docFreq=630, maxDocs=41962)
                0.0625 = fieldNorm(doc=1783)
          0.0625792 = weight(abstract_txt:evaluated in 1783) [ClassicSimilarity], result of:
            0.0625792 = score(doc=1783,freq=1.0), product of:
              0.17396274 = queryWeight, product of:
                1.9355068 = boost
                5.755642 = idf(docFreq=360, maxDocs=41962)
                0.015615927 = queryNorm
              0.35972762 = fieldWeight in 1783, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.755642 = idf(docFreq=360, maxDocs=41962)
                0.0625 = fieldNorm(doc=1783)
          0.20983568 = weight(abstract_txt:news in 1783) [ClassicSimilarity], result of:
            0.20983568 = score(doc=1783,freq=8.0), product of:
              0.19486138 = queryWeight, product of:
                2.0484693 = boost
                6.0915604 = idf(docFreq=257, maxDocs=41962)
                0.015615927 = queryNorm
              1.0768459 = fieldWeight in 1783, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                6.0915604 = idf(docFreq=257, maxDocs=41962)
                0.0625 = fieldNorm(doc=1783)
          0.41536587 = weight(abstract_txt:summaries in 1783) [ClassicSimilarity], result of:
            0.41536587 = score(doc=1783,freq=6.0), product of:
              0.38705269 = queryWeight, product of:
                3.535878 = boost
                7.009791 = idf(docFreq=102, maxDocs=41962)
                0.015615927 = queryNorm
              1.0731508 = fieldWeight in 1783, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                7.009791 = idf(docFreq=102, maxDocs=41962)
                0.0625 = fieldNorm(doc=1783)
        0.28 = coord(7/25)
    
  2. Bando, L.L.; Scholer, F.; Turpin, A.: Query-biased summary generation assisted by query expansion : temporality (2015) 0.14
    0.13615532 = sum of:
      0.13615532 = product of:
        0.6807766 = sum of:
          0.034522794 = weight(abstract_txt:lead in 3821) [ClassicSimilarity], result of:
            0.034522794 = score(doc=3821,freq=1.0), product of:
              0.09287458 = queryWeight, product of:
                5.9474263 = idf(docFreq=297, maxDocs=41962)
                0.015615927 = queryNorm
              0.37171414 = fieldWeight in 3821, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.9474263 = idf(docFreq=297, maxDocs=41962)
                0.0625 = fieldNorm(doc=3821)
          0.10946772 = weight(abstract_txt:sentence in 3821) [ClassicSimilarity], result of:
            0.10946772 = score(doc=3821,freq=4.0), product of:
              0.12627874 = queryWeight, product of:
                1.1660488 = boost
                6.9349895 = idf(docFreq=110, maxDocs=41962)
                0.015615927 = queryNorm
              0.8668737 = fieldWeight in 3821, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.9349895 = idf(docFreq=110, maxDocs=41962)
                0.0625 = fieldNorm(doc=3821)
          0.102694415 = weight(abstract_txt:lengths in 3821) [ClassicSimilarity], result of:
            0.102694415 = score(doc=3821,freq=1.0), product of:
              0.19209856 = queryWeight, product of:
                1.4381813 = boost
                8.553477 = idf(docFreq=21, maxDocs=41962)
                0.015615927 = queryNorm
              0.53459233 = fieldWeight in 3821, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.553477 = idf(docFreq=21, maxDocs=41962)
                0.0625 = fieldNorm(doc=3821)
          0.018725762 = weight(abstract_txt:system in 3821) [ClassicSimilarity], result of:
            0.018725762 = score(doc=3821,freq=1.0), product of:
              0.08908946 = queryWeight, product of:
                1.6963887 = boost
                3.3630488 = idf(docFreq=3949, maxDocs=41962)
                0.015615927 = queryNorm
              0.21019055 = fieldWeight in 3821, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3630488 = idf(docFreq=3949, maxDocs=41962)
                0.0625 = fieldNorm(doc=3821)
          0.41536587 = weight(abstract_txt:summaries in 3821) [ClassicSimilarity], result of:
            0.41536587 = score(doc=3821,freq=6.0), product of:
              0.38705269 = queryWeight, product of:
                3.535878 = boost
                7.009791 = idf(docFreq=102, maxDocs=41962)
                0.015615927 = queryNorm
              1.0731508 = fieldWeight in 3821, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                7.009791 = idf(docFreq=102, maxDocs=41962)
                0.0625 = fieldNorm(doc=3821)
        0.2 = coord(5/25)
    
  3. Maybury, M.T.: Generating summaries from event data (1995) 0.11
    0.10965904 = sum of:
      0.10965904 = product of:
        0.685369 = sum of:
          0.023407204 = weight(abstract_txt:system in 2418) [ClassicSimilarity], result of:
            0.023407204 = score(doc=2418,freq=1.0), product of:
              0.08908946 = queryWeight, product of:
                1.6963887 = boost
                3.3630488 = idf(docFreq=3949, maxDocs=41962)
                0.015615927 = queryNorm
              0.2627382 = fieldWeight in 2418, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3630488 = idf(docFreq=3949, maxDocs=41962)
                0.078125 = fieldNorm(doc=2418)
          0.0575931 = weight(abstract_txt:automatic in 2418) [ClassicSimilarity], result of:
            0.0575931 = score(doc=2418,freq=1.0), product of:
              0.14184363 = queryWeight, product of:
                1.7477186 = boost
                5.1972136 = idf(docFreq=630, maxDocs=41962)
                0.015615927 = queryNorm
              0.40603232 = fieldWeight in 2418, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.1972136 = idf(docFreq=630, maxDocs=41962)
                0.078125 = fieldNorm(doc=2418)
          0.3924032 = weight(abstract_txt:condensation in 2418) [ClassicSimilarity], result of:
            0.3924032 = score(doc=2418,freq=1.0), product of:
              0.5097821 = queryWeight, product of:
                3.3132854 = boost
                9.85276 = idf(docFreq=5, maxDocs=41962)
                0.015615927 = queryNorm
              0.7697469 = fieldWeight in 2418, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.85276 = idf(docFreq=5, maxDocs=41962)
                0.078125 = fieldNorm(doc=2418)
          0.2119655 = weight(abstract_txt:summaries in 2418) [ClassicSimilarity], result of:
            0.2119655 = score(doc=2418,freq=1.0), product of:
              0.38705269 = queryWeight, product of:
                3.535878 = boost
                7.009791 = idf(docFreq=102, maxDocs=41962)
                0.015615927 = queryNorm
              0.5476399 = fieldWeight in 2418, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.009791 = idf(docFreq=102, maxDocs=41962)
                0.078125 = fieldNorm(doc=2418)
        0.16 = coord(4/25)
    
  4. Kar, M.; Nunes, S.; Ribeiro, C.: Summarization of changes in dynamic text collections using Latent Dirichlet Allocation model (2015) 0.11
    0.10823296 = sum of:
      0.10823296 = product of:
        0.45097068 = sum of:
          0.075826235 = weight(abstract_txt:condensed in 4677) [ClassicSimilarity], result of:
            0.075826235 = score(doc=4677,freq=1.0), product of:
              0.19010712 = queryWeight, product of:
                1.4307072 = boost
                8.509026 = idf(docFreq=22, maxDocs=41962)
                0.015615927 = queryNorm
              0.39886057 = fieldWeight in 4677, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.509026 = idf(docFreq=22, maxDocs=41962)
                0.046875 = fieldNorm(doc=4677)
          0.03919802 = weight(abstract_txt:articles in 4677) [ClassicSimilarity], result of:
            0.03919802 = score(doc=4677,freq=2.0), product of:
              0.122450754 = queryWeight, product of:
                1.6238552 = boost
                4.82888 = idf(docFreq=911, maxDocs=41962)
                0.015615927 = queryNorm
              0.32011253 = fieldWeight in 4677, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.82888 = idf(docFreq=911, maxDocs=41962)
                0.046875 = fieldNorm(doc=4677)
          0.01986167 = weight(abstract_txt:system in 4677) [ClassicSimilarity], result of:
            0.01986167 = score(doc=4677,freq=2.0), product of:
              0.08908946 = queryWeight, product of:
                1.6963887 = boost
                3.3630488 = idf(docFreq=3949, maxDocs=41962)
                0.015615927 = queryNorm
              0.22294074 = fieldWeight in 4677, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.3630488 = idf(docFreq=3949, maxDocs=41962)
                0.046875 = fieldNorm(doc=4677)
          0.048869364 = weight(abstract_txt:automatic in 4677) [ClassicSimilarity], result of:
            0.048869364 = score(doc=4677,freq=2.0), product of:
              0.14184363 = queryWeight, product of:
                1.7477186 = boost
                5.1972136 = idf(docFreq=630, maxDocs=41962)
                0.015615927 = queryNorm
              0.34452984 = fieldWeight in 4677, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.1972136 = idf(docFreq=630, maxDocs=41962)
                0.046875 = fieldNorm(doc=4677)
          0.046934403 = weight(abstract_txt:evaluated in 4677) [ClassicSimilarity], result of:
            0.046934403 = score(doc=4677,freq=1.0), product of:
              0.17396274 = queryWeight, product of:
                1.9355068 = boost
                5.755642 = idf(docFreq=360, maxDocs=41962)
                0.015615927 = queryNorm
              0.26979572 = fieldWeight in 4677, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.755642 = idf(docFreq=360, maxDocs=41962)
                0.046875 = fieldNorm(doc=4677)
          0.220281 = weight(abstract_txt:summaries in 4677) [ClassicSimilarity], result of:
            0.220281 = score(doc=4677,freq=3.0), product of:
              0.38705269 = queryWeight, product of:
                3.535878 = boost
                7.009791 = idf(docFreq=102, maxDocs=41962)
                0.015615927 = queryNorm
              0.5691241 = fieldWeight in 4677, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.009791 = idf(docFreq=102, maxDocs=41962)
                0.046875 = fieldNorm(doc=4677)
        0.24 = coord(6/25)
    
  5. Ling, X.; Jiang, J.; He, X.; Mei, Q.; Zhai, C.; Schatz, B.: Generating gene summaries from biomedical literature : a study of semi-structured summarization (2007) 0.10
    0.09595001 = sum of:
      0.09595001 = product of:
        0.47975004 = sum of:
          0.07740536 = weight(abstract_txt:sentence in 2947) [ClassicSimilarity], result of:
            0.07740536 = score(doc=2947,freq=2.0), product of:
              0.12627874 = queryWeight, product of:
                1.1660488 = boost
                6.9349895 = idf(docFreq=110, maxDocs=41962)
                0.015615927 = queryNorm
              0.61297226 = fieldWeight in 2947, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.9349895 = idf(docFreq=110, maxDocs=41962)
                0.0625 = fieldNorm(doc=2947)
          0.0604177 = weight(abstract_txt:performs in 2947) [ClassicSimilarity], result of:
            0.0604177 = score(doc=2947,freq=1.0), product of:
              0.13487631 = queryWeight, product of:
                1.2050898 = boost
                7.167183 = idf(docFreq=87, maxDocs=41962)
                0.015615927 = queryNorm
              0.44794893 = fieldWeight in 2947, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.167183 = idf(docFreq=87, maxDocs=41962)
                0.0625 = fieldNorm(doc=2947)
          0.036956247 = weight(abstract_txt:articles in 2947) [ClassicSimilarity], result of:
            0.036956247 = score(doc=2947,freq=1.0), product of:
              0.122450754 = queryWeight, product of:
                1.6238552 = boost
                4.82888 = idf(docFreq=911, maxDocs=41962)
                0.015615927 = queryNorm
              0.301805 = fieldWeight in 2947, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.82888 = idf(docFreq=911, maxDocs=41962)
                0.0625 = fieldNorm(doc=2947)
          0.06515915 = weight(abstract_txt:automatic in 2947) [ClassicSimilarity], result of:
            0.06515915 = score(doc=2947,freq=2.0), product of:
              0.14184363 = queryWeight, product of:
                1.7477186 = boost
                5.1972136 = idf(docFreq=630, maxDocs=41962)
                0.015615927 = queryNorm
              0.45937312 = fieldWeight in 2947, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.1972136 = idf(docFreq=630, maxDocs=41962)
                0.0625 = fieldNorm(doc=2947)
          0.23981158 = weight(abstract_txt:summaries in 2947) [ClassicSimilarity], result of:
            0.23981158 = score(doc=2947,freq=2.0), product of:
              0.38705269 = queryWeight, product of:
                3.535878 = boost
                7.009791 = idf(docFreq=102, maxDocs=41962)
                0.015615927 = queryNorm
              0.61958385 = fieldWeight in 2947, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.009791 = idf(docFreq=102, maxDocs=41962)
                0.0625 = fieldNorm(doc=2947)
        0.2 = coord(5/25)