Document (#11999)

Author
Brandow, R.
Mitze, K.
Rau, L.F.
Title
Automatic condensation of electronic publications by sentence selection
Source
Information processing and management. 31(1995) no.5, S.675-685
Year
1995
Abstract
Description of a system that performs domain-independent automatic condensation of news from a large commercial news service encompassing 41 different publications. This system was evaluated against a system that condensed the same articles using only the first portions of the texts (the löead), up to the target length of the summaries. 3 lengths of articles were evaluated for 250 documents by both systems, totalling 1.500 suitability judgements in all. The lead-based summaries outperformed the 'intelligent' summaries significantly, achieving acceptability ratings of over 90%, compared to 74,7%
Theme
Automatisches Abstracting

Similar documents (content)

  1. Ou, S.; Khoo, C.S.G.; Goh, D.H.: Multi-document summarization of news articles using an event-based framework (2006) 0.26
    0.25579923 = sum of:
      0.25579923 = product of:
        0.91356874 = sum of:
          0.07459575 = weight(abstract_txt:sentence in 657) [ClassicSimilarity], result of:
            0.07459575 = score(doc=657,freq=2.0), product of:
              0.123297624 = queryWeight, product of:
                1.1617693 = boost
                6.8448567 = idf(docFreq=127, maxDocs=44218)
                0.015504953 = queryNorm
              0.60500556 = fieldWeight in 657, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.8448567 = idf(docFreq=127, maxDocs=44218)
                0.0625 = fieldNorm(doc=657)
          0.09518234 = weight(abstract_txt:articles in 657) [ClassicSimilarity], result of:
            0.09518234 = score(doc=657,freq=7.0), product of:
              0.120365925 = queryWeight, product of:
                1.6233393 = boost
                4.7821565 = idf(docFreq=1006, maxDocs=44218)
                0.015504953 = queryNorm
              0.79077476 = fieldWeight in 657, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                4.7821565 = idf(docFreq=1006, maxDocs=44218)
                0.0625 = fieldNorm(doc=657)
          0.018923834 = weight(abstract_txt:system in 657) [ClassicSimilarity], result of:
            0.018923834 = score(doc=657,freq=1.0), product of:
              0.08978459 = queryWeight, product of:
                1.7171335 = boost
                3.3723085 = idf(docFreq=4123, maxDocs=44218)
                0.015504953 = queryNorm
              0.21076928 = fieldWeight in 657, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3723085 = idf(docFreq=4123, maxDocs=44218)
                0.0625 = fieldNorm(doc=657)
          0.046136275 = weight(abstract_txt:automatic in 657) [ClassicSimilarity], result of:
            0.046136275 = score(doc=657,freq=1.0), product of:
              0.14207806 = queryWeight, product of:
                1.763685 = boost
                5.1955976 = idf(docFreq=665, maxDocs=44218)
                0.015504953 = queryNorm
              0.32472485 = fieldWeight in 657, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.1955976 = idf(docFreq=665, maxDocs=44218)
                0.0625 = fieldNorm(doc=657)
          0.0614991 = weight(abstract_txt:evaluated in 657) [ClassicSimilarity], result of:
            0.0614991 = score(doc=657,freq=1.0), product of:
              0.17208558 = queryWeight, product of:
                1.9410189 = boost
                5.7180014 = idf(docFreq=394, maxDocs=44218)
                0.015504953 = queryNorm
              0.3573751 = fieldWeight in 657, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.7180014 = idf(docFreq=394, maxDocs=44218)
                0.0625 = fieldNorm(doc=657)
          0.19669098 = weight(abstract_txt:news in 657) [ClassicSimilarity], result of:
            0.19669098 = score(doc=657,freq=8.0), product of:
              0.18677767 = queryWeight, product of:
                2.0221808 = boost
                5.957094 = idf(docFreq=310, maxDocs=44218)
                0.015504953 = queryNorm
              1.0530754 = fieldWeight in 657, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                5.957094 = idf(docFreq=310, maxDocs=44218)
                0.0625 = fieldNorm(doc=657)
          0.42054045 = weight(abstract_txt:summaries in 657) [ClassicSimilarity], result of:
            0.42054045 = score(doc=657,freq=6.0), product of:
              0.39055648 = queryWeight, product of:
                3.5813363 = boost
                7.033448 = idf(docFreq=105, maxDocs=44218)
                0.015504953 = queryNorm
              1.0767725 = fieldWeight in 657, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                7.033448 = idf(docFreq=105, maxDocs=44218)
                0.0625 = fieldNorm(doc=657)
        0.28 = coord(7/25)
    
  2. Bando, L.L.; Scholer, F.; Turpin, A.: Query-biased summary generation assisted by query expansion : temporality (2015) 0.14
    0.13605598 = sum of:
      0.13605598 = product of:
        0.68027985 = sum of:
          0.03363872 = weight(abstract_txt:lead in 1820) [ClassicSimilarity], result of:
            0.03363872 = score(doc=1820,freq=1.0), product of:
              0.091351345 = queryWeight, product of:
                5.8917522 = idf(docFreq=331, maxDocs=44218)
                0.015504953 = queryNorm
              0.36823452 = fieldWeight in 1820, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8917522 = idf(docFreq=331, maxDocs=44218)
                0.0625 = fieldNorm(doc=1820)
          0.10549432 = weight(abstract_txt:sentence in 1820) [ClassicSimilarity], result of:
            0.10549432 = score(doc=1820,freq=4.0), product of:
              0.123297624 = queryWeight, product of:
                1.1617693 = boost
                6.8448567 = idf(docFreq=127, maxDocs=44218)
                0.015504953 = queryNorm
              0.8556071 = fieldWeight in 1820, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.8448567 = idf(docFreq=127, maxDocs=44218)
                0.0625 = fieldNorm(doc=1820)
          0.10168253 = weight(abstract_txt:lengths in 1820) [ClassicSimilarity], result of:
            0.10168253 = score(doc=1820,freq=1.0), product of:
              0.19097926 = queryWeight, product of:
                1.4458913 = boost
                8.518833 = idf(docFreq=23, maxDocs=44218)
                0.015504953 = queryNorm
              0.5324271 = fieldWeight in 1820, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.518833 = idf(docFreq=23, maxDocs=44218)
                0.0625 = fieldNorm(doc=1820)
          0.018923834 = weight(abstract_txt:system in 1820) [ClassicSimilarity], result of:
            0.018923834 = score(doc=1820,freq=1.0), product of:
              0.08978459 = queryWeight, product of:
                1.7171335 = boost
                3.3723085 = idf(docFreq=4123, maxDocs=44218)
                0.015504953 = queryNorm
              0.21076928 = fieldWeight in 1820, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3723085 = idf(docFreq=4123, maxDocs=44218)
                0.0625 = fieldNorm(doc=1820)
          0.42054045 = weight(abstract_txt:summaries in 1820) [ClassicSimilarity], result of:
            0.42054045 = score(doc=1820,freq=6.0), product of:
              0.39055648 = queryWeight, product of:
                3.5813363 = boost
                7.033448 = idf(docFreq=105, maxDocs=44218)
                0.015504953 = queryNorm
              1.0767725 = fieldWeight in 1820, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                7.033448 = idf(docFreq=105, maxDocs=44218)
                0.0625 = fieldNorm(doc=1820)
        0.2 = coord(5/25)
    
  3. Maybury, M.T.: Generating summaries from event data (1995) 0.11
    0.11128512 = sum of:
      0.11128512 = product of:
        0.695532 = sum of:
          0.02365479 = weight(abstract_txt:system in 2349) [ClassicSimilarity], result of:
            0.02365479 = score(doc=2349,freq=1.0), product of:
              0.08978459 = queryWeight, product of:
                1.7171335 = boost
                3.3723085 = idf(docFreq=4123, maxDocs=44218)
                0.015504953 = queryNorm
              0.2634616 = fieldWeight in 2349, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3723085 = idf(docFreq=4123, maxDocs=44218)
                0.078125 = fieldNorm(doc=2349)
          0.057670347 = weight(abstract_txt:automatic in 2349) [ClassicSimilarity], result of:
            0.057670347 = score(doc=2349,freq=1.0), product of:
              0.14207806 = queryWeight, product of:
                1.763685 = boost
                5.1955976 = idf(docFreq=665, maxDocs=44218)
                0.015504953 = queryNorm
              0.40590608 = fieldWeight in 2349, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.1955976 = idf(docFreq=665, maxDocs=44218)
                0.078125 = fieldNorm(doc=2349)
          0.3996007 = weight(abstract_txt:condensation in 2349) [ClassicSimilarity], result of:
            0.3996007 = score(doc=2349,freq=1.0), product of:
              0.516388 = queryWeight, product of:
                3.3623707 = boost
                9.905128 = idf(docFreq=5, maxDocs=44218)
                0.015504953 = queryNorm
              0.7738381 = fieldWeight in 2349, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.905128 = idf(docFreq=5, maxDocs=44218)
                0.078125 = fieldNorm(doc=2349)
          0.21460615 = weight(abstract_txt:summaries in 2349) [ClassicSimilarity], result of:
            0.21460615 = score(doc=2349,freq=1.0), product of:
              0.39055648 = queryWeight, product of:
                3.5813363 = boost
                7.033448 = idf(docFreq=105, maxDocs=44218)
                0.015504953 = queryNorm
              0.5494881 = fieldWeight in 2349, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.033448 = idf(docFreq=105, maxDocs=44218)
                0.078125 = fieldNorm(doc=2349)
        0.16 = coord(4/25)
    
  4. Kar, M.; Nunes, S.; Ribeiro, C.: Summarization of changes in dynamic text collections using Latent Dirichlet Allocation model (2015) 0.11
    0.108356364 = sum of:
      0.108356364 = product of:
        0.45148486 = sum of:
          0.07517081 = weight(abstract_txt:condensed in 2676) [ClassicSimilarity], result of:
            0.07517081 = score(doc=2676,freq=1.0), product of:
              0.18915331 = queryWeight, product of:
                1.4389626 = boost
                8.478011 = idf(docFreq=24, maxDocs=44218)
                0.015504953 = queryNorm
              0.39740676 = fieldWeight in 2676, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.478011 = idf(docFreq=24, maxDocs=44218)
                0.046875 = fieldNorm(doc=2676)
          0.038157824 = weight(abstract_txt:articles in 2676) [ClassicSimilarity], result of:
            0.038157824 = score(doc=2676,freq=2.0), product of:
              0.120365925 = queryWeight, product of:
                1.6233393 = boost
                4.7821565 = idf(docFreq=1006, maxDocs=44218)
                0.015504953 = queryNorm
              0.31701517 = fieldWeight in 2676, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.7821565 = idf(docFreq=1006, maxDocs=44218)
                0.046875 = fieldNorm(doc=2676)
          0.020071756 = weight(abstract_txt:system in 2676) [ClassicSimilarity], result of:
            0.020071756 = score(doc=2676,freq=2.0), product of:
              0.08978459 = queryWeight, product of:
                1.7171335 = boost
                3.3723085 = idf(docFreq=4123, maxDocs=44218)
                0.015504953 = queryNorm
              0.22355458 = fieldWeight in 2676, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.3723085 = idf(docFreq=4123, maxDocs=44218)
                0.046875 = fieldNorm(doc=2676)
          0.048934907 = weight(abstract_txt:automatic in 2676) [ClassicSimilarity], result of:
            0.048934907 = score(doc=2676,freq=2.0), product of:
              0.14207806 = queryWeight, product of:
                1.763685 = boost
                5.1955976 = idf(docFreq=665, maxDocs=44218)
                0.015504953 = queryNorm
              0.3444227 = fieldWeight in 2676, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.1955976 = idf(docFreq=665, maxDocs=44218)
                0.046875 = fieldNorm(doc=2676)
          0.046124324 = weight(abstract_txt:evaluated in 2676) [ClassicSimilarity], result of:
            0.046124324 = score(doc=2676,freq=1.0), product of:
              0.17208558 = queryWeight, product of:
                1.9410189 = boost
                5.7180014 = idf(docFreq=394, maxDocs=44218)
                0.015504953 = queryNorm
              0.2680313 = fieldWeight in 2676, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.7180014 = idf(docFreq=394, maxDocs=44218)
                0.046875 = fieldNorm(doc=2676)
          0.22302525 = weight(abstract_txt:summaries in 2676) [ClassicSimilarity], result of:
            0.22302525 = score(doc=2676,freq=3.0), product of:
              0.39055648 = queryWeight, product of:
                3.5813363 = boost
                7.033448 = idf(docFreq=105, maxDocs=44218)
                0.015504953 = queryNorm
              0.5710448 = fieldWeight in 2676, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.033448 = idf(docFreq=105, maxDocs=44218)
                0.046875 = fieldNorm(doc=2676)
        0.24 = coord(6/25)
    
  5. Ling, X.; Jiang, J.; He, X.; Mei, Q.; Zhai, C.; Schatz, B.: Generating gene summaries from biomedical literature : a study of semi-structured summarization (2007) 0.10
    0.09593022 = sum of:
      0.09593022 = product of:
        0.4796511 = sum of:
          0.07459575 = weight(abstract_txt:sentence in 946) [ClassicSimilarity], result of:
            0.07459575 = score(doc=946,freq=2.0), product of:
              0.123297624 = queryWeight, product of:
                1.1617693 = boost
                6.8448567 = idf(docFreq=127, maxDocs=44218)
                0.015504953 = queryNorm
              0.60500556 = fieldWeight in 946, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.8448567 = idf(docFreq=127, maxDocs=44218)
                0.0625 = fieldNorm(doc=946)
          0.061034117 = weight(abstract_txt:performs in 946) [ClassicSimilarity], result of:
            0.061034117 = score(doc=946,freq=1.0), product of:
              0.13589509 = queryWeight, product of:
                1.2196758 = boost
                7.1860275 = idf(docFreq=90, maxDocs=44218)
                0.015504953 = queryNorm
              0.44912672 = fieldWeight in 946, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.1860275 = idf(docFreq=90, maxDocs=44218)
                0.0625 = fieldNorm(doc=946)
          0.035975542 = weight(abstract_txt:articles in 946) [ClassicSimilarity], result of:
            0.035975542 = score(doc=946,freq=1.0), product of:
              0.120365925 = queryWeight, product of:
                1.6233393 = boost
                4.7821565 = idf(docFreq=1006, maxDocs=44218)
                0.015504953 = queryNorm
              0.29888478 = fieldWeight in 946, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.7821565 = idf(docFreq=1006, maxDocs=44218)
                0.0625 = fieldNorm(doc=946)
          0.065246545 = weight(abstract_txt:automatic in 946) [ClassicSimilarity], result of:
            0.065246545 = score(doc=946,freq=2.0), product of:
              0.14207806 = queryWeight, product of:
                1.763685 = boost
                5.1955976 = idf(docFreq=665, maxDocs=44218)
                0.015504953 = queryNorm
              0.45923027 = fieldWeight in 946, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.1955976 = idf(docFreq=665, maxDocs=44218)
                0.0625 = fieldNorm(doc=946)
          0.24279913 = weight(abstract_txt:summaries in 946) [ClassicSimilarity], result of:
            0.24279913 = score(doc=946,freq=2.0), product of:
              0.39055648 = queryWeight, product of:
                3.5813363 = boost
                7.033448 = idf(docFreq=105, maxDocs=44218)
                0.015504953 = queryNorm
              0.62167484 = fieldWeight in 946, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.033448 = idf(docFreq=105, maxDocs=44218)
                0.0625 = fieldNorm(doc=946)
        0.2 = coord(5/25)