Document (#25651)

Author
Haag, M.
Title
Automatic text summarization : Evaluation des Copernic Summarizer und mögliche Einsatzfelder in der Fachinformation der DaimlerCrysler AG
Imprint
Aachen : Shaker Verlag
Year
2002
Pages
211 S
Isbn
3-8265-9952-7
Series
Wirtschaftsinformatik
Abstract
An evaluation of the Copernic Summarizer, a software for automatically summarizing text in various data formats, is being presented. It shall be assessed if and how the Copernic Summarizer can reasonably be used in the DaimlerChrysler Information Division in order to enhance the quality of its information services. First, an introduction into Automatic Text Summarization is given and the Copernic Summarizer is being presented. Various methods for evaluating Automatic Text Summarization systems and software ergonomics are presented. Two evaluation forms are developed with which the employees of the Information Division shall evaluate the quality and relevance of the extracted keywords and summaries as well as the software's usability. The quality and relevance assessment is done by comparing the original text to the summaries. Finally, a recommendation is given concerning the use of the Copernic Summarizer.
Footnote
Diplomarbeit an der HBI Stuttgart. - Vgl. auch: nfd 53(2002) H.4, S.243-244
Theme
Automatisches Abstracting
Object
Copernic Summarizer

Similar documents (content)

  1. Aker, A.; Gaizauskas, R.: Generating descriptive multi-document summaries of geo-located entities using entity type models (2015) 0.31
    0.30660748 = sum of:
      0.30660748 = product of:
        1.2775313 = sum of:
          0.13540225 = weight(abstract_txt:summaries in 3727) [ClassicSimilarity], result of:
            0.13540225 = score(doc=3727,freq=3.0), product of:
              0.17847319 = queryWeight, product of:
                2.2538464 = boost
                7.008293 = idf(docFreq=103, maxDocs=42306)
                0.011298906 = queryNorm
              0.75867 = fieldWeight in 3727, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.008293 = idf(docFreq=103, maxDocs=42306)
                0.0625 = fieldNorm(doc=3727)
          0.035004523 = weight(abstract_txt:quality in 3727) [ClassicSimilarity], result of:
            0.035004523 = score(doc=3727,freq=1.0), product of:
              0.11957563 = queryWeight, product of:
                2.2594607 = boost
                4.6838336 = idf(docFreq=1062, maxDocs=42306)
                0.011298906 = queryNorm
              0.2927396 = fieldWeight in 3727, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.6838336 = idf(docFreq=1062, maxDocs=42306)
                0.0625 = fieldNorm(doc=3727)
          0.047786564 = weight(abstract_txt:automatic in 3727) [ClassicSimilarity], result of:
            0.047786564 = score(doc=3727,freq=1.0), product of:
              0.14715119 = queryWeight, product of:
                2.5064864 = boost
                5.1959147 = idf(docFreq=636, maxDocs=42306)
                0.011298906 = queryNorm
              0.32474467 = fieldWeight in 3727, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.1959147 = idf(docFreq=636, maxDocs=42306)
                0.0625 = fieldNorm(doc=3727)
          0.05340988 = weight(abstract_txt:text in 3727) [ClassicSimilarity], result of:
            0.05340988 = score(doc=3727,freq=2.0), product of:
              0.14913534 = queryWeight, product of:
                3.2576027 = boost
                4.0517817 = idf(docFreq=1999, maxDocs=42306)
                0.011298906 = queryNorm
              0.35813028 = fieldWeight in 3727, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.0517817 = idf(docFreq=1999, maxDocs=42306)
                0.0625 = fieldNorm(doc=3727)
          0.24818602 = weight(abstract_txt:summarization in 3727) [ClassicSimilarity], result of:
            0.24818602 = score(doc=3727,freq=4.0), product of:
              0.27800852 = queryWeight, product of:
                3.4451847 = boost
                7.1418247 = idf(docFreq=90, maxDocs=42306)
                0.011298906 = queryNorm
              0.8927281 = fieldWeight in 3727, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                7.1418247 = idf(docFreq=90, maxDocs=42306)
                0.0625 = fieldNorm(doc=3727)
          0.75774205 = weight(abstract_txt:summarizer in 3727) [ClassicSimilarity], result of:
            0.75774205 = score(doc=3727,freq=3.0), product of:
              0.7635134 = queryWeight, product of:
                7.3708262 = boost
                9.167778 = idf(docFreq=11, maxDocs=42306)
                0.011298906 = queryNorm
              0.99244106 = fieldWeight in 3727, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                9.167778 = idf(docFreq=11, maxDocs=42306)
                0.0625 = fieldNorm(doc=3727)
        0.24 = coord(6/25)
    
  2. Ou, S.; Khoo, C.S.G.; Goh, D.H.: Multi-document summarization of news articles using an event-based framework (2006) 0.25
    0.2516757 = sum of:
      0.2516757 = product of:
        0.62918925 = sum of:
          0.027311921 = weight(abstract_txt:extracted in 1783) [ClassicSimilarity], result of:
            0.027311921 = score(doc=1783,freq=1.0), product of:
              0.07026747 = queryWeight, product of:
                6.218962 = idf(docFreq=228, maxDocs=42306)
                0.011298906 = queryNorm
              0.38868514 = fieldWeight in 1783, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.218962 = idf(docFreq=228, maxDocs=42306)
                0.0625 = fieldNorm(doc=1783)
          0.008527858 = weight(abstract_txt:information in 1783) [ClassicSimilarity], result of:
            0.008527858 = score(doc=1783,freq=3.0), product of:
              0.032340456 = queryWeight, product of:
                1.1750505 = boost
                2.435865 = idf(docFreq=10064, maxDocs=42306)
                0.011298906 = queryNorm
              0.2636901 = fieldWeight in 1783, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.435865 = idf(docFreq=10064, maxDocs=42306)
                0.0625 = fieldNorm(doc=1783)
          0.053939376 = weight(abstract_txt:summarizing in 1783) [ClassicSimilarity], result of:
            0.053939376 = score(doc=1783,freq=1.0), product of:
              0.110608906 = queryWeight, product of:
                1.2546364 = boost
                7.8025365 = idf(docFreq=46, maxDocs=42306)
                0.011298906 = queryNorm
              0.48765853 = fieldWeight in 1783, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.8025365 = idf(docFreq=46, maxDocs=42306)
                0.0625 = fieldNorm(doc=1783)
          0.021177027 = weight(abstract_txt:being in 1783) [ClassicSimilarity], result of:
            0.021177027 = score(doc=1783,freq=1.0), product of:
              0.07472045 = queryWeight, product of:
                1.4583359 = boost
                4.534668 = idf(docFreq=1233, maxDocs=42306)
                0.011298906 = queryNorm
              0.28341675 = fieldWeight in 1783, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.534668 = idf(docFreq=1233, maxDocs=42306)
                0.0625 = fieldNorm(doc=1783)
          0.023796206 = weight(abstract_txt:given in 1783) [ClassicSimilarity], result of:
            0.023796206 = score(doc=1783,freq=1.0), product of:
              0.080760926 = queryWeight, product of:
                1.5161371 = boost
                4.7144 = idf(docFreq=1030, maxDocs=42306)
                0.011298906 = queryNorm
              0.29465 = fieldWeight in 1783, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.7144 = idf(docFreq=1030, maxDocs=42306)
                0.0625 = fieldNorm(doc=1783)
          0.043707687 = weight(abstract_txt:evaluation in 1783) [ClassicSimilarity], result of:
            0.043707687 = score(doc=1783,freq=2.0), product of:
              0.1100496 = queryWeight, product of:
                2.1675928 = boost
                4.4933925 = idf(docFreq=1285, maxDocs=42306)
                0.011298906 = queryNorm
              0.39716354 = fieldWeight in 1783, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.4933925 = idf(docFreq=1285, maxDocs=42306)
                0.0625 = fieldNorm(doc=1783)
          0.1914877 = weight(abstract_txt:summaries in 1783) [ClassicSimilarity], result of:
            0.1914877 = score(doc=1783,freq=6.0), product of:
              0.17847319 = queryWeight, product of:
                2.2538464 = boost
                7.008293 = idf(docFreq=103, maxDocs=42306)
                0.011298906 = queryNorm
              1.0729214 = fieldWeight in 1783, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                7.008293 = idf(docFreq=103, maxDocs=42306)
                0.0625 = fieldNorm(doc=1783)
          0.03596089 = weight(abstract_txt:presented in 1783) [ClassicSimilarity], result of:
            0.03596089 = score(doc=1783,freq=1.0), product of:
              0.1217438 = queryWeight, product of:
                2.2798533 = boost
                4.726107 = idf(docFreq=1018, maxDocs=42306)
                0.011298906 = queryNorm
              0.2953817 = fieldWeight in 1783, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.726107 = idf(docFreq=1018, maxDocs=42306)
                0.0625 = fieldNorm(doc=1783)
          0.047786564 = weight(abstract_txt:automatic in 1783) [ClassicSimilarity], result of:
            0.047786564 = score(doc=1783,freq=1.0), product of:
              0.14715119 = queryWeight, product of:
                2.5064864 = boost
                5.1959147 = idf(docFreq=636, maxDocs=42306)
                0.011298906 = queryNorm
              0.32474467 = fieldWeight in 1783, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.1959147 = idf(docFreq=636, maxDocs=42306)
                0.0625 = fieldNorm(doc=1783)
          0.17549402 = weight(abstract_txt:summarization in 1783) [ClassicSimilarity], result of:
            0.17549402 = score(doc=1783,freq=2.0), product of:
              0.27800852 = queryWeight, product of:
                3.4451847 = boost
                7.1418247 = idf(docFreq=90, maxDocs=42306)
                0.011298906 = queryNorm
              0.6312541 = fieldWeight in 1783, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.1418247 = idf(docFreq=90, maxDocs=42306)
                0.0625 = fieldNorm(doc=1783)
        0.4 = coord(10/25)
    
  3. Steinberger, J.; Poesio, M.; Kabadjov, M.A.; Jezek, K.: Two uses of anaphora resolution in summarization (2007) 0.18
    0.18379882 = sum of:
      0.18379882 = product of:
        1.1487427 = sum of:
          0.0341399 = weight(abstract_txt:extracted in 2950) [ClassicSimilarity], result of:
            0.0341399 = score(doc=2950,freq=1.0), product of:
              0.07026747 = queryWeight, product of:
                6.218962 = idf(docFreq=228, maxDocs=42306)
                0.011298906 = queryNorm
              0.4858564 = fieldWeight in 2950, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.218962 = idf(docFreq=228, maxDocs=42306)
                0.078125 = fieldNorm(doc=2950)
          0.012308904 = weight(abstract_txt:information in 2950) [ClassicSimilarity], result of:
            0.012308904 = score(doc=2950,freq=4.0), product of:
              0.032340456 = queryWeight, product of:
                1.1750505 = boost
                2.435865 = idf(docFreq=10064, maxDocs=42306)
                0.011298906 = queryNorm
              0.3806039 = fieldWeight in 2950, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                2.435865 = idf(docFreq=10064, maxDocs=42306)
                0.078125 = fieldNorm(doc=2950)
          0.15511625 = weight(abstract_txt:summarization in 2950) [ClassicSimilarity], result of:
            0.15511625 = score(doc=2950,freq=1.0), product of:
              0.27800852 = queryWeight, product of:
                3.4451847 = boost
                7.1418247 = idf(docFreq=90, maxDocs=42306)
                0.011298906 = queryNorm
              0.557955 = fieldWeight in 2950, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.1418247 = idf(docFreq=90, maxDocs=42306)
                0.078125 = fieldNorm(doc=2950)
          0.9471776 = weight(abstract_txt:summarizer in 2950) [ClassicSimilarity], result of:
            0.9471776 = score(doc=2950,freq=3.0), product of:
              0.7635134 = queryWeight, product of:
                7.3708262 = boost
                9.167778 = idf(docFreq=11, maxDocs=42306)
                0.011298906 = queryNorm
              1.2405514 = fieldWeight in 2950, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                9.167778 = idf(docFreq=11, maxDocs=42306)
                0.078125 = fieldNorm(doc=2950)
        0.16 = coord(4/25)
    
  4. Sankarasubramaniam, Y.; Ramanathan, K.; Ghosh, S.: Text summarization using Wikipedia (2014) 0.18
    0.17793034 = sum of:
      0.17793034 = product of:
        0.8896517 = sum of:
          0.035004523 = weight(abstract_txt:quality in 4694) [ClassicSimilarity], result of:
            0.035004523 = score(doc=4694,freq=1.0), product of:
              0.11957563 = queryWeight, product of:
                2.2594607 = boost
                4.6838336 = idf(docFreq=1062, maxDocs=42306)
                0.011298906 = queryNorm
              0.2927396 = fieldWeight in 4694, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.6838336 = idf(docFreq=1062, maxDocs=42306)
                0.0625 = fieldNorm(doc=4694)
          0.047786564 = weight(abstract_txt:automatic in 4694) [ClassicSimilarity], result of:
            0.047786564 = score(doc=4694,freq=1.0), product of:
              0.14715119 = queryWeight, product of:
                2.5064864 = boost
                5.1959147 = idf(docFreq=636, maxDocs=42306)
                0.011298906 = queryNorm
              0.32474467 = fieldWeight in 4694, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.1959147 = idf(docFreq=636, maxDocs=42306)
                0.0625 = fieldNorm(doc=4694)
          0.065413475 = weight(abstract_txt:text in 4694) [ClassicSimilarity], result of:
            0.065413475 = score(doc=4694,freq=3.0), product of:
              0.14913534 = queryWeight, product of:
                3.2576027 = boost
                4.0517817 = idf(docFreq=1999, maxDocs=42306)
                0.011298906 = queryNorm
              0.4386182 = fieldWeight in 4694, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.0517817 = idf(docFreq=1999, maxDocs=42306)
                0.0625 = fieldNorm(doc=4694)
          0.30396459 = weight(abstract_txt:summarization in 4694) [ClassicSimilarity], result of:
            0.30396459 = score(doc=4694,freq=6.0), product of:
              0.27800852 = queryWeight, product of:
                3.4451847 = boost
                7.1418247 = idf(docFreq=90, maxDocs=42306)
                0.011298906 = queryNorm
              1.0933642 = fieldWeight in 4694, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                7.1418247 = idf(docFreq=90, maxDocs=42306)
                0.0625 = fieldNorm(doc=4694)
          0.43748257 = weight(abstract_txt:summarizer in 4694) [ClassicSimilarity], result of:
            0.43748257 = score(doc=4694,freq=1.0), product of:
              0.7635134 = queryWeight, product of:
                7.3708262 = boost
                9.167778 = idf(docFreq=11, maxDocs=42306)
                0.011298906 = queryNorm
              0.5729861 = fieldWeight in 4694, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.167778 = idf(docFreq=11, maxDocs=42306)
                0.0625 = fieldNorm(doc=4694)
        0.2 = coord(5/25)
    
  5. Maybury, M.T.: Generating summaries from event data (1995) 0.17
    0.16923885 = sum of:
      0.16923885 = product of:
        0.6044245 = sum of:
          0.013761773 = weight(abstract_txt:information in 2418) [ClassicSimilarity], result of:
            0.013761773 = score(doc=2418,freq=5.0), product of:
              0.032340456 = queryWeight, product of:
                1.1750505 = boost
                2.435865 = idf(docFreq=10064, maxDocs=42306)
                0.011298906 = queryNorm
              0.4255281 = fieldWeight in 2418, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                2.435865 = idf(docFreq=10064, maxDocs=42306)
                0.078125 = fieldNorm(doc=2418)
          0.026471283 = weight(abstract_txt:being in 2418) [ClassicSimilarity], result of:
            0.026471283 = score(doc=2418,freq=1.0), product of:
              0.07472045 = queryWeight, product of:
                1.4583359 = boost
                4.534668 = idf(docFreq=1233, maxDocs=42306)
                0.011298906 = queryNorm
              0.35427094 = fieldWeight in 2418, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.534668 = idf(docFreq=1233, maxDocs=42306)
                0.078125 = fieldNorm(doc=2418)
          0.029745257 = weight(abstract_txt:given in 2418) [ClassicSimilarity], result of:
            0.029745257 = score(doc=2418,freq=1.0), product of:
              0.080760926 = queryWeight, product of:
                1.5161371 = boost
                4.7144 = idf(docFreq=1030, maxDocs=42306)
                0.011298906 = queryNorm
              0.36831248 = fieldWeight in 2418, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.7144 = idf(docFreq=1030, maxDocs=42306)
                0.078125 = fieldNorm(doc=2418)
          0.09771816 = weight(abstract_txt:summaries in 2418) [ClassicSimilarity], result of:
            0.09771816 = score(doc=2418,freq=1.0), product of:
              0.17847319 = queryWeight, product of:
                2.2538464 = boost
                7.008293 = idf(docFreq=103, maxDocs=42306)
                0.011298906 = queryNorm
              0.5475229 = fieldWeight in 2418, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.008293 = idf(docFreq=103, maxDocs=42306)
                0.078125 = fieldNorm(doc=2418)
          0.059733205 = weight(abstract_txt:automatic in 2418) [ClassicSimilarity], result of:
            0.059733205 = score(doc=2418,freq=1.0), product of:
              0.14715119 = queryWeight, product of:
                2.5064864 = boost
                5.1959147 = idf(docFreq=636, maxDocs=42306)
                0.011298906 = queryNorm
              0.40593085 = fieldWeight in 2418, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.1959147 = idf(docFreq=636, maxDocs=42306)
                0.078125 = fieldNorm(doc=2418)
          0.06676234 = weight(abstract_txt:text in 2418) [ClassicSimilarity], result of:
            0.06676234 = score(doc=2418,freq=2.0), product of:
              0.14913534 = queryWeight, product of:
                3.2576027 = boost
                4.0517817 = idf(docFreq=1999, maxDocs=42306)
                0.011298906 = queryNorm
              0.44766283 = fieldWeight in 2418, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.0517817 = idf(docFreq=1999, maxDocs=42306)
                0.078125 = fieldNorm(doc=2418)
          0.3102325 = weight(abstract_txt:summarization in 2418) [ClassicSimilarity], result of:
            0.3102325 = score(doc=2418,freq=4.0), product of:
              0.27800852 = queryWeight, product of:
                3.4451847 = boost
                7.1418247 = idf(docFreq=90, maxDocs=42306)
                0.011298906 = queryNorm
              1.11591 = fieldWeight in 2418, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                7.1418247 = idf(docFreq=90, maxDocs=42306)
                0.078125 = fieldNorm(doc=2418)
        0.28 = coord(7/25)