Document (#25133)

Author
Saggion, H.
Lapalme, G.
Title
Selective analysis for the automatic generation of summaries
Source
Dynamism and stability in knowledge organization: Proceedings of the 6th International ISKO-Conference, 10-13 July 2000, Toronto, Canada. Ed.: C. Beghtol et al
Imprint
Würzburg : Ergon
Year
2000
Pages
S.176-181
Series
Advances in knowledge organization; vol.7
Abstract
Selective Analysis is a new method for text summarization of technical articles whose design is based on the study of a corpus of professional abstracts and technical documents The method emphasizes the selection of particular types of information and its elaboration exploring the issue of dynamical summarization. A computer prototype was developed to demonstrate the viability of the approach and the automatic abstracts were evaluated using human informants. The results so far obtained indicate that the summaries are acceptable in content and text quality
Theme
Automatisches Abstracting

Similar documents (content)

  1. Sjöbergh, J.: Older versions of the ROUGEeval summarization evaluation system were easier to fool (2007) 0.27
    0.2663842 = sum of:
      0.2663842 = product of:
        1.1099342 = sum of:
          0.057340328 = weight(abstract_txt:selection in 940) [ClassicSimilarity], result of:
            0.057340328 = score(doc=940,freq=1.0), product of:
              0.09748276 = queryWeight, product of:
                1.0036627 = boost
                5.377919 = idf(docFreq=554, maxDocs=44218)
                0.018060334 = queryNorm
              0.5882099 = fieldWeight in 940, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.377919 = idf(docFreq=554, maxDocs=44218)
                0.109375 = fieldNorm(doc=940)
          0.0487571 = weight(abstract_txt:text in 940) [ClassicSimilarity], result of:
            0.0487571 = score(doc=940,freq=1.0), product of:
              0.11023588 = queryWeight, product of:
                1.5093862 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.018060334 = queryNorm
              0.4422979 = fieldWeight in 940, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.109375 = fieldNorm(doc=940)
          0.11644491 = weight(abstract_txt:method in 940) [ClassicSimilarity], result of:
            0.11644491 = score(doc=940,freq=3.0), product of:
              0.13656445 = queryWeight, product of:
                1.6799939 = boost
                4.50095 = idf(docFreq=1333, maxDocs=44218)
                0.018060334 = queryNorm
              0.85267365 = fieldWeight in 940, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.50095 = idf(docFreq=1333, maxDocs=44218)
                0.109375 = fieldNorm(doc=940)
          0.14624092 = weight(abstract_txt:automatic in 940) [ClassicSimilarity], result of:
            0.14624092 = score(doc=940,freq=2.0), product of:
              0.18197022 = queryWeight, product of:
                1.9392735 = boost
                5.1955976 = idf(docFreq=665, maxDocs=44218)
                0.018060334 = queryNorm
              0.803653 = fieldWeight in 940, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.1955976 = idf(docFreq=665, maxDocs=44218)
                0.109375 = fieldNorm(doc=940)
          0.36279997 = weight(abstract_txt:summaries in 940) [ClassicSimilarity], result of:
            0.36279997 = score(doc=940,freq=2.0), product of:
              0.33347702 = queryWeight, product of:
                2.625257 = boost
                7.033448 = idf(docFreq=105, maxDocs=44218)
                0.018060334 = queryNorm
              1.0879309 = fieldWeight in 940, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.033448 = idf(docFreq=105, maxDocs=44218)
                0.109375 = fieldNorm(doc=940)
          0.37835094 = weight(abstract_txt:summarization in 940) [ClassicSimilarity], result of:
            0.37835094 = score(doc=940,freq=2.0), product of:
              0.34293956 = queryWeight, product of:
                2.662243 = boost
                7.132539 = idf(docFreq=95, maxDocs=44218)
                0.018060334 = queryNorm
              1.1032584 = fieldWeight in 940, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.132539 = idf(docFreq=95, maxDocs=44218)
                0.109375 = fieldNorm(doc=940)
        0.24 = coord(6/25)
    
  2. Maybury, M.T.: Generating summaries from event data (1995) 0.23
    0.23419592 = sum of:
      0.23419592 = product of:
        0.836414 = sum of:
          0.040957376 = weight(abstract_txt:selection in 2349) [ClassicSimilarity], result of:
            0.040957376 = score(doc=2349,freq=1.0), product of:
              0.09748276 = queryWeight, product of:
                1.0036627 = boost
                5.377919 = idf(docFreq=554, maxDocs=44218)
                0.018060334 = queryNorm
              0.42014992 = fieldWeight in 2349, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.377919 = idf(docFreq=554, maxDocs=44218)
                0.078125 = fieldNorm(doc=2349)
          0.0812239 = weight(abstract_txt:generation in 2349) [ClassicSimilarity], result of:
            0.0812239 = score(doc=2349,freq=3.0), product of:
              0.10668955 = queryWeight, product of:
                1.0499892 = boost
                5.6261497 = idf(docFreq=432, maxDocs=44218)
                0.018060334 = queryNorm
              0.7613107 = fieldWeight in 2349, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.6261497 = idf(docFreq=432, maxDocs=44218)
                0.078125 = fieldNorm(doc=2349)
          0.025684005 = weight(abstract_txt:analysis in 2349) [ClassicSimilarity], result of:
            0.025684005 = score(doc=2349,freq=1.0), product of:
              0.08998254 = queryWeight, product of:
                1.3636974 = boost
                3.6535451 = idf(docFreq=3112, maxDocs=44218)
                0.018060334 = queryNorm
              0.2854332 = fieldWeight in 2349, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6535451 = idf(docFreq=3112, maxDocs=44218)
                0.078125 = fieldNorm(doc=2349)
          0.049252108 = weight(abstract_txt:text in 2349) [ClassicSimilarity], result of:
            0.049252108 = score(doc=2349,freq=2.0), product of:
              0.11023588 = queryWeight, product of:
                1.5093862 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.018060334 = queryNorm
              0.44678837 = fieldWeight in 2349, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.078125 = fieldNorm(doc=2349)
          0.07386282 = weight(abstract_txt:automatic in 2349) [ClassicSimilarity], result of:
            0.07386282 = score(doc=2349,freq=1.0), product of:
              0.18197022 = queryWeight, product of:
                1.9392735 = boost
                5.1955976 = idf(docFreq=665, maxDocs=44218)
                0.018060334 = queryNorm
              0.40590608 = fieldWeight in 2349, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.1955976 = idf(docFreq=665, maxDocs=44218)
                0.078125 = fieldNorm(doc=2349)
          0.18324167 = weight(abstract_txt:summaries in 2349) [ClassicSimilarity], result of:
            0.18324167 = score(doc=2349,freq=1.0), product of:
              0.33347702 = queryWeight, product of:
                2.625257 = boost
                7.033448 = idf(docFreq=105, maxDocs=44218)
                0.018060334 = queryNorm
              0.5494881 = fieldWeight in 2349, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.033448 = idf(docFreq=105, maxDocs=44218)
                0.078125 = fieldNorm(doc=2349)
          0.38219213 = weight(abstract_txt:summarization in 2349) [ClassicSimilarity], result of:
            0.38219213 = score(doc=2349,freq=4.0), product of:
              0.34293956 = queryWeight, product of:
                2.662243 = boost
                7.132539 = idf(docFreq=95, maxDocs=44218)
                0.018060334 = queryNorm
              1.1144592 = fieldWeight in 2349, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                7.132539 = idf(docFreq=95, maxDocs=44218)
                0.078125 = fieldNorm(doc=2349)
        0.28 = coord(7/25)
    
  3. Tseng, Y.-H.; Lin, C.-J.; Lin, Y.-I.: Text mining techniques for patent analysis (2007) 0.22
    0.2211237 = sum of:
      0.2211237 = product of:
        0.55280924 = sum of:
          0.032765903 = weight(abstract_txt:selection in 935) [ClassicSimilarity], result of:
            0.032765903 = score(doc=935,freq=1.0), product of:
              0.09748276 = queryWeight, product of:
                1.0036627 = boost
                5.377919 = idf(docFreq=554, maxDocs=44218)
                0.018060334 = queryNorm
              0.33611995 = fieldWeight in 935, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.377919 = idf(docFreq=554, maxDocs=44218)
                0.0625 = fieldNorm(doc=935)
          0.03289829 = weight(abstract_txt:demonstrate in 935) [ClassicSimilarity], result of:
            0.03289829 = score(doc=935,freq=1.0), product of:
              0.097745165 = queryWeight, product of:
                1.0050126 = boost
                5.3851523 = idf(docFreq=550, maxDocs=44218)
                0.018060334 = queryNorm
              0.33657202 = fieldWeight in 935, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.3851523 = idf(docFreq=550, maxDocs=44218)
                0.0625 = fieldNorm(doc=935)
          0.03751571 = weight(abstract_txt:generation in 935) [ClassicSimilarity], result of:
            0.03751571 = score(doc=935,freq=1.0), product of:
              0.10668955 = queryWeight, product of:
                1.0499892 = boost
                5.6261497 = idf(docFreq=432, maxDocs=44218)
                0.018060334 = queryNorm
              0.35163435 = fieldWeight in 935, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.6261497 = idf(docFreq=432, maxDocs=44218)
                0.0625 = fieldNorm(doc=935)
          0.04777944 = weight(abstract_txt:corpus in 935) [ClassicSimilarity], result of:
            0.04777944 = score(doc=935,freq=1.0), product of:
              0.12535466 = queryWeight, product of:
                1.1381359 = boost
                6.0984654 = idf(docFreq=269, maxDocs=44218)
                0.018060334 = queryNorm
              0.3811541 = fieldWeight in 935, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.0984654 = idf(docFreq=269, maxDocs=44218)
                0.0625 = fieldNorm(doc=935)
          0.041094407 = weight(abstract_txt:analysis in 935) [ClassicSimilarity], result of:
            0.041094407 = score(doc=935,freq=4.0), product of:
              0.08998254 = queryWeight, product of:
                1.3636974 = boost
                3.6535451 = idf(docFreq=3112, maxDocs=44218)
                0.018060334 = queryNorm
              0.45669314 = fieldWeight in 935, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.6535451 = idf(docFreq=3112, maxDocs=44218)
                0.0625 = fieldNorm(doc=935)
          0.039401688 = weight(abstract_txt:text in 935) [ClassicSimilarity], result of:
            0.039401688 = score(doc=935,freq=2.0), product of:
              0.11023588 = queryWeight, product of:
                1.5093862 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.018060334 = queryNorm
              0.3574307 = fieldWeight in 935, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.0625 = fieldNorm(doc=935)
          0.03841686 = weight(abstract_txt:method in 935) [ClassicSimilarity], result of:
            0.03841686 = score(doc=935,freq=1.0), product of:
              0.13656445 = queryWeight, product of:
                1.6799939 = boost
                4.50095 = idf(docFreq=1333, maxDocs=44218)
                0.018060334 = queryNorm
              0.28130937 = fieldWeight in 935, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.50095 = idf(docFreq=1333, maxDocs=44218)
                0.0625 = fieldNorm(doc=935)
          0.052777346 = weight(abstract_txt:technical in 935) [ClassicSimilarity], result of:
            0.052777346 = score(doc=935,freq=1.0), product of:
              0.16876723 = queryWeight, product of:
                1.867596 = boost
                5.0035634 = idf(docFreq=806, maxDocs=44218)
                0.018060334 = queryNorm
              0.3127227 = fieldWeight in 935, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.0035634 = idf(docFreq=806, maxDocs=44218)
                0.0625 = fieldNorm(doc=935)
          0.08356623 = weight(abstract_txt:automatic in 935) [ClassicSimilarity], result of:
            0.08356623 = score(doc=935,freq=2.0), product of:
              0.18197022 = queryWeight, product of:
                1.9392735 = boost
                5.1955976 = idf(docFreq=665, maxDocs=44218)
                0.018060334 = queryNorm
              0.45923027 = fieldWeight in 935, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.1955976 = idf(docFreq=665, maxDocs=44218)
                0.0625 = fieldNorm(doc=935)
          0.14659333 = weight(abstract_txt:summaries in 935) [ClassicSimilarity], result of:
            0.14659333 = score(doc=935,freq=1.0), product of:
              0.33347702 = queryWeight, product of:
                2.625257 = boost
                7.033448 = idf(docFreq=105, maxDocs=44218)
                0.018060334 = queryNorm
              0.4395905 = fieldWeight in 935, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.033448 = idf(docFreq=105, maxDocs=44218)
                0.0625 = fieldNorm(doc=935)
        0.4 = coord(10/25)
    
  4. Hirao, T.; Okumura, M.; Yasuda, N.; Isozaki, H.: Supervised automatic evaluation for summarization with voted regression model (2007) 0.22
    0.22040722 = sum of:
      0.22040722 = product of:
        0.91836345 = sum of:
          0.040957376 = weight(abstract_txt:selection in 942) [ClassicSimilarity], result of:
            0.040957376 = score(doc=942,freq=1.0), product of:
              0.09748276 = queryWeight, product of:
                1.0036627 = boost
                5.377919 = idf(docFreq=554, maxDocs=44218)
                0.018060334 = queryNorm
              0.42014992 = fieldWeight in 942, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.377919 = idf(docFreq=554, maxDocs=44218)
                0.078125 = fieldNorm(doc=942)
          0.07104422 = weight(abstract_txt:obtained in 942) [ClassicSimilarity], result of:
            0.07104422 = score(doc=942,freq=2.0), product of:
              0.1116989 = queryWeight, product of:
                1.0743563 = boost
                5.756716 = idf(docFreq=379, maxDocs=44218)
                0.018060334 = queryNorm
              0.6360333 = fieldWeight in 942, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.756716 = idf(docFreq=379, maxDocs=44218)
                0.078125 = fieldNorm(doc=942)
          0.09604215 = weight(abstract_txt:method in 942) [ClassicSimilarity], result of:
            0.09604215 = score(doc=942,freq=4.0), product of:
              0.13656445 = queryWeight, product of:
                1.6799939 = boost
                4.50095 = idf(docFreq=1333, maxDocs=44218)
                0.018060334 = queryNorm
              0.7032734 = fieldWeight in 942, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.50095 = idf(docFreq=1333, maxDocs=44218)
                0.078125 = fieldNorm(doc=942)
          0.18092622 = weight(abstract_txt:automatic in 942) [ClassicSimilarity], result of:
            0.18092622 = score(doc=942,freq=6.0), product of:
              0.18197022 = queryWeight, product of:
                1.9392735 = boost
                5.1955976 = idf(docFreq=665, maxDocs=44218)
                0.018060334 = queryNorm
              0.99426275 = fieldWeight in 942, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                5.1955976 = idf(docFreq=665, maxDocs=44218)
                0.078125 = fieldNorm(doc=942)
          0.25914285 = weight(abstract_txt:summaries in 942) [ClassicSimilarity], result of:
            0.25914285 = score(doc=942,freq=2.0), product of:
              0.33347702 = queryWeight, product of:
                2.625257 = boost
                7.033448 = idf(docFreq=105, maxDocs=44218)
                0.018060334 = queryNorm
              0.7770935 = fieldWeight in 942, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.033448 = idf(docFreq=105, maxDocs=44218)
                0.078125 = fieldNorm(doc=942)
          0.27025065 = weight(abstract_txt:summarization in 942) [ClassicSimilarity], result of:
            0.27025065 = score(doc=942,freq=2.0), product of:
              0.34293956 = queryWeight, product of:
                2.662243 = boost
                7.132539 = idf(docFreq=95, maxDocs=44218)
                0.018060334 = queryNorm
              0.78804165 = fieldWeight in 942, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.132539 = idf(docFreq=95, maxDocs=44218)
                0.078125 = fieldNorm(doc=942)
        0.24 = coord(6/25)
    
  5. Ou, S.; Khoo, S.G.; Goh, D.H.: Automatic multidocument summarization of research abstracts : design and user evaluation (2007) 0.21
    0.20813559 = sum of:
      0.20813559 = product of:
        0.86723167 = sum of:
          0.032408483 = weight(abstract_txt:indicate in 522) [ClassicSimilarity], result of:
            0.032408483 = score(doc=522,freq=1.0), product of:
              0.09677256 = queryWeight, product of:
                5.358293 = idf(docFreq=565, maxDocs=44218)
                0.018060334 = queryNorm
              0.33489332 = fieldWeight in 522, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.358293 = idf(docFreq=565, maxDocs=44218)
                0.0625 = fieldNorm(doc=522)
          0.06653995 = weight(abstract_txt:method in 522) [ClassicSimilarity], result of:
            0.06653995 = score(doc=522,freq=3.0), product of:
              0.13656445 = queryWeight, product of:
                1.6799939 = boost
                4.50095 = idf(docFreq=1333, maxDocs=44218)
                0.018060334 = queryNorm
              0.4872421 = fieldWeight in 522, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.50095 = idf(docFreq=1333, maxDocs=44218)
                0.0625 = fieldNorm(doc=522)
          0.059090253 = weight(abstract_txt:automatic in 522) [ClassicSimilarity], result of:
            0.059090253 = score(doc=522,freq=1.0), product of:
              0.18197022 = queryWeight, product of:
                1.9392735 = boost
                5.1955976 = idf(docFreq=665, maxDocs=44218)
                0.018060334 = queryNorm
              0.32472485 = fieldWeight in 522, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.1955976 = idf(docFreq=665, maxDocs=44218)
                0.0625 = fieldNorm(doc=522)
          0.19980577 = weight(abstract_txt:abstracts in 522) [ClassicSimilarity], result of:
            0.19980577 = score(doc=522,freq=5.0), product of:
              0.23973885 = queryWeight, product of:
                2.2259126 = boost
                5.963546 = idf(docFreq=308, maxDocs=44218)
                0.018060334 = queryNorm
              0.8334309 = fieldWeight in 522, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                5.963546 = idf(docFreq=308, maxDocs=44218)
                0.0625 = fieldNorm(doc=522)
          0.29318666 = weight(abstract_txt:summaries in 522) [ClassicSimilarity], result of:
            0.29318666 = score(doc=522,freq=4.0), product of:
              0.33347702 = queryWeight, product of:
                2.625257 = boost
                7.033448 = idf(docFreq=105, maxDocs=44218)
                0.018060334 = queryNorm
              0.879181 = fieldWeight in 522, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                7.033448 = idf(docFreq=105, maxDocs=44218)
                0.0625 = fieldNorm(doc=522)
          0.21620052 = weight(abstract_txt:summarization in 522) [ClassicSimilarity], result of:
            0.21620052 = score(doc=522,freq=2.0), product of:
              0.34293956 = queryWeight, product of:
                2.662243 = boost
                7.132539 = idf(docFreq=95, maxDocs=44218)
                0.018060334 = queryNorm
              0.6304333 = fieldWeight in 522, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.132539 = idf(docFreq=95, maxDocs=44218)
                0.0625 = fieldNorm(doc=522)
        0.24 = coord(6/25)