Document (#38520)

Author
Moreno, J.M.T.
Title
Automatic text summarization
Imprint
Hoboken : Wiley
Year
2014
Pages
320 S
Isbn
978-1-84821-668-6
Abstract
This new textbook examines the motivations and the different algorithms for automatic document summarization (ADS). We performed a recent state of the art. The book shows the main problems of ADS, difficulties and the solutions provided by the community. It presents recent advances in ADS, as well as current applications and trends. The approaches are statistical, linguistic and symbolic. Several exemples are included in order to clarify the theoretical concepts. The books currently available in the area of Automatic Document Summarization are not recent. Powerful algorithms have been developed in recent years that include several applications of ADS. The development of recent technology has impacted on the development of algorithms and their applications. The massive use of social networks and the new forms of the technology requires the adaptation of the classical methods of text summarizers. This is a new textbook on Automatic Text Summarization, based on teaching materials used in two or one-semester courses. It presents a extensive state-of-art and describes the new systems on the subject. Previous automatic summarization books have been either collections of specialized papers, or else authored books with only a chapter or two devoted to the field as a whole. In other hand, the classic books on the subject are not recent.
Content
Automatic Text Summarization Some Important Concepts 23 Single document Summarization 53 Guided Multi-Document Summarization 109 Emerging systems 151 Source and DomainSpecific Summarization 179 Text Abstracting 219 Evaluating Document Summaries 243 Conclusion 275 Information Retrieval NLP and Automatic Text Summarization 281 Automatic Text Summarization Resources 305
Theme
Automatisches Indexieren
DDC
025.4
LCC
P98.5 .A87

Similar documents (author)

  1. Moreno, R.B. -> Bailón-Moreno, R.: 5.59
    5.590608 = sum of:
      5.590608 = weight(author_txt:moreno in 609) [ClassicSimilarity], result of:
        5.590608 = fieldWeight in 609, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          9.035788 = idf(docFreq=13, maxDocs=43254)
          0.4375 = fieldNorm(doc=609)
    
  2. Moreno, R.R. -> Bailón-Moreno, R.: 5.59
    5.590608 = sum of:
      5.590608 = weight(author_txt:moreno in 1520) [ClassicSimilarity], result of:
        5.590608 = fieldWeight in 1520, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          9.035788 = idf(docFreq=13, maxDocs=43254)
          0.4375 = fieldNorm(doc=1520)
    
  3. Fernandez, F.S.; Moreno, A.G.: History of information science in Spain : a selected bibliography (1997) 4.52
    4.517894 = sum of:
      4.517894 = weight(author_txt:moreno in 2053) [ClassicSimilarity], result of:
        4.517894 = fieldWeight in 2053, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.035788 = idf(docFreq=13, maxDocs=43254)
          0.5 = fieldNorm(doc=2053)
    
  4. Moreno, N.; Vallecillo, A.: Towards interoperable Web engineering methods (2008) 4.52
    4.517894 = sum of:
      4.517894 = weight(author_txt:moreno in 3861) [ClassicSimilarity], result of:
        4.517894 = fieldWeight in 3861, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.035788 = idf(docFreq=13, maxDocs=43254)
          0.5 = fieldNorm(doc=3861)
    
  5. Moreno Fernández, L.M. -> Fernández, L.M.M.: 3.95
    3.953157 = sum of:
      3.953157 = weight(author_txt:moreno in 5951) [ClassicSimilarity], result of:
        3.953157 = fieldWeight in 5951, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.035788 = idf(docFreq=13, maxDocs=43254)
          0.4375 = fieldNorm(doc=5951)
    

Similar documents (content)

  1. Sankarasubramaniam, Y.; Ramanathan, K.; Ghosh, S.: Text summarization using Wikipedia (2014) 0.27
    0.27396265 = sum of:
      0.27396265 = product of:
        0.978438 = sum of:
          0.019290019 = weight(abstract_txt:been in 4158) [ClassicSimilarity], result of:
            0.019290019 = score(doc=4158,freq=2.0), product of:
              0.060086157 = queryWeight, product of:
                1.0186131 = boost
                3.6321454 = idf(docFreq=3110, maxDocs=43254)
                0.016240595 = queryNorm
              0.32103932 = fieldWeight in 4158, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.6321454 = idf(docFreq=3110, maxDocs=43254)
                0.0625 = fieldNorm(doc=4158)
          0.022381572 = weight(abstract_txt:document in 4158) [ClassicSimilarity], result of:
            0.022381572 = score(doc=4158,freq=1.0), product of:
              0.083590396 = queryWeight, product of:
                1.2014349 = boost
                4.2840466 = idf(docFreq=1620, maxDocs=43254)
                0.016240595 = queryNorm
              0.26775292 = fieldWeight in 4158, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.2840466 = idf(docFreq=1620, maxDocs=43254)
                0.0625 = fieldNorm(doc=4158)
          0.03801067 = weight(abstract_txt:several in 4158) [ClassicSimilarity], result of:
            0.03801067 = score(doc=4158,freq=2.0), product of:
              0.09444008 = queryWeight, product of:
                1.2770275 = boost
                4.5535927 = idf(docFreq=1237, maxDocs=43254)
                0.016240595 = queryNorm
              0.40248454 = fieldWeight in 4158, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.5535927 = idf(docFreq=1237, maxDocs=43254)
                0.0625 = fieldNorm(doc=4158)
          0.04912026 = weight(abstract_txt:text in 4158) [ClassicSimilarity], result of:
            0.04912026 = score(doc=4158,freq=3.0), product of:
              0.112045154 = queryWeight, product of:
                1.7035866 = boost
                4.049738 = idf(docFreq=2048, maxDocs=43254)
                0.016240595 = queryNorm
              0.438397 = fieldWeight in 4158, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.049738 = idf(docFreq=2048, maxDocs=43254)
                0.0625 = fieldNorm(doc=4158)
          0.11467432 = weight(abstract_txt:algorithms in 4158) [ClassicSimilarity], result of:
            0.11467432 = score(doc=4158,freq=2.0), product of:
              0.22571506 = queryWeight, product of:
                2.4179535 = boost
                5.747919 = idf(docFreq=374, maxDocs=43254)
                0.016240595 = queryNorm
              0.5080491 = fieldWeight in 4158, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.747919 = idf(docFreq=374, maxDocs=43254)
                0.0625 = fieldNorm(doc=4158)
          0.09985263 = weight(abstract_txt:automatic in 4158) [ClassicSimilarity], result of:
            0.09985263 = score(doc=4158,freq=1.0), product of:
              0.30745554 = queryWeight, product of:
                3.6432016 = boost
                5.1963353 = idf(docFreq=650, maxDocs=43254)
                0.016240595 = queryNorm
              0.32477096 = fieldWeight in 4158, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.1963353 = idf(docFreq=650, maxDocs=43254)
                0.0625 = fieldNorm(doc=4158)
          0.6351086 = weight(abstract_txt:summarization in 4158) [ClassicSimilarity], result of:
            0.6351086 = score(doc=4158,freq=6.0), product of:
              0.58084136 = queryWeight, product of:
                5.0074983 = boost
                7.1422453 = idf(docFreq=92, maxDocs=43254)
                0.016240595 = queryNorm
              1.0934286 = fieldWeight in 4158, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                7.1422453 = idf(docFreq=92, maxDocs=43254)
                0.0625 = fieldNorm(doc=4158)
        0.28 = coord(7/25)
    
  2. Smeaton, A.F.: Progress in the application of natural language processing to information retrieval tasks (1992) 0.26
    0.2592985 = sum of:
      0.2592985 = product of:
        1.6206156 = sum of:
          0.120319575 = weight(abstract_txt:text in 80) [ClassicSimilarity], result of:
            0.120319575 = score(doc=80,freq=2.0), product of:
              0.112045154 = queryWeight, product of:
                1.7035866 = boost
                4.049738 = idf(docFreq=2048, maxDocs=43254)
                0.016240595 = queryNorm
              1.073849 = fieldWeight in 80, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.049738 = idf(docFreq=2048, maxDocs=43254)
                0.1875 = fieldNorm(doc=80)
          0.42363882 = weight(abstract_txt:automatic in 80) [ClassicSimilarity], result of:
            0.42363882 = score(doc=80,freq=2.0), product of:
              0.30745554 = queryWeight, product of:
                3.6432016 = boost
                5.1963353 = idf(docFreq=650, maxDocs=43254)
                0.016240595 = queryNorm
              1.3778864 = fieldWeight in 80, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.1963353 = idf(docFreq=650, maxDocs=43254)
                0.1875 = fieldNorm(doc=80)
          0.29881135 = weight(abstract_txt:recent in 80) [ClassicSimilarity], result of:
            0.29881135 = score(doc=80,freq=1.0), product of:
              0.3261772 = queryWeight, product of:
                4.1106405 = boost
                4.8858733 = idf(docFreq=887, maxDocs=43254)
                0.016240595 = queryNorm
              0.9161012 = fieldWeight in 80, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.8858733 = idf(docFreq=887, maxDocs=43254)
                0.1875 = fieldNorm(doc=80)
          0.77784586 = weight(abstract_txt:summarization in 80) [ClassicSimilarity], result of:
            0.77784586 = score(doc=80,freq=1.0), product of:
              0.58084136 = queryWeight, product of:
                5.0074983 = boost
                7.1422453 = idf(docFreq=92, maxDocs=43254)
                0.016240595 = queryNorm
              1.3391709 = fieldWeight in 80, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.1422453 = idf(docFreq=92, maxDocs=43254)
                0.1875 = fieldNorm(doc=80)
        0.16 = coord(4/25)
    
  3. Shen, D.; Yang, Q.; Chen, Z.: Noise reduction through summarization for Web-page classification (2007) 0.23
    0.23308854 = sum of:
      0.23308854 = product of:
        1.1654427 = sum of:
          0.033597004 = weight(abstract_txt:several in 2954) [ClassicSimilarity], result of:
            0.033597004 = score(doc=2954,freq=1.0), product of:
              0.09444008 = queryWeight, product of:
                1.2770275 = boost
                4.5535927 = idf(docFreq=1237, maxDocs=43254)
                0.016240595 = queryNorm
              0.35574943 = fieldWeight in 2954, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5535927 = idf(docFreq=1237, maxDocs=43254)
                0.078125 = fieldNorm(doc=2954)
          0.04041589 = weight(abstract_txt:state in 2954) [ClassicSimilarity], result of:
            0.04041589 = score(doc=2954,freq=1.0), product of:
              0.10682119 = queryWeight, product of:
                1.3581594 = boost
                4.842891 = idf(docFreq=926, maxDocs=43254)
                0.016240595 = queryNorm
              0.37835088 = fieldWeight in 2954, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.842891 = idf(docFreq=926, maxDocs=43254)
                0.078125 = fieldNorm(doc=2954)
          0.07089899 = weight(abstract_txt:text in 2954) [ClassicSimilarity], result of:
            0.07089899 = score(doc=2954,freq=4.0), product of:
              0.112045154 = queryWeight, product of:
                1.7035866 = boost
                4.049738 = idf(docFreq=2048, maxDocs=43254)
                0.016240595 = queryNorm
              0.63277155 = fieldWeight in 2954, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.049738 = idf(docFreq=2048, maxDocs=43254)
                0.078125 = fieldNorm(doc=2954)
          0.22664505 = weight(abstract_txt:algorithms in 2954) [ClassicSimilarity], result of:
            0.22664505 = score(doc=2954,freq=5.0), product of:
              0.22571506 = queryWeight, product of:
                2.4179535 = boost
                5.747919 = idf(docFreq=374, maxDocs=43254)
                0.016240595 = queryNorm
              1.0041202 = fieldWeight in 2954, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                5.747919 = idf(docFreq=374, maxDocs=43254)
                0.078125 = fieldNorm(doc=2954)
          0.7938857 = weight(abstract_txt:summarization in 2954) [ClassicSimilarity], result of:
            0.7938857 = score(doc=2954,freq=6.0), product of:
              0.58084136 = queryWeight, product of:
                5.0074983 = boost
                7.1422453 = idf(docFreq=92, maxDocs=43254)
                0.016240595 = queryNorm
              1.3667858 = fieldWeight in 2954, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                7.1422453 = idf(docFreq=92, maxDocs=43254)
                0.078125 = fieldNorm(doc=2954)
        0.2 = coord(5/25)
    
  4. Yang, C.C.; Wang, F.L.: Hierarchical summarization of large documents (2008) 0.19
    0.18681483 = sum of:
      0.18681483 = product of:
        0.93407416 = sum of:
          0.019290019 = weight(abstract_txt:been in 3720) [ClassicSimilarity], result of:
            0.019290019 = score(doc=3720,freq=2.0), product of:
              0.060086157 = queryWeight, product of:
                1.0186131 = boost
                3.6321454 = idf(docFreq=3110, maxDocs=43254)
                0.016240595 = queryNorm
              0.32103932 = fieldWeight in 3720, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.6321454 = idf(docFreq=3110, maxDocs=43254)
                0.0625 = fieldNorm(doc=3720)
          0.05921607 = weight(abstract_txt:document in 3720) [ClassicSimilarity], result of:
            0.05921607 = score(doc=3720,freq=7.0), product of:
              0.083590396 = queryWeight, product of:
                1.2014349 = boost
                4.2840466 = idf(docFreq=1620, maxDocs=43254)
                0.016240595 = queryNorm
              0.7084076 = fieldWeight in 3720, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                4.2840466 = idf(docFreq=1620, maxDocs=43254)
                0.0625 = fieldNorm(doc=3720)
          0.028359594 = weight(abstract_txt:text in 3720) [ClassicSimilarity], result of:
            0.028359594 = score(doc=3720,freq=1.0), product of:
              0.112045154 = queryWeight, product of:
                1.7035866 = boost
                4.049738 = idf(docFreq=2048, maxDocs=43254)
                0.016240595 = queryNorm
              0.25310862 = fieldWeight in 3720, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.049738 = idf(docFreq=2048, maxDocs=43254)
                0.0625 = fieldNorm(doc=3720)
          0.14121294 = weight(abstract_txt:automatic in 3720) [ClassicSimilarity], result of:
            0.14121294 = score(doc=3720,freq=2.0), product of:
              0.30745554 = queryWeight, product of:
                3.6432016 = boost
                5.1963353 = idf(docFreq=650, maxDocs=43254)
                0.016240595 = queryNorm
              0.45929548 = fieldWeight in 3720, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.1963353 = idf(docFreq=650, maxDocs=43254)
                0.0625 = fieldNorm(doc=3720)
          0.6859956 = weight(abstract_txt:summarization in 3720) [ClassicSimilarity], result of:
            0.6859956 = score(doc=3720,freq=7.0), product of:
              0.58084136 = queryWeight, product of:
                5.0074983 = boost
                7.1422453 = idf(docFreq=92, maxDocs=43254)
                0.016240595 = queryNorm
              1.1810378 = fieldWeight in 3720, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                7.1422453 = idf(docFreq=92, maxDocs=43254)
                0.0625 = fieldNorm(doc=3720)
        0.2 = coord(5/25)
    
  5. Moens, M.-F.: Summarizing court decisions (2007) 0.19
    0.18669876 = sum of:
      0.18669876 = product of:
        0.9334938 = sum of:
          0.03357236 = weight(abstract_txt:document in 2955) [ClassicSimilarity], result of:
            0.03357236 = score(doc=2955,freq=1.0), product of:
              0.083590396 = queryWeight, product of:
                1.2014349 = boost
                4.2840466 = idf(docFreq=1620, maxDocs=43254)
                0.016240595 = queryNorm
              0.4016294 = fieldWeight in 2955, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.2840466 = idf(docFreq=1620, maxDocs=43254)
                0.09375 = fieldNorm(doc=2955)
          0.03396879 = weight(abstract_txt:presents in 2955) [ClassicSimilarity], result of:
            0.03396879 = score(doc=2955,freq=1.0), product of:
              0.08424715 = queryWeight, product of:
                1.2061454 = boost
                4.3008432 = idf(docFreq=1593, maxDocs=43254)
                0.016240595 = queryNorm
              0.40320405 = fieldWeight in 2955, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.3008432 = idf(docFreq=1593, maxDocs=43254)
                0.09375 = fieldNorm(doc=2955)
          0.04253939 = weight(abstract_txt:text in 2955) [ClassicSimilarity], result of:
            0.04253939 = score(doc=2955,freq=1.0), product of:
              0.112045154 = queryWeight, product of:
                1.7035866 = boost
                4.049738 = idf(docFreq=2048, maxDocs=43254)
                0.016240595 = queryNorm
              0.37966293 = fieldWeight in 2955, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.049738 = idf(docFreq=2048, maxDocs=43254)
                0.09375 = fieldNorm(doc=2955)
          0.14977895 = weight(abstract_txt:automatic in 2955) [ClassicSimilarity], result of:
            0.14977895 = score(doc=2955,freq=1.0), product of:
              0.30745554 = queryWeight, product of:
                3.6432016 = boost
                5.1963353 = idf(docFreq=650, maxDocs=43254)
                0.016240595 = queryNorm
              0.48715645 = fieldWeight in 2955, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.1963353 = idf(docFreq=650, maxDocs=43254)
                0.09375 = fieldNorm(doc=2955)
          0.6736343 = weight(abstract_txt:summarization in 2955) [ClassicSimilarity], result of:
            0.6736343 = score(doc=2955,freq=3.0), product of:
              0.58084136 = queryWeight, product of:
                5.0074983 = boost
                7.1422453 = idf(docFreq=92, maxDocs=43254)
                0.016240595 = queryNorm
              1.1597561 = fieldWeight in 2955, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.1422453 = idf(docFreq=92, maxDocs=43254)
                0.09375 = fieldNorm(doc=2955)
        0.2 = coord(5/25)