Document (#38519)

Author
Moreno, J.M.T.
Title
Automatic text summarization
Imprint
Hoboken : Wiley
Year
2014
Pages
320 S
Isbn
978-1-84821-668-6
Abstract
This new textbook examines the motivations and the different algorithms for automatic document summarization (ADS). We performed a recent state of the art. The book shows the main problems of ADS, difficulties and the solutions provided by the community. It presents recent advances in ADS, as well as current applications and trends. The approaches are statistical, linguistic and symbolic. Several exemples are included in order to clarify the theoretical concepts. The books currently available in the area of Automatic Document Summarization are not recent. Powerful algorithms have been developed in recent years that include several applications of ADS. The development of recent technology has impacted on the development of algorithms and their applications. The massive use of social networks and the new forms of the technology requires the adaptation of the classical methods of text summarizers. This is a new textbook on Automatic Text Summarization, based on teaching materials used in two or one-semester courses. It presents a extensive state-of-art and describes the new systems on the subject. Previous automatic summarization books have been either collections of specialized papers, or else authored books with only a chapter or two devoted to the field as a whole. In other hand, the classic books on the subject are not recent.
Content
Automatic Text Summarization Some Important Concepts 23 Single document Summarization 53 Guided Multi-Document Summarization 109 Emerging systems 151 Source and DomainSpecific Summarization 179 Text Abstracting 219 Evaluating Document Summaries 243 Conclusion 275 Information Retrieval NLP and Automatic Text Summarization 281 Automatic Text Summarization Resources 305
Theme
Automatisches Indexieren
DDC
025.4
LCC
P98.5 .A87

Similar documents (author)

  1. Moreno, R.B. -> Bailón-Moreno, R.: 5.48
    5.4841185 = sum of:
      5.4841185 = weight(author_txt:moreno in 7609) [ClassicSimilarity], result of:
        5.4841185 = fieldWeight in 7609, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          8.863674 = idf(docFreq=16, maxDocs=44218)
          0.4375 = fieldNorm(doc=7609)
    
  2. Moreno, R.R. -> Bailón-Moreno, R.: 5.48
    5.4841185 = sum of:
      5.4841185 = weight(author_txt:moreno in 55) [ClassicSimilarity], result of:
        5.4841185 = fieldWeight in 55, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          8.863674 = idf(docFreq=16, maxDocs=44218)
          0.4375 = fieldNorm(doc=55)
    
  3. Schneider, J. Moreno => Moreno Schneider, J.: 4.70
    4.700673 = sum of:
      4.700673 = weight(author_txt:moreno in 195) [ClassicSimilarity], result of:
        4.700673 = fieldWeight in 195, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          8.863674 = idf(docFreq=16, maxDocs=44218)
          0.375 = fieldNorm(doc=195)
    
  4. Fernandez, F.S.; Moreno, A.G.: History of information science in Spain : a selected bibliography (1997) 4.43
    4.431837 = sum of:
      4.431837 = weight(author_txt:moreno in 52) [ClassicSimilarity], result of:
        4.431837 = fieldWeight in 52, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.863674 = idf(docFreq=16, maxDocs=44218)
          0.5 = fieldNorm(doc=52)
    
  5. Moreno, N.; Vallecillo, A.: Towards interoperable Web engineering methods (2008) 4.43
    4.431837 = sum of:
      4.431837 = weight(author_txt:moreno in 1860) [ClassicSimilarity], result of:
        4.431837 = fieldWeight in 1860, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.863674 = idf(docFreq=16, maxDocs=44218)
          0.5 = fieldNorm(doc=1860)
    

Similar documents (content)

  1. Sankarasubramaniam, Y.; Ramanathan, K.; Ghosh, S.: Text summarization using Wikipedia (2014) 0.27
    0.27342194 = sum of:
      0.27342194 = product of:
        0.9765069 = sum of:
          0.01912802 = weight(abstract_txt:been in 2693) [ClassicSimilarity], result of:
            0.01912802 = score(doc=2693,freq=2.0), product of:
              0.05982146 = queryWeight, product of:
                1.0187827 = boost
                3.617579 = idf(docFreq=3226, maxDocs=44218)
                0.016231453 = queryNorm
              0.31975183 = fieldWeight in 2693, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.617579 = idf(docFreq=3226, maxDocs=44218)
                0.0625 = fieldNorm(doc=2693)
          0.022597726 = weight(abstract_txt:document in 2693) [ClassicSimilarity], result of:
            0.022597726 = score(doc=2693,freq=1.0), product of:
              0.08422936 = queryWeight, product of:
                1.2088845 = boost
                4.2926083 = idf(docFreq=1642, maxDocs=44218)
                0.016231453 = queryNorm
              0.26828802 = fieldWeight in 2693, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.2926083 = idf(docFreq=1642, maxDocs=44218)
                0.0625 = fieldNorm(doc=2693)
          0.038002074 = weight(abstract_txt:several in 2693) [ClassicSimilarity], result of:
            0.038002074 = score(doc=2693,freq=2.0), product of:
              0.09453991 = queryWeight, product of:
                1.2807391 = boost
                4.5477557 = idf(docFreq=1272, maxDocs=44218)
                0.016231453 = queryNorm
              0.4019686 = fieldWeight in 2693, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.5477557 = idf(docFreq=1272, maxDocs=44218)
                0.0625 = fieldNorm(doc=2693)
          0.049084384 = weight(abstract_txt:text in 2693) [ClassicSimilarity], result of:
            0.049084384 = score(doc=2693,freq=3.0), product of:
              0.11212588 = queryWeight, product of:
                1.7082508 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.016231453 = queryNorm
              0.4377614 = fieldWeight in 2693, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.0625 = fieldNorm(doc=2693)
          0.11270473 = weight(abstract_txt:algorithms in 2693) [ClassicSimilarity], result of:
            0.11270473 = score(doc=2693,freq=2.0), product of:
              0.22339264 = queryWeight, product of:
                2.4111993 = boost
                5.707926 = idf(docFreq=398, maxDocs=44218)
                0.016231453 = queryNorm
              0.5045141 = fieldWeight in 2693, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.707926 = idf(docFreq=398, maxDocs=44218)
                0.0625 = fieldNorm(doc=2693)
          0.100172274 = weight(abstract_txt:automatic in 2693) [ClassicSimilarity], result of:
            0.100172274 = score(doc=2693,freq=1.0), product of:
              0.30848354 = queryWeight, product of:
                3.6579611 = boost
                5.1955976 = idf(docFreq=665, maxDocs=44218)
                0.016231453 = queryNorm
              0.32472485 = fieldWeight in 2693, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.1955976 = idf(docFreq=665, maxDocs=44218)
                0.0625 = fieldNorm(doc=2693)
          0.63481766 = weight(abstract_txt:summarization in 2693) [ClassicSimilarity], result of:
            0.63481766 = score(doc=2693,freq=6.0), product of:
              0.58136547 = queryWeight, product of:
                5.021664 = boost
                7.132539 = idf(docFreq=95, maxDocs=44218)
                0.016231453 = queryNorm
              1.0919425 = fieldWeight in 2693, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                7.132539 = idf(docFreq=95, maxDocs=44218)
                0.0625 = fieldNorm(doc=2693)
        0.28 = coord(7/25)
    
  2. Smeaton, A.F.: Progress in the application of natural language processing to information retrieval tasks (1992) 0.26
    0.25887793 = sum of:
      0.25887793 = product of:
        1.6179872 = sum of:
          0.120231695 = weight(abstract_txt:text in 7080) [ClassicSimilarity], result of:
            0.120231695 = score(doc=7080,freq=2.0), product of:
              0.11212588 = queryWeight, product of:
                1.7082508 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.016231453 = queryNorm
              1.0722921 = fieldWeight in 7080, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.1875 = fieldNorm(doc=7080)
          0.42499495 = weight(abstract_txt:automatic in 7080) [ClassicSimilarity], result of:
            0.42499495 = score(doc=7080,freq=2.0), product of:
              0.30848354 = queryWeight, product of:
                3.6579611 = boost
                5.1955976 = idf(docFreq=665, maxDocs=44218)
                0.016231453 = queryNorm
              1.3776908 = fieldWeight in 7080, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.1955976 = idf(docFreq=665, maxDocs=44218)
                0.1875 = fieldNorm(doc=7080)
          0.2952709 = weight(abstract_txt:recent in 7080) [ClassicSimilarity], result of:
            0.2952709 = score(doc=7080,freq=1.0), product of:
              0.32398653 = queryWeight, product of:
                4.1065507 = boost
                4.860628 = idf(docFreq=930, maxDocs=44218)
                0.016231453 = queryNorm
              0.9113678 = fieldWeight in 7080, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.860628 = idf(docFreq=930, maxDocs=44218)
                0.1875 = fieldNorm(doc=7080)
          0.7774897 = weight(abstract_txt:summarization in 7080) [ClassicSimilarity], result of:
            0.7774897 = score(doc=7080,freq=1.0), product of:
              0.58136547 = queryWeight, product of:
                5.021664 = boost
                7.132539 = idf(docFreq=95, maxDocs=44218)
                0.016231453 = queryNorm
              1.3373511 = fieldWeight in 7080, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.132539 = idf(docFreq=95, maxDocs=44218)
                0.1875 = fieldNorm(doc=7080)
        0.16 = coord(4/25)
    
  3. Shen, D.; Yang, Q.; Chen, Z.: Noise reduction through summarization for Web-page classification (2007) 0.23
    0.23218988 = sum of:
      0.23218988 = product of:
        1.1609493 = sum of:
          0.033589408 = weight(abstract_txt:several in 953) [ClassicSimilarity], result of:
            0.033589408 = score(doc=953,freq=1.0), product of:
              0.09453991 = queryWeight, product of:
                1.2807391 = boost
                4.5477557 = idf(docFreq=1272, maxDocs=44218)
                0.016231453 = queryNorm
              0.35529342 = fieldWeight in 953, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5477557 = idf(docFreq=1272, maxDocs=44218)
                0.078125 = fieldNorm(doc=953)
          0.040238336 = weight(abstract_txt:state in 953) [ClassicSimilarity], result of:
            0.040238336 = score(doc=953,freq=1.0), product of:
              0.10663676 = queryWeight, product of:
                1.3602118 = boost
                4.829954 = idf(docFreq=959, maxDocs=44218)
                0.016231453 = queryNorm
              0.37734017 = fieldWeight in 953, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.829954 = idf(docFreq=959, maxDocs=44218)
                0.078125 = fieldNorm(doc=953)
          0.070847206 = weight(abstract_txt:text in 953) [ClassicSimilarity], result of:
            0.070847206 = score(doc=953,freq=4.0), product of:
              0.11212588 = queryWeight, product of:
                1.7082508 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.016231453 = queryNorm
              0.6318542 = fieldWeight in 953, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.078125 = fieldNorm(doc=953)
          0.2227523 = weight(abstract_txt:algorithms in 953) [ClassicSimilarity], result of:
            0.2227523 = score(doc=953,freq=5.0), product of:
              0.22339264 = queryWeight, product of:
                2.4111993 = boost
                5.707926 = idf(docFreq=398, maxDocs=44218)
                0.016231453 = queryNorm
              0.9971336 = fieldWeight in 953, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                5.707926 = idf(docFreq=398, maxDocs=44218)
                0.078125 = fieldNorm(doc=953)
          0.7935221 = weight(abstract_txt:summarization in 953) [ClassicSimilarity], result of:
            0.7935221 = score(doc=953,freq=6.0), product of:
              0.58136547 = queryWeight, product of:
                5.021664 = boost
                7.132539 = idf(docFreq=95, maxDocs=44218)
                0.016231453 = queryNorm
              1.3649282 = fieldWeight in 953, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                7.132539 = idf(docFreq=95, maxDocs=44218)
                0.078125 = fieldNorm(doc=953)
        0.2 = coord(5/25)
    
  4. Oh, H.; Nam, S.; Zhu, Y.: Structured abstract summarization of scientific articles : summarization using full-text section information (2023) 0.20
    0.20027672 = sum of:
      0.20027672 = product of:
        0.83448637 = sum of:
          0.013525554 = weight(abstract_txt:been in 889) [ClassicSimilarity], result of:
            0.013525554 = score(doc=889,freq=1.0), product of:
              0.05982146 = queryWeight, product of:
                1.0187827 = boost
                3.617579 = idf(docFreq=3226, maxDocs=44218)
                0.016231453 = queryNorm
              0.22609869 = fieldWeight in 889, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.617579 = idf(docFreq=3226, maxDocs=44218)
                0.0625 = fieldNorm(doc=889)
          0.032190666 = weight(abstract_txt:state in 889) [ClassicSimilarity], result of:
            0.032190666 = score(doc=889,freq=1.0), product of:
              0.10663676 = queryWeight, product of:
                1.3602118 = boost
                4.829954 = idf(docFreq=959, maxDocs=44218)
                0.016231453 = queryNorm
              0.30187213 = fieldWeight in 889, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.829954 = idf(docFreq=959, maxDocs=44218)
                0.0625 = fieldNorm(doc=889)
          0.049084384 = weight(abstract_txt:text in 889) [ClassicSimilarity], result of:
            0.049084384 = score(doc=889,freq=3.0), product of:
              0.11212588 = queryWeight, product of:
                1.7082508 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.016231453 = queryNorm
              0.4377614 = fieldWeight in 889, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.0625 = fieldNorm(doc=889)
          0.079694286 = weight(abstract_txt:algorithms in 889) [ClassicSimilarity], result of:
            0.079694286 = score(doc=889,freq=1.0), product of:
              0.22339264 = queryWeight, product of:
                2.4111993 = boost
                5.707926 = idf(docFreq=398, maxDocs=44218)
                0.016231453 = queryNorm
              0.35674536 = fieldWeight in 889, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.707926 = idf(docFreq=398, maxDocs=44218)
                0.0625 = fieldNorm(doc=889)
          0.14166498 = weight(abstract_txt:automatic in 889) [ClassicSimilarity], result of:
            0.14166498 = score(doc=889,freq=2.0), product of:
              0.30848354 = queryWeight, product of:
                3.6579611 = boost
                5.1955976 = idf(docFreq=665, maxDocs=44218)
                0.016231453 = queryNorm
              0.45923027 = fieldWeight in 889, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.1955976 = idf(docFreq=665, maxDocs=44218)
                0.0625 = fieldNorm(doc=889)
          0.51832646 = weight(abstract_txt:summarization in 889) [ClassicSimilarity], result of:
            0.51832646 = score(doc=889,freq=4.0), product of:
              0.58136547 = queryWeight, product of:
                5.021664 = boost
                7.132539 = idf(docFreq=95, maxDocs=44218)
                0.016231453 = queryNorm
              0.89156735 = fieldWeight in 889, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                7.132539 = idf(docFreq=95, maxDocs=44218)
                0.0625 = fieldNorm(doc=889)
        0.24 = coord(6/25)
    
  5. Yang, C.C.; Wang, F.L.: Hierarchical summarization of large documents (2008) 0.19
    0.18692027 = sum of:
      0.18692027 = product of:
        0.9346013 = sum of:
          0.01912802 = weight(abstract_txt:been in 1719) [ClassicSimilarity], result of:
            0.01912802 = score(doc=1719,freq=2.0), product of:
              0.05982146 = queryWeight, product of:
                1.0187827 = boost
                3.617579 = idf(docFreq=3226, maxDocs=44218)
                0.016231453 = queryNorm
              0.31975183 = fieldWeight in 1719, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.617579 = idf(docFreq=3226, maxDocs=44218)
                0.0625 = fieldNorm(doc=1719)
          0.059787966 = weight(abstract_txt:document in 1719) [ClassicSimilarity], result of:
            0.059787966 = score(doc=1719,freq=7.0), product of:
              0.08422936 = queryWeight, product of:
                1.2088845 = boost
                4.2926083 = idf(docFreq=1642, maxDocs=44218)
                0.016231453 = queryNorm
              0.70982337 = fieldWeight in 1719, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                4.2926083 = idf(docFreq=1642, maxDocs=44218)
                0.0625 = fieldNorm(doc=1719)
          0.028338881 = weight(abstract_txt:text in 1719) [ClassicSimilarity], result of:
            0.028338881 = score(doc=1719,freq=1.0), product of:
              0.11212588 = queryWeight, product of:
                1.7082508 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.016231453 = queryNorm
              0.25274166 = fieldWeight in 1719, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.0625 = fieldNorm(doc=1719)
          0.14166498 = weight(abstract_txt:automatic in 1719) [ClassicSimilarity], result of:
            0.14166498 = score(doc=1719,freq=2.0), product of:
              0.30848354 = queryWeight, product of:
                3.6579611 = boost
                5.1955976 = idf(docFreq=665, maxDocs=44218)
                0.016231453 = queryNorm
              0.45923027 = fieldWeight in 1719, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.1955976 = idf(docFreq=665, maxDocs=44218)
                0.0625 = fieldNorm(doc=1719)
          0.68568146 = weight(abstract_txt:summarization in 1719) [ClassicSimilarity], result of:
            0.68568146 = score(doc=1719,freq=7.0), product of:
              0.58136547 = queryWeight, product of:
                5.021664 = boost
                7.132539 = idf(docFreq=95, maxDocs=44218)
                0.016231453 = queryNorm
              1.1794327 = fieldWeight in 1719, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                7.132539 = idf(docFreq=95, maxDocs=44218)
                0.0625 = fieldNorm(doc=1719)
        0.2 = coord(5/25)