Document (#35750)

Author
Marcu, D.
Title
Automatic abstracting and summarization
Source
Encyclopedia of library and information sciences. 3rd ed. Ed.: M.J. Bates
Imprint
London : Taylor & Francis
Year
2009
Pages
S.xx-xx
Abstract
After lying dormant for a few decades, the field of automated text summarization has experienced a tremendous resurgence of interest. Recently, many new algorithms and techniques have been proposed for identifying important information in single documents and document collections, and for mapping this information into grammatical, cohesive, and coherent abstracts. Since 1997, annual workshops, conferences, and large-scale comparative evaluations have provided a rich environment for exchanging ideas between researchers in Asia, Europe, and North America. This entry reviews the main developments in the field and provides a guiding map to those interested in understanding the strengths and weaknesses of an increasingly ubiquitous technology.
Footnote
Vgl.: http://www.tandfonline.com/doi/book/10.1081/E-ELIS3.
Theme
Automatisches Abstracting

Similar documents (content)

  1. Pinfield, S.; Salter, J.; Bath, P.A.; Hubbard, B.; Millington, P.; Anders, J.H.S.; Hussain, A.: Open-access repositories worldwide, 2005-2012 : past growth, current characteristics, and future possibilities (2014) 0.11
    0.1092181 = sum of:
      0.1092181 = product of:
        0.4550754 = sum of:
          0.08456325 = weight(abstract_txt:europe in 3543) [ClassicSimilarity], result of:
            0.08456325 = score(doc=3543,freq=2.0), product of:
              0.14738719 = queryWeight, product of:
                1.0052801 = boost
                6.491228 = idf(docFreq=172, maxDocs=41962)
                0.022586334 = queryNorm
              0.57374895 = fieldWeight in 3543, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.491228 = idf(docFreq=172, maxDocs=41962)
                0.0625 = fieldNorm(doc=3543)
          0.02599592 = weight(abstract_txt:have in 3543) [ClassicSimilarity], result of:
            0.02599592 = score(doc=3543,freq=3.0), product of:
              0.073890455 = queryWeight, product of:
                1.006622 = boost
                3.2499464 = idf(docFreq=4422, maxDocs=41962)
                0.022586334 = queryNorm
              0.351817 = fieldWeight in 3543, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.2499464 = idf(docFreq=4422, maxDocs=41962)
                0.0625 = fieldNorm(doc=3543)
          0.062517405 = weight(abstract_txt:north in 3543) [ClassicSimilarity], result of:
            0.062517405 = score(doc=3543,freq=1.0), product of:
              0.15182708 = queryWeight, product of:
                1.0203093 = boost
                6.588274 = idf(docFreq=156, maxDocs=41962)
                0.022586334 = queryNorm
              0.41176713 = fieldWeight in 3543, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.588274 = idf(docFreq=156, maxDocs=41962)
                0.0625 = fieldNorm(doc=3543)
          0.065010354 = weight(abstract_txt:experienced in 3543) [ClassicSimilarity], result of:
            0.065010354 = score(doc=3543,freq=1.0), product of:
              0.15583691 = queryWeight, product of:
                1.033695 = boost
                6.6747065 = idf(docFreq=143, maxDocs=41962)
                0.022586334 = queryNorm
              0.41716915 = fieldWeight in 3543, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.6747065 = idf(docFreq=143, maxDocs=41962)
                0.0625 = fieldNorm(doc=3543)
          0.10476477 = weight(abstract_txt:america in 3543) [ClassicSimilarity], result of:
            0.10476477 = score(doc=3543,freq=2.0), product of:
              0.17001301 = queryWeight, product of:
                1.0796881 = boost
                6.9716907 = idf(docFreq=106, maxDocs=41962)
                0.022586334 = queryNorm
              0.6162162 = fieldWeight in 3543, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.9716907 = idf(docFreq=106, maxDocs=41962)
                0.0625 = fieldNorm(doc=3543)
          0.11222371 = weight(abstract_txt:asia in 3543) [ClassicSimilarity], result of:
            0.11222371 = score(doc=3543,freq=1.0), product of:
              0.22425306 = queryWeight, product of:
                1.2400135 = boost
                8.006933 = idf(docFreq=37, maxDocs=41962)
                0.022586334 = queryNorm
              0.5004333 = fieldWeight in 3543, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.006933 = idf(docFreq=37, maxDocs=41962)
                0.0625 = fieldNorm(doc=3543)
        0.24 = coord(6/25)
    
  2. Yang, C.C.; Wang, F.L.: Hierarchical summarization of large documents (2008) 0.09
    0.09254259 = sum of:
      0.09254259 = product of:
        0.5783912 = sum of:
          0.02599592 = weight(abstract_txt:have in 3720) [ClassicSimilarity], result of:
            0.02599592 = score(doc=3720,freq=3.0), product of:
              0.073890455 = queryWeight, product of:
                1.006622 = boost
                3.2499464 = idf(docFreq=4422, maxDocs=41962)
                0.022586334 = queryNorm
              0.351817 = fieldWeight in 3720, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.2499464 = idf(docFreq=4422, maxDocs=41962)
                0.0625 = fieldNorm(doc=3720)
          0.06044403 = weight(abstract_txt:decades in 3720) [ClassicSimilarity], result of:
            0.06044403 = score(doc=3720,freq=1.0), product of:
              0.14845139 = queryWeight, product of:
                1.0089029 = boost
                6.514621 = idf(docFreq=168, maxDocs=41962)
                0.022586334 = queryNorm
              0.4071638 = fieldWeight in 3720, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.514621 = idf(docFreq=168, maxDocs=41962)
                0.0625 = fieldNorm(doc=3720)
          0.06604843 = weight(abstract_txt:evaluations in 3720) [ClassicSimilarity], result of:
            0.06604843 = score(doc=3720,freq=1.0), product of:
              0.15749145 = queryWeight, product of:
                1.0391679 = boost
                6.710046 = idf(docFreq=138, maxDocs=41962)
                0.022586334 = queryNorm
              0.41937786 = fieldWeight in 3720, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.710046 = idf(docFreq=138, maxDocs=41962)
                0.0625 = fieldNorm(doc=3720)
          0.42590278 = weight(abstract_txt:summarization in 3720) [ClassicSimilarity], result of:
            0.42590278 = score(doc=3720,freq=7.0), product of:
              0.35936266 = queryWeight, product of:
                2.219927 = boost
                7.167183 = idf(docFreq=87, maxDocs=41962)
                0.022586334 = queryNorm
              1.1851615 = fieldWeight in 3720, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                7.167183 = idf(docFreq=87, maxDocs=41962)
                0.0625 = fieldNorm(doc=3720)
        0.16 = coord(4/25)
    
  3. Rader, H.B.: Information literacy 1973-2002 : a selected literature review (2002) 0.09
    0.08946032 = sum of:
      0.08946032 = product of:
        0.37275136 = sum of:
          0.029365515 = weight(abstract_txt:have in 2044) [ClassicSimilarity], result of:
            0.029365515 = score(doc=2044,freq=5.0), product of:
              0.073890455 = queryWeight, product of:
                1.006622 = boost
                3.2499464 = idf(docFreq=4422, maxDocs=41962)
                0.022586334 = queryNorm
              0.3974196 = fieldWeight in 2044, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                3.2499464 = idf(docFreq=4422, maxDocs=41962)
                0.0546875 = fieldNorm(doc=2044)
          0.07479567 = weight(abstract_txt:decades in 2044) [ClassicSimilarity], result of:
            0.07479567 = score(doc=2044,freq=2.0), product of:
              0.14845139 = queryWeight, product of:
                1.0089029 = boost
                6.514621 = idf(docFreq=168, maxDocs=41962)
                0.022586334 = queryNorm
              0.5038395 = fieldWeight in 2044, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.514621 = idf(docFreq=168, maxDocs=41962)
                0.0546875 = fieldNorm(doc=2044)
          0.05953929 = weight(abstract_txt:annual in 2044) [ClassicSimilarity], result of:
            0.05953929 = score(doc=2044,freq=1.0), product of:
              0.16064937 = queryWeight, product of:
                1.0495346 = boost
                6.776985 = idf(docFreq=129, maxDocs=41962)
                0.022586334 = queryNorm
              0.37061638 = fieldWeight in 2044, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.776985 = idf(docFreq=129, maxDocs=41962)
                0.0546875 = fieldNorm(doc=2044)
          0.064819895 = weight(abstract_txt:america in 2044) [ClassicSimilarity], result of:
            0.064819895 = score(doc=2044,freq=1.0), product of:
              0.17001301 = queryWeight, product of:
                1.0796881 = boost
                6.9716907 = idf(docFreq=106, maxDocs=41962)
                0.022586334 = queryNorm
              0.38126433 = fieldWeight in 2044, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.9716907 = idf(docFreq=106, maxDocs=41962)
                0.0546875 = fieldNorm(doc=2044)
          0.10847962 = weight(abstract_txt:tremendous in 2044) [ClassicSimilarity], result of:
            0.10847962 = score(doc=2044,freq=1.0), product of:
              0.23964886 = queryWeight, product of:
                1.2818727 = boost
                8.277224 = idf(docFreq=28, maxDocs=41962)
                0.022586334 = queryNorm
              0.45266068 = fieldWeight in 2044, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.277224 = idf(docFreq=28, maxDocs=41962)
                0.0546875 = fieldNorm(doc=2044)
          0.035751376 = weight(abstract_txt:field in 2044) [ClassicSimilarity], result of:
            0.035751376 = score(doc=2044,freq=1.0), product of:
              0.14406167 = queryWeight, product of:
                1.4055505 = boost
                4.537914 = idf(docFreq=1219, maxDocs=41962)
                0.022586334 = queryNorm
              0.24816716 = fieldWeight in 2044, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.537914 = idf(docFreq=1219, maxDocs=41962)
                0.0546875 = fieldNorm(doc=2044)
        0.24 = coord(6/25)
    
  4. Multilingual information management : current levels and future abilities. A report Commissioned by the US National Science Foundation and also delivered to the European Commission's Language Engineering Office and the US Defense Advanced Research Projects Agency, April 1999 (1999) 0.08
    0.08276282 = sum of:
      0.08276282 = product of:
        0.4138141 = sum of:
          0.01857238 = weight(abstract_txt:have in 69) [ClassicSimilarity], result of:
            0.01857238 = score(doc=69,freq=2.0), product of:
              0.073890455 = queryWeight, product of:
                1.006622 = boost
                3.2499464 = idf(docFreq=4422, maxDocs=41962)
                0.022586334 = queryNorm
              0.2513502 = fieldWeight in 69, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.2499464 = idf(docFreq=4422, maxDocs=41962)
                0.0546875 = fieldNorm(doc=69)
          0.07252937 = weight(abstract_txt:coherent in 69) [ClassicSimilarity], result of:
            0.07252937 = score(doc=69,freq=1.0), product of:
              0.18323955 = queryWeight, product of:
                1.1208999 = boost
                7.2378006 = idf(docFreq=81, maxDocs=41962)
                0.022586334 = queryNorm
              0.39581722 = fieldWeight in 69, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.2378006 = idf(docFreq=81, maxDocs=41962)
                0.0546875 = fieldNorm(doc=69)
          0.08776319 = weight(abstract_txt:ubiquitous in 69) [ClassicSimilarity], result of:
            0.08776319 = score(doc=69,freq=1.0), product of:
              0.20807418 = queryWeight, product of:
                1.1944455 = boost
                7.712694 = idf(docFreq=50, maxDocs=41962)
                0.022586334 = queryNorm
              0.42178798 = fieldWeight in 69, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.712694 = idf(docFreq=50, maxDocs=41962)
                0.0546875 = fieldNorm(doc=69)
          0.035751376 = weight(abstract_txt:field in 69) [ClassicSimilarity], result of:
            0.035751376 = score(doc=69,freq=1.0), product of:
              0.14406167 = queryWeight, product of:
                1.4055505 = boost
                4.537914 = idf(docFreq=1219, maxDocs=41962)
                0.022586334 = queryNorm
              0.24816716 = fieldWeight in 69, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.537914 = idf(docFreq=1219, maxDocs=41962)
                0.0546875 = fieldNorm(doc=69)
          0.19919778 = weight(abstract_txt:summarization in 69) [ClassicSimilarity], result of:
            0.19919778 = score(doc=69,freq=2.0), product of:
              0.35936266 = queryWeight, product of:
                2.219927 = boost
                7.167183 = idf(docFreq=87, maxDocs=41962)
                0.022586334 = queryNorm
              0.55430853 = fieldWeight in 69, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.167183 = idf(docFreq=87, maxDocs=41962)
                0.0546875 = fieldNorm(doc=69)
        0.2 = coord(5/25)
    
  5. Hjoerland, B.; Hartel, J.: Introduction to a Special Issue of Knowledge Organization (2003) 0.07
    0.07137119 = sum of:
      0.07137119 = product of:
        0.22303498 = sum of:
          0.026160419 = weight(abstract_txt:europe in 4014) [ClassicSimilarity], result of:
            0.026160419 = score(doc=4014,freq=1.0), product of:
              0.14738719 = queryWeight, product of:
                1.0052801 = boost
                6.491228 = idf(docFreq=172, maxDocs=41962)
                0.022586334 = queryNorm
              0.17749453 = fieldWeight in 4014, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.491228 = idf(docFreq=172, maxDocs=41962)
                0.02734375 = fieldNorm(doc=4014)
          0.013132657 = weight(abstract_txt:have in 4014) [ClassicSimilarity], result of:
            0.013132657 = score(doc=4014,freq=4.0), product of:
              0.073890455 = queryWeight, product of:
                1.006622 = boost
                3.2499464 = idf(docFreq=4422, maxDocs=41962)
                0.022586334 = queryNorm
              0.17773144 = fieldWeight in 4014, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.2499464 = idf(docFreq=4422, maxDocs=41962)
                0.02734375 = fieldNorm(doc=4014)
          0.027351364 = weight(abstract_txt:north in 4014) [ClassicSimilarity], result of:
            0.027351364 = score(doc=4014,freq=1.0), product of:
              0.15182708 = queryWeight, product of:
                1.0203093 = boost
                6.588274 = idf(docFreq=156, maxDocs=41962)
                0.022586334 = queryNorm
              0.18014812 = fieldWeight in 4014, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.588274 = idf(docFreq=156, maxDocs=41962)
                0.02734375 = fieldNorm(doc=4014)
          0.028989565 = weight(abstract_txt:strengths in 4014) [ClassicSimilarity], result of:
            0.028989565 = score(doc=4014,freq=1.0), product of:
              0.15783055 = queryWeight, product of:
                1.0402861 = boost
                6.717266 = idf(docFreq=137, maxDocs=41962)
                0.022586334 = queryNorm
              0.18367524 = fieldWeight in 4014, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.717266 = idf(docFreq=137, maxDocs=41962)
                0.02734375 = fieldNorm(doc=4014)
          0.029469999 = weight(abstract_txt:weaknesses in 4014) [ClassicSimilarity], result of:
            0.029469999 = score(doc=4014,freq=1.0), product of:
              0.15956955 = queryWeight, product of:
                1.0460013 = boost
                6.7541704 = idf(docFreq=132, maxDocs=41962)
                0.022586334 = queryNorm
              0.18468435 = fieldWeight in 4014, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.7541704 = idf(docFreq=132, maxDocs=41962)
                0.02734375 = fieldNorm(doc=4014)
          0.029769644 = weight(abstract_txt:annual in 4014) [ClassicSimilarity], result of:
            0.029769644 = score(doc=4014,freq=1.0), product of:
              0.16064937 = queryWeight, product of:
                1.0495346 = boost
                6.776985 = idf(docFreq=129, maxDocs=41962)
                0.022586334 = queryNorm
              0.18530819 = fieldWeight in 4014, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.776985 = idf(docFreq=129, maxDocs=41962)
                0.02734375 = fieldNorm(doc=4014)
          0.032409947 = weight(abstract_txt:america in 4014) [ClassicSimilarity], result of:
            0.032409947 = score(doc=4014,freq=1.0), product of:
              0.17001301 = queryWeight, product of:
                1.0796881 = boost
                6.9716907 = idf(docFreq=106, maxDocs=41962)
                0.022586334 = queryNorm
              0.19063216 = fieldWeight in 4014, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.9716907 = idf(docFreq=106, maxDocs=41962)
                0.02734375 = fieldNorm(doc=4014)
          0.035751376 = weight(abstract_txt:field in 4014) [ClassicSimilarity], result of:
            0.035751376 = score(doc=4014,freq=4.0), product of:
              0.14406167 = queryWeight, product of:
                1.4055505 = boost
                4.537914 = idf(docFreq=1219, maxDocs=41962)
                0.022586334 = queryNorm
              0.24816716 = fieldWeight in 4014, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.537914 = idf(docFreq=1219, maxDocs=41962)
                0.02734375 = fieldNorm(doc=4014)
        0.32 = coord(8/25)