Document (#35983)

Author
Wang, W.
Hwang, D.
Title
Abstraction Assistant : an automatic text abstraction system
Source
Journal of the American Society for Information Science and Technology. 61(2010) no.9, S.1790-1799
Year
2010
Abstract
In the interest of standardization and quality assurance, it is desirable for authors and staff of access services to follow the American National Standards Institute (ANSI) guidelines in preparing abstracts. Using the statistical approach an extraction system (the Abstraction Assistant) was developed to generate informative abstracts to meet the ANSI guidelines for structural content elements. The system performance is evaluated by comparing the system-generated abstracts with the author's original abstracts and the manually enhanced system abstracts on three criteria: balance (satisfaction of the ANSI standards), fluency (text coherence), and understandability (clarity). The results suggest that it is possible to use the system output directly without manual modification, but there are issues that need to be addressed in further studies to make the system a better tool.
Theme
Automatisches Abstracting

Similar documents (author)

  1. Wang, H.; Wang, C.: Ontologies for universal information systems (1995) 4.77
    4.7679567 = sum of:
      4.7679567 = weight(author_txt:wang in 3263) [ClassicSimilarity], result of:
        4.7679567 = fieldWeight in 3263, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          6.7429094 = idf(docFreq=136, maxDocs=42740)
          0.5 = fieldNorm(doc=3263)
    
  2. Wang, C.: ¬The online catalogue, subject access and user reactions : a review (1985) 4.21
    4.2143183 = sum of:
      4.2143183 = weight(author_txt:wang in 986) [ClassicSimilarity], result of:
        4.2143183 = fieldWeight in 986, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          6.7429094 = idf(docFreq=136, maxDocs=42740)
          0.625 = fieldNorm(doc=986)
    
  3. Wang, C.: Bibliometrics : a textbook (1990) 4.21
    4.2143183 = sum of:
      4.2143183 = weight(author_txt:wang in 5109) [ClassicSimilarity], result of:
        4.2143183 = fieldWeight in 5109, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          6.7429094 = idf(docFreq=136, maxDocs=42740)
          0.625 = fieldNorm(doc=5109)
    
  4. Wang, P.: Users' information needs at different stages of a research project : a cognitive view (1997) 4.21
    4.2143183 = sum of:
      4.2143183 = weight(author_txt:wang in 1321) [ClassicSimilarity], result of:
        4.2143183 = fieldWeight in 1321, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          6.7429094 = idf(docFreq=136, maxDocs=42740)
          0.625 = fieldNorm(doc=1321)
    
  5. Wang, D.: Cataloger appraises keyword searching in WorldCat (1997) 4.21
    4.2143183 = sum of:
      4.2143183 = weight(author_txt:wang in 2443) [ClassicSimilarity], result of:
        4.2143183 = fieldWeight in 2443, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          6.7429094 = idf(docFreq=136, maxDocs=42740)
          0.625 = fieldNorm(doc=2443)
    

Similar documents (content)

  1. Tenopir, C.; Jascó, P.: Quality of abstracts (1993) 0.19
    0.18966252 = sum of:
      0.18966252 = product of:
        1.1853907 = sum of:
          0.10752928 = weight(abstract_txt:informative in 5026) [ClassicSimilarity], result of:
            0.10752928 = score(doc=5026,freq=2.0), product of:
              0.11442101 = queryWeight, product of:
                1.1434284 = boost
                7.0881796 = idf(docFreq=96, maxDocs=42740)
                0.0141176395 = queryNorm
              0.9397687 = fieldWeight in 5026, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.0881796 = idf(docFreq=96, maxDocs=42740)
                0.09375 = fieldNorm(doc=5026)
          0.047320586 = weight(abstract_txt:standards in 5026) [ClassicSimilarity], result of:
            0.047320586 = score(doc=5026,freq=1.0), product of:
              0.10508515 = queryWeight, product of:
                1.549679 = boost
                4.8032756 = idf(docFreq=952, maxDocs=42740)
                0.0141176395 = queryNorm
              0.45030707 = fieldWeight in 5026, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.8032756 = idf(docFreq=952, maxDocs=42740)
                0.09375 = fieldNorm(doc=5026)
          0.52370423 = weight(abstract_txt:ansi in 5026) [ClassicSimilarity], result of:
            0.52370423 = score(doc=5026,freq=2.0), product of:
              0.4741536 = queryWeight, product of:
                4.031589 = boost
                8.330686 = idf(docFreq=27, maxDocs=42740)
                0.0141176395 = queryNorm
              1.1045033 = fieldWeight in 5026, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.330686 = idf(docFreq=27, maxDocs=42740)
                0.09375 = fieldNorm(doc=5026)
          0.50683665 = weight(abstract_txt:abstracts in 5026) [ClassicSimilarity], result of:
            0.50683665 = score(doc=5026,freq=5.0), product of:
              0.4052689 = queryWeight, product of:
                4.811857 = boost
                5.965797 = idf(docFreq=297, maxDocs=42740)
                0.0141176395 = queryNorm
              1.2506182 = fieldWeight in 5026, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                5.965797 = idf(docFreq=297, maxDocs=42740)
                0.09375 = fieldNorm(doc=5026)
        0.16 = coord(4/25)
    
  2. Tibbo, H.R.: Abstracting across the disciplines : a content analysis of abstracts for the natural sciences, the social sciences, and the humanities with implications for abstracting standards and online information retrieval (1992) 0.12
    0.123652495 = sum of:
      0.123652495 = product of:
        1.0304375 = sum of:
          0.10928221 = weight(abstract_txt:standards in 2536) [ClassicSimilarity], result of:
            0.10928221 = score(doc=2536,freq=3.0), product of:
              0.10508515 = queryWeight, product of:
                1.549679 = boost
                4.8032756 = idf(docFreq=952, maxDocs=42740)
                0.0141176395 = queryNorm
              1.0399396 = fieldWeight in 2536, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.8032756 = idf(docFreq=952, maxDocs=42740)
                0.125 = fieldNorm(doc=2536)
          0.49375308 = weight(abstract_txt:ansi in 2536) [ClassicSimilarity], result of:
            0.49375308 = score(doc=2536,freq=1.0), product of:
              0.4741536 = queryWeight, product of:
                4.031589 = boost
                8.330686 = idf(docFreq=27, maxDocs=42740)
                0.0141176395 = queryNorm
              1.0413357 = fieldWeight in 2536, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.330686 = idf(docFreq=27, maxDocs=42740)
                0.125 = fieldNorm(doc=2536)
          0.4274022 = weight(abstract_txt:abstracts in 2536) [ClassicSimilarity], result of:
            0.4274022 = score(doc=2536,freq=2.0), product of:
              0.4052689 = queryWeight, product of:
                4.811857 = boost
                5.965797 = idf(docFreq=297, maxDocs=42740)
                0.0141176395 = queryNorm
              1.0546138 = fieldWeight in 2536, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.965797 = idf(docFreq=297, maxDocs=42740)
                0.125 = fieldNorm(doc=2536)
        0.12 = coord(3/25)
    
  3. Goh, A.; Hui, S.C.: TES: a text extraction system (1996) 0.11
    0.10719376 = sum of:
      0.10719376 = product of:
        0.5359688 = sum of:
          0.0846125 = weight(abstract_txt:extraction in 6668) [ClassicSimilarity], result of:
            0.0846125 = score(doc=6668,freq=2.0), product of:
              0.08799942 = queryWeight, product of:
                1.0027577 = boost
                6.216153 = idf(docFreq=231, maxDocs=42740)
                0.0141176395 = queryNorm
              0.9615121 = fieldWeight in 6668, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.216153 = idf(docFreq=231, maxDocs=42740)
                0.109375 = fieldNorm(doc=6668)
          0.0737713 = weight(abstract_txt:manually in 6668) [ClassicSimilarity], result of:
            0.0737713 = score(doc=6668,freq=1.0), product of:
              0.10118707 = queryWeight, product of:
                1.0752727 = boost
                6.665678 = idf(docFreq=147, maxDocs=42740)
                0.0141176395 = queryNorm
              0.7290585 = fieldWeight in 6668, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.665678 = idf(docFreq=147, maxDocs=42740)
                0.109375 = fieldNorm(doc=6668)
          0.046804063 = weight(abstract_txt:text in 6668) [ClassicSimilarity], result of:
            0.046804063 = score(doc=6668,freq=2.0), product of:
              0.074711785 = queryWeight, product of:
                1.3066691 = boost
                4.0500593 = idf(docFreq=2023, maxDocs=42740)
                0.0141176395 = queryNorm
              0.62646157 = fieldWeight in 6668, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.0500593 = idf(docFreq=2023, maxDocs=42740)
                0.109375 = fieldNorm(doc=6668)
          0.06633931 = weight(abstract_txt:system in 6668) [ClassicSimilarity], result of:
            0.06633931 = score(doc=6668,freq=1.0), product of:
              0.180335 = queryWeight, product of:
                3.797914 = boost
                3.3633559 = idf(docFreq=4021, maxDocs=42740)
                0.0141176395 = queryNorm
              0.36786705 = fieldWeight in 6668, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3633559 = idf(docFreq=4021, maxDocs=42740)
                0.109375 = fieldNorm(doc=6668)
          0.2644416 = weight(abstract_txt:abstracts in 6668) [ClassicSimilarity], result of:
            0.2644416 = score(doc=6668,freq=1.0), product of:
              0.4052689 = queryWeight, product of:
                4.811857 = boost
                5.965797 = idf(docFreq=297, maxDocs=42740)
                0.0141176395 = queryNorm
              0.65250903 = fieldWeight in 6668, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.965797 = idf(docFreq=297, maxDocs=42740)
                0.109375 = fieldNorm(doc=6668)
        0.2 = coord(5/25)
    
  4. Goh, A.; Hui, S.C.; Chan, S.K.: ¬A text extraction system for news reports (1996) 0.10
    0.10216105 = sum of:
      0.10216105 = product of:
        0.51080525 = sum of:
          0.07644807 = weight(abstract_txt:extraction in 6670) [ClassicSimilarity], result of:
            0.07644807 = score(doc=6670,freq=5.0), product of:
              0.08799942 = queryWeight, product of:
                1.0027577 = boost
                6.216153 = idf(docFreq=231, maxDocs=42740)
                0.0141176395 = queryNorm
              0.8687338 = fieldWeight in 6670, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                6.216153 = idf(docFreq=231, maxDocs=42740)
                0.0625 = fieldNorm(doc=6670)
          0.05961621 = weight(abstract_txt:manually in 6670) [ClassicSimilarity], result of:
            0.05961621 = score(doc=6670,freq=2.0), product of:
              0.10118707 = queryWeight, product of:
                1.0752727 = boost
                6.665678 = idf(docFreq=147, maxDocs=42740)
                0.0141176395 = queryNorm
              0.58916825 = fieldWeight in 6670, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.665678 = idf(docFreq=147, maxDocs=42740)
                0.0625 = fieldNorm(doc=6670)
          0.018911697 = weight(abstract_txt:text in 6670) [ClassicSimilarity], result of:
            0.018911697 = score(doc=6670,freq=1.0), product of:
              0.074711785 = queryWeight, product of:
                1.3066691 = boost
                4.0500593 = idf(docFreq=2023, maxDocs=42740)
                0.0141176395 = queryNorm
              0.2531287 = fieldWeight in 6670, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.0500593 = idf(docFreq=2023, maxDocs=42740)
                0.0625 = fieldNorm(doc=6670)
          0.053610254 = weight(abstract_txt:system in 6670) [ClassicSimilarity], result of:
            0.053610254 = score(doc=6670,freq=2.0), product of:
              0.180335 = queryWeight, product of:
                3.797914 = boost
                3.3633559 = idf(docFreq=4021, maxDocs=42740)
                0.0141176395 = queryNorm
              0.29728147 = fieldWeight in 6670, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.3633559 = idf(docFreq=4021, maxDocs=42740)
                0.0625 = fieldNorm(doc=6670)
          0.302219 = weight(abstract_txt:abstracts in 6670) [ClassicSimilarity], result of:
            0.302219 = score(doc=6670,freq=4.0), product of:
              0.4052689 = queryWeight, product of:
                4.811857 = boost
                5.965797 = idf(docFreq=297, maxDocs=42740)
                0.0141176395 = queryNorm
              0.7457246 = fieldWeight in 6670, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.965797 = idf(docFreq=297, maxDocs=42740)
                0.0625 = fieldNorm(doc=6670)
        0.2 = coord(5/25)
    
  5. Fidel, R.: Writing abstracts for free-text searching (1986) 0.10
    0.0997881 = sum of:
      0.0997881 = product of:
        0.62367564 = sum of:
          0.07603469 = weight(abstract_txt:informative in 684) [ClassicSimilarity], result of:
            0.07603469 = score(doc=684,freq=1.0), product of:
              0.11442101 = queryWeight, product of:
                1.1434284 = boost
                7.0881796 = idf(docFreq=96, maxDocs=42740)
                0.0141176395 = queryNorm
              0.6645168 = fieldWeight in 684, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.0881796 = idf(docFreq=96, maxDocs=42740)
                0.09375 = fieldNorm(doc=684)
          0.040117767 = weight(abstract_txt:text in 684) [ClassicSimilarity], result of:
            0.040117767 = score(doc=684,freq=2.0), product of:
              0.074711785 = queryWeight, product of:
                1.3066691 = boost
                4.0500593 = idf(docFreq=2023, maxDocs=42740)
                0.0141176395 = queryNorm
              0.53696704 = fieldWeight in 684, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.0500593 = idf(docFreq=2023, maxDocs=42740)
                0.09375 = fieldNorm(doc=684)
          0.11492923 = weight(abstract_txt:guidelines in 684) [ClassicSimilarity], result of:
            0.11492923 = score(doc=684,freq=2.0), product of:
              0.15070173 = queryWeight, product of:
                1.855796 = boost
                5.752094 = idf(docFreq=368, maxDocs=42740)
                0.0141176395 = queryNorm
              0.7626271 = fieldWeight in 684, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.752094 = idf(docFreq=368, maxDocs=42740)
                0.09375 = fieldNorm(doc=684)
          0.39259398 = weight(abstract_txt:abstracts in 684) [ClassicSimilarity], result of:
            0.39259398 = score(doc=684,freq=3.0), product of:
              0.4052689 = queryWeight, product of:
                4.811857 = boost
                5.965797 = idf(docFreq=297, maxDocs=42740)
                0.0141176395 = queryNorm
              0.96872467 = fieldWeight in 684, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.965797 = idf(docFreq=297, maxDocs=42740)
                0.09375 = fieldNorm(doc=684)
        0.16 = coord(4/25)