Document (#34073)

Author
Otterbacher, J.
Radev, D.
Kareem, O.
Title
Hierarchical summarization for delivering information to mobile devices
Source
Information processing and management. 44(2008) no.2, S.931-947
Year
2008
Abstract
Access to information via handheld devices supports decision making away from one's computer. However, limitations include small screens and constrained wireless bandwidth. We present a summarization method that transforms online content for delivery to small devices. Unlike previous algorithms, ours assumes nothing about document formatting, and induces a hierarchical structure based on the relative importance of sentences within the document. As compared to delivering full documents, the method reduces the bytes transferred by half. An experiment also demonstrates that when given hierarchical summaries, users are no less accurate in answering questions about the documents.

Similar documents (author)

  1. Otterbacher, J.; Radev, D.: Exploring fact-focused relevance and novelty detection (2008) 4.55
    4.554948 = sum of:
      4.554948 = weight(author_txt:radev in 4211) [ClassicSimilarity], result of:
        4.554948 = fieldWeight in 4211, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.109896 = idf(docFreq=12, maxDocs=43254)
          0.5 = fieldNorm(doc=4211)
    
  2. Finegan-Dollak, C.; Radev, D.R.: Sentence simplification, compression, and disaggregation for summarization of sophisticated documents (2016) 3.99
    3.9855795 = sum of:
      3.9855795 = weight(author_txt:radev in 4587) [ClassicSimilarity], result of:
        3.9855795 = fieldWeight in 4587, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.109896 = idf(docFreq=12, maxDocs=43254)
          0.4375 = fieldNorm(doc=4587)
    
  3. Radev, D.R.; Libner, K.; Fan, W.: Getting answers to natural language questions on the Web (2002) 3.42
    3.416211 = sum of:
      3.416211 = weight(author_txt:radev in 205) [ClassicSimilarity], result of:
        3.416211 = fieldWeight in 205, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.109896 = idf(docFreq=12, maxDocs=43254)
          0.375 = fieldNorm(doc=205)
    
  4. Otterbacher, J.; Erkan, G.; Radev, D.R.: Biased LexRank : passage retrieval using random walks with question-based priors (2009) 3.42
    3.416211 = sum of:
      3.416211 = weight(author_txt:radev in 4451) [ClassicSimilarity], result of:
        3.416211 = fieldWeight in 4451, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.109896 = idf(docFreq=12, maxDocs=43254)
          0.375 = fieldNorm(doc=4451)
    
  5. Lam, W.; Chan, K.; Radev, D.; Saggion, H.; Teufel, S.: Context-based generic cross-lingual retrieval of documents and automated summaries (2005) 2.85
    2.8468423 = sum of:
      2.8468423 = weight(author_txt:radev in 3966) [ClassicSimilarity], result of:
        2.8468423 = fieldWeight in 3966, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.109896 = idf(docFreq=12, maxDocs=43254)
          0.3125 = fieldNorm(doc=3966)
    

Similar documents (content)

  1. Yang, C.C.; Wang, F.L.: Hierarchical summarization of large documents (2008) 0.10
    0.098457485 = sum of:
      0.098457485 = product of:
        0.6153593 = sum of:
          0.033010647 = weight(abstract_txt:documents in 3720) [ClassicSimilarity], result of:
            0.033010647 = score(doc=3720,freq=2.0), product of:
              0.0907296 = queryWeight, product of:
                1.1661441 = boost
                4.1163282 = idf(docFreq=1916, maxDocs=43254)
                0.018901087 = queryNorm
              0.36383545 = fieldWeight in 3720, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.1163282 = idf(docFreq=1916, maxDocs=43254)
                0.0625 = fieldNorm(doc=3720)
          0.06961785 = weight(abstract_txt:document in 3720) [ClassicSimilarity], result of:
            0.06961785 = score(doc=3720,freq=7.0), product of:
              0.098273724 = queryWeight, product of:
                1.2136583 = boost
                4.2840466 = idf(docFreq=1620, maxDocs=43254)
                0.018901087 = queryNorm
              0.7084076 = fieldWeight in 3720, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                4.2840466 = idf(docFreq=1620, maxDocs=43254)
                0.0625 = fieldNorm(doc=3720)
          0.3225985 = weight(abstract_txt:summarization in 3720) [ClassicSimilarity], result of:
            0.3225985 = score(doc=3720,freq=7.0), product of:
              0.27314833 = queryWeight, product of:
                2.023378 = boost
                7.1422453 = idf(docFreq=92, maxDocs=43254)
                0.018901087 = queryNorm
              1.1810378 = fieldWeight in 3720, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                7.1422453 = idf(docFreq=92, maxDocs=43254)
                0.0625 = fieldNorm(doc=3720)
          0.19013235 = weight(abstract_txt:hierarchical in 3720) [ClassicSimilarity], result of:
            0.19013235 = score(doc=3720,freq=4.0), product of:
              0.26487285 = queryWeight, product of:
                2.4402936 = boost
                5.7426 = idf(docFreq=376, maxDocs=43254)
                0.018901087 = queryNorm
              0.717825 = fieldWeight in 3720, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.7426 = idf(docFreq=376, maxDocs=43254)
                0.0625 = fieldNorm(doc=3720)
        0.16 = coord(4/25)
    
  2. Li, T.; Zhu, S.; Ogihara, M.: Hierarchical document classification using automatically generated hierarchy (2007) 0.09
    0.09077956 = sum of:
      0.09077956 = product of:
        0.45389777 = sum of:
          0.029177565 = weight(abstract_txt:documents in 1262) [ClassicSimilarity], result of:
            0.029177565 = score(doc=1262,freq=1.0), product of:
              0.0907296 = queryWeight, product of:
                1.1661441 = boost
                4.1163282 = idf(docFreq=1916, maxDocs=43254)
                0.018901087 = queryNorm
              0.32158816 = fieldWeight in 1262, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1163282 = idf(docFreq=1916, maxDocs=43254)
                0.078125 = fieldNorm(doc=1262)
          0.032891344 = weight(abstract_txt:document in 1262) [ClassicSimilarity], result of:
            0.032891344 = score(doc=1262,freq=1.0), product of:
              0.098273724 = queryWeight, product of:
                1.2136583 = boost
                4.2840466 = idf(docFreq=1620, maxDocs=43254)
                0.018901087 = queryNorm
              0.33469114 = fieldWeight in 1262, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.2840466 = idf(docFreq=1620, maxDocs=43254)
                0.078125 = fieldNorm(doc=1262)
          0.14756702 = weight(abstract_txt:transforms in 1262) [ClassicSimilarity], result of:
            0.14756702 = score(doc=1262,freq=1.0), product of:
              0.21217744 = queryWeight, product of:
                1.260992 = boost
                8.902256 = idf(docFreq=15, maxDocs=43254)
                0.018901087 = queryNorm
              0.69548875 = fieldWeight in 1262, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.902256 = idf(docFreq=15, maxDocs=43254)
                0.078125 = fieldNorm(doc=1262)
          0.038437534 = weight(abstract_txt:method in 1262) [ClassicSimilarity], result of:
            0.038437534 = score(doc=1262,freq=1.0), product of:
              0.10903184 = queryWeight, product of:
                1.2783636 = boost
                4.5124474 = idf(docFreq=1289, maxDocs=43254)
                0.018901087 = queryNorm
              0.35253495 = fieldWeight in 1262, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5124474 = idf(docFreq=1289, maxDocs=43254)
                0.078125 = fieldNorm(doc=1262)
          0.20582432 = weight(abstract_txt:hierarchical in 1262) [ClassicSimilarity], result of:
            0.20582432 = score(doc=1262,freq=3.0), product of:
              0.26487285 = queryWeight, product of:
                2.4402936 = boost
                5.7426 = idf(docFreq=376, maxDocs=43254)
                0.018901087 = queryNorm
              0.7770684 = fieldWeight in 1262, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.7426 = idf(docFreq=376, maxDocs=43254)
                0.078125 = fieldNorm(doc=1262)
        0.2 = coord(5/25)
    
  3. O'Kane, K.C.: Generating hierarchical document indices from common denominators in large document collections (1996) 0.09
    0.08959027 = sum of:
      0.08959027 = product of:
        0.44795132 = sum of:
          0.09063322 = weight(abstract_txt:unlike in 5106) [ClassicSimilarity], result of:
            0.09063322 = score(doc=5106,freq=1.0), product of:
              0.13576165 = queryWeight, product of:
                1.0086751 = boost
                7.120968 = idf(docFreq=94, maxDocs=43254)
                0.018901087 = queryNorm
              0.66759074 = fieldWeight in 5106, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.120968 = idf(docFreq=94, maxDocs=43254)
                0.09375 = fieldNorm(doc=5106)
          0.030588008 = weight(abstract_txt:about in 5106) [ClassicSimilarity], result of:
            0.030588008 = score(doc=5106,freq=1.0), product of:
              0.08291434 = queryWeight, product of:
                1.1147887 = boost
                3.9350505 = idf(docFreq=2297, maxDocs=43254)
                0.018901087 = queryNorm
              0.36891097 = fieldWeight in 5106, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.9350505 = idf(docFreq=2297, maxDocs=43254)
                0.09375 = fieldNorm(doc=5106)
          0.07893923 = weight(abstract_txt:document in 5106) [ClassicSimilarity], result of:
            0.07893923 = score(doc=5106,freq=4.0), product of:
              0.098273724 = queryWeight, product of:
                1.2136583 = boost
                4.2840466 = idf(docFreq=1620, maxDocs=43254)
                0.018901087 = queryNorm
              0.8032588 = fieldWeight in 5106, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.2840466 = idf(docFreq=1620, maxDocs=43254)
                0.09375 = fieldNorm(doc=5106)
          0.046125043 = weight(abstract_txt:method in 5106) [ClassicSimilarity], result of:
            0.046125043 = score(doc=5106,freq=1.0), product of:
              0.10903184 = queryWeight, product of:
                1.2783636 = boost
                4.5124474 = idf(docFreq=1289, maxDocs=43254)
                0.018901087 = queryNorm
              0.42304194 = fieldWeight in 5106, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5124474 = idf(docFreq=1289, maxDocs=43254)
                0.09375 = fieldNorm(doc=5106)
          0.20166582 = weight(abstract_txt:hierarchical in 5106) [ClassicSimilarity], result of:
            0.20166582 = score(doc=5106,freq=2.0), product of:
              0.26487285 = queryWeight, product of:
                2.4402936 = boost
                5.7426 = idf(docFreq=376, maxDocs=43254)
                0.018901087 = queryNorm
              0.7613684 = fieldWeight in 5106, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.7426 = idf(docFreq=376, maxDocs=43254)
                0.09375 = fieldNorm(doc=5106)
        0.2 = coord(5/25)
    
  4. Condron, L.; Tittemore, C.P.: Functional Requirements for Bibliographic Records (2004) 0.08
    0.08065503 = sum of:
      0.08065503 = product of:
        0.40327513 = sum of:
          0.07652819 = weight(abstract_txt:one's in 655) [ClassicSimilarity], result of:
            0.07652819 = score(doc=655,freq=1.0), product of:
              0.15892564 = queryWeight, product of:
                1.0913391 = boost
                7.704553 = idf(docFreq=52, maxDocs=43254)
                0.018901087 = queryNorm
              0.48153457 = fieldWeight in 655, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.704553 = idf(docFreq=52, maxDocs=43254)
                0.0625 = fieldNorm(doc=655)
          0.02334205 = weight(abstract_txt:documents in 655) [ClassicSimilarity], result of:
            0.02334205 = score(doc=655,freq=1.0), product of:
              0.0907296 = queryWeight, product of:
                1.1661441 = boost
                4.1163282 = idf(docFreq=1916, maxDocs=43254)
                0.018901087 = queryNorm
              0.25727051 = fieldWeight in 655, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1163282 = idf(docFreq=1916, maxDocs=43254)
                0.0625 = fieldNorm(doc=655)
          0.026313076 = weight(abstract_txt:document in 655) [ClassicSimilarity], result of:
            0.026313076 = score(doc=655,freq=1.0), product of:
              0.098273724 = queryWeight, product of:
                1.2136583 = boost
                4.2840466 = idf(docFreq=1620, maxDocs=43254)
                0.018901087 = queryNorm
              0.26775292 = fieldWeight in 655, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.2840466 = idf(docFreq=1620, maxDocs=43254)
                0.0625 = fieldNorm(doc=655)
          0.118053615 = weight(abstract_txt:transforms in 655) [ClassicSimilarity], result of:
            0.118053615 = score(doc=655,freq=1.0), product of:
              0.21217744 = queryWeight, product of:
                1.260992 = boost
                8.902256 = idf(docFreq=15, maxDocs=43254)
                0.018901087 = queryNorm
              0.556391 = fieldWeight in 655, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.902256 = idf(docFreq=15, maxDocs=43254)
                0.0625 = fieldNorm(doc=655)
          0.15903819 = weight(abstract_txt:delivering in 655) [ClassicSimilarity], result of:
            0.15903819 = score(doc=655,freq=1.0), product of:
              0.32607985 = queryWeight, product of:
                2.2107503 = boost
                7.803644 = idf(docFreq=47, maxDocs=43254)
                0.018901087 = queryNorm
              0.48772776 = fieldWeight in 655, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.803644 = idf(docFreq=47, maxDocs=43254)
                0.0625 = fieldNorm(doc=655)
        0.2 = coord(5/25)
    
  5. Ou, S.; Khoo, C.S.G.; Goh, D.H.: Multi-document summarization of news articles using an event-based framework (2006) 0.08
    0.080417655 = sum of:
      0.080417655 = product of:
        0.40208828 = sum of:
          0.05262615 = weight(abstract_txt:document in 1783) [ClassicSimilarity], result of:
            0.05262615 = score(doc=1783,freq=4.0), product of:
              0.098273724 = queryWeight, product of:
                1.2136583 = boost
                4.2840466 = idf(docFreq=1620, maxDocs=43254)
                0.018901087 = queryNorm
              0.53550583 = fieldWeight in 1783, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.2840466 = idf(docFreq=1620, maxDocs=43254)
                0.0625 = fieldNorm(doc=1783)
          0.030750027 = weight(abstract_txt:method in 1783) [ClassicSimilarity], result of:
            0.030750027 = score(doc=1783,freq=1.0), product of:
              0.10903184 = queryWeight, product of:
                1.2783636 = boost
                4.5124474 = idf(docFreq=1289, maxDocs=43254)
                0.018901087 = queryNorm
              0.28202796 = fieldWeight in 1783, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5124474 = idf(docFreq=1289, maxDocs=43254)
                0.0625 = fieldNorm(doc=1783)
          0.051209763 = weight(abstract_txt:small in 1783) [ClassicSimilarity], result of:
            0.051209763 = score(doc=1783,freq=1.0), product of:
              0.15318803 = queryWeight, product of:
                1.5152705 = boost
                5.3486958 = idf(docFreq=558, maxDocs=43254)
                0.018901087 = queryNorm
              0.33429348 = fieldWeight in 1783, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.3486958 = idf(docFreq=558, maxDocs=43254)
                0.0625 = fieldNorm(doc=1783)
          0.17243615 = weight(abstract_txt:summarization in 1783) [ClassicSimilarity], result of:
            0.17243615 = score(doc=1783,freq=2.0), product of:
              0.27314833 = queryWeight, product of:
                2.023378 = boost
                7.1422453 = idf(docFreq=92, maxDocs=43254)
                0.018901087 = queryNorm
              0.6312913 = fieldWeight in 1783, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.1422453 = idf(docFreq=92, maxDocs=43254)
                0.0625 = fieldNorm(doc=1783)
          0.095066175 = weight(abstract_txt:hierarchical in 1783) [ClassicSimilarity], result of:
            0.095066175 = score(doc=1783,freq=1.0), product of:
              0.26487285 = queryWeight, product of:
                2.4402936 = boost
                5.7426 = idf(docFreq=376, maxDocs=43254)
                0.018901087 = queryNorm
              0.3589125 = fieldWeight in 1783, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.7426 = idf(docFreq=376, maxDocs=43254)
                0.0625 = fieldNorm(doc=1783)
        0.2 = coord(5/25)