Document (#34072)

Author
Otterbacher, J.
Radev, D.
Kareem, O.
Title
Hierarchical summarization for delivering information to mobile devices
Source
Information processing and management. 44(2008) no.2, S.931-947
Year
2008
Abstract
Access to information via handheld devices supports decision making away from one's computer. However, limitations include small screens and constrained wireless bandwidth. We present a summarization method that transforms online content for delivery to small devices. Unlike previous algorithms, ours assumes nothing about document formatting, and induces a hierarchical structure based on the relative importance of sentences within the document. As compared to delivering full documents, the method reduces the bytes transferred by half. An experiment also demonstrates that when given hierarchical summaries, users are no less accurate in answering questions about the documents.

Similar documents (author)

  1. Otterbacher, J.; Radev, D.: Exploring fact-focused relevance and novelty detection (2008) 4.57
    4.565969 = sum of:
      4.565969 = weight(author_txt:radev in 2210) [ClassicSimilarity], result of:
        4.565969 = fieldWeight in 2210, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.131938 = idf(docFreq=12, maxDocs=44218)
          0.5 = fieldNorm(doc=2210)
    
  2. Finegan-Dollak, C.; Radev, D.R.: Sentence simplification, compression, and disaggregation for summarization of sophisticated documents (2016) 4.00
    3.9952228 = sum of:
      3.9952228 = weight(author_txt:radev in 3122) [ClassicSimilarity], result of:
        3.9952228 = fieldWeight in 3122, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.131938 = idf(docFreq=12, maxDocs=44218)
          0.4375 = fieldNorm(doc=3122)
    
  3. Radev, D.R.; Libner, K.; Fan, W.: Getting answers to natural language questions on the Web (2002) 3.42
    3.4244766 = sum of:
      3.4244766 = weight(author_txt:radev in 5204) [ClassicSimilarity], result of:
        3.4244766 = fieldWeight in 5204, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.131938 = idf(docFreq=12, maxDocs=44218)
          0.375 = fieldNorm(doc=5204)
    
  4. Otterbacher, J.; Erkan, G.; Radev, D.R.: Biased LexRank : passage retrieval using random walks with question-based priors (2009) 3.42
    3.4244766 = sum of:
      3.4244766 = weight(author_txt:radev in 2450) [ClassicSimilarity], result of:
        3.4244766 = fieldWeight in 2450, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.131938 = idf(docFreq=12, maxDocs=44218)
          0.375 = fieldNorm(doc=2450)
    
  5. Lam, W.; Chan, K.; Radev, D.; Saggion, H.; Teufel, S.: Context-based generic cross-lingual retrieval of documents and automated summaries (2005) 2.85
    2.8537307 = sum of:
      2.8537307 = weight(author_txt:radev in 1965) [ClassicSimilarity], result of:
        2.8537307 = fieldWeight in 1965, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.131938 = idf(docFreq=12, maxDocs=44218)
          0.3125 = fieldNorm(doc=1965)
    

Similar documents (content)

  1. Yang, C.C.; Wang, F.L.: Hierarchical summarization of large documents (2008) 0.13
    0.13456199 = sum of:
      0.13456199 = product of:
        0.6728099 = sum of:
          0.05832386 = weight(abstract_txt:summaries in 1719) [ClassicSimilarity], result of:
            0.05832386 = score(doc=1719,freq=1.0), product of:
              0.1326777 = queryWeight, product of:
                7.033448 = idf(docFreq=105, maxDocs=44218)
                0.01886382 = queryNorm
              0.4395905 = fieldWeight in 1719, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.033448 = idf(docFreq=105, maxDocs=44218)
                0.0625 = fieldNorm(doc=1719)
          0.033188675 = weight(abstract_txt:documents in 1719) [ClassicSimilarity], result of:
            0.033188675 = score(doc=1719,freq=2.0), product of:
              0.091108814 = queryWeight, product of:
                1.1719153 = boost
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.01886382 = queryNorm
              0.36427513 = fieldWeight in 1719, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.0625 = fieldNorm(doc=1719)
          0.07015912 = weight(abstract_txt:document in 1719) [ClassicSimilarity], result of:
            0.07015912 = score(doc=1719,freq=7.0), product of:
              0.09884025 = queryWeight, product of:
                1.220627 = boost
                4.2926083 = idf(docFreq=1642, maxDocs=44218)
                0.01886382 = queryNorm
              0.70982337 = fieldWeight in 1719, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                4.2926083 = idf(docFreq=1642, maxDocs=44218)
                0.0625 = fieldNorm(doc=1719)
          0.32184947 = weight(abstract_txt:summarization in 1719) [ClassicSimilarity], result of:
            0.32184947 = score(doc=1719,freq=7.0), product of:
              0.27288496 = queryWeight, product of:
                2.028177 = boost
                7.132539 = idf(docFreq=95, maxDocs=44218)
                0.01886382 = queryNorm
              1.1794327 = fieldWeight in 1719, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                7.132539 = idf(docFreq=95, maxDocs=44218)
                0.0625 = fieldNorm(doc=1719)
          0.1892888 = weight(abstract_txt:hierarchical in 1719) [ClassicSimilarity], result of:
            0.1892888 = score(doc=1719,freq=4.0), product of:
              0.2642434 = queryWeight, product of:
                2.444352 = boost
                5.7307405 = idf(docFreq=389, maxDocs=44218)
                0.01886382 = queryNorm
              0.71634257 = fieldWeight in 1719, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.7307405 = idf(docFreq=389, maxDocs=44218)
                0.0625 = fieldNorm(doc=1719)
        0.2 = coord(5/25)
    
  2. Ou, S.; Khoo, C.S.G.; Goh, D.H.: Multi-document summarization of news articles using an event-based framework (2006) 0.13
    0.13056536 = sum of:
      0.13056536 = product of:
        0.5440223 = sum of:
          0.14286369 = weight(abstract_txt:summaries in 657) [ClassicSimilarity], result of:
            0.14286369 = score(doc=657,freq=6.0), product of:
              0.1326777 = queryWeight, product of:
                7.033448 = idf(docFreq=105, maxDocs=44218)
                0.01886382 = queryNorm
              1.0767725 = fieldWeight in 657, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                7.033448 = idf(docFreq=105, maxDocs=44218)
                0.0625 = fieldNorm(doc=657)
          0.05303531 = weight(abstract_txt:document in 657) [ClassicSimilarity], result of:
            0.05303531 = score(doc=657,freq=4.0), product of:
              0.09884025 = queryWeight, product of:
                1.220627 = boost
                4.2926083 = idf(docFreq=1642, maxDocs=44218)
                0.01886382 = queryNorm
              0.53657603 = fieldWeight in 657, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.2926083 = idf(docFreq=1642, maxDocs=44218)
                0.0625 = fieldNorm(doc=657)
          0.030569188 = weight(abstract_txt:method in 657) [ClassicSimilarity], result of:
            0.030569188 = score(doc=657,freq=1.0), product of:
              0.10866751 = queryWeight, product of:
                1.27987 = boost
                4.50095 = idf(docFreq=1333, maxDocs=44218)
                0.01886382 = queryNorm
              0.28130937 = fieldWeight in 657, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.50095 = idf(docFreq=1333, maxDocs=44218)
                0.0625 = fieldNorm(doc=657)
          0.05087401 = weight(abstract_txt:small in 657) [ClassicSimilarity], result of:
            0.05087401 = score(doc=657,freq=1.0), product of:
              0.152607 = queryWeight, product of:
                1.5167124 = boost
                5.333859 = idf(docFreq=579, maxDocs=44218)
                0.01886382 = queryNorm
              0.3333662 = fieldWeight in 657, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.333859 = idf(docFreq=579, maxDocs=44218)
                0.0625 = fieldNorm(doc=657)
          0.17203577 = weight(abstract_txt:summarization in 657) [ClassicSimilarity], result of:
            0.17203577 = score(doc=657,freq=2.0), product of:
              0.27288496 = queryWeight, product of:
                2.028177 = boost
                7.132539 = idf(docFreq=95, maxDocs=44218)
                0.01886382 = queryNorm
              0.6304333 = fieldWeight in 657, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.132539 = idf(docFreq=95, maxDocs=44218)
                0.0625 = fieldNorm(doc=657)
          0.0946444 = weight(abstract_txt:hierarchical in 657) [ClassicSimilarity], result of:
            0.0946444 = score(doc=657,freq=1.0), product of:
              0.2642434 = queryWeight, product of:
                2.444352 = boost
                5.7307405 = idf(docFreq=389, maxDocs=44218)
                0.01886382 = queryNorm
              0.35817128 = fieldWeight in 657, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.7307405 = idf(docFreq=389, maxDocs=44218)
                0.0625 = fieldNorm(doc=657)
        0.24 = coord(6/25)
    
  3. Pons-Porrata, A.; Berlanga-Llavori, R.; Ruiz-Shulcloper, J.: Topic discovery based on text mining techniques (2007) 0.10
    0.10323011 = sum of:
      0.10323011 = product of:
        0.51615053 = sum of:
          0.10310298 = weight(abstract_txt:summaries in 916) [ClassicSimilarity], result of:
            0.10310298 = score(doc=916,freq=2.0), product of:
              0.1326777 = queryWeight, product of:
                7.033448 = idf(docFreq=105, maxDocs=44218)
                0.01886382 = queryNorm
              0.7770935 = fieldWeight in 916, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.033448 = idf(docFreq=105, maxDocs=44218)
                0.078125 = fieldNorm(doc=916)
          0.041485842 = weight(abstract_txt:documents in 916) [ClassicSimilarity], result of:
            0.041485842 = score(doc=916,freq=2.0), product of:
              0.091108814 = queryWeight, product of:
                1.1719153 = boost
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.01886382 = queryNorm
              0.4553439 = fieldWeight in 916, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.078125 = fieldNorm(doc=916)
          0.038211484 = weight(abstract_txt:method in 916) [ClassicSimilarity], result of:
            0.038211484 = score(doc=916,freq=1.0), product of:
              0.10866751 = queryWeight, product of:
                1.27987 = boost
                4.50095 = idf(docFreq=1333, maxDocs=44218)
                0.01886382 = queryNorm
              0.3516367 = fieldWeight in 916, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.50095 = idf(docFreq=1333, maxDocs=44218)
                0.078125 = fieldNorm(doc=916)
          0.21504472 = weight(abstract_txt:summarization in 916) [ClassicSimilarity], result of:
            0.21504472 = score(doc=916,freq=2.0), product of:
              0.27288496 = queryWeight, product of:
                2.028177 = boost
                7.132539 = idf(docFreq=95, maxDocs=44218)
                0.01886382 = queryNorm
              0.78804165 = fieldWeight in 916, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.132539 = idf(docFreq=95, maxDocs=44218)
                0.078125 = fieldNorm(doc=916)
          0.11830549 = weight(abstract_txt:hierarchical in 916) [ClassicSimilarity], result of:
            0.11830549 = score(doc=916,freq=1.0), product of:
              0.2642434 = queryWeight, product of:
                2.444352 = boost
                5.7307405 = idf(docFreq=389, maxDocs=44218)
                0.01886382 = queryNorm
              0.4477141 = fieldWeight in 916, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.7307405 = idf(docFreq=389, maxDocs=44218)
                0.078125 = fieldNorm(doc=916)
        0.2 = coord(5/25)
    
  4. Xiong, S.; Ji, D.: Query-focused multi-document summarization using hypergraph-based ranking (2016) 0.10
    0.0964934 = sum of:
      0.0964934 = product of:
        0.482467 = sum of:
          0.072904825 = weight(abstract_txt:summaries in 2972) [ClassicSimilarity], result of:
            0.072904825 = score(doc=2972,freq=1.0), product of:
              0.1326777 = queryWeight, product of:
                7.033448 = idf(docFreq=105, maxDocs=44218)
                0.01886382 = queryNorm
              0.5494881 = fieldWeight in 2972, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.033448 = idf(docFreq=105, maxDocs=44218)
                0.078125 = fieldNorm(doc=2972)
          0.029334923 = weight(abstract_txt:documents in 2972) [ClassicSimilarity], result of:
            0.029334923 = score(doc=2972,freq=1.0), product of:
              0.091108814 = queryWeight, product of:
                1.1719153 = boost
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.01886382 = queryNorm
              0.32197678 = fieldWeight in 2972, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.078125 = fieldNorm(doc=2972)
          0.046877034 = weight(abstract_txt:document in 2972) [ClassicSimilarity], result of:
            0.046877034 = score(doc=2972,freq=2.0), product of:
              0.09884025 = queryWeight, product of:
                1.220627 = boost
                4.2926083 = idf(docFreq=1642, maxDocs=44218)
                0.01886382 = queryNorm
              0.4742707 = fieldWeight in 2972, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.2926083 = idf(docFreq=1642, maxDocs=44218)
                0.078125 = fieldNorm(doc=2972)
          0.21504472 = weight(abstract_txt:summarization in 2972) [ClassicSimilarity], result of:
            0.21504472 = score(doc=2972,freq=2.0), product of:
              0.27288496 = queryWeight, product of:
                2.028177 = boost
                7.132539 = idf(docFreq=95, maxDocs=44218)
                0.01886382 = queryNorm
              0.78804165 = fieldWeight in 2972, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.132539 = idf(docFreq=95, maxDocs=44218)
                0.078125 = fieldNorm(doc=2972)
          0.11830549 = weight(abstract_txt:hierarchical in 2972) [ClassicSimilarity], result of:
            0.11830549 = score(doc=2972,freq=1.0), product of:
              0.2642434 = queryWeight, product of:
                2.444352 = boost
                5.7307405 = idf(docFreq=389, maxDocs=44218)
                0.01886382 = queryNorm
              0.4477141 = fieldWeight in 2972, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.7307405 = idf(docFreq=389, maxDocs=44218)
                0.078125 = fieldNorm(doc=2972)
        0.2 = coord(5/25)
    
  5. Chang, Y.-W.: Influence of human behavior and the principle of least effort on library and information science research (2016) 0.10
    0.0964934 = sum of:
      0.0964934 = product of:
        0.482467 = sum of:
          0.072904825 = weight(abstract_txt:summaries in 2973) [ClassicSimilarity], result of:
            0.072904825 = score(doc=2973,freq=1.0), product of:
              0.1326777 = queryWeight, product of:
                7.033448 = idf(docFreq=105, maxDocs=44218)
                0.01886382 = queryNorm
              0.5494881 = fieldWeight in 2973, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.033448 = idf(docFreq=105, maxDocs=44218)
                0.078125 = fieldNorm(doc=2973)
          0.029334923 = weight(abstract_txt:documents in 2973) [ClassicSimilarity], result of:
            0.029334923 = score(doc=2973,freq=1.0), product of:
              0.091108814 = queryWeight, product of:
                1.1719153 = boost
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.01886382 = queryNorm
              0.32197678 = fieldWeight in 2973, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.078125 = fieldNorm(doc=2973)
          0.046877034 = weight(abstract_txt:document in 2973) [ClassicSimilarity], result of:
            0.046877034 = score(doc=2973,freq=2.0), product of:
              0.09884025 = queryWeight, product of:
                1.220627 = boost
                4.2926083 = idf(docFreq=1642, maxDocs=44218)
                0.01886382 = queryNorm
              0.4742707 = fieldWeight in 2973, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.2926083 = idf(docFreq=1642, maxDocs=44218)
                0.078125 = fieldNorm(doc=2973)
          0.21504472 = weight(abstract_txt:summarization in 2973) [ClassicSimilarity], result of:
            0.21504472 = score(doc=2973,freq=2.0), product of:
              0.27288496 = queryWeight, product of:
                2.028177 = boost
                7.132539 = idf(docFreq=95, maxDocs=44218)
                0.01886382 = queryNorm
              0.78804165 = fieldWeight in 2973, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.132539 = idf(docFreq=95, maxDocs=44218)
                0.078125 = fieldNorm(doc=2973)
          0.11830549 = weight(abstract_txt:hierarchical in 2973) [ClassicSimilarity], result of:
            0.11830549 = score(doc=2973,freq=1.0), product of:
              0.2642434 = queryWeight, product of:
                2.444352 = boost
                5.7307405 = idf(docFreq=389, maxDocs=44218)
                0.01886382 = queryNorm
              0.4477141 = fieldWeight in 2973, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.7307405 = idf(docFreq=389, maxDocs=44218)
                0.078125 = fieldNorm(doc=2973)
        0.2 = coord(5/25)