Document (#42245)

Author
Chin, J.Y.
Bhowmick, S.S.
Jatowt, A.
Title
On-demand recent personal tweets summarization on mobile devices
Source
Journal of the Association for Information Science and Technology. 70(2019) no.6, S.547-562
Year
2019
Abstract
Tweets summarization aims to find a group of representative tweets for a specific set of input tweets or a given topic. In recent times, there have been several research efforts toward devising a variety of techniques to summarize tweets in Twitter. However, these techniques are either not personal (that is, consider only tweets in the timeline of a specific user) or are too expensive to be realized on a mobile device. Given that 80% of active Twitter users access the site on mobile devices, in this article we present a lightweight, personal, on-demand, topic modeling-based tweets summarization engine called TOTEM, designed for such devices. Specifically, TOTEM first preprocesses recent tweets in a user's timeline and exploits Latent Dirichlet Allocation-based topic modeling to assign each preprocessed tweet to a topic. Then it generates a ranked list of relevant tweets, a topic label, and a topic summary for each of the topics. Our experimental study with real-world data sets demonstrates the superiority of TOTEM.
Content
Vgl.: https://onlinelibrary.wiley.com/doi/10.1002/asi.24137.

Similar documents (author)

  1. Jatowt, A.; Yeung, C.M.A.; Tanaka, K.: Generic method for detecting focus time of documents (2015) 3.71
    3.7087662 = sum of:
      3.7087662 = weight(author_txt:jatowt in 4666) [ClassicSimilarity], result of:
        3.7087662 = fieldWeight in 4666, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.890043 = idf(docFreq=5, maxDocs=43556)
          0.375 = fieldNorm(doc=4666)
    
  2. Joho, H.; Jatowt, A.; Blanco, R.: Temporal information searching behaviour and strategies (2015) 3.71
    3.7087662 = sum of:
      3.7087662 = weight(author_txt:jatowt in 4672) [ClassicSimilarity], result of:
        3.7087662 = fieldWeight in 4672, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.890043 = idf(docFreq=5, maxDocs=43556)
          0.375 = fieldNorm(doc=4672)
    
  3. Lee, J.; Jatowt, A.; Kim, K.-S..: Discovering underlying sensations of human emotions based on social media (2021) 3.71
    3.7087662 = sum of:
      3.7087662 = weight(author_txt:jatowt in 2450) [ClassicSimilarity], result of:
        3.7087662 = fieldWeight in 2450, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.890043 = idf(docFreq=5, maxDocs=43556)
          0.375 = fieldNorm(doc=2450)
    
  4. Zielinski, K.; Nielek, R.; Wierzbicki, A.; Jatowt, A.: Computing controversy : formal model and algorithms for detecting controversy on Wikipedia and in search queries (2018) 3.09
    3.0906386 = sum of:
      3.0906386 = weight(author_txt:jatowt in 1379) [ClassicSimilarity], result of:
        3.0906386 = fieldWeight in 1379, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.890043 = idf(docFreq=5, maxDocs=43556)
          0.3125 = fieldNorm(doc=1379)
    

Similar documents (content)

  1. Luo, Z.; Yu, Y.; Osborne, M.; Wang, T.: Structuring tweets for improving Twitter search (2015) 0.36
    0.3595743 = sum of:
      0.3595743 = product of:
        1.2841939 = sum of:
          0.0127213765 = weight(abstract_txt:each in 4333) [ClassicSimilarity], result of:
            0.0127213765 = score(doc=4333,freq=1.0), product of:
              0.049351536 = queryWeight, product of:
                1.109689 = boost
                4.12433 = idf(docFreq=1914, maxDocs=43556)
                0.010783158 = queryNorm
              0.25777063 = fieldWeight in 4333, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.12433 = idf(docFreq=1914, maxDocs=43556)
                0.0625 = fieldNorm(doc=4333)
          0.098025754 = weight(abstract_txt:tweet in 4333) [ClassicSimilarity], result of:
            0.098025754 = score(doc=4333,freq=3.0), product of:
              0.105954885 = queryWeight, product of:
                1.1497315 = boost
                8.5463085 = idf(docFreq=22, maxDocs=43556)
                0.010783158 = queryNorm
              0.925165 = fieldWeight in 4333, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                8.5463085 = idf(docFreq=22, maxDocs=43556)
                0.0625 = fieldNorm(doc=4333)
          0.014668895 = weight(abstract_txt:specific in 4333) [ClassicSimilarity], result of:
            0.014668895 = score(doc=4333,freq=1.0), product of:
              0.054267883 = queryWeight, product of:
                1.16365 = boost
                4.3248844 = idf(docFreq=1566, maxDocs=43556)
                0.010783158 = queryNorm
              0.27030528 = fieldWeight in 4333, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.3248844 = idf(docFreq=1566, maxDocs=43556)
                0.0625 = fieldNorm(doc=4333)
          0.039954174 = weight(abstract_txt:modeling in 4333) [ClassicSimilarity], result of:
            0.039954174 = score(doc=4333,freq=1.0), product of:
              0.105840705 = queryWeight, product of:
                1.6250896 = boost
                6.0398955 = idf(docFreq=281, maxDocs=43556)
                0.010783158 = queryNorm
              0.37749347 = fieldWeight in 4333, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.0398955 = idf(docFreq=281, maxDocs=43556)
                0.0625 = fieldNorm(doc=4333)
          0.15054716 = weight(abstract_txt:twitter in 4333) [ClassicSimilarity], result of:
            0.15054716 = score(doc=4333,freq=6.0), product of:
              0.14104009 = queryWeight, product of:
                1.8759543 = boost
                6.9722724 = idf(docFreq=110, maxDocs=43556)
                0.010783158 = queryNorm
              1.0674069 = fieldWeight in 4333, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                6.9722724 = idf(docFreq=110, maxDocs=43556)
                0.0625 = fieldNorm(doc=4333)
          0.07116878 = weight(abstract_txt:topic in 4333) [ClassicSimilarity], result of:
            0.07116878 = score(doc=4333,freq=1.0), product of:
              0.22430798 = queryWeight, product of:
                4.0976415 = boost
                5.0765047 = idf(docFreq=738, maxDocs=43556)
                0.010783158 = queryNorm
              0.31728154 = fieldWeight in 4333, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.0765047 = idf(docFreq=738, maxDocs=43556)
                0.0625 = fieldNorm(doc=4333)
          0.8971077 = weight(abstract_txt:tweets in 4333) [ClassicSimilarity], result of:
            0.8971077 = score(doc=4333,freq=6.0), product of:
              0.7653523 = queryWeight, product of:
                9.270174 = boost
                7.656451 = idf(docFreq=55, maxDocs=43556)
                0.010783158 = queryNorm
              1.17215 = fieldWeight in 4333, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                7.656451 = idf(docFreq=55, maxDocs=43556)
                0.0625 = fieldNorm(doc=4333)
        0.28 = coord(7/25)
    
  2. Sedhai, S.; Sun, A.: ¬An analysis of 14 Million tweets on hashtag-oriented spamming* (2017) 0.20
    0.19905086 = sum of:
      0.19905086 = product of:
        1.2440679 = sum of:
          0.098025754 = weight(abstract_txt:tweet in 681) [ClassicSimilarity], result of:
            0.098025754 = score(doc=681,freq=3.0), product of:
              0.105954885 = queryWeight, product of:
                1.1497315 = boost
                8.5463085 = idf(docFreq=22, maxDocs=43556)
                0.010783158 = queryNorm
              0.925165 = fieldWeight in 681, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                8.5463085 = idf(docFreq=22, maxDocs=43556)
                0.0625 = fieldNorm(doc=681)
          0.13743012 = weight(abstract_txt:twitter in 681) [ClassicSimilarity], result of:
            0.13743012 = score(doc=681,freq=5.0), product of:
              0.14104009 = queryWeight, product of:
                1.8759543 = boost
                6.9722724 = idf(docFreq=110, maxDocs=43556)
                0.010783158 = queryNorm
              0.9744047 = fieldWeight in 681, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                6.9722724 = idf(docFreq=110, maxDocs=43556)
                0.0625 = fieldNorm(doc=681)
          0.03962508 = weight(abstract_txt:personal in 681) [ClassicSimilarity], result of:
            0.03962508 = score(doc=681,freq=1.0), product of:
              0.12049114 = queryWeight, product of:
                2.123607 = boost
                5.261808 = idf(docFreq=613, maxDocs=43556)
                0.010783158 = queryNorm
              0.328863 = fieldWeight in 681, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.261808 = idf(docFreq=613, maxDocs=43556)
                0.0625 = fieldNorm(doc=681)
          0.9689869 = weight(abstract_txt:tweets in 681) [ClassicSimilarity], result of:
            0.9689869 = score(doc=681,freq=7.0), product of:
              0.7653523 = queryWeight, product of:
                9.270174 = boost
                7.656451 = idf(docFreq=55, maxDocs=43556)
                0.010783158 = queryNorm
              1.2660666 = fieldWeight in 681, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                7.656451 = idf(docFreq=55, maxDocs=43556)
                0.0625 = fieldNorm(doc=681)
        0.16 = coord(4/25)
    
  3. Gonçalo Oliveira, H.: Automatic generation of poetry inspired by Twitter trends (2016) 0.19
    0.1930529 = sum of:
      0.1930529 = product of:
        0.9652645 = sum of:
          0.02358696 = weight(abstract_txt:given in 4386) [ClassicSimilarity], result of:
            0.02358696 = score(doc=4386,freq=1.0), product of:
              0.06418782 = queryWeight, product of:
                1.265544 = boost
                4.703589 = idf(docFreq=1072, maxDocs=43556)
                0.010783158 = queryNorm
              0.36746788 = fieldWeight in 4386, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.703589 = idf(docFreq=1072, maxDocs=43556)
                0.078125 = fieldNorm(doc=4386)
          0.076825775 = weight(abstract_txt:twitter in 4386) [ClassicSimilarity], result of:
            0.076825775 = score(doc=4386,freq=1.0), product of:
              0.14104009 = queryWeight, product of:
                1.8759543 = boost
                6.9722724 = idf(docFreq=110, maxDocs=43556)
                0.010783158 = queryNorm
              0.5447088 = fieldWeight in 4386, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.9722724 = idf(docFreq=110, maxDocs=43556)
                0.078125 = fieldNorm(doc=4386)
          0.039498128 = weight(abstract_txt:recent in 4386) [ClassicSimilarity], result of:
            0.039498128 = score(doc=4386,freq=1.0), product of:
              0.10361422 = queryWeight, product of:
                1.9692745 = boost
                4.879408 = idf(docFreq=899, maxDocs=43556)
                0.010783158 = queryNorm
              0.38120374 = fieldWeight in 4386, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.879408 = idf(docFreq=899, maxDocs=43556)
                0.078125 = fieldNorm(doc=4386)
          0.17792195 = weight(abstract_txt:topic in 4386) [ClassicSimilarity], result of:
            0.17792195 = score(doc=4386,freq=4.0), product of:
              0.22430798 = queryWeight, product of:
                4.0976415 = boost
                5.0765047 = idf(docFreq=738, maxDocs=43556)
                0.010783158 = queryNorm
              0.79320383 = fieldWeight in 4386, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.0765047 = idf(docFreq=738, maxDocs=43556)
                0.078125 = fieldNorm(doc=4386)
          0.6474317 = weight(abstract_txt:tweets in 4386) [ClassicSimilarity], result of:
            0.6474317 = score(doc=4386,freq=2.0), product of:
              0.7653523 = queryWeight, product of:
                9.270174 = boost
                7.656451 = idf(docFreq=55, maxDocs=43556)
                0.010783158 = queryNorm
              0.84592634 = fieldWeight in 4386, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.656451 = idf(docFreq=55, maxDocs=43556)
                0.078125 = fieldNorm(doc=4386)
        0.2 = coord(5/25)
    
  4. Zheng, X.; Sun, A.: Collecting event-related tweets from twitter stream (2019) 0.19
    0.19246326 = sum of:
      0.19246326 = product of:
        0.9623163 = sum of:
          0.05211387 = weight(abstract_txt:superiority in 958) [ClassicSimilarity], result of:
            0.05211387 = score(doc=958,freq=1.0), product of:
              0.100285195 = queryWeight, product of:
                1.1185473 = boost
                8.314507 = idf(docFreq=28, maxDocs=43556)
                0.010783158 = queryNorm
              0.51965666 = fieldWeight in 958, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.314507 = idf(docFreq=28, maxDocs=43556)
                0.0625 = fieldNorm(doc=958)
          0.056595195 = weight(abstract_txt:tweet in 958) [ClassicSimilarity], result of:
            0.056595195 = score(doc=958,freq=1.0), product of:
              0.105954885 = queryWeight, product of:
                1.1497315 = boost
                8.5463085 = idf(docFreq=22, maxDocs=43556)
                0.010783158 = queryNorm
              0.5341443 = fieldWeight in 958, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.5463085 = idf(docFreq=22, maxDocs=43556)
                0.0625 = fieldNorm(doc=958)
          0.014668895 = weight(abstract_txt:specific in 958) [ClassicSimilarity], result of:
            0.014668895 = score(doc=958,freq=1.0), product of:
              0.054267883 = queryWeight, product of:
                1.16365 = boost
                4.3248844 = idf(docFreq=1566, maxDocs=43556)
                0.010783158 = queryNorm
              0.27030528 = fieldWeight in 958, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.3248844 = idf(docFreq=1566, maxDocs=43556)
                0.0625 = fieldNorm(doc=958)
          0.10645292 = weight(abstract_txt:twitter in 958) [ClassicSimilarity], result of:
            0.10645292 = score(doc=958,freq=3.0), product of:
              0.14104009 = queryWeight, product of:
                1.8759543 = boost
                6.9722724 = idf(docFreq=110, maxDocs=43556)
                0.010783158 = queryNorm
              0.75477064 = fieldWeight in 958, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.9722724 = idf(docFreq=110, maxDocs=43556)
                0.0625 = fieldNorm(doc=958)
          0.73248535 = weight(abstract_txt:tweets in 958) [ClassicSimilarity], result of:
            0.73248535 = score(doc=958,freq=4.0), product of:
              0.7653523 = queryWeight, product of:
                9.270174 = boost
                7.656451 = idf(docFreq=55, maxDocs=43556)
                0.010783158 = queryNorm
              0.9570564 = fieldWeight in 958, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                7.656451 = idf(docFreq=55, maxDocs=43556)
                0.0625 = fieldNorm(doc=958)
        0.2 = coord(5/25)
    
  5. Cotelo, J.M.; Cruz, F.L.; Troyano, J.A.: Dynamic topic-related tweet retrieval (2014) 0.16
    0.15694538 = sum of:
      0.15694538 = product of:
        0.98090863 = sum of:
          0.070744 = weight(abstract_txt:tweet in 3215) [ClassicSimilarity], result of:
            0.070744 = score(doc=3215,freq=1.0), product of:
              0.105954885 = queryWeight, product of:
                1.1497315 = boost
                8.5463085 = idf(docFreq=22, maxDocs=43556)
                0.010783158 = queryNorm
              0.6676804 = fieldWeight in 3215, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.5463085 = idf(docFreq=22, maxDocs=43556)
                0.078125 = fieldNorm(doc=3215)
          0.108648054 = weight(abstract_txt:twitter in 3215) [ClassicSimilarity], result of:
            0.108648054 = score(doc=3215,freq=2.0), product of:
              0.14104009 = queryWeight, product of:
                1.8759543 = boost
                6.9722724 = idf(docFreq=110, maxDocs=43556)
                0.010783158 = queryNorm
              0.77033454 = fieldWeight in 3215, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.9722724 = idf(docFreq=110, maxDocs=43556)
                0.078125 = fieldNorm(doc=3215)
          0.15408492 = weight(abstract_txt:topic in 3215) [ClassicSimilarity], result of:
            0.15408492 = score(doc=3215,freq=3.0), product of:
              0.22430798 = queryWeight, product of:
                4.0976415 = boost
                5.0765047 = idf(docFreq=738, maxDocs=43556)
                0.010783158 = queryNorm
              0.68693465 = fieldWeight in 3215, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.0765047 = idf(docFreq=738, maxDocs=43556)
                0.078125 = fieldNorm(doc=3215)
          0.6474317 = weight(abstract_txt:tweets in 3215) [ClassicSimilarity], result of:
            0.6474317 = score(doc=3215,freq=2.0), product of:
              0.7653523 = queryWeight, product of:
                9.270174 = boost
                7.656451 = idf(docFreq=55, maxDocs=43556)
                0.010783158 = queryNorm
              0.84592634 = fieldWeight in 3215, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.656451 = idf(docFreq=55, maxDocs=43556)
                0.078125 = fieldNorm(doc=3215)
        0.16 = coord(4/25)