Document (#42247)

Author
Chin, J.Y.
Bhowmick, S.S.
Jatowt, A.
Title
On-demand recent personal tweets summarization on mobile devices
Source
Journal of the Association for Information Science and Technology. 70(2019) no.6, S.547-562
Year
2019
Abstract
Tweets summarization aims to find a group of representative tweets for a specific set of input tweets or a given topic. In recent times, there have been several research efforts toward devising a variety of techniques to summarize tweets in Twitter. However, these techniques are either not personal (that is, consider only tweets in the timeline of a specific user) or are too expensive to be realized on a mobile device. Given that 80% of active Twitter users access the site on mobile devices, in this article we present a lightweight, personal, on-demand, topic modeling-based tweets summarization engine called TOTEM, designed for such devices. Specifically, TOTEM first preprocesses recent tweets in a user's timeline and exploits Latent Dirichlet Allocation-based topic modeling to assign each preprocessed tweet to a topic. Then it generates a ranked list of relevant tweets, a topic label, and a topic summary for each of the topics. Our experimental study with real-world data sets demonstrates the superiority of TOTEM.
Content
Vgl.: https://onlinelibrary.wiley.com/doi/10.1002/asi.24137.

Similar documents (author)

  1. Jatowt, A.; Yeung, C.M.A.; Tanaka, K.: Generic method for detecting focus time of documents (2015) 3.71
    3.7144227 = sum of:
      3.7144227 = weight(author_txt:jatowt in 2668) [ClassicSimilarity], result of:
        3.7144227 = fieldWeight in 2668, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.905128 = idf(docFreq=5, maxDocs=44218)
          0.375 = fieldNorm(doc=2668)
    
  2. Joho, H.; Jatowt, A.; Blanco, R.: Temporal information searching behaviour and strategies (2015) 3.71
    3.7144227 = sum of:
      3.7144227 = weight(author_txt:jatowt in 2674) [ClassicSimilarity], result of:
        3.7144227 = fieldWeight in 2674, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.905128 = idf(docFreq=5, maxDocs=44218)
          0.375 = fieldNorm(doc=2674)
    
  3. Lee, J.; Jatowt, A.; Kim, K.-S..: Discovering underlying sensations of human emotions based on social media (2021) 3.71
    3.7144227 = sum of:
      3.7144227 = weight(author_txt:jatowt in 163) [ClassicSimilarity], result of:
        3.7144227 = fieldWeight in 163, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.905128 = idf(docFreq=5, maxDocs=44218)
          0.375 = fieldNorm(doc=163)
    
  4. Zielinski, K.; Nielek, R.; Wierzbicki, A.; Jatowt, A.: Computing controversy : formal model and algorithms for detecting controversy on Wikipedia and in search queries (2018) 3.10
    3.0953524 = sum of:
      3.0953524 = weight(author_txt:jatowt in 5093) [ClassicSimilarity], result of:
        3.0953524 = fieldWeight in 5093, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.905128 = idf(docFreq=5, maxDocs=44218)
          0.3125 = fieldNorm(doc=5093)
    

Similar documents (content)

  1. Luo, Z.; Yu, Y.; Osborne, M.; Wang, T.: Structuring tweets for improving Twitter search (2015) 0.36
    0.35560527 = sum of:
      0.35560527 = product of:
        1.2700188 = sum of:
          0.012832282 = weight(abstract_txt:each in 2335) [ClassicSimilarity], result of:
            0.012832282 = score(doc=2335,freq=1.0), product of:
              0.049849328 = queryWeight, product of:
                1.1080514 = boost
                4.118742 = idf(docFreq=1954, maxDocs=44218)
                0.01092282 = queryNorm
              0.25742137 = fieldWeight in 2335, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.118742 = idf(docFreq=1954, maxDocs=44218)
                0.0625 = fieldNorm(doc=2335)
          0.09832889 = weight(abstract_txt:tweet in 2335) [ClassicSimilarity], result of:
            0.09832889 = score(doc=2335,freq=3.0), product of:
              0.106625326 = queryWeight, product of:
                1.1458967 = boost
                8.518833 = idf(docFreq=23, maxDocs=44218)
                0.01092282 = queryNorm
              0.9221907 = fieldWeight in 2335, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                8.518833 = idf(docFreq=23, maxDocs=44218)
                0.0625 = fieldNorm(doc=2335)
          0.01474664 = weight(abstract_txt:specific in 2335) [ClassicSimilarity], result of:
            0.01474664 = score(doc=2335,freq=1.0), product of:
              0.054691363 = queryWeight, product of:
                1.1606189 = boost
                4.314141 = idf(docFreq=1607, maxDocs=44218)
                0.01092282 = queryNorm
              0.2696338 = fieldWeight in 2335, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.314141 = idf(docFreq=1607, maxDocs=44218)
                0.0625 = fieldNorm(doc=2335)
          0.039934695 = weight(abstract_txt:modeling in 2335) [ClassicSimilarity], result of:
            0.039934695 = score(doc=2335,freq=1.0), product of:
              0.10625685 = queryWeight, product of:
                1.61774 = boost
                6.0133076 = idf(docFreq=293, maxDocs=44218)
                0.01092282 = queryNorm
              0.37583172 = fieldWeight in 2335, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.0133076 = idf(docFreq=293, maxDocs=44218)
                0.0625 = fieldNorm(doc=2335)
          0.14893027 = weight(abstract_txt:twitter in 2335) [ClassicSimilarity], result of:
            0.14893027 = score(doc=2335,freq=6.0), product of:
              0.14062469 = queryWeight, product of:
                1.8610629 = boost
                6.9177637 = idf(docFreq=118, maxDocs=44218)
                0.01092282 = queryNorm
              1.059062 = fieldWeight in 2335, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                6.9177637 = idf(docFreq=118, maxDocs=44218)
                0.0625 = fieldNorm(doc=2335)
          0.071476474 = weight(abstract_txt:topic in 2335) [ClassicSimilarity], result of:
            0.071476474 = score(doc=2335,freq=1.0), product of:
              0.22591195 = queryWeight, product of:
                4.085644 = boost
                5.062254 = idf(docFreq=760, maxDocs=44218)
                0.01092282 = queryNorm
              0.31639087 = fieldWeight in 2335, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.062254 = idf(docFreq=760, maxDocs=44218)
                0.0625 = fieldNorm(doc=2335)
          0.88376963 = weight(abstract_txt:tweets in 2335) [ClassicSimilarity], result of:
            0.88376963 = score(doc=2335,freq=6.0), product of:
              0.76097393 = queryWeight, product of:
                9.183779 = boost
                7.5860133 = idf(docFreq=60, maxDocs=44218)
                0.01092282 = queryNorm
              1.1613665 = fieldWeight in 2335, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                7.5860133 = idf(docFreq=60, maxDocs=44218)
                0.0625 = fieldNorm(doc=2335)
        0.28 = coord(7/25)
    
  2. Zheng, H.; Goh, D.H.-L.; Lee, E.W.J.; Lee, C.S.; Theng, Y.-L.: Understanding the effects of message cues on COVID-19 information sharing on Twitter (2022) 0.26
    0.26417822 = sum of:
      0.26417822 = product of:
        1.1007426 = sum of:
          0.040351227 = weight(abstract_txt:allocation in 564) [ClassicSimilarity], result of:
            0.040351227 = score(doc=564,freq=1.0), product of:
              0.08492154 = queryWeight, product of:
                1.0226433 = boost
                7.602543 = idf(docFreq=59, maxDocs=44218)
                0.01092282 = queryNorm
              0.47515893 = fieldWeight in 564, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.602543 = idf(docFreq=59, maxDocs=44218)
                0.0625 = fieldNorm(doc=564)
          0.051805712 = weight(abstract_txt:dirichlet in 564) [ClassicSimilarity], result of:
            0.051805712 = score(doc=564,freq=1.0), product of:
              0.100314826 = queryWeight, product of:
                1.1114702 = boost
                8.2629 = idf(docFreq=30, maxDocs=44218)
                0.01092282 = queryNorm
              0.5164313 = fieldWeight in 564, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.2629 = idf(docFreq=30, maxDocs=44218)
                0.0625 = fieldNorm(doc=564)
          0.039934695 = weight(abstract_txt:modeling in 564) [ClassicSimilarity], result of:
            0.039934695 = score(doc=564,freq=1.0), product of:
              0.10625685 = queryWeight, product of:
                1.61774 = boost
                6.0133076 = idf(docFreq=293, maxDocs=44218)
                0.01092282 = queryNorm
              0.37583172 = fieldWeight in 564, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.0133076 = idf(docFreq=293, maxDocs=44218)
                0.0625 = fieldNorm(doc=564)
          0.060800523 = weight(abstract_txt:twitter in 564) [ClassicSimilarity], result of:
            0.060800523 = score(doc=564,freq=1.0), product of:
              0.14062469 = queryWeight, product of:
                1.8610629 = boost
                6.9177637 = idf(docFreq=118, maxDocs=44218)
                0.01092282 = queryNorm
              0.43236023 = fieldWeight in 564, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.9177637 = idf(docFreq=118, maxDocs=44218)
                0.0625 = fieldNorm(doc=564)
          0.101083 = weight(abstract_txt:topic in 564) [ClassicSimilarity], result of:
            0.101083 = score(doc=564,freq=2.0), product of:
              0.22591195 = queryWeight, product of:
                4.085644 = boost
                5.062254 = idf(docFreq=760, maxDocs=44218)
                0.01092282 = queryNorm
              0.44744426 = fieldWeight in 564, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.062254 = idf(docFreq=760, maxDocs=44218)
                0.0625 = fieldNorm(doc=564)
          0.80676746 = weight(abstract_txt:tweets in 564) [ClassicSimilarity], result of:
            0.80676746 = score(doc=564,freq=5.0), product of:
              0.76097393 = queryWeight, product of:
                9.183779 = boost
                7.5860133 = idf(docFreq=60, maxDocs=44218)
                0.01092282 = queryNorm
              1.0601776 = fieldWeight in 564, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                7.5860133 = idf(docFreq=60, maxDocs=44218)
                0.0625 = fieldNorm(doc=564)
        0.24 = coord(6/25)
    
  3. Sedhai, S.; Sun, A.: ¬An analysis of 14 Million tweets on hashtag-oriented spamming* (2017) 0.20
    0.19660057 = sum of:
      0.19660057 = product of:
        1.2287536 = sum of:
          0.09832889 = weight(abstract_txt:tweet in 3683) [ClassicSimilarity], result of:
            0.09832889 = score(doc=3683,freq=3.0), product of:
              0.106625326 = queryWeight, product of:
                1.1458967 = boost
                8.518833 = idf(docFreq=23, maxDocs=44218)
                0.01092282 = queryNorm
              0.9221907 = fieldWeight in 3683, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                8.518833 = idf(docFreq=23, maxDocs=44218)
                0.0625 = fieldNorm(doc=3683)
          0.1359541 = weight(abstract_txt:twitter in 3683) [ClassicSimilarity], result of:
            0.1359541 = score(doc=3683,freq=5.0), product of:
              0.14062469 = queryWeight, product of:
                1.8610629 = boost
                6.9177637 = idf(docFreq=118, maxDocs=44218)
                0.01092282 = queryNorm
              0.96678686 = fieldWeight in 3683, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                6.9177637 = idf(docFreq=118, maxDocs=44218)
                0.0625 = fieldNorm(doc=3683)
          0.039890453 = weight(abstract_txt:personal in 3683) [ClassicSimilarity], result of:
            0.039890453 = score(doc=3683,freq=1.0), product of:
              0.12154388 = queryWeight, product of:
                2.119056 = boost
                5.2511673 = idf(docFreq=629, maxDocs=44218)
                0.01092282 = queryNorm
              0.32819796 = fieldWeight in 3683, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.2511673 = idf(docFreq=629, maxDocs=44218)
                0.0625 = fieldNorm(doc=3683)
          0.9545801 = weight(abstract_txt:tweets in 3683) [ClassicSimilarity], result of:
            0.9545801 = score(doc=3683,freq=7.0), product of:
              0.76097393 = queryWeight, product of:
                9.183779 = boost
                7.5860133 = idf(docFreq=60, maxDocs=44218)
                0.01092282 = queryNorm
              1.254419 = fieldWeight in 3683, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                7.5860133 = idf(docFreq=60, maxDocs=44218)
                0.0625 = fieldNorm(doc=3683)
        0.16 = coord(4/25)
    
  4. Gonçalo Oliveira, H.: Automatic generation of poetry inspired by Twitter trends (2016) 0.19
    0.19117884 = sum of:
      0.19117884 = product of:
        0.95589423 = sum of:
          0.023851978 = weight(abstract_txt:given in 2388) [ClassicSimilarity], result of:
            0.023851978 = score(doc=2388,freq=1.0), product of:
              0.06494309 = queryWeight, product of:
                1.2647269 = boost
                4.701121 = idf(docFreq=1091, maxDocs=44218)
                0.01092282 = queryNorm
              0.36727506 = fieldWeight in 2388, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.701121 = idf(docFreq=1091, maxDocs=44218)
                0.078125 = fieldNorm(doc=2388)
          0.07600065 = weight(abstract_txt:twitter in 2388) [ClassicSimilarity], result of:
            0.07600065 = score(doc=2388,freq=1.0), product of:
              0.14062469 = queryWeight, product of:
                1.8610629 = boost
                6.9177637 = idf(docFreq=118, maxDocs=44218)
                0.01092282 = queryNorm
              0.5404503 = fieldWeight in 2388, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.9177637 = idf(docFreq=118, maxDocs=44218)
                0.078125 = fieldNorm(doc=2388)
          0.039544728 = weight(abstract_txt:recent in 2388) [ClassicSimilarity], result of:
            0.039544728 = score(doc=2388,freq=1.0), product of:
              0.104137264 = queryWeight, product of:
                1.9614578 = boost
                4.860628 = idf(docFreq=930, maxDocs=44218)
                0.01092282 = queryNorm
              0.37973657 = fieldWeight in 2388, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.860628 = idf(docFreq=930, maxDocs=44218)
                0.078125 = fieldNorm(doc=2388)
          0.1786912 = weight(abstract_txt:topic in 2388) [ClassicSimilarity], result of:
            0.1786912 = score(doc=2388,freq=4.0), product of:
              0.22591195 = queryWeight, product of:
                4.085644 = boost
                5.062254 = idf(docFreq=760, maxDocs=44218)
                0.01092282 = queryNorm
              0.7909772 = fieldWeight in 2388, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.062254 = idf(docFreq=760, maxDocs=44218)
                0.078125 = fieldNorm(doc=2388)
          0.6378057 = weight(abstract_txt:tweets in 2388) [ClassicSimilarity], result of:
            0.6378057 = score(doc=2388,freq=2.0), product of:
              0.76097393 = queryWeight, product of:
                9.183779 = boost
                7.5860133 = idf(docFreq=60, maxDocs=44218)
                0.01092282 = queryNorm
              0.83814394 = fieldWeight in 2388, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.5860133 = idf(docFreq=60, maxDocs=44218)
                0.078125 = fieldNorm(doc=2388)
        0.2 = coord(5/25)
    
  5. Zheng, X.; Sun, A.: Collecting event-related tweets from twitter stream (2019) 0.19
    0.19016925 = sum of:
      0.19016925 = product of:
        0.9508462 = sum of:
          0.0524249 = weight(abstract_txt:superiority in 4672) [ClassicSimilarity], result of:
            0.0524249 = score(doc=4672,freq=1.0), product of:
              0.10111256 = queryWeight, product of:
                1.1158808 = boost
                8.29569 = idf(docFreq=29, maxDocs=44218)
                0.01092282 = queryNorm
              0.5184806 = fieldWeight in 4672, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.29569 = idf(docFreq=29, maxDocs=44218)
                0.0625 = fieldNorm(doc=4672)
          0.05677021 = weight(abstract_txt:tweet in 4672) [ClassicSimilarity], result of:
            0.05677021 = score(doc=4672,freq=1.0), product of:
              0.106625326 = queryWeight, product of:
                1.1458967 = boost
                8.518833 = idf(docFreq=23, maxDocs=44218)
                0.01092282 = queryNorm
              0.5324271 = fieldWeight in 4672, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.518833 = idf(docFreq=23, maxDocs=44218)
                0.0625 = fieldNorm(doc=4672)
          0.01474664 = weight(abstract_txt:specific in 4672) [ClassicSimilarity], result of:
            0.01474664 = score(doc=4672,freq=1.0), product of:
              0.054691363 = queryWeight, product of:
                1.1606189 = boost
                4.314141 = idf(docFreq=1607, maxDocs=44218)
                0.01092282 = queryNorm
              0.2696338 = fieldWeight in 4672, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.314141 = idf(docFreq=1607, maxDocs=44218)
                0.0625 = fieldNorm(doc=4672)
          0.1053096 = weight(abstract_txt:twitter in 4672) [ClassicSimilarity], result of:
            0.1053096 = score(doc=4672,freq=3.0), product of:
              0.14062469 = queryWeight, product of:
                1.8610629 = boost
                6.9177637 = idf(docFreq=118, maxDocs=44218)
                0.01092282 = queryNorm
              0.7488699 = fieldWeight in 4672, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.9177637 = idf(docFreq=118, maxDocs=44218)
                0.0625 = fieldNorm(doc=4672)
          0.7215948 = weight(abstract_txt:tweets in 4672) [ClassicSimilarity], result of:
            0.7215948 = score(doc=4672,freq=4.0), product of:
              0.76097393 = queryWeight, product of:
                9.183779 = boost
                7.5860133 = idf(docFreq=60, maxDocs=44218)
                0.01092282 = queryNorm
              0.94825166 = fieldWeight in 4672, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                7.5860133 = idf(docFreq=60, maxDocs=44218)
                0.0625 = fieldNorm(doc=4672)
        0.2 = coord(5/25)