Document (#42248)

Author
Chin, J.Y.
Bhowmick, S.S.
Jatowt, A.
Title
On-demand recent personal tweets summarization on mobile devices
Source
Journal of the Association for Information Science and Technology. 70(2019) no.6, S.547-562
Year
2019
Abstract
Tweets summarization aims to find a group of representative tweets for a specific set of input tweets or a given topic. In recent times, there have been several research efforts toward devising a variety of techniques to summarize tweets in Twitter. However, these techniques are either not personal (that is, consider only tweets in the timeline of a specific user) or are too expensive to be realized on a mobile device. Given that 80% of active Twitter users access the site on mobile devices, in this article we present a lightweight, personal, on-demand, topic modeling-based tweets summarization engine called TOTEM, designed for such devices. Specifically, TOTEM first preprocesses recent tweets in a user's timeline and exploits Latent Dirichlet Allocation-based topic modeling to assign each preprocessed tweet to a topic. Then it generates a ranked list of relevant tweets, a topic label, and a topic summary for each of the topics. Our experimental study with real-world data sets demonstrates the superiority of TOTEM.
Content
Vgl.: https://onlinelibrary.wiley.com/doi/10.1002/asi.24137.

Similar documents (content)

  1. Luo, Z.; Yu, Y.; Osborne, M.; Wang, T.: Structuring tweets for improving Twitter search (2015) 0.37
    0.3677184 = sum of:
      0.3677184 = product of:
        1.31328 = sum of:
          0.012533808 = weight(abstract_txt:each in 4336) [ClassicSimilarity], result of:
            0.012533808 = score(doc=4336,freq=1.0), product of:
              0.048522502 = queryWeight, product of:
                1.1105024 = boost
                4.132947 = idf(docFreq=1862, maxDocs=42740)
                0.010572163 = queryNorm
              0.2583092 = fieldWeight in 4336, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.132947 = idf(docFreq=1862, maxDocs=42740)
                0.0625 = fieldNorm(doc=4336)
          0.09842587 = weight(abstract_txt:tweet in 4336) [ClassicSimilarity], result of:
            0.09842587 = score(doc=4336,freq=3.0), product of:
              0.105497845 = queryWeight, product of:
                1.1578563 = boost
                8.618368 = idf(docFreq=20, maxDocs=42740)
                0.010572163 = queryNorm
              0.9329657 = fieldWeight in 4336, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                8.618368 = idf(docFreq=20, maxDocs=42740)
                0.0625 = fieldNorm(doc=4336)
          0.014412102 = weight(abstract_txt:specific in 4336) [ClassicSimilarity], result of:
            0.014412102 = score(doc=4336,freq=1.0), product of:
              0.05325651 = queryWeight, product of:
                1.1634139 = boost
                4.3298674 = idf(docFreq=1529, maxDocs=42740)
                0.010572163 = queryNorm
              0.2706167 = fieldWeight in 4336, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.3298674 = idf(docFreq=1529, maxDocs=42740)
                0.0625 = fieldNorm(doc=4336)
          0.03996607 = weight(abstract_txt:modeling in 4336) [ClassicSimilarity], result of:
            0.03996607 = score(doc=4336,freq=1.0), product of:
              0.10511922 = queryWeight, product of:
                1.6345152 = boost
                6.083161 = idf(docFreq=264, maxDocs=42740)
                0.010572163 = queryNorm
              0.38019755 = fieldWeight in 4336, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.083161 = idf(docFreq=264, maxDocs=42740)
                0.0625 = fieldNorm(doc=4336)
          0.15909897 = weight(abstract_txt:twitter in 4336) [ClassicSimilarity], result of:
            0.15909897 = score(doc=4336,freq=6.0), product of:
              0.14530559 = queryWeight, product of:
                1.9217153 = boost
                7.152031 = idf(docFreq=90, maxDocs=42740)
                0.010572163 = queryNorm
              1.0949267 = fieldWeight in 4336, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                7.152031 = idf(docFreq=90, maxDocs=42740)
                0.0625 = fieldNorm(doc=4336)
          0.070438385 = weight(abstract_txt:topic in 4336) [ClassicSimilarity], result of:
            0.070438385 = score(doc=4336,freq=1.0), product of:
              0.22120817 = queryWeight, product of:
                4.1068525 = boost
                5.0948124 = idf(docFreq=711, maxDocs=42740)
                0.010572163 = queryNorm
              0.31842577 = fieldWeight in 4336, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.0948124 = idf(docFreq=711, maxDocs=42740)
                0.0625 = fieldNorm(doc=4336)
          0.91840476 = weight(abstract_txt:tweets in 4336) [ClassicSimilarity], result of:
            0.91840476 = score(doc=4336,freq=6.0), product of:
              0.7719651 = queryWeight, product of:
                9.396215 = boost
                7.77107 = idf(docFreq=48, maxDocs=42740)
                0.010572163 = queryNorm
              1.1896973 = fieldWeight in 4336, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                7.77107 = idf(docFreq=48, maxDocs=42740)
                0.0625 = fieldNorm(doc=4336)
        0.28 = coord(7/25)
    
  2. Sedhai, S.; Sun, A.: ¬An analysis of 14 Million tweets on hashtag-oriented spamming* (2017) 0.20
    0.20396845 = sum of:
      0.20396845 = product of:
        1.2748028 = sum of:
          0.09842587 = weight(abstract_txt:tweet in 5684) [ClassicSimilarity], result of:
            0.09842587 = score(doc=5684,freq=3.0), product of:
              0.105497845 = queryWeight, product of:
                1.1578563 = boost
                8.618368 = idf(docFreq=20, maxDocs=42740)
                0.010572163 = queryNorm
              0.9329657 = fieldWeight in 5684, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                8.618368 = idf(docFreq=20, maxDocs=42740)
                0.0625 = fieldNorm(doc=5684)
          0.14523682 = weight(abstract_txt:twitter in 5684) [ClassicSimilarity], result of:
            0.14523682 = score(doc=5684,freq=5.0), product of:
              0.14530559 = queryWeight, product of:
                1.9217153 = boost
                7.152031 = idf(docFreq=90, maxDocs=42740)
                0.010572163 = queryNorm
              0.99952674 = fieldWeight in 5684, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                7.152031 = idf(docFreq=90, maxDocs=42740)
                0.0625 = fieldNorm(doc=5684)
          0.039149653 = weight(abstract_txt:personal in 5684) [ClassicSimilarity], result of:
            0.039149653 = score(doc=5684,freq=1.0), product of:
              0.1186871 = queryWeight, product of:
                2.127136 = boost
                5.277696 = idf(docFreq=592, maxDocs=42740)
                0.010572163 = queryNorm
              0.329856 = fieldWeight in 5684, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.277696 = idf(docFreq=592, maxDocs=42740)
                0.0625 = fieldNorm(doc=5684)
          0.9919905 = weight(abstract_txt:tweets in 5684) [ClassicSimilarity], result of:
            0.9919905 = score(doc=5684,freq=7.0), product of:
              0.7719651 = queryWeight, product of:
                9.396215 = boost
                7.77107 = idf(docFreq=48, maxDocs=42740)
                0.010572163 = queryNorm
              1.2850199 = fieldWeight in 5684, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                7.77107 = idf(docFreq=48, maxDocs=42740)
                0.0625 = fieldNorm(doc=5684)
        0.16 = coord(4/25)
    
  3. Zheng, X.; Sun, A.: Collecting event-related tweets from twitter stream (2019) 0.20
    0.19712225 = sum of:
      0.19712225 = product of:
        0.9856112 = sum of:
          0.05199855 = weight(abstract_txt:superiority in 673) [ClassicSimilarity], result of:
            0.05199855 = score(doc=673,freq=1.0), product of:
              0.09943485 = queryWeight, product of:
                1.1240929 = boost
                8.367054 = idf(docFreq=26, maxDocs=42740)
                0.010572163 = queryNorm
              0.5229409 = fieldWeight in 673, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.367054 = idf(docFreq=26, maxDocs=42740)
                0.0625 = fieldNorm(doc=673)
          0.056826204 = weight(abstract_txt:tweet in 673) [ClassicSimilarity], result of:
            0.056826204 = score(doc=673,freq=1.0), product of:
              0.105497845 = queryWeight, product of:
                1.1578563 = boost
                8.618368 = idf(docFreq=20, maxDocs=42740)
                0.010572163 = queryNorm
              0.538648 = fieldWeight in 673, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.618368 = idf(docFreq=20, maxDocs=42740)
                0.0625 = fieldNorm(doc=673)
          0.014412102 = weight(abstract_txt:specific in 673) [ClassicSimilarity], result of:
            0.014412102 = score(doc=673,freq=1.0), product of:
              0.05325651 = queryWeight, product of:
                1.1634139 = boost
                4.3298674 = idf(docFreq=1529, maxDocs=42740)
                0.010572163 = queryNorm
              0.2706167 = fieldWeight in 673, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.3298674 = idf(docFreq=1529, maxDocs=42740)
                0.0625 = fieldNorm(doc=673)
          0.11249995 = weight(abstract_txt:twitter in 673) [ClassicSimilarity], result of:
            0.11249995 = score(doc=673,freq=3.0), product of:
              0.14530559 = queryWeight, product of:
                1.9217153 = boost
                7.152031 = idf(docFreq=90, maxDocs=42740)
                0.010572163 = queryNorm
              0.77423006 = fieldWeight in 673, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.152031 = idf(docFreq=90, maxDocs=42740)
                0.0625 = fieldNorm(doc=673)
          0.74987435 = weight(abstract_txt:tweets in 673) [ClassicSimilarity], result of:
            0.74987435 = score(doc=673,freq=4.0), product of:
              0.7719651 = queryWeight, product of:
                9.396215 = boost
                7.77107 = idf(docFreq=48, maxDocs=42740)
                0.010572163 = queryNorm
              0.97138375 = fieldWeight in 673, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                7.77107 = idf(docFreq=48, maxDocs=42740)
                0.0625 = fieldNorm(doc=673)
        0.2 = coord(5/25)
    
  4. Gonçalo Oliveira, H.: Automatic generation of poetry inspired by Twitter trends (2016) 0.20
    0.19643378 = sum of:
      0.19643378 = product of:
        0.9821689 = sum of:
          0.023078727 = weight(abstract_txt:given in 4389) [ClassicSimilarity], result of:
            0.023078727 = score(doc=4389,freq=1.0), product of:
              0.06281871 = queryWeight, product of:
                1.2635499 = boost
                4.702543 = idf(docFreq=1053, maxDocs=42740)
                0.010572163 = queryNorm
              0.36738616 = fieldWeight in 4389, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.702543 = idf(docFreq=1053, maxDocs=42740)
                0.078125 = fieldNorm(doc=4389)
          0.08118985 = weight(abstract_txt:twitter in 4389) [ClassicSimilarity], result of:
            0.08118985 = score(doc=4389,freq=1.0), product of:
              0.14530559 = queryWeight, product of:
                1.9217153 = boost
                7.152031 = idf(docFreq=90, maxDocs=42740)
                0.010572163 = queryNorm
              0.5587524 = fieldWeight in 4389, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.152031 = idf(docFreq=90, maxDocs=42740)
                0.078125 = fieldNorm(doc=4389)
          0.039002877 = weight(abstract_txt:recent in 4389) [ClassicSimilarity], result of:
            0.039002877 = score(doc=4389,freq=1.0), product of:
              0.102025636 = queryWeight, product of:
                1.9721874 = boost
                4.8932486 = idf(docFreq=870, maxDocs=42740)
                0.010572163 = queryNorm
              0.38228506 = fieldWeight in 4389, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.8932486 = idf(docFreq=870, maxDocs=42740)
                0.078125 = fieldNorm(doc=4389)
          0.17609596 = weight(abstract_txt:topic in 4389) [ClassicSimilarity], result of:
            0.17609596 = score(doc=4389,freq=4.0), product of:
              0.22120817 = queryWeight, product of:
                4.1068525 = boost
                5.0948124 = idf(docFreq=711, maxDocs=42740)
                0.010572163 = queryNorm
              0.79606444 = fieldWeight in 4389, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.0948124 = idf(docFreq=711, maxDocs=42740)
                0.078125 = fieldNorm(doc=4389)
          0.6628015 = weight(abstract_txt:tweets in 4389) [ClassicSimilarity], result of:
            0.6628015 = score(doc=4389,freq=2.0), product of:
              0.7719651 = queryWeight, product of:
                9.396215 = boost
                7.77107 = idf(docFreq=48, maxDocs=42740)
                0.010572163 = queryNorm
              0.85859 = fieldWeight in 4389, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.77107 = idf(docFreq=48, maxDocs=42740)
                0.078125 = fieldNorm(doc=4389)
        0.2 = coord(5/25)
    
  5. Cotelo, J.M.; Cruz, F.L.; Troyano, J.A.: Dynamic topic-related tweet retrieval (2014) 0.16
    0.1601852 = sum of:
      0.1601852 = product of:
        1.0011575 = sum of:
          0.071032755 = weight(abstract_txt:tweet in 3218) [ClassicSimilarity], result of:
            0.071032755 = score(doc=3218,freq=1.0), product of:
              0.105497845 = queryWeight, product of:
                1.1578563 = boost
                8.618368 = idf(docFreq=20, maxDocs=42740)
                0.010572163 = queryNorm
              0.67331004 = fieldWeight in 3218, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.618368 = idf(docFreq=20, maxDocs=42740)
                0.078125 = fieldNorm(doc=3218)
          0.11481978 = weight(abstract_txt:twitter in 3218) [ClassicSimilarity], result of:
            0.11481978 = score(doc=3218,freq=2.0), product of:
              0.14530559 = queryWeight, product of:
                1.9217153 = boost
                7.152031 = idf(docFreq=90, maxDocs=42740)
                0.010572163 = queryNorm
              0.7901952 = fieldWeight in 3218, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.152031 = idf(docFreq=90, maxDocs=42740)
                0.078125 = fieldNorm(doc=3218)
          0.15250356 = weight(abstract_txt:topic in 3218) [ClassicSimilarity], result of:
            0.15250356 = score(doc=3218,freq=3.0), product of:
              0.22120817 = queryWeight, product of:
                4.1068525 = boost
                5.0948124 = idf(docFreq=711, maxDocs=42740)
                0.010572163 = queryNorm
              0.689412 = fieldWeight in 3218, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.0948124 = idf(docFreq=711, maxDocs=42740)
                0.078125 = fieldNorm(doc=3218)
          0.6628015 = weight(abstract_txt:tweets in 3218) [ClassicSimilarity], result of:
            0.6628015 = score(doc=3218,freq=2.0), product of:
              0.7719651 = queryWeight, product of:
                9.396215 = boost
                7.77107 = idf(docFreq=48, maxDocs=42740)
                0.010572163 = queryNorm
              0.85859 = fieldWeight in 3218, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.77107 = idf(docFreq=48, maxDocs=42740)
                0.078125 = fieldNorm(doc=3218)
        0.16 = coord(4/25)