Document (#39888)

Author
Leginus, M.
Zhai, C.X.
Dolog, P.
Title
Personalized generation of word clouds from tweets
Source
Journal of the Association for Information Science and Technology. 67(2016) no.5, S.1021-1032
Year
2016
Abstract
Active users of Twitter are often overwhelmed with the vast amount of tweets. In this work we attempt to help users browsing a large number of accumulated posts. We propose a personalized word cloud generation as a means for users' navigation. Various user past activities such as user published tweets, retweets, and seen but not retweeted tweets are leveraged for enhanced personalization of word clouds. The best personalization results are attained with user past retweets. However, users' own past tweets are not as useful as retweets for personalization. Negative preferences derived from seen but not retweeted tweets further enhance personalized word cloud generation. The ranking combination method outperforms the preranking approach and provides a general framework for combined ranking of various user past information for enhanced word cloud generation. To better capture subtle differences of generated word clouds, we propose an evaluation of word clouds with a mean average precision measure.
Content
Vgl.: http://onlinelibrary.wiley.com/doi/10.1002/asi.23494/abstract.
Theme
Informetrie
Object
Twitter

Similar documents (author)

  1. Zhai, C.X.; Lafferty, J.: ¬A risk minimization framework for information retrieval (2006) 4.73
    4.72773 = sum of:
      4.72773 = weight(author_txt:zhai in 2959) [ClassicSimilarity], result of:
        4.72773 = fieldWeight in 2959, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.45546 = idf(docFreq=8, maxDocs=42306)
          0.5 = fieldNorm(doc=2959)
    
  2. Zhai, Y; Ding, Y.; Wang, F.: Measuring the diffusion of an innovation : a citation analysis (2018) 3.55
    3.5457973 = sum of:
      3.5457973 = weight(author_txt:zhai in 1035) [ClassicSimilarity], result of:
        3.5457973 = fieldWeight in 1035, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.45546 = idf(docFreq=8, maxDocs=42306)
          0.375 = fieldNorm(doc=1035)
    
  3. Vinod Vydiswaran, V.G.; Zhai, C.X.; Roth, D.; Pirolli, P.: Overcoming bias to learn about controversial topics (2015) 2.95
    2.9548311 = sum of:
      2.9548311 = weight(author_txt:zhai in 4130) [ClassicSimilarity], result of:
        2.9548311 = fieldWeight in 4130, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.45546 = idf(docFreq=8, maxDocs=42306)
          0.3125 = fieldNorm(doc=4130)
    
  4. Zhang, J.; Zhai, S.; Liu, H.; Stevenson, J.A.: Social network analysis on a topic-based navigation guidance system in a public health portal (2016) 2.95
    2.9548311 = sum of:
      2.9548311 = weight(author_txt:zhai in 4888) [ClassicSimilarity], result of:
        2.9548311 = fieldWeight in 4888, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.45546 = idf(docFreq=8, maxDocs=42306)
          0.3125 = fieldNorm(doc=4888)
    
  5. Zhang, J.; Zhai, S.; Stevenson, J.A.; Xia, L.: Optimization of the subject directory in a government agriculture department web portal (2016) 2.95
    2.9548311 = sum of:
      2.9548311 = weight(author_txt:zhai in 7) [ClassicSimilarity], result of:
        2.9548311 = fieldWeight in 7, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.45546 = idf(docFreq=8, maxDocs=42306)
          0.3125 = fieldNorm(doc=7)
    

Similar documents (content)

  1. Sedhai, S.; Sun, A.: ¬An analysis of 14 Million tweets on hashtag-oriented spamming* (2017) 0.24
    0.23829508 = sum of:
      0.23829508 = product of:
        0.85105383 = sum of:
          0.06768268 = weight(abstract_txt:twitter in 602) [ClassicSimilarity], result of:
            0.06768268 = score(doc=602,freq=5.0), product of:
              0.06738736 = queryWeight, product of:
                7.186776 = idf(docFreq=86, maxDocs=42306)
                0.009376576 = queryNorm
              1.0043825 = fieldWeight in 602, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                7.186776 = idf(docFreq=86, maxDocs=42306)
                0.0625 = fieldNorm(doc=602)
          0.00394565 = weight(abstract_txt:with in 602) [ClassicSimilarity], result of:
            0.00394565 = score(doc=602,freq=1.0), product of:
              0.02498635 = queryWeight, product of:
                1.0546852 = boost
                2.5265954 = idf(docFreq=9191, maxDocs=42306)
                0.009376576 = queryNorm
              0.15791221 = fieldWeight in 602, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.5265954 = idf(docFreq=9191, maxDocs=42306)
                0.0625 = fieldNorm(doc=602)
          0.014158677 = weight(abstract_txt:various in 602) [ClassicSimilarity], result of:
            0.014158677 = score(doc=602,freq=1.0), product of:
              0.05116133 = queryWeight, product of:
                1.232244 = boost
                4.427931 = idf(docFreq=1372, maxDocs=42306)
                0.009376576 = queryNorm
              0.27674568 = fieldWeight in 602, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.427931 = idf(docFreq=1372, maxDocs=42306)
                0.0625 = fieldNorm(doc=602)
          0.021135489 = weight(abstract_txt:users in 602) [ClassicSimilarity], result of:
            0.021135489 = score(doc=602,freq=2.0), product of:
              0.06682438 = queryWeight, product of:
                1.9916282 = boost
                3.5783465 = idf(docFreq=3210, maxDocs=42306)
                0.009376576 = queryNorm
              0.31628412 = fieldWeight in 602, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.5783465 = idf(docFreq=3210, maxDocs=42306)
                0.0625 = fieldNorm(doc=602)
          0.036313478 = weight(abstract_txt:user in 602) [ClassicSimilarity], result of:
            0.036313478 = score(doc=602,freq=5.0), product of:
              0.070630446 = queryWeight, product of:
                2.0475607 = boost
                3.67884 = idf(docFreq=2903, maxDocs=42306)
                0.009376576 = queryNorm
              0.5141335 = fieldWeight in 602, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                3.67884 = idf(docFreq=2903, maxDocs=42306)
                0.0625 = fieldNorm(doc=602)
          0.09292722 = weight(abstract_txt:word in 602) [ClassicSimilarity], result of:
            0.09292722 = score(doc=602,freq=1.0), product of:
              0.27229813 = queryWeight, product of:
                5.318414 = boost
                5.460322 = idf(docFreq=488, maxDocs=42306)
                0.009376576 = queryNorm
              0.34127012 = fieldWeight in 602, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.460322 = idf(docFreq=488, maxDocs=42306)
                0.0625 = fieldNorm(doc=602)
          0.61489064 = weight(abstract_txt:tweets in 602) [ClassicSimilarity], result of:
            0.61489064 = score(doc=602,freq=7.0), product of:
              0.47657698 = queryWeight, product of:
                6.514078 = boost
                7.8025365 = idf(docFreq=46, maxDocs=42306)
                0.009376576 = queryNorm
              1.2902231 = fieldWeight in 602, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                7.8025365 = idf(docFreq=46, maxDocs=42306)
                0.0625 = fieldNorm(doc=602)
        0.28 = coord(7/25)
    
  2. Cai, F.; Wang, S.; Rijke, M.de: Behavior-based personalization in web search (2017) 0.20
    0.2025143 = sum of:
      0.2025143 = product of:
        0.5625397 = sum of:
          0.030414924 = weight(abstract_txt:outperforms in 446) [ClassicSimilarity], result of:
            0.030414924 = score(doc=446,freq=1.0), product of:
              0.06760433 = queryWeight, product of:
                1.0016086 = boost
                7.198337 = idf(docFreq=85, maxDocs=42306)
                0.009376576 = queryNorm
              0.44989607 = fieldWeight in 446, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.198337 = idf(docFreq=85, maxDocs=42306)
                0.0625 = fieldNorm(doc=446)
          0.0055799917 = weight(abstract_txt:with in 446) [ClassicSimilarity], result of:
            0.0055799917 = score(doc=446,freq=2.0), product of:
              0.02498635 = queryWeight, product of:
                1.0546852 = boost
                2.5265954 = idf(docFreq=9191, maxDocs=42306)
                0.009376576 = queryNorm
              0.22332159 = fieldWeight in 446, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.5265954 = idf(docFreq=9191, maxDocs=42306)
                0.0625 = fieldNorm(doc=446)
          0.02864987 = weight(abstract_txt:ranking in 446) [ClassicSimilarity], result of:
            0.02864987 = score(doc=446,freq=1.0), product of:
              0.08184808 = queryWeight, product of:
                1.5585834 = boost
                5.600595 = idf(docFreq=424, maxDocs=42306)
                0.009376576 = queryNorm
              0.3500372 = fieldWeight in 446, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.600595 = idf(docFreq=424, maxDocs=42306)
                0.0625 = fieldNorm(doc=446)
          0.059469078 = weight(abstract_txt:seen in 446) [ClassicSimilarity], result of:
            0.059469078 = score(doc=446,freq=3.0), product of:
              0.09234508 = queryWeight, product of:
                1.6555133 = boost
                5.9489017 = idf(docFreq=299, maxDocs=42306)
                0.009376576 = queryNorm
              0.6439875 = fieldWeight in 446, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.9489017 = idf(docFreq=299, maxDocs=42306)
                0.0625 = fieldNorm(doc=446)
          0.036607742 = weight(abstract_txt:users in 446) [ClassicSimilarity], result of:
            0.036607742 = score(doc=446,freq=6.0), product of:
              0.06682438 = queryWeight, product of:
                1.9916282 = boost
                3.5783465 = idf(docFreq=3210, maxDocs=42306)
                0.009376576 = queryNorm
              0.5478202 = fieldWeight in 446, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                3.5783465 = idf(docFreq=3210, maxDocs=42306)
                0.0625 = fieldNorm(doc=446)
          0.0281283 = weight(abstract_txt:user in 446) [ClassicSimilarity], result of:
            0.0281283 = score(doc=446,freq=3.0), product of:
              0.070630446 = queryWeight, product of:
                2.0475607 = boost
                3.67884 = idf(docFreq=2903, maxDocs=42306)
                0.009376576 = queryNorm
              0.3982461 = fieldWeight in 446, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.67884 = idf(docFreq=2903, maxDocs=42306)
                0.0625 = fieldNorm(doc=446)
          0.16455919 = weight(abstract_txt:personalized in 446) [ClassicSimilarity], result of:
            0.16455919 = score(doc=446,freq=3.0), product of:
              0.20835222 = queryWeight, product of:
                3.0455835 = boost
                7.295975 = idf(docFreq=77, maxDocs=42306)
                0.009376576 = queryNorm
              0.78981245 = fieldWeight in 446, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.295975 = idf(docFreq=77, maxDocs=42306)
                0.0625 = fieldNorm(doc=446)
          0.11620341 = weight(abstract_txt:personalization in 446) [ClassicSimilarity], result of:
            0.11620341 = score(doc=446,freq=1.0), product of:
              0.23828849 = queryWeight, product of:
                3.257039 = boost
                7.8025365 = idf(docFreq=46, maxDocs=42306)
                0.009376576 = queryNorm
              0.48765853 = fieldWeight in 446, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.8025365 = idf(docFreq=46, maxDocs=42306)
                0.0625 = fieldNorm(doc=446)
          0.09292722 = weight(abstract_txt:word in 446) [ClassicSimilarity], result of:
            0.09292722 = score(doc=446,freq=1.0), product of:
              0.27229813 = queryWeight, product of:
                5.318414 = boost
                5.460322 = idf(docFreq=488, maxDocs=42306)
                0.009376576 = queryNorm
              0.34127012 = fieldWeight in 446, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.460322 = idf(docFreq=488, maxDocs=42306)
                0.0625 = fieldNorm(doc=446)
        0.36 = coord(9/25)
    
  3. Gorrell, G.; Bontcheva, K.: Classifying Twitter favorites : Like, bookmark, or Thanks? (2016) 0.15
    0.15071352 = sum of:
      0.15071352 = product of:
        0.627973 = sum of:
          0.05242678 = weight(abstract_txt:twitter in 4488) [ClassicSimilarity], result of:
            0.05242678 = score(doc=4488,freq=3.0), product of:
              0.06738736 = queryWeight, product of:
                7.186776 = idf(docFreq=86, maxDocs=42306)
                0.009376576 = queryNorm
              0.77799135 = fieldWeight in 4488, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.186776 = idf(docFreq=86, maxDocs=42306)
                0.0625 = fieldNorm(doc=4488)
          0.00394565 = weight(abstract_txt:with in 4488) [ClassicSimilarity], result of:
            0.00394565 = score(doc=4488,freq=1.0), product of:
              0.02498635 = queryWeight, product of:
                1.0546852 = boost
                2.5265954 = idf(docFreq=9191, maxDocs=42306)
                0.009376576 = queryNorm
              0.15791221 = fieldWeight in 4488, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.5265954 = idf(docFreq=9191, maxDocs=42306)
                0.0625 = fieldNorm(doc=4488)
          0.029890096 = weight(abstract_txt:users in 4488) [ClassicSimilarity], result of:
            0.029890096 = score(doc=4488,freq=4.0), product of:
              0.06682438 = queryWeight, product of:
                1.9916282 = boost
                3.5783465 = idf(docFreq=3210, maxDocs=42306)
                0.009376576 = queryNorm
              0.4472933 = fieldWeight in 4488, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.5783465 = idf(docFreq=3210, maxDocs=42306)
                0.0625 = fieldNorm(doc=4488)
          0.02296666 = weight(abstract_txt:user in 4488) [ClassicSimilarity], result of:
            0.02296666 = score(doc=4488,freq=2.0), product of:
              0.070630446 = queryWeight, product of:
                2.0475607 = boost
                3.67884 = idf(docFreq=2903, maxDocs=42306)
                0.009376576 = queryNorm
              0.32516658 = fieldWeight in 4488, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.67884 = idf(docFreq=2903, maxDocs=42306)
                0.0625 = fieldNorm(doc=4488)
          0.11620341 = weight(abstract_txt:personalization in 4488) [ClassicSimilarity], result of:
            0.11620341 = score(doc=4488,freq=1.0), product of:
              0.23828849 = queryWeight, product of:
                3.257039 = boost
                7.8025365 = idf(docFreq=46, maxDocs=42306)
                0.009376576 = queryNorm
              0.48765853 = fieldWeight in 4488, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.8025365 = idf(docFreq=46, maxDocs=42306)
                0.0625 = fieldNorm(doc=4488)
          0.40254042 = weight(abstract_txt:tweets in 4488) [ClassicSimilarity], result of:
            0.40254042 = score(doc=4488,freq=3.0), product of:
              0.47657698 = queryWeight, product of:
                6.514078 = boost
                7.8025365 = idf(docFreq=46, maxDocs=42306)
                0.009376576 = queryNorm
              0.8446493 = fieldWeight in 4488, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.8025365 = idf(docFreq=46, maxDocs=42306)
                0.0625 = fieldNorm(doc=4488)
        0.24 = coord(6/25)
    
  4. Chin, J.Y.; Bhowmick, S.S.; Jatowt, A.: On-demand recent personal tweets summarization on mobile devices (2019) 0.15
    0.14705653 = sum of:
      0.14705653 = product of:
        0.73528266 = sum of:
          0.042806286 = weight(abstract_txt:twitter in 2164) [ClassicSimilarity], result of:
            0.042806286 = score(doc=2164,freq=2.0), product of:
              0.06738736 = queryWeight, product of:
                7.186776 = idf(docFreq=86, maxDocs=42306)
                0.009376576 = queryNorm
              0.63522726 = fieldWeight in 2164, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.186776 = idf(docFreq=86, maxDocs=42306)
                0.0625 = fieldNorm(doc=2164)
          0.00394565 = weight(abstract_txt:with in 2164) [ClassicSimilarity], result of:
            0.00394565 = score(doc=2164,freq=1.0), product of:
              0.02498635 = queryWeight, product of:
                1.0546852 = boost
                2.5265954 = idf(docFreq=9191, maxDocs=42306)
                0.009376576 = queryNorm
              0.15791221 = fieldWeight in 2164, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.5265954 = idf(docFreq=9191, maxDocs=42306)
                0.0625 = fieldNorm(doc=2164)
          0.014945048 = weight(abstract_txt:users in 2164) [ClassicSimilarity], result of:
            0.014945048 = score(doc=2164,freq=1.0), product of:
              0.06682438 = queryWeight, product of:
                1.9916282 = boost
                3.5783465 = idf(docFreq=3210, maxDocs=42306)
                0.009376576 = queryNorm
              0.22364666 = fieldWeight in 2164, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.5783465 = idf(docFreq=3210, maxDocs=42306)
                0.0625 = fieldNorm(doc=2164)
          0.016239882 = weight(abstract_txt:user in 2164) [ClassicSimilarity], result of:
            0.016239882 = score(doc=2164,freq=1.0), product of:
              0.070630446 = queryWeight, product of:
                2.0475607 = boost
                3.67884 = idf(docFreq=2903, maxDocs=42306)
                0.009376576 = queryNorm
              0.2299275 = fieldWeight in 2164, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.67884 = idf(docFreq=2903, maxDocs=42306)
                0.0625 = fieldNorm(doc=2164)
          0.6573458 = weight(abstract_txt:tweets in 2164) [ClassicSimilarity], result of:
            0.6573458 = score(doc=2164,freq=8.0), product of:
              0.47657698 = queryWeight, product of:
                6.514078 = boost
                7.8025365 = idf(docFreq=46, maxDocs=42306)
                0.009376576 = queryNorm
              1.3793066 = fieldWeight in 2164, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                7.8025365 = idf(docFreq=46, maxDocs=42306)
                0.0625 = fieldNorm(doc=2164)
        0.2 = coord(5/25)
    
  5. Arakawa, Y.; Kameda, A.; Aizawa, A.; Suzuki, T.: Adding Twitter-specific features to stylistic features for classifying tweets by user type and number of retweets (2014) 0.15
    0.14551853 = sum of:
      0.14551853 = product of:
        0.7275926 = sum of:
          0.05242678 = weight(abstract_txt:twitter in 3308) [ClassicSimilarity], result of:
            0.05242678 = score(doc=3308,freq=3.0), product of:
              0.06738736 = queryWeight, product of:
                7.186776 = idf(docFreq=86, maxDocs=42306)
                0.009376576 = queryNorm
              0.77799135 = fieldWeight in 3308, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.186776 = idf(docFreq=86, maxDocs=42306)
                0.0625 = fieldNorm(doc=3308)
          0.0055799917 = weight(abstract_txt:with in 3308) [ClassicSimilarity], result of:
            0.0055799917 = score(doc=3308,freq=2.0), product of:
              0.02498635 = queryWeight, product of:
                1.0546852 = boost
                2.5265954 = idf(docFreq=9191, maxDocs=42306)
                0.009376576 = queryNorm
              0.22332159 = fieldWeight in 3308, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.5265954 = idf(docFreq=9191, maxDocs=42306)
                0.0625 = fieldNorm(doc=3308)
          0.032479763 = weight(abstract_txt:user in 3308) [ClassicSimilarity], result of:
            0.032479763 = score(doc=3308,freq=4.0), product of:
              0.070630446 = queryWeight, product of:
                2.0475607 = boost
                3.67884 = idf(docFreq=2903, maxDocs=42306)
                0.009376576 = queryNorm
              0.459855 = fieldWeight in 3308, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.67884 = idf(docFreq=2903, maxDocs=42306)
                0.0625 = fieldNorm(doc=3308)
          0.23456563 = weight(abstract_txt:retweets in 3308) [ClassicSimilarity], result of:
            0.23456563 = score(doc=3308,freq=1.0), product of:
              0.3805982 = queryWeight, product of:
                4.116279 = boost
                9.860925 = idf(docFreq=5, maxDocs=42306)
                0.009376576 = queryNorm
              0.6163078 = fieldWeight in 3308, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.860925 = idf(docFreq=5, maxDocs=42306)
                0.0625 = fieldNorm(doc=3308)
          0.40254042 = weight(abstract_txt:tweets in 3308) [ClassicSimilarity], result of:
            0.40254042 = score(doc=3308,freq=3.0), product of:
              0.47657698 = queryWeight, product of:
                6.514078 = boost
                7.8025365 = idf(docFreq=46, maxDocs=42306)
                0.009376576 = queryNorm
              0.8446493 = fieldWeight in 3308, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.8025365 = idf(docFreq=46, maxDocs=42306)
                0.0625 = fieldNorm(doc=3308)
        0.2 = coord(5/25)