Document (#39339)

Author
Kong, S.
Ye, F.
Feng, L.
Zhao, Z.
Title
Towards the prediction problems of bursting hashtags on Twitter
Source
Journal of the Association for Information Science and Technology. 66(2015) no.12, S.2566-2579
Year
2015
Abstract
Hundreds of thousands of hashtags are generated every day on Twitter. Only a few will burst and become trending topics. In this article, we provide the definition of a bursting hashtag and conduct a systematic study of a series of challenging prediction problems that span the entire life cycles of bursting hashtags. Around the problem of "how to build a system to predict bursting hashtags," we explore different types of features and present machine learning solutions. On real data sets from Twitter, experiments are conducted to evaluate the effectiveness of the proposed solutions and the contributions of features.
Content
Vgl.: http://onlinelibrary.wiley.com/doi/10.1002/asi.23342/abstract.
Theme
Internet
Data Mining
Object
Twitter

Similar documents (author)

  1. Feng, S.: ¬A comparative study of indexing languages in single and multidatabase searching (1989) 2.57
    2.5720606 = sum of:
      2.5720606 = product of:
        5.144121 = sum of:
          5.144121 = weight(author_txt:feng in 2494) [ClassicSimilarity], result of:
            5.144121 = score(doc=2494,freq=1.0), product of:
              0.85579824 = queryWeight, product of:
                1.2862054 = boost
                9.617446 = idf(docFreq=7, maxDocs=44218)
                0.069183305 = queryNorm
              6.010904 = fieldWeight in 2494, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.617446 = idf(docFreq=7, maxDocs=44218)
                0.625 = fieldNorm(doc=2494)
        0.5 = coord(1/2)
    
  2. Feng, Y.; Agosto, D.E.: Revisiting personal information management through information practices with activity tracking technology (2019) 2.06
    2.0576484 = sum of:
      2.0576484 = product of:
        4.115297 = sum of:
          4.115297 = weight(author_txt:feng in 5438) [ClassicSimilarity], result of:
            4.115297 = score(doc=5438,freq=1.0), product of:
              0.85579824 = queryWeight, product of:
                1.2862054 = boost
                9.617446 = idf(docFreq=7, maxDocs=44218)
                0.069183305 = queryNorm
              4.808723 = fieldWeight in 5438, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.617446 = idf(docFreq=7, maxDocs=44218)
                0.5 = fieldNorm(doc=5438)
        0.5 = coord(1/2)
    
  3. Feng, L.; Jeusfeld, M.A.; Hoppenbrouwers, J.: Beyond information searching and browsing : acquiring knowledge from digital libraries (2005) 1.54
    1.5432363 = sum of:
      1.5432363 = product of:
        3.0864725 = sum of:
          3.0864725 = weight(author_txt:feng in 1000) [ClassicSimilarity], result of:
            3.0864725 = score(doc=1000,freq=1.0), product of:
              0.85579824 = queryWeight, product of:
                1.2862054 = boost
                9.617446 = idf(docFreq=7, maxDocs=44218)
                0.069183305 = queryNorm
              3.606542 = fieldWeight in 1000, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.617446 = idf(docFreq=7, maxDocs=44218)
                0.375 = fieldNorm(doc=1000)
        0.5 = coord(1/2)
    
  4. Xu, G.; Cao, Y.; Ren, Y.; Li, X.; Feng, Z.: Network security situation awareness based on semantic ontology and user-defined rules for Internet of Things (2017) 1.29
    1.2860303 = sum of:
      1.2860303 = product of:
        2.5720606 = sum of:
          2.5720606 = weight(author_txt:feng in 306) [ClassicSimilarity], result of:
            2.5720606 = score(doc=306,freq=1.0), product of:
              0.85579824 = queryWeight, product of:
                1.2862054 = boost
                9.617446 = idf(docFreq=7, maxDocs=44218)
                0.069183305 = queryNorm
              3.005452 = fieldWeight in 306, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.617446 = idf(docFreq=7, maxDocs=44218)
                0.3125 = fieldNorm(doc=306)
        0.5 = coord(1/2)
    
  5. Zhao, L.: Save space for "newcomers" : analyzing problems in book number assignment under the LCC system (2004) 1.21
    1.208788 = sum of:
      1.208788 = product of:
        2.417576 = sum of:
          2.417576 = weight(author_txt:zhao in 3081) [ClassicSimilarity], result of:
            2.417576 = score(doc=3081,freq=1.0), product of:
              0.5173098 = queryWeight, product of:
                7.4773793 = idf(docFreq=67, maxDocs=44218)
                0.069183305 = queryNorm
              4.6733623 = fieldWeight in 3081, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.4773793 = idf(docFreq=67, maxDocs=44218)
                0.625 = fieldNorm(doc=3081)
        0.5 = coord(1/2)
    

Similar documents (content)

  1. Çelebi, A.; Özgür, A.: Segmenting hashtags and analyzing their grammatical structure (2018) 0.43
    0.43049285 = sum of:
      0.43049285 = product of:
        1.7937202 = sum of:
          0.041656896 = weight(abstract_txt:challenging in 4221) [ClassicSimilarity], result of:
            0.041656896 = score(doc=4221,freq=1.0), product of:
              0.09907879 = queryWeight, product of:
                1.2189062 = boost
                6.727074 = idf(docFreq=143, maxDocs=44218)
                0.012083263 = queryNorm
              0.42044213 = fieldWeight in 4221, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.727074 = idf(docFreq=143, maxDocs=44218)
                0.0625 = fieldNorm(doc=4221)
          0.0361974 = weight(abstract_txt:features in 4221) [ClassicSimilarity], result of:
            0.0361974 = score(doc=4221,freq=2.0), product of:
              0.09022101 = queryWeight, product of:
                1.644935 = boost
                4.5391517 = idf(docFreq=1283, maxDocs=44218)
                0.012083263 = queryNorm
              0.4012081 = fieldWeight in 4221, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.5391517 = idf(docFreq=1283, maxDocs=44218)
                0.0625 = fieldNorm(doc=4221)
          0.117309414 = weight(abstract_txt:trending in 4221) [ClassicSimilarity], result of:
            0.117309414 = score(doc=4221,freq=1.0), product of:
              0.19758077 = queryWeight, product of:
                1.7212828 = boost
                9.499662 = idf(docFreq=8, maxDocs=44218)
                0.012083263 = queryNorm
              0.5937289 = fieldWeight in 4221, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.499662 = idf(docFreq=8, maxDocs=44218)
                0.0625 = fieldNorm(doc=4221)
          0.28368592 = weight(abstract_txt:hashtag in 4221) [ClassicSimilarity], result of:
            0.28368592 = score(doc=4221,freq=5.0), product of:
              0.2081731 = queryWeight, product of:
                1.7668196 = boost
                9.7509775 = idf(docFreq=6, maxDocs=44218)
                0.012083263 = queryNorm
              1.3627405 = fieldWeight in 4221, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                9.7509775 = idf(docFreq=6, maxDocs=44218)
                0.0625 = fieldNorm(doc=4221)
          0.13590229 = weight(abstract_txt:twitter in 4221) [ClassicSimilarity], result of:
            0.13590229 = score(doc=4221,freq=1.0), product of:
              0.31432652 = queryWeight, product of:
                3.7603743 = boost
                6.9177637 = idf(docFreq=118, maxDocs=44218)
                0.012083263 = queryNorm
              0.43236023 = fieldWeight in 4221, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.9177637 = idf(docFreq=118, maxDocs=44218)
                0.0625 = fieldNorm(doc=4221)
          1.1789683 = weight(abstract_txt:hashtags in 4221) [ClassicSimilarity], result of:
            1.1789683 = score(doc=4221,freq=8.0), product of:
              0.73032165 = queryWeight, product of:
                6.6186132 = boost
                9.131938 = idf(docFreq=12, maxDocs=44218)
                0.012083263 = queryNorm
              1.6143138 = fieldWeight in 4221, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                9.131938 = idf(docFreq=12, maxDocs=44218)
                0.0625 = fieldNorm(doc=4221)
        0.24 = coord(6/25)
    
  2. Ma, Z.; Sun, A.; Cong, G.: On predicting the popularity of newly emerging hashtags in Twitter (2013) 0.35
    0.34979066 = sum of:
      0.34979066 = product of:
        1.4574611 = sum of:
          0.04258953 = weight(abstract_txt:predict in 967) [ClassicSimilarity], result of:
            0.04258953 = score(doc=967,freq=1.0), product of:
              0.10055214 = queryWeight, product of:
                1.2279356 = boost
                6.7769065 = idf(docFreq=136, maxDocs=44218)
                0.012083263 = queryNorm
              0.42355666 = fieldWeight in 967, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.7769065 = idf(docFreq=136, maxDocs=44218)
                0.0625 = fieldNorm(doc=967)
          0.06771914 = weight(abstract_txt:features in 967) [ClassicSimilarity], result of:
            0.06771914 = score(doc=967,freq=7.0), product of:
              0.09022101 = queryWeight, product of:
                1.644935 = boost
                4.5391517 = idf(docFreq=1283, maxDocs=44218)
                0.012083263 = queryNorm
              0.75059164 = fieldWeight in 967, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                4.5391517 = idf(docFreq=1283, maxDocs=44218)
                0.0625 = fieldNorm(doc=967)
          0.21974216 = weight(abstract_txt:hashtag in 967) [ClassicSimilarity], result of:
            0.21974216 = score(doc=967,freq=3.0), product of:
              0.2081731 = queryWeight, product of:
                1.7668196 = boost
                9.7509775 = idf(docFreq=6, maxDocs=44218)
                0.012083263 = queryNorm
              1.0555743 = fieldWeight in 967, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                9.7509775 = idf(docFreq=6, maxDocs=44218)
                0.0625 = fieldNorm(doc=967)
          0.10155585 = weight(abstract_txt:prediction in 967) [ClassicSimilarity], result of:
            0.10155585 = score(doc=967,freq=1.0), product of:
              0.22611848 = queryWeight, product of:
                2.6041317 = boost
                7.1860275 = idf(docFreq=90, maxDocs=44218)
                0.012083263 = queryNorm
              0.44912672 = fieldWeight in 967, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.1860275 = idf(docFreq=90, maxDocs=44218)
                0.0625 = fieldNorm(doc=967)
          0.30388674 = weight(abstract_txt:twitter in 967) [ClassicSimilarity], result of:
            0.30388674 = score(doc=967,freq=5.0), product of:
              0.31432652 = queryWeight, product of:
                3.7603743 = boost
                6.9177637 = idf(docFreq=118, maxDocs=44218)
                0.012083263 = queryNorm
              0.96678686 = fieldWeight in 967, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                6.9177637 = idf(docFreq=118, maxDocs=44218)
                0.0625 = fieldNorm(doc=967)
          0.7219677 = weight(abstract_txt:hashtags in 967) [ClassicSimilarity], result of:
            0.7219677 = score(doc=967,freq=3.0), product of:
              0.73032165 = queryWeight, product of:
                6.6186132 = boost
                9.131938 = idf(docFreq=12, maxDocs=44218)
                0.012083263 = queryNorm
              0.9885613 = fieldWeight in 967, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                9.131938 = idf(docFreq=12, maxDocs=44218)
                0.0625 = fieldNorm(doc=967)
        0.24 = coord(6/25)
    
  3. Chang, H.-C.; Iyer, I.: Trends in Twitter hashtag applications : design features for value-added dimensions to future library catalogues (2012) 0.24
    0.24034145 = sum of:
      0.24034145 = product of:
        1.5021341 = sum of:
          0.031994287 = weight(abstract_txt:features in 5574) [ClassicSimilarity], result of:
            0.031994287 = score(doc=5574,freq=1.0), product of:
              0.09022101 = queryWeight, product of:
                1.644935 = boost
                4.5391517 = idf(docFreq=1283, maxDocs=44218)
                0.012083263 = queryNorm
              0.35462123 = fieldWeight in 5574, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5391517 = idf(docFreq=1283, maxDocs=44218)
                0.078125 = fieldNorm(doc=5574)
          0.3171705 = weight(abstract_txt:hashtag in 5574) [ClassicSimilarity], result of:
            0.3171705 = score(doc=5574,freq=4.0), product of:
              0.2081731 = queryWeight, product of:
                1.7668196 = boost
                9.7509775 = idf(docFreq=6, maxDocs=44218)
                0.012083263 = queryNorm
              1.5235902 = fieldWeight in 5574, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                9.7509775 = idf(docFreq=6, maxDocs=44218)
                0.078125 = fieldNorm(doc=5574)
          0.4161141 = weight(abstract_txt:twitter in 5574) [ClassicSimilarity], result of:
            0.4161141 = score(doc=5574,freq=6.0), product of:
              0.31432652 = queryWeight, product of:
                3.7603743 = boost
                6.9177637 = idf(docFreq=118, maxDocs=44218)
                0.012083263 = queryNorm
              1.3238275 = fieldWeight in 5574, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                6.9177637 = idf(docFreq=118, maxDocs=44218)
                0.078125 = fieldNorm(doc=5574)
          0.7368552 = weight(abstract_txt:hashtags in 5574) [ClassicSimilarity], result of:
            0.7368552 = score(doc=5574,freq=2.0), product of:
              0.73032165 = queryWeight, product of:
                6.6186132 = boost
                9.131938 = idf(docFreq=12, maxDocs=44218)
                0.012083263 = queryNorm
              1.0089462 = fieldWeight in 5574, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.131938 = idf(docFreq=12, maxDocs=44218)
                0.078125 = fieldNorm(doc=5574)
        0.16 = coord(4/25)
    
  4. Zhang, M.; Zhang, Y.: Professional organizations in Twittersphere : an empirical study of U.S. library and information science professional organizations-related Tweets (2020) 0.18
    0.1809256 = sum of:
      0.1809256 = product of:
        1.130785 = sum of:
          0.0412862 = weight(abstract_txt:systematic in 5775) [ClassicSimilarity], result of:
            0.0412862 = score(doc=5775,freq=1.0), product of:
              0.07516204 = queryWeight, product of:
                1.0616447 = boost
                5.8591566 = idf(docFreq=342, maxDocs=44218)
                0.012083263 = queryNorm
              0.5492959 = fieldWeight in 5775, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8591566 = idf(docFreq=342, maxDocs=44218)
                0.09375 = fieldNorm(doc=5775)
          0.17596412 = weight(abstract_txt:trending in 5775) [ClassicSimilarity], result of:
            0.17596412 = score(doc=5775,freq=1.0), product of:
              0.19758077 = queryWeight, product of:
                1.7212828 = boost
                9.499662 = idf(docFreq=8, maxDocs=44218)
                0.012083263 = queryNorm
              0.89059335 = fieldWeight in 5775, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.499662 = idf(docFreq=8, maxDocs=44218)
                0.09375 = fieldNorm(doc=5775)
          0.2882923 = weight(abstract_txt:twitter in 5775) [ClassicSimilarity], result of:
            0.2882923 = score(doc=5775,freq=2.0), product of:
              0.31432652 = queryWeight, product of:
                3.7603743 = boost
                6.9177637 = idf(docFreq=118, maxDocs=44218)
                0.012083263 = queryNorm
              0.9171746 = fieldWeight in 5775, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.9177637 = idf(docFreq=118, maxDocs=44218)
                0.09375 = fieldNorm(doc=5775)
          0.62524235 = weight(abstract_txt:hashtags in 5775) [ClassicSimilarity], result of:
            0.62524235 = score(doc=5775,freq=1.0), product of:
              0.73032165 = queryWeight, product of:
                6.6186132 = boost
                9.131938 = idf(docFreq=12, maxDocs=44218)
                0.012083263 = queryNorm
              0.85611916 = fieldWeight in 5775, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.131938 = idf(docFreq=12, maxDocs=44218)
                0.09375 = fieldNorm(doc=5775)
        0.16 = coord(4/25)
    
  5. Alkhodair, S.A.; Fung, B.C.M.; Patrick, O.R.; Hung, C.K.: Improving interpretations of topic modeling in microblogs (2018) 0.15
    0.1481517 = sum of:
      0.1481517 = product of:
        0.92594814 = sum of:
          0.023002557 = weight(abstract_txt:life in 4181) [ClassicSimilarity], result of:
            0.023002557 = score(doc=4181,freq=1.0), product of:
              0.06668685 = queryWeight, product of:
                5.5189433 = idf(docFreq=481, maxDocs=44218)
                0.012083263 = queryNorm
              0.34493396 = fieldWeight in 4181, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.5189433 = idf(docFreq=481, maxDocs=44218)
                0.0625 = fieldNorm(doc=4181)
          0.041656896 = weight(abstract_txt:challenging in 4181) [ClassicSimilarity], result of:
            0.041656896 = score(doc=4181,freq=1.0), product of:
              0.09907879 = queryWeight, product of:
                1.2189062 = boost
                6.727074 = idf(docFreq=143, maxDocs=44218)
                0.012083263 = queryNorm
              0.42044213 = fieldWeight in 4181, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.727074 = idf(docFreq=143, maxDocs=44218)
                0.0625 = fieldNorm(doc=4181)
          0.27180457 = weight(abstract_txt:twitter in 4181) [ClassicSimilarity], result of:
            0.27180457 = score(doc=4181,freq=4.0), product of:
              0.31432652 = queryWeight, product of:
                3.7603743 = boost
                6.9177637 = idf(docFreq=118, maxDocs=44218)
                0.012083263 = queryNorm
              0.86472046 = fieldWeight in 4181, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.9177637 = idf(docFreq=118, maxDocs=44218)
                0.0625 = fieldNorm(doc=4181)
          0.58948416 = weight(abstract_txt:hashtags in 4181) [ClassicSimilarity], result of:
            0.58948416 = score(doc=4181,freq=2.0), product of:
              0.73032165 = queryWeight, product of:
                6.6186132 = boost
                9.131938 = idf(docFreq=12, maxDocs=44218)
                0.012083263 = queryNorm
              0.8071569 = fieldWeight in 4181, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.131938 = idf(docFreq=12, maxDocs=44218)
                0.0625 = fieldNorm(doc=4181)
        0.16 = coord(4/25)