Document (#38218)

Author
Cotelo, J.M.
Cruz, F.L.
Troyano, J.A.
Title
Dynamic topic-related tweet retrieval
Source
Journal of the Association for Information Science and Technology. 65(2014) no.3, S.513-523
Year
2014
Abstract
Twitter is a social network in which people publish publicly accessible brief, instant messages. With its exponential growth and the public nature and transversality of its contents, more researchers are using Twitter as a source of data for multiple purposes. In this context, the ability to retrieve those messages (tweets) related to a certain topic becomes critical. In this work, we define the topic-related tweet retrieval task and propose a dynamic, graph-based method with which to address it. We have applied our method to capture a data set containing tweets related to the participation of the Spanish team in the Euro 2012 soccer competition, measuring the precision and recall against other simple but commonly used approaches. The results demonstrate the effectiveness of our method, which significantly increases coverage of the chosen topic and is able to capture related but unknown à priori subtopics.
Object
Twitter

Similar documents (author)

  1. Díaz, N.P. Cruz -> Cruz Díaz, N.P.: 4.98
    4.982081 = sum of:
      4.982081 = weight(author_txt:cruz in 233) [ClassicSimilarity], result of:
        4.982081 = fieldWeight in 233, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          9.394302 = idf(docFreq=9, maxDocs=44218)
          0.375 = fieldNorm(doc=233)
    
  2. Cruz, T. Trindade => Trindade Cruz, T.: 4.98
    4.982081 = sum of:
      4.982081 = weight(author_txt:cruz in 4843) [ClassicSimilarity], result of:
        4.982081 = fieldWeight in 4843, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          9.394302 = idf(docFreq=9, maxDocs=44218)
          0.375 = fieldNorm(doc=4843)
    
  3. Trindade Cruz, T.: Digital heritage : challenges and opportunities in the access and organisation of digital knowledge in contemporary societies (2018) 4.70
    4.697151 = sum of:
      4.697151 = weight(author_txt:cruz in 4846) [ClassicSimilarity], result of:
        4.697151 = fieldWeight in 4846, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.394302 = idf(docFreq=9, maxDocs=44218)
          0.5 = fieldNorm(doc=4846)
    
  4. Barrueco Cruz, J.M.; Krichel, T.: Subject description in the Academic Metadata Format (2003) 4.11
    4.1100073 = sum of:
      4.1100073 = weight(author_txt:cruz in 3548) [ClassicSimilarity], result of:
        4.1100073 = fieldWeight in 3548, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.394302 = idf(docFreq=9, maxDocs=44218)
          0.4375 = fieldNorm(doc=3548)
    
  5. Cruz, J.M.B.; Garcia, J.A.C.; Lopez, R.F.: Preprints: communication through electronic nets : an example of bibliographic control (1996) 3.52
    3.5228634 = sum of:
      3.5228634 = weight(author_txt:cruz in 4723) [ClassicSimilarity], result of:
        3.5228634 = fieldWeight in 4723, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.394302 = idf(docFreq=9, maxDocs=44218)
          0.375 = fieldNorm(doc=4723)
    

Similar documents (content)

  1. Zheng, X.; Sun, A.: Collecting event-related tweets from twitter stream (2019) 0.37
    0.3659779 = sum of:
      0.3659779 = product of:
        1.143681 = sum of:
          0.0996565 = weight(abstract_txt:instant in 4672) [ClassicSimilarity], result of:
            0.0996565 = score(doc=4672,freq=1.0), product of:
              0.19142643 = queryWeight, product of:
                1.2637938 = boost
                8.329592 = idf(docFreq=28, maxDocs=44218)
                0.018184524 = queryNorm
              0.5205995 = fieldWeight in 4672, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.329592 = idf(docFreq=28, maxDocs=44218)
                0.0625 = fieldNorm(doc=4672)
          0.012836233 = weight(abstract_txt:which in 4672) [ClassicSimilarity], result of:
            0.012836233 = score(doc=4672,freq=1.0), product of:
              0.070414744 = queryWeight, product of:
                1.3276014 = boost
                2.9167147 = idf(docFreq=6503, maxDocs=44218)
                0.018184524 = queryNorm
              0.18229467 = fieldWeight in 4672, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.9167147 = idf(docFreq=6503, maxDocs=44218)
                0.0625 = fieldNorm(doc=4672)
          0.09434051 = weight(abstract_txt:method in 4672) [ClassicSimilarity], result of:
            0.09434051 = score(doc=4672,freq=4.0), product of:
              0.16768107 = queryWeight, product of:
                2.048698 = boost
                4.50095 = idf(docFreq=1333, maxDocs=44218)
                0.018184524 = queryNorm
              0.56261873 = fieldWeight in 4672, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.50095 = idf(docFreq=1333, maxDocs=44218)
                0.0625 = fieldNorm(doc=4672)
          0.11375871 = weight(abstract_txt:messages in 4672) [ClassicSimilarity], result of:
            0.11375871 = score(doc=4672,freq=1.0), product of:
              0.2634296 = queryWeight, product of:
                2.0966337 = boost
                6.9093957 = idf(docFreq=119, maxDocs=44218)
                0.018184524 = queryNorm
              0.43183723 = fieldWeight in 4672, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.9093957 = idf(docFreq=119, maxDocs=44218)
                0.0625 = fieldNorm(doc=4672)
          0.19775265 = weight(abstract_txt:twitter in 4672) [ClassicSimilarity], result of:
            0.19775265 = score(doc=4672,freq=3.0), product of:
              0.2640681 = queryWeight, product of:
                2.099173 = boost
                6.9177637 = idf(docFreq=118, maxDocs=44218)
                0.018184524 = queryNorm
              0.7488699 = fieldWeight in 4672, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.9177637 = idf(docFreq=118, maxDocs=44218)
                0.0625 = fieldNorm(doc=4672)
          0.301117 = weight(abstract_txt:tweets in 4672) [ClassicSimilarity], result of:
            0.301117 = score(doc=4672,freq=4.0), product of:
              0.31754968 = queryWeight, product of:
                2.3019512 = boost
                7.5860133 = idf(docFreq=60, maxDocs=44218)
                0.018184524 = queryNorm
              0.94825166 = fieldWeight in 4672, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                7.5860133 = idf(docFreq=60, maxDocs=44218)
                0.0625 = fieldNorm(doc=4672)
          0.21320866 = weight(abstract_txt:tweet in 4672) [ClassicSimilarity], result of:
            0.21320866 = score(doc=4672,freq=1.0), product of:
              0.40044668 = queryWeight, product of:
                2.5850124 = boost
                8.518833 = idf(docFreq=23, maxDocs=44218)
                0.018184524 = queryNorm
              0.5324271 = fieldWeight in 4672, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.518833 = idf(docFreq=23, maxDocs=44218)
                0.0625 = fieldNorm(doc=4672)
          0.11101081 = weight(abstract_txt:related in 4672) [ClassicSimilarity], result of:
            0.11101081 = score(doc=4672,freq=3.0), product of:
              0.24388845 = queryWeight, product of:
                3.1897447 = boost
                4.2046843 = idf(docFreq=1793, maxDocs=44218)
                0.018184524 = queryNorm
              0.45517042 = fieldWeight in 4672, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.2046843 = idf(docFreq=1793, maxDocs=44218)
                0.0625 = fieldNorm(doc=4672)
        0.32 = coord(8/25)
    
  2. Luo, Z.; Yu, Y.; Osborne, M.; Wang, T.: Structuring tweets for improving Twitter search (2015) 0.35
    0.35189858 = sum of:
      0.35189858 = product of:
        1.2567806 = sum of:
          0.038293947 = weight(abstract_txt:retrieval in 2335) [ClassicSimilarity], result of:
            0.038293947 = score(doc=2335,freq=7.0), product of:
              0.066639066 = queryWeight, product of:
                1.0545198 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.018184524 = queryNorm
              0.5746471 = fieldWeight in 2335, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.0625 = fieldNorm(doc=2335)
          0.047170255 = weight(abstract_txt:method in 2335) [ClassicSimilarity], result of:
            0.047170255 = score(doc=2335,freq=1.0), product of:
              0.16768107 = queryWeight, product of:
                2.048698 = boost
                4.50095 = idf(docFreq=1333, maxDocs=44218)
                0.018184524 = queryNorm
              0.28130937 = fieldWeight in 2335, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.50095 = idf(docFreq=1333, maxDocs=44218)
                0.0625 = fieldNorm(doc=2335)
          0.2796645 = weight(abstract_txt:twitter in 2335) [ClassicSimilarity], result of:
            0.2796645 = score(doc=2335,freq=6.0), product of:
              0.2640681 = queryWeight, product of:
                2.099173 = boost
                6.9177637 = idf(docFreq=118, maxDocs=44218)
                0.018184524 = queryNorm
              1.059062 = fieldWeight in 2335, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                6.9177637 = idf(docFreq=118, maxDocs=44218)
                0.0625 = fieldNorm(doc=2335)
          0.36879155 = weight(abstract_txt:tweets in 2335) [ClassicSimilarity], result of:
            0.36879155 = score(doc=2335,freq=6.0), product of:
              0.31754968 = queryWeight, product of:
                2.3019512 = boost
                7.5860133 = idf(docFreq=60, maxDocs=44218)
                0.018184524 = queryNorm
              1.1613665 = fieldWeight in 2335, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                7.5860133 = idf(docFreq=60, maxDocs=44218)
                0.0625 = fieldNorm(doc=2335)
          0.3692882 = weight(abstract_txt:tweet in 2335) [ClassicSimilarity], result of:
            0.3692882 = score(doc=2335,freq=3.0), product of:
              0.40044668 = queryWeight, product of:
                2.5850124 = boost
                8.518833 = idf(docFreq=23, maxDocs=44218)
                0.018184524 = queryNorm
              0.9221907 = fieldWeight in 2335, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                8.518833 = idf(docFreq=23, maxDocs=44218)
                0.0625 = fieldNorm(doc=2335)
          0.08948006 = weight(abstract_txt:topic in 2335) [ClassicSimilarity], result of:
            0.08948006 = score(doc=2335,freq=1.0), product of:
              0.28281492 = queryWeight, product of:
                3.0722492 = boost
                5.062254 = idf(docFreq=760, maxDocs=44218)
                0.018184524 = queryNorm
              0.31639087 = fieldWeight in 2335, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.062254 = idf(docFreq=760, maxDocs=44218)
                0.0625 = fieldNorm(doc=2335)
          0.06409212 = weight(abstract_txt:related in 2335) [ClassicSimilarity], result of:
            0.06409212 = score(doc=2335,freq=1.0), product of:
              0.24388845 = queryWeight, product of:
                3.1897447 = boost
                4.2046843 = idf(docFreq=1793, maxDocs=44218)
                0.018184524 = queryNorm
              0.26279277 = fieldWeight in 2335, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.2046843 = idf(docFreq=1793, maxDocs=44218)
                0.0625 = fieldNorm(doc=2335)
        0.28 = coord(7/25)
    
  3. Bae, Y.; Lee, H.: Sentiment analysis of twitter audiences : measuring the positive or negative influence of popular twitterers (2012) 0.23
    0.2343473 = sum of:
      0.2343473 = product of:
        0.9764471 = sum of:
          0.16087912 = weight(abstract_txt:messages in 520) [ClassicSimilarity], result of:
            0.16087912 = score(doc=520,freq=2.0), product of:
              0.2634296 = queryWeight, product of:
                2.0966337 = boost
                6.9093957 = idf(docFreq=119, maxDocs=44218)
                0.018184524 = queryNorm
              0.6107101 = fieldWeight in 520, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.9093957 = idf(docFreq=119, maxDocs=44218)
                0.0625 = fieldNorm(doc=520)
          0.16146436 = weight(abstract_txt:twitter in 520) [ClassicSimilarity], result of:
            0.16146436 = score(doc=520,freq=2.0), product of:
              0.2640681 = queryWeight, product of:
                2.099173 = boost
                6.9177637 = idf(docFreq=118, maxDocs=44218)
                0.018184524 = queryNorm
              0.6114497 = fieldWeight in 520, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.9177637 = idf(docFreq=118, maxDocs=44218)
                0.0625 = fieldNorm(doc=520)
          0.26077497 = weight(abstract_txt:tweets in 520) [ClassicSimilarity], result of:
            0.26077497 = score(doc=520,freq=3.0), product of:
              0.31754968 = queryWeight, product of:
                2.3019512 = boost
                7.5860133 = idf(docFreq=60, maxDocs=44218)
                0.018184524 = queryNorm
              0.82121 = fieldWeight in 520, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.5860133 = idf(docFreq=60, maxDocs=44218)
                0.0625 = fieldNorm(doc=520)
          0.21320866 = weight(abstract_txt:tweet in 520) [ClassicSimilarity], result of:
            0.21320866 = score(doc=520,freq=1.0), product of:
              0.40044668 = queryWeight, product of:
                2.5850124 = boost
                8.518833 = idf(docFreq=23, maxDocs=44218)
                0.018184524 = queryNorm
              0.5324271 = fieldWeight in 520, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.518833 = idf(docFreq=23, maxDocs=44218)
                0.0625 = fieldNorm(doc=520)
          0.08948006 = weight(abstract_txt:topic in 520) [ClassicSimilarity], result of:
            0.08948006 = score(doc=520,freq=1.0), product of:
              0.28281492 = queryWeight, product of:
                3.0722492 = boost
                5.062254 = idf(docFreq=760, maxDocs=44218)
                0.018184524 = queryNorm
              0.31639087 = fieldWeight in 520, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.062254 = idf(docFreq=760, maxDocs=44218)
                0.0625 = fieldNorm(doc=520)
          0.09063995 = weight(abstract_txt:related in 520) [ClassicSimilarity], result of:
            0.09063995 = score(doc=520,freq=2.0), product of:
              0.24388845 = queryWeight, product of:
                3.1897447 = boost
                4.2046843 = idf(docFreq=1793, maxDocs=44218)
                0.018184524 = queryNorm
              0.3716451 = fieldWeight in 520, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.2046843 = idf(docFreq=1793, maxDocs=44218)
                0.0625 = fieldNorm(doc=520)
        0.24 = coord(6/25)
    
  4. Fang, Z.; Dudek, J.; Costas, R.: Facing the volatility of tweets in altmetric research (2022) 0.23
    0.23117743 = sum of:
      0.23117743 = product of:
        0.9632393 = sum of:
          0.03579921 = weight(abstract_txt:data in 605) [ClassicSimilarity], result of:
            0.03579921 = score(doc=605,freq=5.0), product of:
              0.061422445 = queryWeight, product of:
                1.012404 = boost
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.018184524 = queryNorm
              0.582836 = fieldWeight in 605, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.078125 = fieldNorm(doc=605)
          0.01809219 = weight(abstract_txt:retrieval in 605) [ClassicSimilarity], result of:
            0.01809219 = score(doc=605,freq=1.0), product of:
              0.066639066 = queryWeight, product of:
                1.0545198 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.018184524 = queryNorm
              0.27149525 = fieldWeight in 605, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.078125 = fieldNorm(doc=605)
          0.08281199 = weight(abstract_txt:dynamic in 605) [ClassicSimilarity], result of:
            0.08281199 = score(doc=605,freq=1.0), product of:
              0.18370892 = queryWeight, product of:
                1.7508761 = boost
                5.7699614 = idf(docFreq=374, maxDocs=44218)
                0.018184524 = queryNorm
              0.45077825 = fieldWeight in 605, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.7699614 = idf(docFreq=374, maxDocs=44218)
                0.078125 = fieldNorm(doc=605)
          0.28543136 = weight(abstract_txt:twitter in 605) [ClassicSimilarity], result of:
            0.28543136 = score(doc=605,freq=4.0), product of:
              0.2640681 = queryWeight, product of:
                2.099173 = boost
                6.9177637 = idf(docFreq=118, maxDocs=44218)
                0.018184524 = queryNorm
              1.0809005 = fieldWeight in 605, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.9177637 = idf(docFreq=118, maxDocs=44218)
                0.078125 = fieldNorm(doc=605)
          0.46098942 = weight(abstract_txt:tweets in 605) [ClassicSimilarity], result of:
            0.46098942 = score(doc=605,freq=6.0), product of:
              0.31754968 = queryWeight, product of:
                2.3019512 = boost
                7.5860133 = idf(docFreq=60, maxDocs=44218)
                0.018184524 = queryNorm
              1.4517081 = fieldWeight in 605, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                7.5860133 = idf(docFreq=60, maxDocs=44218)
                0.078125 = fieldNorm(doc=605)
          0.080115154 = weight(abstract_txt:related in 605) [ClassicSimilarity], result of:
            0.080115154 = score(doc=605,freq=1.0), product of:
              0.24388845 = queryWeight, product of:
                3.1897447 = boost
                4.2046843 = idf(docFreq=1793, maxDocs=44218)
                0.018184524 = queryNorm
              0.32849097 = fieldWeight in 605, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.2046843 = idf(docFreq=1793, maxDocs=44218)
                0.078125 = fieldNorm(doc=605)
        0.24 = coord(6/25)
    
  5. Yi, K.; Choi, N.; Kim, Y.S.: ¬A content analysis of Twitter hyperlinks and their application in web resource indexing (2016) 0.22
    0.2174639 = sum of:
      0.2174639 = product of:
        1.0873195 = sum of:
          0.19703588 = weight(abstract_txt:messages in 3075) [ClassicSimilarity], result of:
            0.19703588 = score(doc=3075,freq=3.0), product of:
              0.2634296 = queryWeight, product of:
                2.0966337 = boost
                6.9093957 = idf(docFreq=119, maxDocs=44218)
                0.018184524 = queryNorm
              0.747964 = fieldWeight in 3075, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.9093957 = idf(docFreq=119, maxDocs=44218)
                0.0625 = fieldNorm(doc=3075)
          0.16146436 = weight(abstract_txt:twitter in 3075) [ClassicSimilarity], result of:
            0.16146436 = score(doc=3075,freq=2.0), product of:
              0.2640681 = queryWeight, product of:
                2.099173 = boost
                6.9177637 = idf(docFreq=118, maxDocs=44218)
                0.018184524 = queryNorm
              0.6114497 = fieldWeight in 3075, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.9177637 = idf(docFreq=118, maxDocs=44218)
                0.0625 = fieldNorm(doc=3075)
          0.21292187 = weight(abstract_txt:tweets in 3075) [ClassicSimilarity], result of:
            0.21292187 = score(doc=3075,freq=2.0), product of:
              0.31754968 = queryWeight, product of:
                2.3019512 = boost
                7.5860133 = idf(docFreq=60, maxDocs=44218)
                0.018184524 = queryNorm
              0.6705152 = fieldWeight in 3075, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.5860133 = idf(docFreq=60, maxDocs=44218)
                0.0625 = fieldNorm(doc=3075)
          0.42641732 = weight(abstract_txt:tweet in 3075) [ClassicSimilarity], result of:
            0.42641732 = score(doc=3075,freq=4.0), product of:
              0.40044668 = queryWeight, product of:
                2.5850124 = boost
                8.518833 = idf(docFreq=23, maxDocs=44218)
                0.018184524 = queryNorm
              1.0648541 = fieldWeight in 3075, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                8.518833 = idf(docFreq=23, maxDocs=44218)
                0.0625 = fieldNorm(doc=3075)
          0.08948006 = weight(abstract_txt:topic in 3075) [ClassicSimilarity], result of:
            0.08948006 = score(doc=3075,freq=1.0), product of:
              0.28281492 = queryWeight, product of:
                3.0722492 = boost
                5.062254 = idf(docFreq=760, maxDocs=44218)
                0.018184524 = queryNorm
              0.31639087 = fieldWeight in 3075, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.062254 = idf(docFreq=760, maxDocs=44218)
                0.0625 = fieldNorm(doc=3075)
        0.2 = coord(5/25)