Document (#36456)

Author
Efron, M.
Title
Information search and retrieval in microblogs
Source
Journal of the American Society for Information Science and Technology. 62(2011) no.6, S.996-1008
Year
2011
Series
Advances in information science
Abstract
Modern information retrieval (IR) has come to terms with numerous new media in efforts to help people find information in increasingly diverse settings. Among these new media are so-called microblogs. A microblog is a stream of text that is written by an author over time. It comprises many very brief updates that are presented to the microblog's readers in reverse-chronological order. Today, the service called Twitter is the most popular microblogging platform. Although microblogging is increasingly popular, methods for organizing and providing access to microblog data are still new. This review offers an introduction to the problems that face researchers and developers of IR systems in microblog settings. After an overview of microblogs and the behavior surrounding them, the review describes established problems in microblog retrieval, such as entity search and sentiment analysis, and modeling abstractions, such as authority and quality. The review also treats user-created metadata that often appear in microblogs. Because the problem of microblog search is so new, the review concludes with a discussion of particularly pressing research issues yet to be studied in the field.

Similar documents (author)

  1. Efron, M.: Eigenvalue-based model selection during Latent Semantic Indexing (2005) 6.09
    6.094361 = sum of:
      6.094361 = weight(author_txt:efron in 3685) [ClassicSimilarity], result of:
        6.094361 = fieldWeight in 3685, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.7509775 = idf(docFreq=6, maxDocs=44218)
          0.625 = fieldNorm(doc=3685)
    
  2. Efron, M.: Shannon meets Shortz : a probabilistic model of crossword puzzle difficulty (2008) 6.09
    6.094361 = sum of:
      6.094361 = weight(author_txt:efron in 1620) [ClassicSimilarity], result of:
        6.094361 = fieldWeight in 1620, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.7509775 = idf(docFreq=6, maxDocs=44218)
          0.625 = fieldNorm(doc=1620)
    
  3. Efron, M.: Query expansion and dimensionality reduction : Notions of optimality in Rocchio relevance feedback and latent semantic indexing (2008) 6.09
    6.094361 = sum of:
      6.094361 = weight(author_txt:efron in 2020) [ClassicSimilarity], result of:
        6.094361 = fieldWeight in 2020, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.7509775 = idf(docFreq=6, maxDocs=44218)
          0.625 = fieldNorm(doc=2020)
    
  4. Efron, M.: Linear time series models for term weighting in information retrieval (2010) 6.09
    6.094361 = sum of:
      6.094361 = weight(author_txt:efron in 3688) [ClassicSimilarity], result of:
        6.094361 = fieldWeight in 3688, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.7509775 = idf(docFreq=6, maxDocs=44218)
          0.625 = fieldNorm(doc=3688)
    
  5. Efron, M.; Winget, M.: Query polyrepresentation for ranking retrieval systems without relevance judgments (2010) 4.88
    4.8754888 = sum of:
      4.8754888 = weight(author_txt:efron in 3469) [ClassicSimilarity], result of:
        4.8754888 = fieldWeight in 3469, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.7509775 = idf(docFreq=6, maxDocs=44218)
          0.5 = fieldNorm(doc=3469)
    

Similar documents (content)

  1. Jansen, B.J.; Zhang, M.; Sobel, K.; Chowdury, A.: Twitter power : tweets as electronic word of mouth (2009) 0.30
    0.29631886 = sum of:
      0.29631886 = product of:
        1.4815943 = sum of:
          0.05985132 = weight(abstract_txt:sentiment in 3157) [ClassicSimilarity], result of:
            0.05985132 = score(doc=3157,freq=2.0), product of:
              0.08964291 = queryWeight, product of:
                1.1025462 = boost
                7.5537524 = idf(docFreq=62, maxDocs=44218)
                0.010763572 = queryNorm
              0.6676637 = fieldWeight in 3157, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.5537524 = idf(docFreq=62, maxDocs=44218)
                0.0625 = fieldNorm(doc=3157)
          0.0073892768 = weight(abstract_txt:that in 3157) [ClassicSimilarity], result of:
            0.0073892768 = score(doc=3157,freq=2.0), product of:
              0.035282128 = queryWeight, product of:
                1.383395 = boost
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.010763572 = queryNorm
              0.20943399 = fieldWeight in 3157, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.0625 = fieldNorm(doc=3157)
          0.2203259 = weight(abstract_txt:microblogging in 3157) [ClassicSimilarity], result of:
            0.2203259 = score(doc=3157,freq=3.0), product of:
              0.2352286 = queryWeight, product of:
                2.5257995 = boost
                8.652365 = idf(docFreq=20, maxDocs=44218)
                0.010763572 = queryNorm
              0.9366459 = fieldWeight in 3157, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                8.652365 = idf(docFreq=20, maxDocs=44218)
                0.0625 = fieldNorm(doc=3157)
          0.6512595 = weight(abstract_txt:microblogs in 3157) [ClassicSimilarity], result of:
            0.6512595 = score(doc=3157,freq=4.0), product of:
              0.5545995 = queryWeight, product of:
                5.4847717 = boost
                9.394302 = idf(docFreq=9, maxDocs=44218)
                0.010763572 = queryNorm
              1.1742878 = fieldWeight in 3157, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                9.394302 = idf(docFreq=9, maxDocs=44218)
                0.0625 = fieldNorm(doc=3157)
          0.54276836 = weight(abstract_txt:microblog in 3157) [ClassicSimilarity], result of:
            0.54276836 = score(doc=3157,freq=2.0), product of:
              0.66660184 = queryWeight, product of:
                6.7229066 = boost
                9.211981 = idf(docFreq=11, maxDocs=44218)
                0.010763572 = queryNorm
              0.81423175 = fieldWeight in 3157, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.211981 = idf(docFreq=11, maxDocs=44218)
                0.0625 = fieldNorm(doc=3157)
        0.2 = coord(5/25)
    
  2. Bandaragoda, T.R.; Silva, D. de; Alahakoon, D.: Automatic event detection in microblogs using incremental machine learning (2017) 0.27
    0.27093953 = sum of:
      0.27093953 = product of:
        1.3546976 = sum of:
          0.011164089 = weight(abstract_txt:such in 3826) [ClassicSimilarity], result of:
            0.011164089 = score(doc=3826,freq=2.0), product of:
              0.03687163 = queryWeight, product of:
                3.4255946 = idf(docFreq=3909, maxDocs=44218)
                0.010763572 = queryNorm
              0.30278262 = fieldWeight in 3826, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4255946 = idf(docFreq=3909, maxDocs=44218)
                0.0625 = fieldNorm(doc=3826)
          0.03250631 = weight(abstract_txt:twitter in 3826) [ClassicSimilarity], result of:
            0.03250631 = score(doc=3826,freq=1.0), product of:
              0.07518339 = queryWeight, product of:
                1.0097172 = boost
                6.9177637 = idf(docFreq=118, maxDocs=44218)
                0.010763572 = queryNorm
              0.43236023 = fieldWeight in 3826, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.9177637 = idf(docFreq=118, maxDocs=44218)
                0.0625 = fieldNorm(doc=3826)
          0.12720522 = weight(abstract_txt:microblogging in 3826) [ClassicSimilarity], result of:
            0.12720522 = score(doc=3826,freq=1.0), product of:
              0.2352286 = queryWeight, product of:
                2.5257995 = boost
                8.652365 = idf(docFreq=20, maxDocs=44218)
                0.010763572 = queryNorm
              0.5407728 = fieldWeight in 3826, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.652365 = idf(docFreq=20, maxDocs=44218)
                0.0625 = fieldNorm(doc=3826)
          0.32562974 = weight(abstract_txt:microblogs in 3826) [ClassicSimilarity], result of:
            0.32562974 = score(doc=3826,freq=1.0), product of:
              0.5545995 = queryWeight, product of:
                5.4847717 = boost
                9.394302 = idf(docFreq=9, maxDocs=44218)
                0.010763572 = queryNorm
              0.5871439 = fieldWeight in 3826, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.394302 = idf(docFreq=9, maxDocs=44218)
                0.0625 = fieldNorm(doc=3826)
          0.8581922 = weight(abstract_txt:microblog in 3826) [ClassicSimilarity], result of:
            0.8581922 = score(doc=3826,freq=5.0), product of:
              0.66660184 = queryWeight, product of:
                6.7229066 = boost
                9.211981 = idf(docFreq=11, maxDocs=44218)
                0.010763572 = queryNorm
              1.2874135 = fieldWeight in 3826, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                9.211981 = idf(docFreq=11, maxDocs=44218)
                0.0625 = fieldNorm(doc=3826)
        0.2 = coord(5/25)
    
  3. Sin, S.-C.J.: Social media and problematic everyday life information-seeking outcomes : differences across use frequency, gender, and problem-solving styles (2016) 0.22
    0.22457401 = sum of:
      0.22457401 = product of:
        0.9357251 = sum of:
          0.007894204 = weight(abstract_txt:such in 3043) [ClassicSimilarity], result of:
            0.007894204 = score(doc=3043,freq=1.0), product of:
              0.03687163 = queryWeight, product of:
                3.4255946 = idf(docFreq=3909, maxDocs=44218)
                0.010763572 = queryNorm
              0.21409966 = fieldWeight in 3043, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4255946 = idf(docFreq=3909, maxDocs=44218)
                0.0625 = fieldNorm(doc=3043)
          0.009346139 = weight(abstract_txt:information in 3043) [ClassicSimilarity], result of:
            0.009346139 = score(doc=3043,freq=5.0), product of:
              0.027623713 = queryWeight, product of:
                1.0600845 = boost
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.010763572 = queryNorm
              0.33833754 = fieldWeight in 3043, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.0625 = fieldNorm(doc=3043)
          0.005225008 = weight(abstract_txt:that in 3043) [ClassicSimilarity], result of:
            0.005225008 = score(doc=3043,freq=1.0), product of:
              0.035282128 = queryWeight, product of:
                1.383395 = boost
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.010763572 = queryNorm
              0.1480922 = fieldWeight in 3043, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.0625 = fieldNorm(doc=3043)
          0.04486164 = weight(abstract_txt:media in 3043) [ClassicSimilarity], result of:
            0.04486164 = score(doc=3043,freq=3.0), product of:
              0.081413515 = queryWeight, product of:
                1.4859427 = boost
                5.090237 = idf(docFreq=739, maxDocs=44218)
                0.010763572 = queryNorm
              0.55103433 = fieldWeight in 3043, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.090237 = idf(docFreq=739, maxDocs=44218)
                0.0625 = fieldNorm(doc=3043)
          0.32562974 = weight(abstract_txt:microblogs in 3043) [ClassicSimilarity], result of:
            0.32562974 = score(doc=3043,freq=1.0), product of:
              0.5545995 = queryWeight, product of:
                5.4847717 = boost
                9.394302 = idf(docFreq=9, maxDocs=44218)
                0.010763572 = queryNorm
              0.5871439 = fieldWeight in 3043, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.394302 = idf(docFreq=9, maxDocs=44218)
                0.0625 = fieldNorm(doc=3043)
          0.54276836 = weight(abstract_txt:microblog in 3043) [ClassicSimilarity], result of:
            0.54276836 = score(doc=3043,freq=2.0), product of:
              0.66660184 = queryWeight, product of:
                6.7229066 = boost
                9.211981 = idf(docFreq=11, maxDocs=44218)
                0.010763572 = queryNorm
              0.81423175 = fieldWeight in 3043, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.211981 = idf(docFreq=11, maxDocs=44218)
                0.0625 = fieldNorm(doc=3043)
        0.24 = coord(6/25)
    
  4. Xu, L.; Qiu, J.: Unsupervised multi-class sentiment classification approach (2019) 0.14
    0.13850437 = sum of:
      0.13850437 = product of:
        0.8656523 = sum of:
          0.12696381 = weight(abstract_txt:sentiment in 5003) [ClassicSimilarity], result of:
            0.12696381 = score(doc=5003,freq=9.0), product of:
              0.08964291 = queryWeight, product of:
                1.1025462 = boost
                7.5537524 = idf(docFreq=62, maxDocs=44218)
                0.010763572 = queryNorm
              1.4163285 = fieldWeight in 5003, product of:
                3.0 = tf(freq=9.0), with freq of:
                  9.0 = termFreq=9.0
                7.5537524 = idf(docFreq=62, maxDocs=44218)
                0.0625 = fieldNorm(doc=5003)
          0.029263591 = weight(abstract_txt:called in 5003) [ClassicSimilarity], result of:
            0.029263591 = score(doc=5003,freq=1.0), product of:
              0.08831583 = queryWeight, product of:
                1.5476513 = boost
                5.3016257 = idf(docFreq=598, maxDocs=44218)
                0.010763572 = queryNorm
              0.3313516 = fieldWeight in 5003, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.3016257 = idf(docFreq=598, maxDocs=44218)
                0.0625 = fieldNorm(doc=5003)
          0.32562974 = weight(abstract_txt:microblogs in 5003) [ClassicSimilarity], result of:
            0.32562974 = score(doc=5003,freq=1.0), product of:
              0.5545995 = queryWeight, product of:
                5.4847717 = boost
                9.394302 = idf(docFreq=9, maxDocs=44218)
                0.010763572 = queryNorm
              0.5871439 = fieldWeight in 5003, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.394302 = idf(docFreq=9, maxDocs=44218)
                0.0625 = fieldNorm(doc=5003)
          0.3837952 = weight(abstract_txt:microblog in 5003) [ClassicSimilarity], result of:
            0.3837952 = score(doc=5003,freq=1.0), product of:
              0.66660184 = queryWeight, product of:
                6.7229066 = boost
                9.211981 = idf(docFreq=11, maxDocs=44218)
                0.010763572 = queryNorm
              0.5757488 = fieldWeight in 5003, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.211981 = idf(docFreq=11, maxDocs=44218)
                0.0625 = fieldNorm(doc=5003)
        0.16 = coord(4/25)
    
  5. Moulahi, B.; Tamine, L.; Yahia, S.B.: iAggregator: multidimensional relevance aggregation based on a fuzzy operator (2014) 0.13
    0.12943287 = sum of:
      0.12943287 = product of:
        0.46226025 = sum of:
          0.007894204 = weight(abstract_txt:such in 1501) [ClassicSimilarity], result of:
            0.007894204 = score(doc=1501,freq=1.0), product of:
              0.03687163 = queryWeight, product of:
                3.4255946 = idf(docFreq=3909, maxDocs=44218)
                0.010763572 = queryNorm
              0.21409966 = fieldWeight in 1501, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4255946 = idf(docFreq=3909, maxDocs=44218)
                0.0625 = fieldNorm(doc=1501)
          0.0041797203 = weight(abstract_txt:information in 1501) [ClassicSimilarity], result of:
            0.0041797203 = score(doc=1501,freq=1.0), product of:
              0.027623713 = queryWeight, product of:
                1.0600845 = boost
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.010763572 = queryNorm
              0.15130915 = fieldWeight in 1501, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.0625 = fieldNorm(doc=1501)
          0.005225008 = weight(abstract_txt:that in 1501) [ClassicSimilarity], result of:
            0.005225008 = score(doc=1501,freq=1.0), product of:
              0.035282128 = queryWeight, product of:
                1.383395 = boost
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.010763572 = queryNorm
              0.1480922 = fieldWeight in 1501, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.0625 = fieldNorm(doc=1501)
          0.017483298 = weight(abstract_txt:retrieval in 1501) [ClassicSimilarity], result of:
            0.017483298 = score(doc=1501,freq=2.0), product of:
              0.056918856 = queryWeight, product of:
                1.5216947 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.010763572 = queryNorm
              0.3071618 = fieldWeight in 1501, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.0625 = fieldNorm(doc=1501)
          0.029263591 = weight(abstract_txt:called in 1501) [ClassicSimilarity], result of:
            0.029263591 = score(doc=1501,freq=1.0), product of:
              0.08831583 = queryWeight, product of:
                1.5476513 = boost
                5.3016257 = idf(docFreq=598, maxDocs=44218)
                0.010763572 = queryNorm
              0.3313516 = fieldWeight in 1501, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.3016257 = idf(docFreq=598, maxDocs=44218)
                0.0625 = fieldNorm(doc=1501)
          0.01441921 = weight(abstract_txt:search in 1501) [ClassicSimilarity], result of:
            0.01441921 = score(doc=1501,freq=1.0), product of:
              0.06306836 = queryWeight, product of:
                1.6017886 = boost
                3.6580524 = idf(docFreq=3098, maxDocs=44218)
                0.010763572 = queryNorm
              0.22862828 = fieldWeight in 1501, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6580524 = idf(docFreq=3098, maxDocs=44218)
                0.0625 = fieldNorm(doc=1501)
          0.3837952 = weight(abstract_txt:microblog in 1501) [ClassicSimilarity], result of:
            0.3837952 = score(doc=1501,freq=1.0), product of:
              0.66660184 = queryWeight, product of:
                6.7229066 = boost
                9.211981 = idf(docFreq=11, maxDocs=44218)
                0.010763572 = queryNorm
              0.5757488 = fieldWeight in 1501, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.211981 = idf(docFreq=11, maxDocs=44218)
                0.0625 = fieldNorm(doc=1501)
        0.28 = coord(7/25)