Document (#36457)

Author
Efron, M.
Title
Information search and retrieval in microblogs
Source
Journal of the American Society for Information Science and Technology. 62(2011) no.6, S.996-1008
Year
2011
Series
Advances in information science
Abstract
Modern information retrieval (IR) has come to terms with numerous new media in efforts to help people find information in increasingly diverse settings. Among these new media are so-called microblogs. A microblog is a stream of text that is written by an author over time. It comprises many very brief updates that are presented to the microblog's readers in reverse-chronological order. Today, the service called Twitter is the most popular microblogging platform. Although microblogging is increasingly popular, methods for organizing and providing access to microblog data are still new. This review offers an introduction to the problems that face researchers and developers of IR systems in microblog settings. After an overview of microblogs and the behavior surrounding them, the review describes established problems in microblog retrieval, such as entity search and sentiment analysis, and modeling abstractions, such as authority and quality. The review also treats user-created metadata that often appear in microblogs. Because the problem of microblog search is so new, the review concludes with a discussion of particularly pressing research issues yet to be studied in the field.

Similar documents (author)

  1. Efron, M.: Eigenvalue-based model selection during Latent Semantic Indexing (2005) 6.07
    6.0731125 = sum of:
      6.0731125 = weight(author_txt:efron in 4686) [ClassicSimilarity], result of:
        6.0731125 = fieldWeight in 4686, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.71698 = idf(docFreq=6, maxDocs=42740)
          0.625 = fieldNorm(doc=4686)
    
  2. Efron, M.: Shannon meets Shortz : a probabilistic model of crossword puzzle difficulty (2008) 6.07
    6.0731125 = sum of:
      6.0731125 = weight(author_txt:efron in 3621) [ClassicSimilarity], result of:
        6.0731125 = fieldWeight in 3621, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.71698 = idf(docFreq=6, maxDocs=42740)
          0.625 = fieldNorm(doc=3621)
    
  3. Efron, M.: Query expansion and dimensionality reduction : Notions of optimality in Rocchio relevance feedback and latent semantic indexing (2008) 6.07
    6.0731125 = sum of:
      6.0731125 = weight(author_txt:efron in 4021) [ClassicSimilarity], result of:
        6.0731125 = fieldWeight in 4021, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.71698 = idf(docFreq=6, maxDocs=42740)
          0.625 = fieldNorm(doc=4021)
    
  4. Efron, M.: Linear time series models for term weighting in information retrieval (2010) 6.07
    6.0731125 = sum of:
      6.0731125 = weight(author_txt:efron in 689) [ClassicSimilarity], result of:
        6.0731125 = fieldWeight in 689, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.71698 = idf(docFreq=6, maxDocs=42740)
          0.625 = fieldNorm(doc=689)
    
  5. Efron, M.; Winget, M.: Query polyrepresentation for ranking retrieval systems without relevance judgments (2010) 4.86
    4.85849 = sum of:
      4.85849 = weight(author_txt:efron in 470) [ClassicSimilarity], result of:
        4.85849 = fieldWeight in 470, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.71698 = idf(docFreq=6, maxDocs=42740)
          0.5 = fieldNorm(doc=470)
    

Similar documents (content)

  1. Jansen, B.J.; Zhang, M.; Sobel, K.; Chowdury, A.: Twitter power : tweets as electronic word of mouth (2009) 0.30
    0.29726398 = sum of:
      0.29726398 = product of:
        1.4863199 = sum of:
          0.061792742 = weight(abstract_txt:sentiment in 158) [ClassicSimilarity], result of:
            0.061792742 = score(doc=158,freq=2.0), product of:
              0.091101594 = queryWeight, product of:
                1.1117942 = boost
                7.6739063 = idf(docFreq=53, maxDocs=42740)
                0.010677881 = queryNorm
              0.6782839 = fieldWeight in 158, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.6739063 = idf(docFreq=53, maxDocs=42740)
                0.0625 = fieldNorm(doc=158)
          0.0075106714 = weight(abstract_txt:that in 158) [ClassicSimilarity], result of:
            0.0075106714 = score(doc=158,freq=2.0), product of:
              0.03548462 = queryWeight, product of:
                1.3877509 = boost
                2.3946586 = idf(docFreq=10595, maxDocs=42740)
                0.010677881 = queryNorm
              0.21165991 = fieldWeight in 158, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.3946586 = idf(docFreq=10595, maxDocs=42740)
                0.0625 = fieldNorm(doc=158)
          0.22196355 = weight(abstract_txt:microblogging in 158) [ClassicSimilarity], result of:
            0.22196355 = score(doc=158,freq=3.0), product of:
              0.2351807 = queryWeight, product of:
                2.5262554 = boost
                8.7184515 = idf(docFreq=18, maxDocs=42740)
                0.010677881 = queryNorm
              0.94380003 = fieldWeight in 158, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                8.7184515 = idf(docFreq=18, maxDocs=42740)
                0.0625 = fieldNorm(doc=158)
          0.6343561 = weight(abstract_txt:microblogs in 158) [ClassicSimilarity], result of:
            0.6343561 = score(doc=158,freq=4.0), product of:
              0.54216695 = queryWeight, product of:
                5.424478 = boost
                9.360306 = idf(docFreq=9, maxDocs=42740)
                0.010677881 = queryNorm
              1.1700382 = fieldWeight in 158, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                9.360306 = idf(docFreq=9, maxDocs=42740)
                0.0625 = fieldNorm(doc=158)
          0.56069684 = weight(abstract_txt:microblog in 158) [ClassicSimilarity], result of:
            0.56069684 = score(doc=158,freq=2.0), product of:
              0.6777087 = queryWeight, product of:
                6.780597 = boost
                9.360306 = idf(docFreq=9, maxDocs=42740)
                0.010677881 = queryNorm
              0.827342 = fieldWeight in 158, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.360306 = idf(docFreq=9, maxDocs=42740)
                0.0625 = fieldNorm(doc=158)
        0.2 = coord(5/25)
    
  2. Bandaragoda, T.R.; Silva, D. de; Alahakoon, D.: Automatic event detection in microblogs using incremental machine learning (2017) 0.28
    0.27569628 = sum of:
      0.27569628 = product of:
        1.3784814 = sum of:
          0.011240982 = weight(abstract_txt:such in 5827) [ClassicSimilarity], result of:
            0.011240982 = score(doc=5827,freq=2.0), product of:
              0.03685082 = queryWeight, product of:
                3.451136 = idf(docFreq=3683, maxDocs=42740)
                0.010677881 = queryNorm
              0.3050402 = fieldWeight in 5827, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.451136 = idf(docFreq=3683, maxDocs=42740)
                0.0625 = fieldNorm(doc=5827)
          0.035372127 = weight(abstract_txt:twitter in 5827) [ClassicSimilarity], result of:
            0.035372127 = score(doc=5827,freq=1.0), product of:
              0.07913193 = queryWeight, product of:
                1.036185 = boost
                7.152031 = idf(docFreq=90, maxDocs=42740)
                0.010677881 = queryNorm
              0.44700193 = fieldWeight in 5827, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.152031 = idf(docFreq=90, maxDocs=42740)
                0.0625 = fieldNorm(doc=5827)
          0.12815072 = weight(abstract_txt:microblogging in 5827) [ClassicSimilarity], result of:
            0.12815072 = score(doc=5827,freq=1.0), product of:
              0.2351807 = queryWeight, product of:
                2.5262554 = boost
                8.7184515 = idf(docFreq=18, maxDocs=42740)
                0.010677881 = queryNorm
              0.5449032 = fieldWeight in 5827, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.7184515 = idf(docFreq=18, maxDocs=42740)
                0.0625 = fieldNorm(doc=5827)
          0.31717804 = weight(abstract_txt:microblogs in 5827) [ClassicSimilarity], result of:
            0.31717804 = score(doc=5827,freq=1.0), product of:
              0.54216695 = queryWeight, product of:
                5.424478 = boost
                9.360306 = idf(docFreq=9, maxDocs=42740)
                0.010677881 = queryNorm
              0.5850191 = fieldWeight in 5827, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.360306 = idf(docFreq=9, maxDocs=42740)
                0.0625 = fieldNorm(doc=5827)
          0.8865396 = weight(abstract_txt:microblog in 5827) [ClassicSimilarity], result of:
            0.8865396 = score(doc=5827,freq=5.0), product of:
              0.6777087 = queryWeight, product of:
                6.780597 = boost
                9.360306 = idf(docFreq=9, maxDocs=42740)
                0.010677881 = queryNorm
              1.3081425 = fieldWeight in 5827, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                9.360306 = idf(docFreq=9, maxDocs=42740)
                0.0625 = fieldNorm(doc=5827)
        0.2 = coord(5/25)
    
  3. Sin, S.-C.J.: Social media and problematic everyday life information-seeking outcomes : differences across use frequency, gender, and problem-solving styles (2016) 0.23
    0.22703661 = sum of:
      0.22703661 = product of:
        0.9459859 = sum of:
          0.007948575 = weight(abstract_txt:such in 5044) [ClassicSimilarity], result of:
            0.007948575 = score(doc=5044,freq=1.0), product of:
              0.03685082 = queryWeight, product of:
                3.451136 = idf(docFreq=3683, maxDocs=42740)
                0.010677881 = queryNorm
              0.215696 = fieldWeight in 5044, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.451136 = idf(docFreq=3683, maxDocs=42740)
                0.0625 = fieldNorm(doc=5044)
          0.009307946 = weight(abstract_txt:information in 5044) [ClassicSimilarity], result of:
            0.009307946 = score(doc=5044,freq=5.0), product of:
              0.027407154 = queryWeight, product of:
                1.0562191 = boost
                2.430104 = idf(docFreq=10226, maxDocs=42740)
                0.010677881 = queryNorm
              0.33961737 = fieldWeight in 5044, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                2.430104 = idf(docFreq=10226, maxDocs=42740)
                0.0625 = fieldNorm(doc=5044)
          0.005310847 = weight(abstract_txt:that in 5044) [ClassicSimilarity], result of:
            0.005310847 = score(doc=5044,freq=1.0), product of:
              0.03548462 = queryWeight, product of:
                1.3877509 = boost
                2.3946586 = idf(docFreq=10595, maxDocs=42740)
                0.010677881 = queryNorm
              0.14966616 = fieldWeight in 5044, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.3946586 = idf(docFreq=10595, maxDocs=42740)
                0.0625 = fieldNorm(doc=5044)
          0.045543686 = weight(abstract_txt:media in 5044) [ClassicSimilarity], result of:
            0.045543686 = score(doc=5044,freq=3.0), product of:
              0.08181495 = queryWeight, product of:
                1.4900223 = boost
                5.1422696 = idf(docFreq=678, maxDocs=42740)
                0.010677881 = queryNorm
              0.55666703 = fieldWeight in 5044, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.1422696 = idf(docFreq=678, maxDocs=42740)
                0.0625 = fieldNorm(doc=5044)
          0.31717804 = weight(abstract_txt:microblogs in 5044) [ClassicSimilarity], result of:
            0.31717804 = score(doc=5044,freq=1.0), product of:
              0.54216695 = queryWeight, product of:
                5.424478 = boost
                9.360306 = idf(docFreq=9, maxDocs=42740)
                0.010677881 = queryNorm
              0.5850191 = fieldWeight in 5044, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.360306 = idf(docFreq=9, maxDocs=42740)
                0.0625 = fieldNorm(doc=5044)
          0.56069684 = weight(abstract_txt:microblog in 5044) [ClassicSimilarity], result of:
            0.56069684 = score(doc=5044,freq=2.0), product of:
              0.6777087 = queryWeight, product of:
                6.780597 = boost
                9.360306 = idf(docFreq=9, maxDocs=42740)
                0.010677881 = queryNorm
              0.827342 = fieldWeight in 5044, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.360306 = idf(docFreq=9, maxDocs=42740)
                0.0625 = fieldNorm(doc=5044)
        0.24 = coord(6/25)
    
  4. Xu, L.; Qiu, J.: Unsupervised multi-class sentiment classification approach (2019) 0.14
    0.13983183 = sum of:
      0.13983183 = product of:
        0.87394893 = sum of:
          0.1310822 = weight(abstract_txt:sentiment in 1004) [ClassicSimilarity], result of:
            0.1310822 = score(doc=1004,freq=9.0), product of:
              0.091101594 = queryWeight, product of:
                1.1117942 = boost
                7.6739063 = idf(docFreq=53, maxDocs=42740)
                0.010677881 = queryNorm
              1.4388574 = fieldWeight in 1004, product of:
                3.0 = tf(freq=9.0), with freq of:
                  9.0 = termFreq=9.0
                7.6739063 = idf(docFreq=53, maxDocs=42740)
                0.0625 = fieldNorm(doc=1004)
          0.029216101 = weight(abstract_txt:called in 1004) [ClassicSimilarity], result of:
            0.029216101 = score(doc=1004,freq=1.0), product of:
              0.08776792 = queryWeight, product of:
                1.5432786 = boost
                5.3260646 = idf(docFreq=564, maxDocs=42740)
                0.010677881 = queryNorm
              0.33287904 = fieldWeight in 1004, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.3260646 = idf(docFreq=564, maxDocs=42740)
                0.0625 = fieldNorm(doc=1004)
          0.31717804 = weight(abstract_txt:microblogs in 1004) [ClassicSimilarity], result of:
            0.31717804 = score(doc=1004,freq=1.0), product of:
              0.54216695 = queryWeight, product of:
                5.424478 = boost
                9.360306 = idf(docFreq=9, maxDocs=42740)
                0.010677881 = queryNorm
              0.5850191 = fieldWeight in 1004, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.360306 = idf(docFreq=9, maxDocs=42740)
                0.0625 = fieldNorm(doc=1004)
          0.39647254 = weight(abstract_txt:microblog in 1004) [ClassicSimilarity], result of:
            0.39647254 = score(doc=1004,freq=1.0), product of:
              0.6777087 = queryWeight, product of:
                6.780597 = boost
                9.360306 = idf(docFreq=9, maxDocs=42740)
                0.010677881 = queryNorm
              0.5850191 = fieldWeight in 1004, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.360306 = idf(docFreq=9, maxDocs=42740)
                0.0625 = fieldNorm(doc=1004)
        0.16 = coord(4/25)
    
  5. Moulahi, B.; Tamine, L.; Yahia, S.B.: iAggregator: multidimensional relevance aggregation based on a fuzzy operator (2014) 0.13
    0.1328029 = sum of:
      0.1328029 = product of:
        0.47429606 = sum of:
          0.007948575 = weight(abstract_txt:such in 3502) [ClassicSimilarity], result of:
            0.007948575 = score(doc=3502,freq=1.0), product of:
              0.03685082 = queryWeight, product of:
                3.451136 = idf(docFreq=3683, maxDocs=42740)
                0.010677881 = queryNorm
              0.215696 = fieldWeight in 3502, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.451136 = idf(docFreq=3683, maxDocs=42740)
                0.0625 = fieldNorm(doc=3502)
          0.00416264 = weight(abstract_txt:information in 3502) [ClassicSimilarity], result of:
            0.00416264 = score(doc=3502,freq=1.0), product of:
              0.027407154 = queryWeight, product of:
                1.0562191 = boost
                2.430104 = idf(docFreq=10226, maxDocs=42740)
                0.010677881 = queryNorm
              0.1518815 = fieldWeight in 3502, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.430104 = idf(docFreq=10226, maxDocs=42740)
                0.0625 = fieldNorm(doc=3502)
          0.005310847 = weight(abstract_txt:that in 3502) [ClassicSimilarity], result of:
            0.005310847 = score(doc=3502,freq=1.0), product of:
              0.03548462 = queryWeight, product of:
                1.3877509 = boost
                2.3946586 = idf(docFreq=10595, maxDocs=42740)
                0.010677881 = queryNorm
              0.14966616 = fieldWeight in 3502, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.3946586 = idf(docFreq=10595, maxDocs=42740)
                0.0625 = fieldNorm(doc=3502)
          0.017062562 = weight(abstract_txt:retrieval in 3502) [ClassicSimilarity], result of:
            0.017062562 = score(doc=3502,freq=2.0), product of:
              0.05571484 = queryWeight, product of:
                1.5059394 = boost
                3.4648013 = idf(docFreq=3633, maxDocs=42740)
                0.010677881 = queryNorm
              0.30624807 = fieldWeight in 3502, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4648013 = idf(docFreq=3633, maxDocs=42740)
                0.0625 = fieldNorm(doc=3502)
          0.029216101 = weight(abstract_txt:called in 3502) [ClassicSimilarity], result of:
            0.029216101 = score(doc=3502,freq=1.0), product of:
              0.08776792 = queryWeight, product of:
                1.5432786 = boost
                5.3260646 = idf(docFreq=564, maxDocs=42740)
                0.010677881 = queryNorm
              0.33287904 = fieldWeight in 3502, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.3260646 = idf(docFreq=564, maxDocs=42740)
                0.0625 = fieldNorm(doc=3502)
          0.014122802 = weight(abstract_txt:search in 3502) [ClassicSimilarity], result of:
            0.014122802 = score(doc=3502,freq=1.0), product of:
              0.06188214 = queryWeight, product of:
                1.5871016 = boost
                3.6515355 = idf(docFreq=3014, maxDocs=42740)
                0.010677881 = queryNorm
              0.22822097 = fieldWeight in 3502, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6515355 = idf(docFreq=3014, maxDocs=42740)
                0.0625 = fieldNorm(doc=3502)
          0.39647254 = weight(abstract_txt:microblog in 3502) [ClassicSimilarity], result of:
            0.39647254 = score(doc=3502,freq=1.0), product of:
              0.6777087 = queryWeight, product of:
                6.780597 = boost
                9.360306 = idf(docFreq=9, maxDocs=42740)
                0.010677881 = queryNorm
              0.5850191 = fieldWeight in 3502, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.360306 = idf(docFreq=9, maxDocs=42740)
                0.0625 = fieldNorm(doc=3502)
        0.28 = coord(7/25)