Document (#36457)

Author
Efron, M.
Title
Information search and retrieval in microblogs
Source
Journal of the American Society for Information Science and Technology. 62(2011) no.6, S.996-1008
Year
2011
Series
Advances in information science
Abstract
Modern information retrieval (IR) has come to terms with numerous new media in efforts to help people find information in increasingly diverse settings. Among these new media are so-called microblogs. A microblog is a stream of text that is written by an author over time. It comprises many very brief updates that are presented to the microblog's readers in reverse-chronological order. Today, the service called Twitter is the most popular microblogging platform. Although microblogging is increasingly popular, methods for organizing and providing access to microblog data are still new. This review offers an introduction to the problems that face researchers and developers of IR systems in microblog settings. After an overview of microblogs and the behavior surrounding them, the review describes established problems in microblog retrieval, such as entity search and sentiment analysis, and modeling abstractions, such as authority and quality. The review also treats user-created metadata that often appear in microblogs. Because the problem of microblog search is so new, the review concludes with a discussion of particularly pressing research issues yet to be studied in the field.

Similar documents (author)

  1. Efron, M.: Eigenvalue-based model selection during Latent Semantic Indexing (2005) 6.08
    6.0805845 = sum of:
      6.0805845 = weight(author_txt:efron in 5686) [ClassicSimilarity], result of:
        6.0805845 = fieldWeight in 5686, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.728935 = idf(docFreq=6, maxDocs=43254)
          0.625 = fieldNorm(doc=5686)
    
  2. Efron, M.: Shannon meets Shortz : a probabilistic model of crossword puzzle difficulty (2008) 6.08
    6.0805845 = sum of:
      6.0805845 = weight(author_txt:efron in 3621) [ClassicSimilarity], result of:
        6.0805845 = fieldWeight in 3621, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.728935 = idf(docFreq=6, maxDocs=43254)
          0.625 = fieldNorm(doc=3621)
    
  3. Efron, M.: Query expansion and dimensionality reduction : Notions of optimality in Rocchio relevance feedback and latent semantic indexing (2008) 6.08
    6.0805845 = sum of:
      6.0805845 = weight(author_txt:efron in 4021) [ClassicSimilarity], result of:
        6.0805845 = fieldWeight in 4021, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.728935 = idf(docFreq=6, maxDocs=43254)
          0.625 = fieldNorm(doc=4021)
    
  4. Efron, M.: Linear time series models for term weighting in information retrieval (2010) 6.08
    6.0805845 = sum of:
      6.0805845 = weight(author_txt:efron in 153) [ClassicSimilarity], result of:
        6.0805845 = fieldWeight in 153, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.728935 = idf(docFreq=6, maxDocs=43254)
          0.625 = fieldNorm(doc=153)
    
  5. Efron, M.; Winget, M.: Query polyrepresentation for ranking retrieval systems without relevance judgments (2010) 4.86
    4.8644676 = sum of:
      4.8644676 = weight(author_txt:efron in 470) [ClassicSimilarity], result of:
        4.8644676 = fieldWeight in 470, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.728935 = idf(docFreq=6, maxDocs=43254)
          0.5 = fieldNorm(doc=470)
    

Similar documents (content)

  1. Jansen, B.J.; Zhang, M.; Sobel, K.; Chowdury, A.: Twitter power : tweets as electronic word of mouth (2009) 0.30
    0.29546544 = sum of:
      0.29546544 = product of:
        1.4773271 = sum of:
          0.062300194 = weight(abstract_txt:sentiment in 158) [ClassicSimilarity], result of:
            0.062300194 = score(doc=158,freq=2.0), product of:
              0.092142865 = queryWeight, product of:
                1.1103814 = boost
                7.649493 = idf(docFreq=55, maxDocs=43254)
                0.01084818 = queryNorm
              0.67612606 = fieldWeight in 158, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.649493 = idf(docFreq=55, maxDocs=43254)
                0.0625 = fieldNorm(doc=158)
          0.0075569046 = weight(abstract_txt:that in 158) [ClassicSimilarity], result of:
            0.0075569046 = score(doc=158,freq=2.0), product of:
              0.035841383 = queryWeight, product of:
                1.3850443 = boost
                2.3854163 = idf(docFreq=10822, maxDocs=43254)
                0.01084818 = queryNorm
              0.210843 = fieldWeight in 158, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.3854163 = idf(docFreq=10822, maxDocs=43254)
                0.0625 = fieldNorm(doc=158)
          0.21915334 = weight(abstract_txt:microblogging in 158) [ClassicSimilarity], result of:
            0.21915334 = score(doc=158,freq=3.0), product of:
              0.23457432 = queryWeight, product of:
                2.5055122 = boost
                8.630322 = idf(docFreq=20, maxDocs=43254)
                0.01084818 = queryNorm
              0.9342598 = fieldWeight in 158, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                8.630322 = idf(docFreq=20, maxDocs=43254)
                0.0625 = fieldNorm(doc=158)
          0.64818555 = weight(abstract_txt:microblogs in 158) [ClassicSimilarity], result of:
            0.64818555 = score(doc=158,freq=4.0), product of:
              0.55328006 = queryWeight, product of:
                5.4418154 = boost
                9.37226 = idf(docFreq=9, maxDocs=43254)
                0.01084818 = queryNorm
              1.1715325 = fieldWeight in 158, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                9.37226 = idf(docFreq=9, maxDocs=43254)
                0.0625 = fieldNorm(doc=158)
          0.5401311 = weight(abstract_txt:microblog in 158) [ClassicSimilarity], result of:
            0.5401311 = score(doc=158,freq=2.0), product of:
              0.66495395 = queryWeight, product of:
                6.6699424 = boost
                9.189939 = idf(docFreq=11, maxDocs=43254)
                0.01084818 = queryNorm
              0.81228346 = fieldWeight in 158, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.189939 = idf(docFreq=11, maxDocs=43254)
                0.0625 = fieldNorm(doc=158)
        0.2 = coord(5/25)
    
  2. Bandaragoda, T.R.; Silva, D. de; Alahakoon, D.: Automatic event detection in microblogs using incremental machine learning (2017) 0.27
    0.2700161 = sum of:
      0.2700161 = product of:
        1.3500805 = sum of:
          0.011376612 = weight(abstract_txt:such in 5291) [ClassicSimilarity], result of:
            0.011376612 = score(doc=5291,freq=2.0), product of:
              0.03736693 = queryWeight, product of:
                3.4445343 = idf(docFreq=3752, maxDocs=43254)
                0.01084818 = queryNorm
              0.30445668 = fieldWeight in 5291, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4445343 = idf(docFreq=3752, maxDocs=43254)
                0.0625 = fieldNorm(doc=5291)
          0.034060568 = weight(abstract_txt:twitter in 5291) [ClassicSimilarity], result of:
            0.034060568 = score(doc=5291,freq=1.0), product of:
              0.07762115 = queryWeight, product of:
                1.019134 = boost
                7.0208845 = idf(docFreq=104, maxDocs=43254)
                0.01084818 = queryNorm
              0.43880528 = fieldWeight in 5291, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.0208845 = idf(docFreq=104, maxDocs=43254)
                0.0625 = fieldNorm(doc=5291)
          0.12652825 = weight(abstract_txt:microblogging in 5291) [ClassicSimilarity], result of:
            0.12652825 = score(doc=5291,freq=1.0), product of:
              0.23457432 = queryWeight, product of:
                2.5055122 = boost
                8.630322 = idf(docFreq=20, maxDocs=43254)
                0.01084818 = queryNorm
              0.53939515 = fieldWeight in 5291, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.630322 = idf(docFreq=20, maxDocs=43254)
                0.0625 = fieldNorm(doc=5291)
          0.32409278 = weight(abstract_txt:microblogs in 5291) [ClassicSimilarity], result of:
            0.32409278 = score(doc=5291,freq=1.0), product of:
              0.55328006 = queryWeight, product of:
                5.4418154 = boost
                9.37226 = idf(docFreq=9, maxDocs=43254)
                0.01084818 = queryNorm
              0.58576626 = fieldWeight in 5291, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.37226 = idf(docFreq=9, maxDocs=43254)
                0.0625 = fieldNorm(doc=5291)
          0.85402226 = weight(abstract_txt:microblog in 5291) [ClassicSimilarity], result of:
            0.85402226 = score(doc=5291,freq=5.0), product of:
              0.66495395 = queryWeight, product of:
                6.6699424 = boost
                9.189939 = idf(docFreq=11, maxDocs=43254)
                0.01084818 = queryNorm
              1.284333 = fieldWeight in 5291, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                9.189939 = idf(docFreq=11, maxDocs=43254)
                0.0625 = fieldNorm(doc=5291)
        0.2 = coord(5/25)
    
  3. Sin, S.-C.J.: Social media and problematic everyday life information-seeking outcomes : differences across use frequency, gender, and problem-solving styles (2016) 0.22
    0.22385241 = sum of:
      0.22385241 = product of:
        0.9327184 = sum of:
          0.008044479 = weight(abstract_txt:such in 4508) [ClassicSimilarity], result of:
            0.008044479 = score(doc=4508,freq=1.0), product of:
              0.03736693 = queryWeight, product of:
                3.4445343 = idf(docFreq=3752, maxDocs=43254)
                0.01084818 = queryNorm
              0.2152834 = fieldWeight in 4508, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4445343 = idf(docFreq=3752, maxDocs=43254)
                0.0625 = fieldNorm(doc=4508)
          0.009437327 = weight(abstract_txt:information in 4508) [ClassicSimilarity], result of:
            0.009437327 = score(doc=4508,freq=5.0), product of:
              0.027824575 = queryWeight, product of:
                1.0568569 = boost
                2.42692 = idf(docFreq=10382, maxDocs=43254)
                0.01084818 = queryNorm
              0.33917236 = fieldWeight in 4508, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                2.42692 = idf(docFreq=10382, maxDocs=43254)
                0.0625 = fieldNorm(doc=4508)
          0.0053435387 = weight(abstract_txt:that in 4508) [ClassicSimilarity], result of:
            0.0053435387 = score(doc=4508,freq=1.0), product of:
              0.035841383 = queryWeight, product of:
                1.3850443 = boost
                2.3854163 = idf(docFreq=10822, maxDocs=43254)
                0.01084818 = queryNorm
              0.14908852 = fieldWeight in 4508, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.3854163 = idf(docFreq=10822, maxDocs=43254)
                0.0625 = fieldNorm(doc=4508)
          0.04566921 = weight(abstract_txt:media in 4508) [ClassicSimilarity], result of:
            0.04566921 = score(doc=4508,freq=3.0), product of:
              0.08245127 = queryWeight, product of:
                1.4854395 = boost
                5.1166472 = idf(docFreq=704, maxDocs=43254)
                0.01084818 = queryNorm
              0.5538933 = fieldWeight in 4508, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.1166472 = idf(docFreq=704, maxDocs=43254)
                0.0625 = fieldNorm(doc=4508)
          0.32409278 = weight(abstract_txt:microblogs in 4508) [ClassicSimilarity], result of:
            0.32409278 = score(doc=4508,freq=1.0), product of:
              0.55328006 = queryWeight, product of:
                5.4418154 = boost
                9.37226 = idf(docFreq=9, maxDocs=43254)
                0.01084818 = queryNorm
              0.58576626 = fieldWeight in 4508, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.37226 = idf(docFreq=9, maxDocs=43254)
                0.0625 = fieldNorm(doc=4508)
          0.5401311 = weight(abstract_txt:microblog in 4508) [ClassicSimilarity], result of:
            0.5401311 = score(doc=4508,freq=2.0), product of:
              0.66495395 = queryWeight, product of:
                6.6699424 = boost
                9.189939 = idf(docFreq=11, maxDocs=43254)
                0.01084818 = queryNorm
              0.81228346 = fieldWeight in 4508, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.189939 = idf(docFreq=11, maxDocs=43254)
                0.0625 = fieldNorm(doc=4508)
        0.24 = coord(6/25)
    
  4. Xu, L.; Qiu, J.: Unsupervised multi-class sentiment classification approach (2019) 0.14
    0.13883385 = sum of:
      0.13883385 = product of:
        0.86771154 = sum of:
          0.13215867 = weight(abstract_txt:sentiment in 4) [ClassicSimilarity], result of:
            0.13215867 = score(doc=4,freq=9.0), product of:
              0.092142865 = queryWeight, product of:
                1.1103814 = boost
                7.649493 = idf(docFreq=55, maxDocs=43254)
                0.01084818 = queryNorm
              1.4342799 = fieldWeight in 4, product of:
                3.0 = tf(freq=9.0), with freq of:
                  9.0 = termFreq=9.0
                7.649493 = idf(docFreq=55, maxDocs=43254)
                0.0625 = fieldNorm(doc=4)
          0.029529687 = weight(abstract_txt:called in 4) [ClassicSimilarity], result of:
            0.029529687 = score(doc=4,freq=1.0), product of:
              0.08891902 = queryWeight, product of:
                1.5426011 = boost
                5.3135424 = idf(docFreq=578, maxDocs=43254)
                0.01084818 = queryNorm
              0.3320964 = fieldWeight in 4, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.3135424 = idf(docFreq=578, maxDocs=43254)
                0.0625 = fieldNorm(doc=4)
          0.32409278 = weight(abstract_txt:microblogs in 4) [ClassicSimilarity], result of:
            0.32409278 = score(doc=4,freq=1.0), product of:
              0.55328006 = queryWeight, product of:
                5.4418154 = boost
                9.37226 = idf(docFreq=9, maxDocs=43254)
                0.01084818 = queryNorm
              0.58576626 = fieldWeight in 4, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.37226 = idf(docFreq=9, maxDocs=43254)
                0.0625 = fieldNorm(doc=4)
          0.38193038 = weight(abstract_txt:microblog in 4) [ClassicSimilarity], result of:
            0.38193038 = score(doc=4,freq=1.0), product of:
              0.66495395 = queryWeight, product of:
                6.6699424 = boost
                9.189939 = idf(docFreq=11, maxDocs=43254)
                0.01084818 = queryNorm
              0.57437116 = fieldWeight in 4, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.189939 = idf(docFreq=11, maxDocs=43254)
                0.0625 = fieldNorm(doc=4)
        0.16 = coord(4/25)
    
  5. Moulahi, B.; Tamine, L.; Yahia, S.B.: iAggregator: multidimensional relevance aggregation based on a fuzzy operator (2014) 0.13
    0.12905349 = sum of:
      0.12905349 = product of:
        0.4609053 = sum of:
          0.008044479 = weight(abstract_txt:such in 2966) [ClassicSimilarity], result of:
            0.008044479 = score(doc=2966,freq=1.0), product of:
              0.03736693 = queryWeight, product of:
                3.4445343 = idf(docFreq=3752, maxDocs=43254)
                0.01084818 = queryNorm
              0.2152834 = fieldWeight in 2966, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4445343 = idf(docFreq=3752, maxDocs=43254)
                0.0625 = fieldNorm(doc=2966)
          0.004220501 = weight(abstract_txt:information in 2966) [ClassicSimilarity], result of:
            0.004220501 = score(doc=2966,freq=1.0), product of:
              0.027824575 = queryWeight, product of:
                1.0568569 = boost
                2.42692 = idf(docFreq=10382, maxDocs=43254)
                0.01084818 = queryNorm
              0.1516825 = fieldWeight in 2966, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.42692 = idf(docFreq=10382, maxDocs=43254)
                0.0625 = fieldNorm(doc=2966)
          0.0053435387 = weight(abstract_txt:that in 2966) [ClassicSimilarity], result of:
            0.0053435387 = score(doc=2966,freq=1.0), product of:
              0.035841383 = queryWeight, product of:
                1.3850443 = boost
                2.3854163 = idf(docFreq=10822, maxDocs=43254)
                0.01084818 = queryNorm
              0.14908852 = fieldWeight in 2966, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.3854163 = idf(docFreq=10822, maxDocs=43254)
                0.0625 = fieldNorm(doc=2966)
          0.017444698 = weight(abstract_txt:retrieval in 2966) [ClassicSimilarity], result of:
            0.017444698 = score(doc=2966,freq=2.0), product of:
              0.05687894 = queryWeight, product of:
                1.5110459 = boost
                3.4699 = idf(docFreq=3658, maxDocs=43254)
                0.01084818 = queryNorm
              0.3066987 = fieldWeight in 2966, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4699 = idf(docFreq=3658, maxDocs=43254)
                0.0625 = fieldNorm(doc=2966)
          0.029529687 = weight(abstract_txt:called in 2966) [ClassicSimilarity], result of:
            0.029529687 = score(doc=2966,freq=1.0), product of:
              0.08891902 = queryWeight, product of:
                1.5426011 = boost
                5.3135424 = idf(docFreq=578, maxDocs=43254)
                0.01084818 = queryNorm
              0.3320964 = fieldWeight in 2966, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.3135424 = idf(docFreq=578, maxDocs=43254)
                0.0625 = fieldNorm(doc=2966)
          0.014392043 = weight(abstract_txt:search in 2966) [ClassicSimilarity], result of:
            0.014392043 = score(doc=2966,freq=1.0), product of:
              0.06303777 = queryWeight, product of:
                1.5907515 = boost
                3.6529322 = idf(docFreq=3046, maxDocs=43254)
                0.01084818 = queryNorm
              0.22830826 = fieldWeight in 2966, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6529322 = idf(docFreq=3046, maxDocs=43254)
                0.0625 = fieldNorm(doc=2966)
          0.38193038 = weight(abstract_txt:microblog in 2966) [ClassicSimilarity], result of:
            0.38193038 = score(doc=2966,freq=1.0), product of:
              0.66495395 = queryWeight, product of:
                6.6699424 = boost
                9.189939 = idf(docFreq=11, maxDocs=43254)
                0.01084818 = queryNorm
              0.57437116 = fieldWeight in 2966, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.189939 = idf(docFreq=11, maxDocs=43254)
                0.0625 = fieldNorm(doc=2966)
        0.28 = coord(7/25)