Search (3 results, page 1 of 1)

Efron, M.; Winget, M.: Query polyrepresentation for ranking retrieval systems without relevance judgments (2010) 0.01

0.007346594 = product of:
  0.036732968 = sum of:
    0.012212053 = product of:
      0.03663616 = sum of:
        0.03663616 = weight(_text_:problem in 3469) [ClassicSimilarity], result of:
          0.03663616 = score(doc=3469,freq=2.0), product of:
            0.1302053 = queryWeight, product of:
              4.244485 = idf(docFreq=1723, maxDocs=44218)
              0.03067635 = queryNorm
            0.28137225 = fieldWeight in 3469, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              4.244485 = idf(docFreq=1723, maxDocs=44218)
              0.046875 = fieldNorm(doc=3469)
      0.33333334 = coord(1/3)
    0.024520915 = product of:
      0.07356274 = sum of:
        0.07356274 = weight(_text_:2010 in 3469) [ClassicSimilarity], result of:
          0.07356274 = score(doc=3469,freq=5.0), product of:
            0.14672957 = queryWeight, product of:
              4.7831497 = idf(docFreq=1005, maxDocs=44218)
              0.03067635 = queryNorm
            0.5013491 = fieldWeight in 3469, product of:
              2.236068 = tf(freq=5.0), with freq of:
                5.0 = termFreq=5.0
              4.7831497 = idf(docFreq=1005, maxDocs=44218)
              0.046875 = fieldNorm(doc=3469)
      0.33333334 = coord(1/3)
  0.2 = coord(2/10)

Abstract: Ranking information retrieval (IR) systems with respect to their effectiveness is a crucial operation during IR evaluation, as well as during data fusion. This article offers a novel method of approaching the system-ranking problem, based on the widely studied idea of polyrepresentation. The principle of polyrepresentation suggests that a single information need can be represented by many query articulations-what we call query aspects. By skimming the top k (where k is small) documents retrieved by a single system for multiple query aspects, we collect a set of documents that are likely to be relevant to a given test topic. Labeling these skimmed documents as putatively relevant lets us build pseudorelevance judgments without undue human intervention. We report experiments where using these pseudorelevance judgments delivers a rank ordering of IR systems that correlates highly with rankings based on human relevance judgments.
Source: Journal of the American Society for Information Science and Technology. 61(2010) no.6, S.1081-1091
Year: 2010

Efron, M.: Linear time series models for term weighting in information retrieval (2010) 0.00

0.0024520915 = product of:
  0.024520915 = sum of:
    0.024520915 = product of:
      0.07356274 = sum of:
        0.07356274 = weight(_text_:2010 in 3688) [ClassicSimilarity], result of:
          0.07356274 = score(doc=3688,freq=5.0), product of:
            0.14672957 = queryWeight, product of:
              4.7831497 = idf(docFreq=1005, maxDocs=44218)
              0.03067635 = queryNorm
            0.5013491 = fieldWeight in 3688, product of:
              2.236068 = tf(freq=5.0), with freq of:
                5.0 = termFreq=5.0
              4.7831497 = idf(docFreq=1005, maxDocs=44218)
              0.046875 = fieldNorm(doc=3688)
      0.33333334 = coord(1/3)
  0.1 = coord(1/10)

Source: Journal of the American Society for Information Science and Technology. 61(2010) no.7, S.1299-1312
Year: 2010

Efron, M.: Information search and retrieval in microblogs (2011) 0.00
```
0.0012212053 = product of:
  0.012212053 = sum of:
    0.012212053 = product of:
      0.03663616 = sum of:
        0.03663616 = weight(_text_:problem in 4455) [ClassicSimilarity], result of:
          0.03663616 = score(doc=4455,freq=2.0), product of:
            0.1302053 = queryWeight, product of:
              4.244485 = idf(docFreq=1723, maxDocs=44218)
              0.03067635 = queryNorm
            0.28137225 = fieldWeight in 4455, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              4.244485 = idf(docFreq=1723, maxDocs=44218)
              0.046875 = fieldNorm(doc=4455)
      0.33333334 = coord(1/3)
  0.1 = coord(1/10)
```
Abstract

Modern information retrieval (IR) has come to terms with numerous new media in efforts to help people find information in increasingly diverse settings. Among these new media are so-called microblogs. A microblog is a stream of text that is written by an author over time. It comprises many very brief updates that are presented to the microblog's readers in reverse-chronological order. Today, the service called Twitter is the most popular microblogging platform. Although microblogging is increasingly popular, methods for organizing and providing access to microblog data are still new. This review offers an introduction to the problems that face researchers and developers of IR systems in microblog settings. After an overview of microblogs and the behavior surrounding them, the review describes established problems in microblog retrieval, such as entity search and sentiment analysis, and modeling abstractions, such as authority and quality. The review also treats user-created metadata that often appear in microblogs. Because the problem of microblog search is so new, the review concludes with a discussion of particularly pressing research issues yet to be studied in the field.