-
Efron, M.; Winget, M.: Query polyrepresentation for ranking retrieval systems without relevance judgments (2010)
0.02
0.019288149 = product of:
0.038576297 = sum of:
0.038576297 = product of:
0.057864446 = sum of:
0.05430262 = weight(_text_:k in 3469) [ClassicSimilarity], result of:
0.05430262 = score(doc=3469,freq=4.0), product of:
0.16225883 = queryWeight, product of:
3.569778 = idf(docFreq=3384, maxDocs=44218)
0.04545348 = queryNorm
0.33466667 = fieldWeight in 3469, product of:
2.0 = tf(freq=4.0), with freq of:
4.0 = termFreq=4.0
3.569778 = idf(docFreq=3384, maxDocs=44218)
0.046875 = fieldNorm(doc=3469)
0.003561823 = weight(_text_:s in 3469) [ClassicSimilarity], result of:
0.003561823 = score(doc=3469,freq=2.0), product of:
0.049418733 = queryWeight, product of:
1.0872376 = idf(docFreq=40523, maxDocs=44218)
0.04545348 = queryNorm
0.072074346 = fieldWeight in 3469, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
1.0872376 = idf(docFreq=40523, maxDocs=44218)
0.046875 = fieldNorm(doc=3469)
0.6666667 = coord(2/3)
0.5 = coord(1/2)
- Abstract
- Ranking information retrieval (IR) systems with respect to their effectiveness is a crucial operation during IR evaluation, as well as during data fusion. This article offers a novel method of approaching the system-ranking problem, based on the widely studied idea of polyrepresentation. The principle of polyrepresentation suggests that a single information need can be represented by many query articulations-what we call query aspects. By skimming the top k (where k is small) documents retrieved by a single system for multiple query aspects, we collect a set of documents that are likely to be relevant to a given test topic. Labeling these skimmed documents as putatively relevant lets us build pseudorelevance judgments without undue human intervention. We report experiments where using these pseudorelevance judgments delivers a rank ordering of IR systems that correlates highly with rankings based on human relevance judgments.
- Source
- Journal of the American Society for Information Science and Technology. 61(2010) no.6, S.1081-1091
-
Efron, M.: Query expansion and dimensionality reduction : Notions of optimality in Rocchio relevance feedback and latent semantic indexing (2008)
0.00
5.936372E-4 = product of:
0.0011872743 = sum of:
0.0011872743 = product of:
0.003561823 = sum of:
0.003561823 = weight(_text_:s in 2020) [ClassicSimilarity], result of:
0.003561823 = score(doc=2020,freq=2.0), product of:
0.049418733 = queryWeight, product of:
1.0872376 = idf(docFreq=40523, maxDocs=44218)
0.04545348 = queryNorm
0.072074346 = fieldWeight in 2020, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
1.0872376 = idf(docFreq=40523, maxDocs=44218)
0.046875 = fieldNorm(doc=2020)
0.33333334 = coord(1/3)
0.5 = coord(1/2)
- Source
- Information processing and management. 44(2008) no.1, S.163-180
-
Efron, M.: Linear time series models for term weighting in information retrieval (2010)
0.00
5.936372E-4 = product of:
0.0011872743 = sum of:
0.0011872743 = product of:
0.003561823 = sum of:
0.003561823 = weight(_text_:s in 3688) [ClassicSimilarity], result of:
0.003561823 = score(doc=3688,freq=2.0), product of:
0.049418733 = queryWeight, product of:
1.0872376 = idf(docFreq=40523, maxDocs=44218)
0.04545348 = queryNorm
0.072074346 = fieldWeight in 3688, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
1.0872376 = idf(docFreq=40523, maxDocs=44218)
0.046875 = fieldNorm(doc=3688)
0.33333334 = coord(1/3)
0.5 = coord(1/2)
- Source
- Journal of the American Society for Information Science and Technology. 61(2010) no.7, S.1299-1312