Dang, E.K.F.; Luk, R.W.P.; Allan, J.: ¬A retrieval model family based on the probability ranking principle for ad hoc retrieval (2022)
0.01
0.006490764 = product of:
0.051926114 = sum of:
0.0074839313 = weight(_text_:information in 638) [ClassicSimilarity], result of:
0.0074839313 = score(doc=638,freq=2.0), product of:
0.055122808 = queryWeight, product of:
1.7554779 = idf(docFreq=20772, maxDocs=44218)
0.031400457 = queryNorm
0.13576832 = fieldWeight in 638, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
1.7554779 = idf(docFreq=20772, maxDocs=44218)
0.0546875 = fieldNorm(doc=638)
0.044442184 = weight(_text_:retrieval in 638) [ClassicSimilarity], result of:
0.044442184 = score(doc=638,freq=8.0), product of:
0.09498371 = queryWeight, product of:
3.024915 = idf(docFreq=5836, maxDocs=44218)
0.031400457 = queryNorm
0.46789268 = fieldWeight in 638, product of:
2.828427 = tf(freq=8.0), with freq of:
8.0 = termFreq=8.0
3.024915 = idf(docFreq=5836, maxDocs=44218)
0.0546875 = fieldNorm(doc=638)
0.125 = coord(2/16)
- Abstract
- Many successful retrieval models are derived based on or conform to the probability ranking principle (PRP). We present a new derivation of a document ranking function given by the probability of relevance of a document, conforming to the PRP. Our formulation yields a family of retrieval models, called probabilistic binary relevance (PBR) models, with various instantiations obtained by different probability estimations. By extensive experiments on a range of TREC collections, improvement of the PBR models over some established baselines with statistical significance is observed, especially in the large Clueweb09 Cat-B collection.
- Source
- Journal of the Association for Information Science and Technology. 73(2022) no.8, S.1140-1154