-
Egghe, L.: ¬A universal method of information retrieval evaluation : the "missing" link M and the universal IR surface (2004)
0.01
0.012689516 = product of:
0.03172379 = sum of:
0.021944191 = product of:
0.06583257 = sum of:
0.06583257 = weight(_text_:f in 2558) [ClassicSimilarity], result of:
0.06583257 = score(doc=2558,freq=6.0), product of:
0.14385001 = queryWeight, product of:
3.985786 = idf(docFreq=2232, maxDocs=44218)
0.036090754 = queryNorm
0.45764732 = fieldWeight in 2558, product of:
2.4494898 = tf(freq=6.0), with freq of:
6.0 = termFreq=6.0
3.985786 = idf(docFreq=2232, maxDocs=44218)
0.046875 = fieldNorm(doc=2558)
0.33333334 = coord(1/3)
0.009779599 = product of:
0.029338794 = sum of:
0.029338794 = weight(_text_:22 in 2558) [ClassicSimilarity], result of:
0.029338794 = score(doc=2558,freq=2.0), product of:
0.12638368 = queryWeight, product of:
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.036090754 = queryNorm
0.23214069 = fieldWeight in 2558, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.046875 = fieldNorm(doc=2558)
0.33333334 = coord(1/3)
0.4 = coord(2/5)
- Abstract
- The paper shows that the present evaluation methods in information retrieval (basically recall R and precision P and in some cases fallout F ) lack universal comparability in the sense that their values depend on the generality of the IR problem. A solution is given by using all "parts" of the database, including the non-relevant documents and also the not-retrieved documents. It turns out that the solution is given by introducing the measure M being the fraction of the not-retrieved documents that are relevant (hence the "miss" measure). We prove that - independent of the IR problem or of the IR action - the quadruple (P,R,F,M) belongs to a universal IR surface, being the same for all IR-activities. This universality is then exploited by defining a new measure for evaluation in IR allowing for unbiased comparisons of all IR results. We also show that only using one, two or even three measures from the set {P,R,F,M} necessary leads to evaluation measures that are non-universal and hence not capable of comparing different IR situations.
- Date
- 14. 8.2004 19:17:22
-
Egghe, L.: Existence theorem of the quadruple (P, R, F, M) : precision, recall, fallout and miss (2007)
0.01
0.005665965 = product of:
0.028329825 = sum of:
0.028329825 = product of:
0.08498947 = sum of:
0.08498947 = weight(_text_:f in 2011) [ClassicSimilarity], result of:
0.08498947 = score(doc=2011,freq=10.0), product of:
0.14385001 = queryWeight, product of:
3.985786 = idf(docFreq=2232, maxDocs=44218)
0.036090754 = queryNorm
0.5908201 = fieldWeight in 2011, product of:
3.1622777 = tf(freq=10.0), with freq of:
10.0 = termFreq=10.0
3.985786 = idf(docFreq=2232, maxDocs=44218)
0.046875 = fieldNorm(doc=2011)
0.33333334 = coord(1/3)
0.2 = coord(1/5)
- Abstract
- In an earlier paper [Egghe, L. (2004). A universal method of information retrieval evaluation: the "missing" link M and the universal IR surface. Information Processing and Management, 40, 21-30] we showed that, given an IR system, and if P denotes precision, R recall, F fallout and M miss (re-introduced in the paper mentioned above), we have the following relationship between P, R, F and M: P/(1-P)*(1-R)/R*F/(1-F)*(1-M)/M = 1. In this paper we prove the (more difficult) converse: given any four rational numbers in the interval ]0, 1[ satisfying the above equation, then there exists an IR system such that these four numbers (in any order) are the precision, recall, fallout and miss of this IR system. As a consequence we show that any three rational numbers in ]0, 1[ represent any three measures taken from precision, recall, fallout and miss of a certain IR system. We also show that this result is also true for two numbers instead of three.
-
Egghe, L.; Guns, R.; Rousseau, R.; Leuven, K.U.: Erratum (2012)
0.00
0.0032598663 = product of:
0.016299332 = sum of:
0.016299332 = product of:
0.048897993 = sum of:
0.048897993 = weight(_text_:22 in 4992) [ClassicSimilarity], result of:
0.048897993 = score(doc=4992,freq=2.0), product of:
0.12638368 = queryWeight, product of:
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.036090754 = queryNorm
0.38690117 = fieldWeight in 4992, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.078125 = fieldNorm(doc=4992)
0.33333334 = coord(1/3)
0.2 = coord(1/5)
- Date
- 14. 2.2012 12:53:22
-
Egghe, L.: ¬A noninformetric analysis of the relationship between citation age and journal productivity (2001)
0.00
0.0019736742 = product of:
0.00986837 = sum of:
0.00986837 = product of:
0.029605111 = sum of:
0.029605111 = weight(_text_:29 in 5685) [ClassicSimilarity], result of:
0.029605111 = score(doc=5685,freq=2.0), product of:
0.12695599 = queryWeight, product of:
3.5176873 = idf(docFreq=3565, maxDocs=44218)
0.036090754 = queryNorm
0.23319192 = fieldWeight in 5685, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.5176873 = idf(docFreq=3565, maxDocs=44218)
0.046875 = fieldNorm(doc=5685)
0.33333334 = coord(1/3)
0.2 = coord(1/5)
- Date
- 29. 9.2001 13:59:34
-
Egghe, L.: Influence of adding or deleting items and sources on the h-index (2010)
0.00
0.0019736742 = product of:
0.00986837 = sum of:
0.00986837 = product of:
0.029605111 = sum of:
0.029605111 = weight(_text_:29 in 3336) [ClassicSimilarity], result of:
0.029605111 = score(doc=3336,freq=2.0), product of:
0.12695599 = queryWeight, product of:
3.5176873 = idf(docFreq=3565, maxDocs=44218)
0.036090754 = queryNorm
0.23319192 = fieldWeight in 3336, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.5176873 = idf(docFreq=3565, maxDocs=44218)
0.046875 = fieldNorm(doc=3336)
0.33333334 = coord(1/3)
0.2 = coord(1/5)
- Date
- 31. 5.2010 15:02:29
-
Egghe, L.; Rousseau, R.: Averaging and globalising quotients of informetric and scientometric data (1996)
0.00
0.0019559197 = product of:
0.009779599 = sum of:
0.009779599 = product of:
0.029338794 = sum of:
0.029338794 = weight(_text_:22 in 7659) [ClassicSimilarity], result of:
0.029338794 = score(doc=7659,freq=2.0), product of:
0.12638368 = queryWeight, product of:
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.036090754 = queryNorm
0.23214069 = fieldWeight in 7659, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.046875 = fieldNorm(doc=7659)
0.33333334 = coord(1/3)
0.2 = coord(1/5)
- Source
- Journal of information science. 22(1996) no.3, S.165-170
-
Egghe, L.: Properties of the n-overlap vector and n-overlap similarity theory (2006)
0.00
0.0016447286 = product of:
0.008223643 = sum of:
0.008223643 = product of:
0.024670927 = sum of:
0.024670927 = weight(_text_:29 in 194) [ClassicSimilarity], result of:
0.024670927 = score(doc=194,freq=2.0), product of:
0.12695599 = queryWeight, product of:
3.5176873 = idf(docFreq=3565, maxDocs=44218)
0.036090754 = queryNorm
0.19432661 = fieldWeight in 194, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.5176873 = idf(docFreq=3565, maxDocs=44218)
0.0390625 = fieldNorm(doc=194)
0.33333334 = coord(1/3)
0.2 = coord(1/5)
- Date
- 3. 1.2007 14:26:29
-
Egghe, L.: Untangling Herdan's law and Heaps' law : mathematical and informetric arguments (2007)
0.00
0.0016447286 = product of:
0.008223643 = sum of:
0.008223643 = product of:
0.024670927 = sum of:
0.024670927 = weight(_text_:29 in 271) [ClassicSimilarity], result of:
0.024670927 = score(doc=271,freq=2.0), product of:
0.12695599 = queryWeight, product of:
3.5176873 = idf(docFreq=3565, maxDocs=44218)
0.036090754 = queryNorm
0.19432661 = fieldWeight in 271, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.5176873 = idf(docFreq=3565, maxDocs=44218)
0.0390625 = fieldNorm(doc=271)
0.33333334 = coord(1/3)
0.2 = coord(1/5)
- Date
- 29. 4.2007 19:51:08