-
Egghe, L.: ¬A universal method of information retrieval evaluation : the "missing" link M and the universal IR surface (2004)
0.00
0.0016497315 = product of:
0.019796778 = sum of:
0.01086849 = product of:
0.03260547 = sum of:
0.03260547 = weight(_text_:p in 2558) [ClassicSimilarity], result of:
0.03260547 = score(doc=2558,freq=6.0), product of:
0.078979194 = queryWeight, product of:
3.5955126 = idf(docFreq=3298, maxDocs=44218)
0.021966046 = queryNorm
0.4128362 = fieldWeight in 2558, product of:
2.4494898 = tf(freq=6.0), with freq of:
6.0 = termFreq=6.0
3.5955126 = idf(docFreq=3298, maxDocs=44218)
0.046875 = fieldNorm(doc=2558)
0.33333334 = coord(1/3)
0.008928288 = product of:
0.017856576 = sum of:
0.017856576 = weight(_text_:22 in 2558) [ClassicSimilarity], result of:
0.017856576 = score(doc=2558,freq=2.0), product of:
0.07692135 = queryWeight, product of:
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.021966046 = queryNorm
0.23214069 = fieldWeight in 2558, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.046875 = fieldNorm(doc=2558)
0.5 = coord(1/2)
0.083333336 = coord(2/24)
- Abstract
- The paper shows that the present evaluation methods in information retrieval (basically recall R and precision P and in some cases fallout F ) lack universal comparability in the sense that their values depend on the generality of the IR problem. A solution is given by using all "parts" of the database, including the non-relevant documents and also the not-retrieved documents. It turns out that the solution is given by introducing the measure M being the fraction of the not-retrieved documents that are relevant (hence the "miss" measure). We prove that - independent of the IR problem or of the IR action - the quadruple (P,R,F,M) belongs to a universal IR surface, being the same for all IR-activities. This universality is then exploited by defining a new measure for evaluation in IR allowing for unbiased comparisons of all IR results. We also show that only using one, two or even three measures from the set {P,R,F,M} necessary leads to evaluation measures that are non-universal and hence not capable of comparing different IR situations.
- Date
- 14. 8.2004 19:17:22
-
Egghe, L.: Empirical and combinatorial study of country occurrences in multi-authored papers (2006)
0.00
7.947839E-4 = product of:
0.009537406 = sum of:
0.004768703 = weight(_text_:und in 81) [ClassicSimilarity], result of:
0.004768703 = score(doc=81,freq=2.0), product of:
0.04868482 = queryWeight, product of:
2.216367 = idf(docFreq=13101, maxDocs=44218)
0.021966046 = queryNorm
0.09795051 = fieldWeight in 81, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
2.216367 = idf(docFreq=13101, maxDocs=44218)
0.03125 = fieldNorm(doc=81)
0.004768703 = weight(_text_:und in 81) [ClassicSimilarity], result of:
0.004768703 = score(doc=81,freq=2.0), product of:
0.04868482 = queryWeight, product of:
2.216367 = idf(docFreq=13101, maxDocs=44218)
0.021966046 = queryNorm
0.09795051 = fieldWeight in 81, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
2.216367 = idf(docFreq=13101, maxDocs=44218)
0.03125 = fieldNorm(doc=81)
0.083333336 = coord(2/24)
- Source
- Information - Wissenschaft und Praxis. 57(2006) H.8, S.427-432
-
Egghe, L.: Existence theorem of the quadruple (P, R, F, M) : precision, recall, fallout and miss (2007)
0.00
5.8463163E-4 = product of:
0.014031159 = sum of:
0.014031159 = product of:
0.042093474 = sum of:
0.042093474 = weight(_text_:p in 2011) [ClassicSimilarity], result of:
0.042093474 = score(doc=2011,freq=10.0), product of:
0.078979194 = queryWeight, product of:
3.5955126 = idf(docFreq=3298, maxDocs=44218)
0.021966046 = queryNorm
0.5329692 = fieldWeight in 2011, product of:
3.1622777 = tf(freq=10.0), with freq of:
10.0 = termFreq=10.0
3.5955126 = idf(docFreq=3298, maxDocs=44218)
0.046875 = fieldNorm(doc=2011)
0.33333334 = coord(1/3)
0.041666668 = coord(1/24)
- Abstract
- In an earlier paper [Egghe, L. (2004). A universal method of information retrieval evaluation: the "missing" link M and the universal IR surface. Information Processing and Management, 40, 21-30] we showed that, given an IR system, and if P denotes precision, R recall, F fallout and M miss (re-introduced in the paper mentioned above), we have the following relationship between P, R, F and M: P/(1-P)*(1-R)/R*F/(1-F)*(1-M)/M = 1. In this paper we prove the (more difficult) converse: given any four rational numbers in the interval ]0, 1[ satisfying the above equation, then there exists an IR system such that these four numbers (in any order) are the precision, recall, fallout and miss of this IR system. As a consequence we show that any three rational numbers in ]0, 1[ represent any three measures taken from precision, recall, fallout and miss of a certain IR system. We also show that this result is also true for two numbers instead of three.
-
Egghe, L.: ¬A noninformetric analysis of the relationship between citation age and journal productivity (2001)
0.00
3.7538886E-4 = product of:
0.009009332 = sum of:
0.009009332 = product of:
0.018018665 = sum of:
0.018018665 = weight(_text_:29 in 5685) [ClassicSimilarity], result of:
0.018018665 = score(doc=5685,freq=2.0), product of:
0.07726968 = queryWeight, product of:
3.5176873 = idf(docFreq=3565, maxDocs=44218)
0.021966046 = queryNorm
0.23319192 = fieldWeight in 5685, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.5176873 = idf(docFreq=3565, maxDocs=44218)
0.046875 = fieldNorm(doc=5685)
0.5 = coord(1/2)
0.041666668 = coord(1/24)
- Date
- 29. 9.2001 13:59:34
-
Egghe, L.: Properties of the n-overlap vector and n-overlap similarity theory (2006)
0.00
3.1282406E-4 = product of:
0.0075077773 = sum of:
0.0075077773 = product of:
0.015015555 = sum of:
0.015015555 = weight(_text_:29 in 194) [ClassicSimilarity], result of:
0.015015555 = score(doc=194,freq=2.0), product of:
0.07726968 = queryWeight, product of:
3.5176873 = idf(docFreq=3565, maxDocs=44218)
0.021966046 = queryNorm
0.19432661 = fieldWeight in 194, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.5176873 = idf(docFreq=3565, maxDocs=44218)
0.0390625 = fieldNorm(doc=194)
0.5 = coord(1/2)
0.041666668 = coord(1/24)
- Date
- 3. 1.2007 14:26:29
-
Egghe, L.: Untangling Herdan's law and Heaps' law : mathematical and informetric arguments (2007)
0.00
3.1282406E-4 = product of:
0.0075077773 = sum of:
0.0075077773 = product of:
0.015015555 = sum of:
0.015015555 = weight(_text_:29 in 271) [ClassicSimilarity], result of:
0.015015555 = score(doc=271,freq=2.0), product of:
0.07726968 = queryWeight, product of:
3.5176873 = idf(docFreq=3565, maxDocs=44218)
0.021966046 = queryNorm
0.19432661 = fieldWeight in 271, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.5176873 = idf(docFreq=3565, maxDocs=44218)
0.0390625 = fieldNorm(doc=271)
0.5 = coord(1/2)
0.041666668 = coord(1/24)
- Date
- 29. 4.2007 19:51:08