-
Egghe, L.; Rousseau, R.: Introduction to informetrics : quantitative methods in library, documentation and information science (1990)
0.01
0.0109192245 = product of:
0.032757673 = sum of:
0.020491816 = weight(_text_:information in 1515) [ClassicSimilarity], result of:
0.020491816 = score(doc=1515,freq=10.0), product of:
0.067498945 = queryWeight, product of:
1.7554779 = idf(docFreq=20772, maxDocs=44218)
0.03845047 = queryNorm
0.3035872 = fieldWeight in 1515, product of:
3.1622777 = tf(freq=10.0), with freq of:
10.0 = termFreq=10.0
1.7554779 = idf(docFreq=20772, maxDocs=44218)
0.0546875 = fieldNorm(doc=1515)
0.012265856 = product of:
0.03679757 = sum of:
0.03679757 = weight(_text_:29 in 1515) [ClassicSimilarity], result of:
0.03679757 = score(doc=1515,freq=2.0), product of:
0.13525672 = queryWeight, product of:
3.5176873 = idf(docFreq=3565, maxDocs=44218)
0.03845047 = queryNorm
0.27205724 = fieldWeight in 1515, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.5176873 = idf(docFreq=3565, maxDocs=44218)
0.0546875 = fieldNorm(doc=1515)
0.33333334 = coord(1/3)
0.33333334 = coord(2/6)
- COMPASS
- Information science / Statistical mathematics
- Date
- 29. 2.2008 19:02:46
- LCSH
- Information science / Statistical methods
- Subject
- Information science / Statistical mathematics
Information science / Statistical methods
-
Egghe, L.; Guns, R.; Rousseau, R.; Leuven, K.U.: Erratum (2012)
0.01
0.010152256 = product of:
0.030456766 = sum of:
0.01309174 = weight(_text_:information in 4992) [ClassicSimilarity], result of:
0.01309174 = score(doc=4992,freq=2.0), product of:
0.067498945 = queryWeight, product of:
1.7554779 = idf(docFreq=20772, maxDocs=44218)
0.03845047 = queryNorm
0.19395474 = fieldWeight in 4992, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
1.7554779 = idf(docFreq=20772, maxDocs=44218)
0.078125 = fieldNorm(doc=4992)
0.017365025 = product of:
0.052095074 = sum of:
0.052095074 = weight(_text_:22 in 4992) [ClassicSimilarity], result of:
0.052095074 = score(doc=4992,freq=2.0), product of:
0.13464698 = queryWeight, product of:
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.03845047 = queryNorm
0.38690117 = fieldWeight in 4992, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.078125 = fieldNorm(doc=4992)
0.33333334 = coord(1/3)
0.33333334 = coord(2/6)
- Date
- 14. 2.2012 12:53:22
- Source
- Journal of the American Society for Information Science and Technology. 63(2012) no.2, S.429
-
Egghe, L.: ¬A universal method of information retrieval evaluation : the "missing" link M and the universal IR surface (2004)
0.01
0.008008117 = product of:
0.02402435 = sum of:
0.013605336 = weight(_text_:information in 2558) [ClassicSimilarity], result of:
0.013605336 = score(doc=2558,freq=6.0), product of:
0.067498945 = queryWeight, product of:
1.7554779 = idf(docFreq=20772, maxDocs=44218)
0.03845047 = queryNorm
0.20156369 = fieldWeight in 2558, product of:
2.4494898 = tf(freq=6.0), with freq of:
6.0 = termFreq=6.0
1.7554779 = idf(docFreq=20772, maxDocs=44218)
0.046875 = fieldNorm(doc=2558)
0.010419015 = product of:
0.031257045 = sum of:
0.031257045 = weight(_text_:22 in 2558) [ClassicSimilarity], result of:
0.031257045 = score(doc=2558,freq=2.0), product of:
0.13464698 = queryWeight, product of:
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.03845047 = queryNorm
0.23214069 = fieldWeight in 2558, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.046875 = fieldNorm(doc=2558)
0.33333334 = coord(1/3)
0.33333334 = coord(2/6)
- Abstract
- The paper shows that the present evaluation methods in information retrieval (basically recall R and precision P and in some cases fallout F ) lack universal comparability in the sense that their values depend on the generality of the IR problem. A solution is given by using all "parts" of the database, including the non-relevant documents and also the not-retrieved documents. It turns out that the solution is given by introducing the measure M being the fraction of the not-retrieved documents that are relevant (hence the "miss" measure). We prove that - independent of the IR problem or of the IR action - the quadruple (P,R,F,M) belongs to a universal IR surface, being the same for all IR-activities. This universality is then exploited by defining a new measure for evaluation in IR allowing for unbiased comparisons of all IR results. We also show that only using one, two or even three measures from the set {P,R,F,M} necessary leads to evaluation measures that are non-universal and hence not capable of comparing different IR situations.
- Date
- 14. 8.2004 19:17:22
- Source
- Information processing and management. 40(2004) no.1, S.21-30
-
Egghe, L.: ¬A noninformetric analysis of the relationship between citation age and journal productivity (2001)
0.01
0.0061228788 = product of:
0.018368635 = sum of:
0.007855045 = weight(_text_:information in 5685) [ClassicSimilarity], result of:
0.007855045 = score(doc=5685,freq=2.0), product of:
0.067498945 = queryWeight, product of:
1.7554779 = idf(docFreq=20772, maxDocs=44218)
0.03845047 = queryNorm
0.116372846 = fieldWeight in 5685, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
1.7554779 = idf(docFreq=20772, maxDocs=44218)
0.046875 = fieldNorm(doc=5685)
0.010513592 = product of:
0.031540774 = sum of:
0.031540774 = weight(_text_:29 in 5685) [ClassicSimilarity], result of:
0.031540774 = score(doc=5685,freq=2.0), product of:
0.13525672 = queryWeight, product of:
3.5176873 = idf(docFreq=3565, maxDocs=44218)
0.03845047 = queryNorm
0.23319192 = fieldWeight in 5685, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.5176873 = idf(docFreq=3565, maxDocs=44218)
0.046875 = fieldNorm(doc=5685)
0.33333334 = coord(1/3)
0.33333334 = coord(2/6)
- Date
- 29. 9.2001 13:59:34
- Source
- Journal of the American Society for Information Science and technology. 52(2001) no.5, S.371-377
-
Egghe, L.: Influence of adding or deleting items and sources on the h-index (2010)
0.01
0.0061228788 = product of:
0.018368635 = sum of:
0.007855045 = weight(_text_:information in 3336) [ClassicSimilarity], result of:
0.007855045 = score(doc=3336,freq=2.0), product of:
0.067498945 = queryWeight, product of:
1.7554779 = idf(docFreq=20772, maxDocs=44218)
0.03845047 = queryNorm
0.116372846 = fieldWeight in 3336, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
1.7554779 = idf(docFreq=20772, maxDocs=44218)
0.046875 = fieldNorm(doc=3336)
0.010513592 = product of:
0.031540774 = sum of:
0.031540774 = weight(_text_:29 in 3336) [ClassicSimilarity], result of:
0.031540774 = score(doc=3336,freq=2.0), product of:
0.13525672 = queryWeight, product of:
3.5176873 = idf(docFreq=3565, maxDocs=44218)
0.03845047 = queryNorm
0.23319192 = fieldWeight in 3336, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.5176873 = idf(docFreq=3565, maxDocs=44218)
0.046875 = fieldNorm(doc=3336)
0.33333334 = coord(1/3)
0.33333334 = coord(2/6)
- Date
- 31. 5.2010 15:02:29
- Source
- Journal of the American Society for Information Science and Technology. 61(2010) no.2, S.370-373
-
Egghe, L.; Rousseau, R.: Averaging and globalising quotients of informetric and scientometric data (1996)
0.01
0.0060913535 = product of:
0.01827406 = sum of:
0.007855045 = weight(_text_:information in 7659) [ClassicSimilarity], result of:
0.007855045 = score(doc=7659,freq=2.0), product of:
0.067498945 = queryWeight, product of:
1.7554779 = idf(docFreq=20772, maxDocs=44218)
0.03845047 = queryNorm
0.116372846 = fieldWeight in 7659, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
1.7554779 = idf(docFreq=20772, maxDocs=44218)
0.046875 = fieldNorm(doc=7659)
0.010419015 = product of:
0.031257045 = sum of:
0.031257045 = weight(_text_:22 in 7659) [ClassicSimilarity], result of:
0.031257045 = score(doc=7659,freq=2.0), product of:
0.13464698 = queryWeight, product of:
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.03845047 = queryNorm
0.23214069 = fieldWeight in 7659, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.046875 = fieldNorm(doc=7659)
0.33333334 = coord(1/3)
0.33333334 = coord(2/6)
- Source
- Journal of information science. 22(1996) no.3, S.165-170
-
Egghe, L.: Untangling Herdan's law and Heaps' law : mathematical and informetric arguments (2007)
0.01
0.006006195 = product of:
0.018018585 = sum of:
0.009257258 = weight(_text_:information in 271) [ClassicSimilarity], result of:
0.009257258 = score(doc=271,freq=4.0), product of:
0.067498945 = queryWeight, product of:
1.7554779 = idf(docFreq=20772, maxDocs=44218)
0.03845047 = queryNorm
0.13714671 = fieldWeight in 271, product of:
2.0 = tf(freq=4.0), with freq of:
4.0 = termFreq=4.0
1.7554779 = idf(docFreq=20772, maxDocs=44218)
0.0390625 = fieldNorm(doc=271)
0.008761327 = product of:
0.02628398 = sum of:
0.02628398 = weight(_text_:29 in 271) [ClassicSimilarity], result of:
0.02628398 = score(doc=271,freq=2.0), product of:
0.13525672 = queryWeight, product of:
3.5176873 = idf(docFreq=3565, maxDocs=44218)
0.03845047 = queryNorm
0.19432661 = fieldWeight in 271, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.5176873 = idf(docFreq=3565, maxDocs=44218)
0.0390625 = fieldNorm(doc=271)
0.33333334 = coord(1/3)
0.33333334 = coord(2/6)
- Abstract
- Herdan's law in linguistics and Heaps' law in information retrieval are different formulations of the same phenomenon. Stated briefly and in linguistic terms they state that vocabularies' sizes are concave increasing power laws of texts' sizes. This study investigates these laws from a purely mathematical and informetric point of view. A general informetric argument shows that the problem of proving these laws is, in fact, ill-posed. Using the more general terminology of sources and items, the author shows by presenting exact formulas from Lotkaian informetrics that the total number T of sources is not only a function of the total number A of items, but is also a function of several parameters (e.g., the parameters occurring in Lotka's law). Consequently, it is shown that a fixed T(or A) value can lead to different possible A (respectively, T) values. Limiting the T(A)-variability to increasing samples (e.g., in a text as done in linguistics) the author then shows, in a purely mathematical way, that for large sample sizes T~ A**phi, where phi is a constant, phi < 1 but close to 1, hence roughly, Heaps' or Herdan's law can be proved without using any linguistic or informetric argument. The author also shows that for smaller samples, a is not a constant but essentially decreases as confirmed by practical examples. Finally, an exact informetric argument on random sampling in the items shows that, in most cases, T= T(A) is a concavely increasing function, in accordance with practical examples.
- Date
- 29. 4.2007 19:51:08
- Source
- Journal of the American Society for Information Science and Technology. 58(2007) no.5, S.702-709
-
Egghe, L.: Properties of the n-overlap vector and n-overlap similarity theory (2006)
0.01
0.0051023993 = product of:
0.015307197 = sum of:
0.00654587 = weight(_text_:information in 194) [ClassicSimilarity], result of:
0.00654587 = score(doc=194,freq=2.0), product of:
0.067498945 = queryWeight, product of:
1.7554779 = idf(docFreq=20772, maxDocs=44218)
0.03845047 = queryNorm
0.09697737 = fieldWeight in 194, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
1.7554779 = idf(docFreq=20772, maxDocs=44218)
0.0390625 = fieldNorm(doc=194)
0.008761327 = product of:
0.02628398 = sum of:
0.02628398 = weight(_text_:29 in 194) [ClassicSimilarity], result of:
0.02628398 = score(doc=194,freq=2.0), product of:
0.13525672 = queryWeight, product of:
3.5176873 = idf(docFreq=3565, maxDocs=44218)
0.03845047 = queryNorm
0.19432661 = fieldWeight in 194, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.5176873 = idf(docFreq=3565, maxDocs=44218)
0.0390625 = fieldNorm(doc=194)
0.33333334 = coord(1/3)
0.33333334 = coord(2/6)
- Date
- 3. 1.2007 14:26:29
- Source
- Journal of the American Society for Information Science and Technology. 57(2006) no.9, S.1165-1177
-
Egghe, L.; Rousseau, R.: ¬A theoretical study of recall and precision using a topological approach to information retrieval (1998)
0.00
0.0030234081 = product of:
0.018140448 = sum of:
0.018140448 = weight(_text_:information in 3267) [ClassicSimilarity], result of:
0.018140448 = score(doc=3267,freq=6.0), product of:
0.067498945 = queryWeight, product of:
1.7554779 = idf(docFreq=20772, maxDocs=44218)
0.03845047 = queryNorm
0.2687516 = fieldWeight in 3267, product of:
2.4494898 = tf(freq=6.0), with freq of:
6.0 = termFreq=6.0
1.7554779 = idf(docFreq=20772, maxDocs=44218)
0.0625 = fieldNorm(doc=3267)
0.16666667 = coord(1/6)
- Abstract
- Topologies for information retrieval systems are generated by certain subsets, called retrievals. Shows how recall and precision can be expressed using only retrievals. Investigates different types of retrieval systems: both threshold systems and close match systems and both optimal and non optimal retrieval. Highlights the relation with the hypergeometric and some non-standard distributions
- Source
- Information processing and management. 34(1998) nos.2/3, S.191-218
-
Egghe, L.; Rousseau, R.: Topological aspects of information retrieval (1998)
0.00
0.002645482 = product of:
0.015872892 = sum of:
0.015872892 = weight(_text_:information in 2157) [ClassicSimilarity], result of:
0.015872892 = score(doc=2157,freq=6.0), product of:
0.067498945 = queryWeight, product of:
1.7554779 = idf(docFreq=20772, maxDocs=44218)
0.03845047 = queryNorm
0.23515764 = fieldWeight in 2157, product of:
2.4494898 = tf(freq=6.0), with freq of:
6.0 = termFreq=6.0
1.7554779 = idf(docFreq=20772, maxDocs=44218)
0.0546875 = fieldNorm(doc=2157)
0.16666667 = coord(1/6)
- Abstract
- Let (DS, DQ, sim) be a retrieval system consisting of a document space DS, a query space QS, and a function sim, expressing the similarity between a document and a query. Following D.M. Everett and S.C. Cater (1992), we introduce topologies on the document space. These topologies are generated by the similarity function sim and the query space QS. 3 topologies will be studied: the retrieval topology, the similarity topology and the (pseudo-)metric one. It is shown that the retrieval topology is the coarsest of the three, while the (pseudo-)metric is the strongest. These 3 topologies are generally different, reflecting distinct topological aspects of information retrieval. We present necessary and sufficient conditions for these topological aspects to be equal
- Source
- Journal of the American Society for Information Science. 49(1998) no.13, S.1144-1160
-
Egghe, L.: Expansion of the field of informetrics : the second special issue (2006)
0.00
0.0026183482 = product of:
0.01571009 = sum of:
0.01571009 = weight(_text_:information in 7119) [ClassicSimilarity], result of:
0.01571009 = score(doc=7119,freq=2.0), product of:
0.067498945 = queryWeight, product of:
1.7554779 = idf(docFreq=20772, maxDocs=44218)
0.03845047 = queryNorm
0.23274569 = fieldWeight in 7119, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
1.7554779 = idf(docFreq=20772, maxDocs=44218)
0.09375 = fieldNorm(doc=7119)
0.16666667 = coord(1/6)
- Source
- Information processing and management. 42(2006) no.6, S.1405-1407
-
Egghe, L.: Expansion of the field of informetrics : origins and consequences (2005)
0.00
0.0026183482 = product of:
0.01571009 = sum of:
0.01571009 = weight(_text_:information in 1910) [ClassicSimilarity], result of:
0.01571009 = score(doc=1910,freq=2.0), product of:
0.067498945 = queryWeight, product of:
1.7554779 = idf(docFreq=20772, maxDocs=44218)
0.03845047 = queryNorm
0.23274569 = fieldWeight in 1910, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
1.7554779 = idf(docFreq=20772, maxDocs=44218)
0.09375 = fieldNorm(doc=1910)
0.16666667 = coord(1/6)
- Source
- Information processing and management. 41(2005) no.6, S.1311-1316
-
Egghe, L.: Special features of the author - publication relationship and a new explanation of Lotka's law based on convolution theory (1994)
0.00
0.0026183482 = product of:
0.01571009 = sum of:
0.01571009 = weight(_text_:information in 5068) [ClassicSimilarity], result of:
0.01571009 = score(doc=5068,freq=2.0), product of:
0.067498945 = queryWeight, product of:
1.7554779 = idf(docFreq=20772, maxDocs=44218)
0.03845047 = queryNorm
0.23274569 = fieldWeight in 5068, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
1.7554779 = idf(docFreq=20772, maxDocs=44218)
0.09375 = fieldNorm(doc=5068)
0.16666667 = coord(1/6)
- Source
- Journal of the American Society for Information Science. 45(1994) no.6, S.422-427
-
Egghe, L.: Note on a possible decomposition of the h-Index (2013)
0.00
0.0026183482 = product of:
0.01571009 = sum of:
0.01571009 = weight(_text_:information in 683) [ClassicSimilarity], result of:
0.01571009 = score(doc=683,freq=2.0), product of:
0.067498945 = queryWeight, product of:
1.7554779 = idf(docFreq=20772, maxDocs=44218)
0.03845047 = queryNorm
0.23274569 = fieldWeight in 683, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
1.7554779 = idf(docFreq=20772, maxDocs=44218)
0.09375 = fieldNorm(doc=683)
0.16666667 = coord(1/6)
- Source
- Journal of the American Society for Information Science and Technology. 64(2013) no.4, S.871
-
Egghe, L.: ¬The Hirsch index and related impact measures (2010)
0.00
0.0026183482 = product of:
0.01571009 = sum of:
0.01571009 = weight(_text_:information in 1597) [ClassicSimilarity], result of:
0.01571009 = score(doc=1597,freq=2.0), product of:
0.067498945 = queryWeight, product of:
1.7554779 = idf(docFreq=20772, maxDocs=44218)
0.03845047 = queryNorm
0.23274569 = fieldWeight in 1597, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
1.7554779 = idf(docFreq=20772, maxDocs=44218)
0.09375 = fieldNorm(doc=1597)
0.16666667 = coord(1/6)
- Source
- Annual review of information science and technology. 44(2010) no.1, S.65-114
-
Egghe, L.; Rousseau, R.: Duality in information retrieval and the hypegeometric distribution (1997)
0.00
0.0024686023 = product of:
0.014811614 = sum of:
0.014811614 = weight(_text_:information in 647) [ClassicSimilarity], result of:
0.014811614 = score(doc=647,freq=4.0), product of:
0.067498945 = queryWeight, product of:
1.7554779 = idf(docFreq=20772, maxDocs=44218)
0.03845047 = queryNorm
0.21943474 = fieldWeight in 647, product of:
2.0 = tf(freq=4.0), with freq of:
4.0 = termFreq=4.0
1.7554779 = idf(docFreq=20772, maxDocs=44218)
0.0625 = fieldNorm(doc=647)
0.16666667 = coord(1/6)
- Abstract
- Asserts that duality is an important topic in informetrics, especially in connection with the classical informetric laws. Yet this concept is less studied in information retrieval. It deals with the unification or symmetry between queries and documents, search formulation versus indexing, and relevant versus retrieved documents. Elaborates these ideas and highlights the connection with the hypergeometric distribution
-
Egghe, L.: Existence theorem of the quadruple (P, R, F, M) : precision, recall, fallout and miss (2007)
0.00
0.002267556 = product of:
0.013605336 = sum of:
0.013605336 = weight(_text_:information in 2011) [ClassicSimilarity], result of:
0.013605336 = score(doc=2011,freq=6.0), product of:
0.067498945 = queryWeight, product of:
1.7554779 = idf(docFreq=20772, maxDocs=44218)
0.03845047 = queryNorm
0.20156369 = fieldWeight in 2011, product of:
2.4494898 = tf(freq=6.0), with freq of:
6.0 = termFreq=6.0
1.7554779 = idf(docFreq=20772, maxDocs=44218)
0.046875 = fieldNorm(doc=2011)
0.16666667 = coord(1/6)
- Abstract
- In an earlier paper [Egghe, L. (2004). A universal method of information retrieval evaluation: the "missing" link M and the universal IR surface. Information Processing and Management, 40, 21-30] we showed that, given an IR system, and if P denotes precision, R recall, F fallout and M miss (re-introduced in the paper mentioned above), we have the following relationship between P, R, F and M: P/(1-P)*(1-R)/R*F/(1-F)*(1-M)/M = 1. In this paper we prove the (more difficult) converse: given any four rational numbers in the interval ]0, 1[ satisfying the above equation, then there exists an IR system such that these four numbers (in any order) are the precision, recall, fallout and miss of this IR system. As a consequence we show that any three rational numbers in ]0, 1[ represent any three measures taken from precision, recall, fallout and miss of a certain IR system. We also show that this result is also true for two numbers instead of three.
- Source
- Information processing and management. 43(2007) no.1, S.265-272
-
Egghe, L.: ¬A good normalized impact and concentration measure (2014)
0.00
0.0021819568 = product of:
0.01309174 = sum of:
0.01309174 = weight(_text_:information in 1508) [ClassicSimilarity], result of:
0.01309174 = score(doc=1508,freq=2.0), product of:
0.067498945 = queryWeight, product of:
1.7554779 = idf(docFreq=20772, maxDocs=44218)
0.03845047 = queryNorm
0.19395474 = fieldWeight in 1508, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
1.7554779 = idf(docFreq=20772, maxDocs=44218)
0.078125 = fieldNorm(doc=1508)
0.16666667 = coord(1/6)
- Source
- Journal of the Association for Information Science and Technology. 65(2014) no.10, S.2052-2054
-
Egghe, L.: Vector retrieval, fuzzy retrieval and the universal fuzzy IR surface for IR evaluation (2004)
0.00
0.0021600272 = product of:
0.012960162 = sum of:
0.012960162 = weight(_text_:information in 2531) [ClassicSimilarity], result of:
0.012960162 = score(doc=2531,freq=4.0), product of:
0.067498945 = queryWeight, product of:
1.7554779 = idf(docFreq=20772, maxDocs=44218)
0.03845047 = queryNorm
0.1920054 = fieldWeight in 2531, product of:
2.0 = tf(freq=4.0), with freq of:
4.0 = termFreq=4.0
1.7554779 = idf(docFreq=20772, maxDocs=44218)
0.0546875 = fieldNorm(doc=2531)
0.16666667 = coord(1/6)
- Abstract
- It is shown that vector information retrieval (IR) and general fuzzy IR uses two types of fuzzy set operations: the original "Zadeh min-max operations" and the so-called "probabilistic sum and algebraic product operations". The universal IR surface, valid for classical 0-1 IR (i.e. where ordinary sets are used) and used in IR evaluation, is extended to and reproved for vector IR, using the probabilistic sum and algebraic product model. We also show (by counterexample) that, using the "Zadeh min-max" fuzzy model, yields a breakdown of this IR surface.
- Source
- Information processing and management. 40(2004) no.4, S.603-618
-
Egghe, L.: ¬The influence of transformations on the h-index and the g-index (2008)
0.00
0.0021600272 = product of:
0.012960162 = sum of:
0.012960162 = weight(_text_:information in 1881) [ClassicSimilarity], result of:
0.012960162 = score(doc=1881,freq=4.0), product of:
0.067498945 = queryWeight, product of:
1.7554779 = idf(docFreq=20772, maxDocs=44218)
0.03845047 = queryNorm
0.1920054 = fieldWeight in 1881, product of:
2.0 = tf(freq=4.0), with freq of:
4.0 = termFreq=4.0
1.7554779 = idf(docFreq=20772, maxDocs=44218)
0.0546875 = fieldNorm(doc=1881)
0.16666667 = coord(1/6)
- Abstract
- In a previous article, we introduced a general transformation on sources and one on items in an arbitrary information production process (IPP). In this article, we investigate the influence of these transformations on the h-index and on the g-index. General formulae that describe this influence are presented. These are applied to the case that the size-frequency function is Lotkaian (i.e., is a decreasing power function). We further show that the h-index of the transformed IPP belongs to the interval bounded by the two transformations of the h-index of the original IPP, and we also show that this property is not true for the g-index.
- Source
- Journal of the American Society for Information Science and Technology. 59(2008) no.8, S.1304-1312