Search (14 results, page 1 of 1)

Egghe, L.: ¬A universal method of information retrieval evaluation : the "missing" link M and the universal IR surface (2004) 0.02
```
0.020988256 = product of:
  0.05247064 = sum of:
    0.035095613 = weight(_text_:m in 2558) [ClassicSimilarity], result of:
      0.035095613 = score(doc=2558,freq=8.0), product of:
        0.106374584 = queryWeight, product of:
          2.4884486 = idf(docFreq=9980, maxDocs=44218)
          0.04274735 = queryNorm
        0.3299248 = fieldWeight in 2558, product of:
          2.828427 = tf(freq=8.0), with freq of:
            8.0 = termFreq=8.0
          2.4884486 = idf(docFreq=9980, maxDocs=44218)
          0.046875 = fieldNorm(doc=2558)
    0.017375026 = product of:
      0.03475005 = sum of:
        0.03475005 = weight(_text_:22 in 2558) [ClassicSimilarity], result of:
          0.03475005 = score(doc=2558,freq=2.0), product of:
            0.14969394 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.04274735 = queryNorm
            0.23214069 = fieldWeight in 2558, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.046875 = fieldNorm(doc=2558)
      0.5 = coord(1/2)
  0.4 = coord(2/5)
```
Abstract

The paper shows that the present evaluation methods in information retrieval (basically recall R and precision P and in some cases fallout F ) lack universal comparability in the sense that their values depend on the generality of the IR problem. A solution is given by using all "parts" of the database, including the non-relevant documents and also the not-retrieved documents. It turns out that the solution is given by introducing the measure M being the fraction of the not-retrieved documents that are relevant (hence the "miss" measure). We prove that - independent of the IR problem or of the IR action - the quadruple (P,R,F,M) belongs to a universal IR surface, being the same for all IR-activities. This universality is then exploited by defining a new measure for evaluation in IR allowing for unbiased comparisons of all IR results. We also show that only using one, two or even three measures from the set {P,R,F,M} necessary leads to evaluation measures that are non-universal and hence not capable of comparing different IR situations.

Date

14. 8.2004 19:17:22
Egghe, L.: Mathematical theory of the h- and g-index in case of fractional counting of authorship (2008) 0.02
```
0.019584393 = product of:
  0.09792196 = sum of:
    0.09792196 = weight(_text_:g in 2004) [ClassicSimilarity], result of:
      0.09792196 = score(doc=2004,freq=12.0), product of:
        0.1605568 = queryWeight, product of:
          3.7559474 = idf(docFreq=2809, maxDocs=44218)
          0.04274735 = queryNorm
        0.60988986 = fieldWeight in 2004, product of:
          3.4641016 = tf(freq=12.0), with freq of:
            12.0 = termFreq=12.0
          3.7559474 = idf(docFreq=2809, maxDocs=44218)
          0.046875 = fieldNorm(doc=2004)
  0.2 = coord(1/5)
```
Abstract

This article studies the h-index (Hirsch index) and the g-index of authors, in case one counts authorship of the cited articles in a fractional way. There are two ways to do this: One counts the citations to these papers in a fractional way or one counts the ranks of the papers in a fractional way as credit for an author. In both cases, we define the fractional h- and g-indexes, and we present inequalities (both upper and lower bounds) between these fractional h- and g-indexes and their corresponding unweighted values (also involving, of course, the coauthorship distribution). Wherever applicable, examples and counterexamples are provided. In a concrete example (the publication citation list of the present author), we make explicit calculations of these fractional h- and g-indexes and show that they are not very different from the unweighted ones.

Object

g-index

Egghe, L.: ¬A good normalized impact and concentration measure (2014) 0.02

0.01884509 = product of:
  0.09422545 = sum of:
    0.09422545 = weight(_text_:g in 1508) [ClassicSimilarity], result of:
      0.09422545 = score(doc=1508,freq=4.0), product of:
        0.1605568 = queryWeight, product of:
          3.7559474 = idf(docFreq=2809, maxDocs=44218)
          0.04274735 = queryNorm
        0.5868668 = fieldWeight in 1508, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          3.7559474 = idf(docFreq=2809, maxDocs=44218)
          0.078125 = fieldNorm(doc=1508)
  0.2 = coord(1/5)

Abstract: It is shown that a normalized version of the g-index is a good normalized impact and concentration measure. A proposal for such a measure by Bartolucci is improved.
Object: g-index

Egghe, L.: ¬The influence of transformations on the h-index and the g-index (2008) 0.02
```
0.018655688 = product of:
  0.09327844 = sum of:
    0.09327844 = weight(_text_:g in 1881) [ClassicSimilarity], result of:
      0.09327844 = score(doc=1881,freq=8.0), product of:
        0.1605568 = queryWeight, product of:
          3.7559474 = idf(docFreq=2809, maxDocs=44218)
          0.04274735 = queryNorm
        0.5809685 = fieldWeight in 1881, product of:
          2.828427 = tf(freq=8.0), with freq of:
            8.0 = termFreq=8.0
          3.7559474 = idf(docFreq=2809, maxDocs=44218)
          0.0546875 = fieldNorm(doc=1881)
  0.2 = coord(1/5)
```
Abstract

In a previous article, we introduced a general transformation on sources and one on items in an arbitrary information production process (IPP). In this article, we investigate the influence of these transformations on the h-index and on the g-index. General formulae that describe this influence are presented. These are applied to the case that the size-frequency function is Lotkaian (i.e., is a decreasing power function). We further show that the h-index of the transformed IPP belongs to the interval bounded by the two transformations of the h-index of the original IPP, and we also show that this property is not true for the g-index.

Object

g-index
Egghe, L.: Remarks on the paper by A. De Visscher, "what does the g-index really measure?" (2012) 0.02
```
0.016156299 = product of:
  0.0807815 = sum of:
    0.0807815 = weight(_text_:g in 463) [ClassicSimilarity], result of:
      0.0807815 = score(doc=463,freq=6.0), product of:
        0.1605568 = queryWeight, product of:
          3.7559474 = idf(docFreq=2809, maxDocs=44218)
          0.04274735 = queryNorm
        0.5031335 = fieldWeight in 463, product of:
          2.4494898 = tf(freq=6.0), with freq of:
            6.0 = termFreq=6.0
          3.7559474 = idf(docFreq=2809, maxDocs=44218)
          0.0546875 = fieldNorm(doc=463)
  0.2 = coord(1/5)
```
Abstract

The author presents a different view on properties of impact measures than given in the paper of De Visscher (2011). He argues that a good impact measure works better when citations are concentrated rather than spread out over articles. The author also presents theoretical evidence that the g-index and the R-index can be close to the square root of the total number of citations, whereas this is not the case for the A-index. Here the author confirms an assertion of De Visscher.

Object

g-index
Egghe, L.: Empirical and combinatorial study of country occurrences in multi-authored papers (2006) 0.01
```
0.0107218865 = product of:
  0.05360943 = sum of:
    0.05360943 = weight(_text_:m in 81) [ClassicSimilarity], result of:
      0.05360943 = score(doc=81,freq=42.0), product of:
        0.106374584 = queryWeight, product of:
          2.4884486 = idf(docFreq=9980, maxDocs=44218)
          0.04274735 = queryNorm
        0.5039684 = fieldWeight in 81, product of:
          6.4807405 = tf(freq=42.0), with freq of:
            42.0 = termFreq=42.0
          2.4884486 = idf(docFreq=9980, maxDocs=44218)
          0.03125 = fieldNorm(doc=81)
  0.2 = coord(1/5)
```
Abstract

Papers written by several authors can be classified according to the countries of the author affiliations. The empirical part of this paper consists of two datasets. One dataset consists of 1,035 papers retrieved via the search "pedagog*" in the years 2004 and 2005 (up to October) in Academic Search Elite which is a case where phi(m) = the number of papers with m =1, 2,3 ... authors is decreasing, hence most of the papers have a low number of authors. Here we find that #, m = the number of times a country occurs j times in a m-authored paper, j =1, ..., m-1 is decreasing and that # m, m is much higher than all the other #j, m values. The other dataset consists of 3,271 papers retrieved via the search "enzyme" in the year 2005 (up to October) in the same database which is a case of a non-decreasing phi(m): most papers have 3 or 4 authors and we even find many papers with a much higher number of authors. In this case we show again that # m, m is much higher than the other #j, m values but that #j, m is not decreasing anymore in j =1, ..., m-1, although #1, m is (apart from # m, m) the largest number amongst the #j,m. The combinatorial part gives a proof of the fact that #j,m decreases for j = 1, m-1, supposing that all cases are equally possible. This shows that the first dataset is more conform with this model than the second dataset. Explanations for these findings are given. From the data we also find the (we think: new) distribution of number of papers with n =1, 2,3,... countries (i.e. where there are n different countries involved amongst the m (a n) authors of a paper): a fast decreasing function e.g. as a power law with a very large Lotka exponent.
Egghe, L.: Mathematical study of h-index sequences (2009) 0.01
```
0.009422545 = product of:
  0.047112726 = sum of:
    0.047112726 = weight(_text_:g in 4217) [ClassicSimilarity], result of:
      0.047112726 = score(doc=4217,freq=4.0), product of:
        0.1605568 = queryWeight, product of:
          3.7559474 = idf(docFreq=2809, maxDocs=44218)
          0.04274735 = queryNorm
        0.2934334 = fieldWeight in 4217, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          3.7559474 = idf(docFreq=2809, maxDocs=44218)
          0.0390625 = fieldNorm(doc=4217)
  0.2 = coord(1/5)
```
Abstract

This paper studies mathematical properties of h-index sequences as developed by Liang [Liang, L. (2006). h-Index sequence and h-index matrix: Constructions and applications. Scientometrics, 69(1), 153-159]. For practical reasons, Liming studies such sequences where the time goes backwards while it is more logical to use the time going forward (real career periods). Both type of h-index sequences are studied here and their interrelations are revealed. We show cases where these sequences are convex, linear and concave. We also show that, when one of the sequences is convex then the other one is concave, showing that the reverse-time sequence, in general, cannot be used to derive similar properties of the (difficult to obtain) forward time sequence. We show that both sequences are the same if and only if the author produces the same number of papers per year. If the author produces an increasing number of papers per year, then Liang's h-sequences are above the "normal" ones. All these results are also valid for g- and R-sequences. The results are confirmed by the h-, g- and R-sequences (forward and reverse time) of the author.

Egghe, L.: On the law of Zipf-Mandelbrot for multi-word phrases (1999) 0.01

0.00935883 = product of:
  0.04679415 = sum of:
    0.04679415 = weight(_text_:m in 3058) [ClassicSimilarity], result of:
      0.04679415 = score(doc=3058,freq=8.0), product of:
        0.106374584 = queryWeight, product of:
          2.4884486 = idf(docFreq=9980, maxDocs=44218)
          0.04274735 = queryNorm
        0.4398997 = fieldWeight in 3058, product of:
          2.828427 = tf(freq=8.0), with freq of:
            8.0 = termFreq=8.0
          2.4884486 = idf(docFreq=9980, maxDocs=44218)
          0.0625 = fieldNorm(doc=3058)
  0.2 = coord(1/5)

Abstract: This article studies the probabilities of the occurence of multi-word (m-word) phrases (m=2,3,...) in relation to the probabilities of occurence of the single words. It is well known that, in the latter case, the lae of Zipf is valid (i.e., a power law). We prove that in the case of m-word phrases (m>=2), this is not the case. We present 2 independent proof of this

Egghe, L.: Existence theorem of the quadruple (P, R, F, M) : precision, recall, fallout and miss (2007) 0.01
```
0.008596634 = product of:
  0.042983167 = sum of:
    0.042983167 = weight(_text_:m in 2011) [ClassicSimilarity], result of:
      0.042983167 = score(doc=2011,freq=12.0), product of:
        0.106374584 = queryWeight, product of:
          2.4884486 = idf(docFreq=9980, maxDocs=44218)
          0.04274735 = queryNorm
        0.40407366 = fieldWeight in 2011, product of:
          3.4641016 = tf(freq=12.0), with freq of:
            12.0 = termFreq=12.0
          2.4884486 = idf(docFreq=9980, maxDocs=44218)
          0.046875 = fieldNorm(doc=2011)
  0.2 = coord(1/5)
```
Abstract

In an earlier paper [Egghe, L. (2004). A universal method of information retrieval evaluation: the "missing" link M and the universal IR surface. Information Processing and Management, 40, 21-30] we showed that, given an IR system, and if P denotes precision, R recall, F fallout and M miss (re-introduced in the paper mentioned above), we have the following relationship between P, R, F and M: P/(1-P)*(1-R)/R*F/(1-F)*(1-M)/M = 1. In this paper we prove the (more difficult) converse: given any four rational numbers in the interval ]0, 1[ satisfying the above equation, then there exists an IR system such that these four numbers (in any order) are the precision, recall, fallout and miss of this IR system. As a consequence we show that any three rational numbers in ]0, 1[ represent any three measures taken from precision, recall, fallout and miss of a certain IR system. We also show that this result is also true for two numbers instead of three.

Egghe, L.; Rousseau, R.; Hooydonk, G. van: Methods for accrediting publications to authors or countries : consequences for evaluation studies (2000) 0.01

0.007995294 = product of:
  0.03997647 = sum of:
    0.03997647 = weight(_text_:g in 4384) [ClassicSimilarity], result of:
      0.03997647 = score(doc=4384,freq=2.0), product of:
        0.1605568 = queryWeight, product of:
          3.7559474 = idf(docFreq=2809, maxDocs=44218)
          0.04274735 = queryNorm
        0.24898648 = fieldWeight in 4384, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.7559474 = idf(docFreq=2809, maxDocs=44218)
          0.046875 = fieldNorm(doc=4384)
  0.2 = coord(1/5)

Egghe, L.; Guns, R.; Rousseau, R.; Leuven, K.U.: Erratum (2012) 0.01

0.005791676 = product of:
  0.02895838 = sum of:
    0.02895838 = product of:
      0.05791676 = sum of:
        0.05791676 = weight(_text_:22 in 4992) [ClassicSimilarity], result of:
          0.05791676 = score(doc=4992,freq=2.0), product of:
            0.14969394 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.04274735 = queryNorm
            0.38690117 = fieldWeight in 4992, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.078125 = fieldNorm(doc=4992)
      0.5 = coord(1/2)
  0.2 = coord(1/5)

Date: 14. 2.2012 12:53:22

Egghe, L.; Rousseau, R.: Introduction to informetrics : quantitative methods in library, documentation and information science (1990) 0.00

0.0040944885 = product of:
  0.02047244 = sum of:
    0.02047244 = weight(_text_:m in 1515) [ClassicSimilarity], result of:
      0.02047244 = score(doc=1515,freq=2.0), product of:
        0.106374584 = queryWeight, product of:
          2.4884486 = idf(docFreq=9980, maxDocs=44218)
          0.04274735 = queryNorm
        0.19245613 = fieldWeight in 1515, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          2.4884486 = idf(docFreq=9980, maxDocs=44218)
          0.0546875 = fieldNorm(doc=1515)
  0.2 = coord(1/5)

Type: m

Egghe, L.; Rousseau, R.: Averaging and globalising quotients of informetric and scientometric data (1996) 0.00

0.0034750053 = product of:
  0.017375026 = sum of:
    0.017375026 = product of:
      0.03475005 = sum of:
        0.03475005 = weight(_text_:22 in 7659) [ClassicSimilarity], result of:
          0.03475005 = score(doc=7659,freq=2.0), product of:
            0.14969394 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.04274735 = queryNorm
            0.23214069 = fieldWeight in 7659, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.046875 = fieldNorm(doc=7659)
      0.5 = coord(1/2)
  0.2 = coord(1/5)

Source: Journal of information science. 22(1996) no.3, S.165-170

Rousseau, R.; Egghe, L.; Guns, R.: Becoming metric-wise : a bibliometric guide for researchers (2018) 0.00

0.0029246346 = product of:
  0.014623173 = sum of:
    0.014623173 = weight(_text_:m in 5226) [ClassicSimilarity], result of:
      0.014623173 = score(doc=5226,freq=2.0), product of:
        0.106374584 = queryWeight, product of:
          2.4884486 = idf(docFreq=9980, maxDocs=44218)
          0.04274735 = queryNorm
        0.13746867 = fieldWeight in 5226, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          2.4884486 = idf(docFreq=9980, maxDocs=44218)
          0.0390625 = fieldNorm(doc=5226)
  0.2 = coord(1/5)

Type: m

Search (14 results, page 1 of 1)

Authors

Years

Types

Subjects

Classifications