Search (60 results, page 1 of 3)

  • × author_ss:"Egghe, L."
  1. Egghe, L.; Guns, R.; Rousseau, R.; Leuven, K.U.: Erratum (2012) 0.09
    0.08650949 = product of:
      0.14418247 = sum of:
        0.006095233 = weight(_text_:a in 4992) [ClassicSimilarity], result of:
          0.006095233 = score(doc=4992,freq=2.0), product of:
            0.047845192 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.041494574 = queryNorm
            0.12739488 = fieldWeight in 4992, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.078125 = fieldNorm(doc=4992)
        0.109977536 = weight(_text_:63 in 4992) [ClassicSimilarity], result of:
          0.109977536 = score(doc=4992,freq=2.0), product of:
            0.20323344 = queryWeight, product of:
              4.8978314 = idf(docFreq=896, maxDocs=44218)
              0.041494574 = queryNorm
            0.541139 = fieldWeight in 4992, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              4.8978314 = idf(docFreq=896, maxDocs=44218)
              0.078125 = fieldNorm(doc=4992)
        0.02810971 = product of:
          0.05621942 = sum of:
            0.05621942 = weight(_text_:22 in 4992) [ClassicSimilarity], result of:
              0.05621942 = score(doc=4992,freq=2.0), product of:
                0.14530693 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.041494574 = queryNorm
                0.38690117 = fieldWeight in 4992, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.078125 = fieldNorm(doc=4992)
          0.5 = coord(1/2)
      0.6 = coord(3/5)
    
    Date
    14. 2.2012 12:53:22
    Source
    Journal of the American Society for Information Science and Technology. 63(2012) no.2, S.429
    Type
    a
  2. Egghe, L.: Remarks on the paper by A. De Visscher, "what does the g-index really measure?" (2012) 0.03
    0.034609932 = product of:
      0.08652483 = sum of:
        0.0095405495 = weight(_text_:a in 463) [ClassicSimilarity], result of:
          0.0095405495 = score(doc=463,freq=10.0), product of:
            0.047845192 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.041494574 = queryNorm
            0.19940455 = fieldWeight in 463, product of:
              3.1622777 = tf(freq=10.0), with freq of:
                10.0 = termFreq=10.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.0546875 = fieldNorm(doc=463)
        0.07698428 = weight(_text_:63 in 463) [ClassicSimilarity], result of:
          0.07698428 = score(doc=463,freq=2.0), product of:
            0.20323344 = queryWeight, product of:
              4.8978314 = idf(docFreq=896, maxDocs=44218)
              0.041494574 = queryNorm
            0.37879732 = fieldWeight in 463, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              4.8978314 = idf(docFreq=896, maxDocs=44218)
              0.0546875 = fieldNorm(doc=463)
      0.4 = coord(2/5)
    
    Abstract
    The author presents a different view on properties of impact measures than given in the paper of De Visscher (2011). He argues that a good impact measure works better when citations are concentrated rather than spread out over articles. The author also presents theoretical evidence that the g-index and the R-index can be close to the square root of the total number of citations, whereas this is not the case for the A-index. Here the author confirms an assertion of De Visscher.
    Source
    Journal of the American Society for Information Science and Technology. 63(2012) no.10, S.2118-2121
    Type
    a
  3. Egghe, L.; Rousseau, R.: ¬The Hirsch index of a shifted Lotka function and its relation with the impact factor (2012) 0.03
    0.0332073 = product of:
      0.08301825 = sum of:
        0.0060339733 = weight(_text_:a in 243) [ClassicSimilarity], result of:
          0.0060339733 = score(doc=243,freq=4.0), product of:
            0.047845192 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.041494574 = queryNorm
            0.12611452 = fieldWeight in 243, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.0546875 = fieldNorm(doc=243)
        0.07698428 = weight(_text_:63 in 243) [ClassicSimilarity], result of:
          0.07698428 = score(doc=243,freq=2.0), product of:
            0.20323344 = queryWeight, product of:
              4.8978314 = idf(docFreq=896, maxDocs=44218)
              0.041494574 = queryNorm
            0.37879732 = fieldWeight in 243, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              4.8978314 = idf(docFreq=896, maxDocs=44218)
              0.0546875 = fieldNorm(doc=243)
      0.4 = coord(2/5)
    
    Source
    Journal of the American Society for Information Science and Technology. 63(2012) no.5, S.1048-1053
    Type
    a
  4. Egghe, L.; Guns, R.; Rousseau, R.: Thoughts on uncitedness : Nobel laureates and Fields medalists as case studies (2011) 0.03
    0.029320324 = product of:
      0.07330081 = sum of:
        0.0073142797 = weight(_text_:a in 4994) [ClassicSimilarity], result of:
          0.0073142797 = score(doc=4994,freq=8.0), product of:
            0.047845192 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.041494574 = queryNorm
            0.15287387 = fieldWeight in 4994, product of:
              2.828427 = tf(freq=8.0), with freq of:
                8.0 = termFreq=8.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.046875 = fieldNorm(doc=4994)
        0.06598653 = weight(_text_:63 in 4994) [ClassicSimilarity], result of:
          0.06598653 = score(doc=4994,freq=2.0), product of:
            0.20323344 = queryWeight, product of:
              4.8978314 = idf(docFreq=896, maxDocs=44218)
              0.041494574 = queryNorm
            0.32468343 = fieldWeight in 4994, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              4.8978314 = idf(docFreq=896, maxDocs=44218)
              0.046875 = fieldNorm(doc=4994)
      0.4 = coord(2/5)
    
    Abstract
    Contrary to what one might expect, Nobel laureates and Fields medalists have a rather large fraction (10% or more) of uncited publications. This is the case for (in total) 75 examined researchers from the fields of mathematics (Fields medalists), physics, chemistry, and physiology or medicine (Nobel laureates). We study several indicators for these researchers, including the h-index, total number of publications, average number of citations per publication, the number (and fraction) of uncited publications, and their interrelations. The most remarkable result is a positive correlation between the h-index and the number of uncited articles. We also present a Lotkaian model, which partially explains the empirically found regularities.
    Footnote
    Vgl.: Erratum. In: Journal of the American Society for Information Science and Technology. 63(2012) no.2, S.429.
    Type
    a
  5. Egghe, L.; Guns, R.: Applications of the generalized law of Benford to informetric data (2012) 0.03
    0.029320324 = product of:
      0.07330081 = sum of:
        0.0073142797 = weight(_text_:a in 376) [ClassicSimilarity], result of:
          0.0073142797 = score(doc=376,freq=8.0), product of:
            0.047845192 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.041494574 = queryNorm
            0.15287387 = fieldWeight in 376, product of:
              2.828427 = tf(freq=8.0), with freq of:
                8.0 = termFreq=8.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.046875 = fieldNorm(doc=376)
        0.06598653 = weight(_text_:63 in 376) [ClassicSimilarity], result of:
          0.06598653 = score(doc=376,freq=2.0), product of:
            0.20323344 = queryWeight, product of:
              4.8978314 = idf(docFreq=896, maxDocs=44218)
              0.041494574 = queryNorm
            0.32468343 = fieldWeight in 376, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              4.8978314 = idf(docFreq=896, maxDocs=44218)
              0.046875 = fieldNorm(doc=376)
      0.4 = coord(2/5)
    
    Abstract
    In a previous work (Egghe, 2011), the first author showed that Benford's law (describing the logarithmic distribution of the numbers 1, 2, ... , 9 as first digits of data in decimal form) is related to the classical law of Zipf with exponent 1. The work of Campanario and Coslado (2011), however, shows that Benford's law does not always fit practical data in a statistical sense. In this article, we use a generalization of Benford's law related to the general law of Zipf with exponent ? > 0. Using data from Campanario and Coslado, we apply nonlinear least squares to determine the optimal ? and show that this generalized law of Benford fits the data better than the classical law of Benford.
    Source
    Journal of the American Society for Information Science and Technology. 63(2012) no.8, S.1662-1665
    Type
    a
  6. Egghe, L.; Rousseau, R.: Introduction to informetrics : quantitative methods in library, documentation and information science (1990) 0.02
    0.015396856 = product of:
      0.07698428 = sum of:
        0.07698428 = weight(_text_:63 in 1515) [ClassicSimilarity], result of:
          0.07698428 = score(doc=1515,freq=2.0), product of:
            0.20323344 = queryWeight, product of:
              4.8978314 = idf(docFreq=896, maxDocs=44218)
              0.041494574 = queryNorm
            0.37879732 = fieldWeight in 1515, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              4.8978314 = idf(docFreq=896, maxDocs=44218)
              0.0546875 = fieldNorm(doc=1515)
      0.2 = coord(1/5)
    
    Signature
    63 BCGS 1010
  7. Egghe, L.: Properties of the n-overlap vector and n-overlap similarity theory (2006) 0.02
    0.015130216 = product of:
      0.03782554 = sum of:
        0.006814678 = weight(_text_:a in 194) [ClassicSimilarity], result of:
          0.006814678 = score(doc=194,freq=10.0), product of:
            0.047845192 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.041494574 = queryNorm
            0.14243183 = fieldWeight in 194, product of:
              3.1622777 = tf(freq=10.0), with freq of:
                10.0 = termFreq=10.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.0390625 = fieldNorm(doc=194)
        0.031010862 = product of:
          0.062021725 = sum of:
            0.062021725 = weight(_text_:dewey in 194) [ClassicSimilarity], result of:
              0.062021725 = score(doc=194,freq=2.0), product of:
                0.21583907 = queryWeight, product of:
                  5.2016215 = idf(docFreq=661, maxDocs=44218)
                  0.041494574 = queryNorm
                0.2873517 = fieldWeight in 194, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  5.2016215 = idf(docFreq=661, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=194)
          0.5 = coord(1/2)
      0.4 = coord(2/5)
    
    Abstract
    In the first part of this article the author defines the n-overlap vector whose coordinates consist of the fraction of the objects (e.g., books, N-grams, etc.) that belong to 1, 2, , n sets (more generally: families) (e.g., libraries, databases, etc.). With the aid of the Lorenz concentration theory, a theory of n-overlap similarity is conceived together with corresponding measures, such as the generalized Jaccard index (generalizing the well-known Jaccard index in case n 5 2). Next, the distributional form of the n-overlap vector is determined assuming certain distributions of the object's and of the set (family) sizes. In this section the decreasing power law and decreasing exponential distribution is explained for the n-overlap vector. Both item (token) n-overlap and source (type) n-overlap are studied. The n-overlap properties of objects indexed by a hierarchical system (e.g., books indexed by numbers from a UDC or Dewey system or by N-grams) are presented in the final section. The author shows how the results given in the previous section can be applied as well as how the Lorenz order of the n-overlap vector is respected by an increase or a decrease of the level of refinement in the hierarchical system (e.g., the value N in N-grams).
    Type
    a
  8. Egghe, L.; Rousseau, R.: Averaging and globalising quotients of informetric and scientometric data (1996) 0.01
    0.011134898 = product of:
      0.027837245 = sum of:
        0.0109714195 = weight(_text_:a in 7659) [ClassicSimilarity], result of:
          0.0109714195 = score(doc=7659,freq=18.0), product of:
            0.047845192 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.041494574 = queryNorm
            0.22931081 = fieldWeight in 7659, product of:
              4.2426405 = tf(freq=18.0), with freq of:
                18.0 = termFreq=18.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.046875 = fieldNorm(doc=7659)
        0.016865825 = product of:
          0.03373165 = sum of:
            0.03373165 = weight(_text_:22 in 7659) [ClassicSimilarity], result of:
              0.03373165 = score(doc=7659,freq=2.0), product of:
                0.14530693 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.041494574 = queryNorm
                0.23214069 = fieldWeight in 7659, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.046875 = fieldNorm(doc=7659)
          0.5 = coord(1/2)
      0.4 = coord(2/5)
    
    Abstract
    It is possible, using ISI's Journal Citation Report (JCR), to calculate average impact factors (AIF) for LCR's subject categories but it can be more useful to know the global Impact Factor (GIF) of a subject category and compare the 2 values. Reports results of a study to compare the relationships between AIFs and GIFs of subjects, based on the particular case of the average impact factor of a subfield versus the impact factor of this subfield as a whole, the difference being studied between an average of quotients, denoted as AQ, and a global average, obtained as a quotient of averages, and denoted as GQ. In the case of impact factors, AQ becomes the average impact factor of a field, and GQ becomes its global impact factor. Discusses a number of applications of this technique in the context of informetrics and scientometrics
    Source
    Journal of information science. 22(1996) no.3, S.165-170
    Type
    a
  9. Egghe, L.: ¬A universal method of information retrieval evaluation : the "missing" link M and the universal IR surface (2004) 0.01
    0.010017375 = product of:
      0.02504344 = sum of:
        0.008177614 = weight(_text_:a in 2558) [ClassicSimilarity], result of:
          0.008177614 = score(doc=2558,freq=10.0), product of:
            0.047845192 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.041494574 = queryNorm
            0.1709182 = fieldWeight in 2558, product of:
              3.1622777 = tf(freq=10.0), with freq of:
                10.0 = termFreq=10.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.046875 = fieldNorm(doc=2558)
        0.016865825 = product of:
          0.03373165 = sum of:
            0.03373165 = weight(_text_:22 in 2558) [ClassicSimilarity], result of:
              0.03373165 = score(doc=2558,freq=2.0), product of:
                0.14530693 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.041494574 = queryNorm
                0.23214069 = fieldWeight in 2558, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.046875 = fieldNorm(doc=2558)
          0.5 = coord(1/2)
      0.4 = coord(2/5)
    
    Abstract
    The paper shows that the present evaluation methods in information retrieval (basically recall R and precision P and in some cases fallout F ) lack universal comparability in the sense that their values depend on the generality of the IR problem. A solution is given by using all "parts" of the database, including the non-relevant documents and also the not-retrieved documents. It turns out that the solution is given by introducing the measure M being the fraction of the not-retrieved documents that are relevant (hence the "miss" measure). We prove that - independent of the IR problem or of the IR action - the quadruple (P,R,F,M) belongs to a universal IR surface, being the same for all IR-activities. This universality is then exploited by defining a new measure for evaluation in IR allowing for unbiased comparisons of all IR results. We also show that only using one, two or even three measures from the set {P,R,F,M} necessary leads to evaluation measures that are non-universal and hence not capable of comparing different IR situations.
    Date
    14. 8.2004 19:17:22
    Type
    a
  10. Egghe, L.: ¬A good normalized impact and concentration measure (2014) 0.00
    0.0029860423 = product of:
      0.014930211 = sum of:
        0.014930211 = weight(_text_:a in 1508) [ClassicSimilarity], result of:
          0.014930211 = score(doc=1508,freq=12.0), product of:
            0.047845192 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.041494574 = queryNorm
            0.3120525 = fieldWeight in 1508, product of:
              3.4641016 = tf(freq=12.0), with freq of:
                12.0 = termFreq=12.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.078125 = fieldNorm(doc=1508)
      0.2 = coord(1/5)
    
    Abstract
    It is shown that a normalized version of the g-index is a good normalized impact and concentration measure. A proposal for such a measure by Bartolucci is improved.
    Type
    a
  11. Egghe, L.: Untangling Herdan's law and Heaps' law : mathematical and informetric arguments (2007) 0.00
    0.0025859883 = product of:
      0.0129299415 = sum of:
        0.0129299415 = weight(_text_:a in 271) [ClassicSimilarity], result of:
          0.0129299415 = score(doc=271,freq=36.0), product of:
            0.047845192 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.041494574 = queryNorm
            0.27024537 = fieldWeight in 271, product of:
              6.0 = tf(freq=36.0), with freq of:
                36.0 = termFreq=36.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.0390625 = fieldNorm(doc=271)
      0.2 = coord(1/5)
    
    Abstract
    Herdan's law in linguistics and Heaps' law in information retrieval are different formulations of the same phenomenon. Stated briefly and in linguistic terms they state that vocabularies' sizes are concave increasing power laws of texts' sizes. This study investigates these laws from a purely mathematical and informetric point of view. A general informetric argument shows that the problem of proving these laws is, in fact, ill-posed. Using the more general terminology of sources and items, the author shows by presenting exact formulas from Lotkaian informetrics that the total number T of sources is not only a function of the total number A of items, but is also a function of several parameters (e.g., the parameters occurring in Lotka's law). Consequently, it is shown that a fixed T(or A) value can lead to different possible A (respectively, T) values. Limiting the T(A)-variability to increasing samples (e.g., in a text as done in linguistics) the author then shows, in a purely mathematical way, that for large sample sizes T~ A**phi, where phi is a constant, phi < 1 but close to 1, hence roughly, Heaps' or Herdan's law can be proved without using any linguistic or informetric argument. The author also shows that for smaller samples, a is not a constant but essentially decreases as confirmed by practical examples. Finally, an exact informetric argument on random sampling in the items shows that, in most cases, T= T(A) is a concavely increasing function, in accordance with practical examples.
    Type
    a
  12. Egghe, L.: ¬A rationale for the Hirsch-index rank-order distribution and a comparison with the impact factor rank-order distribution (2009) 0.00
    0.0025599978 = product of:
      0.012799989 = sum of:
        0.012799989 = weight(_text_:a in 3124) [ClassicSimilarity], result of:
          0.012799989 = score(doc=3124,freq=18.0), product of:
            0.047845192 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.041494574 = queryNorm
            0.26752928 = fieldWeight in 3124, product of:
              4.2426405 = tf(freq=18.0), with freq of:
                18.0 = termFreq=18.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.0546875 = fieldNorm(doc=3124)
      0.2 = coord(1/5)
    
    Abstract
    We present a rationale for the Hirsch-index rank-order distribution and prove that it is a power law (hence a straight line in the log-log scale). This is confirmed by experimental data of Pyykkö and by data produced in this article on 206 mathematics journals. This distribution is of a completely different nature than the impact factor (IF) rank-order distribution which (as proved in a previous article) is S-shaped. This is also confirmed by our example. Only in the log-log scale of the h-index distribution do we notice a concave deviation of the straight line for higher ranks. This phenomenon is discussed.
    Type
    a
  13. Egghe, L.: Dynamic h-index : the Hirsch index in function of time (2007) 0.00
    0.0023888338 = product of:
      0.011944169 = sum of:
        0.011944169 = weight(_text_:a in 147) [ClassicSimilarity], result of:
          0.011944169 = score(doc=147,freq=12.0), product of:
            0.047845192 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.041494574 = queryNorm
            0.24964198 = fieldWeight in 147, product of:
              3.4641016 = tf(freq=12.0), with freq of:
                12.0 = termFreq=12.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.0625 = fieldNorm(doc=147)
      0.2 = coord(1/5)
    
    Abstract
    When there are a group of articles and the present time is fixed we can determine the unique number h being the number of articles that received h or more citations while the other articles received a number of citations which is not larger than h. In this article, the time dependence of the h-index is determined. This is important to describe the expected career evolution of a scientist's work or of a journal's production in a fixed year.
    Type
    a
  14. Egghe, L.; Rousseau, R.: Topological aspects of information retrieval (1998) 0.00
    0.0022577061 = product of:
      0.01128853 = sum of:
        0.01128853 = weight(_text_:a in 2157) [ClassicSimilarity], result of:
          0.01128853 = score(doc=2157,freq=14.0), product of:
            0.047845192 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.041494574 = queryNorm
            0.23593865 = fieldWeight in 2157, product of:
              3.7416575 = tf(freq=14.0), with freq of:
                14.0 = termFreq=14.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.0546875 = fieldNorm(doc=2157)
      0.2 = coord(1/5)
    
    Abstract
    Let (DS, DQ, sim) be a retrieval system consisting of a document space DS, a query space QS, and a function sim, expressing the similarity between a document and a query. Following D.M. Everett and S.C. Cater (1992), we introduce topologies on the document space. These topologies are generated by the similarity function sim and the query space QS. 3 topologies will be studied: the retrieval topology, the similarity topology and the (pseudo-)metric one. It is shown that the retrieval topology is the coarsest of the three, while the (pseudo-)metric is the strongest. These 3 topologies are generally different, reflecting distinct topological aspects of information retrieval. We present necessary and sufficient conditions for these topological aspects to be equal
    Type
    a
  15. Egghe, L.: New relations between similarity measures for vectors based on vector norms (2009) 0.00
    0.002194284 = product of:
      0.0109714195 = sum of:
        0.0109714195 = weight(_text_:a in 2708) [ClassicSimilarity], result of:
          0.0109714195 = score(doc=2708,freq=18.0), product of:
            0.047845192 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.041494574 = queryNorm
            0.22931081 = fieldWeight in 2708, product of:
              4.2426405 = tf(freq=18.0), with freq of:
                18.0 = termFreq=18.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.046875 = fieldNorm(doc=2708)
      0.2 = coord(1/5)
    
    Abstract
    The well-known similarity measures Jaccard, Salton's cosine, Dice, and several related overlap measures for vectors are compared. While general relations are not possible to prove, we study these measures on the trajectories of the form [X]=a[Y], where a > 0 is a constant and [·] denotes the Euclidean norm of a vector. In this case, direct functional relations between these measures are proved. For Jaccard, we prove that it is a convexly increasing function of Salton's cosine measure, but always smaller than or equal to the latter, hereby explaining a curve, experimentally found by Leydesdorff. All the other measures have a linear relation with Salton's cosine, reducing even to equality, in case a = 1. Hence, for equally normed vectors (e.g., for normalized vectors) we, essentially, only have Jaccard's measure and Salton's cosine measure since all the other measures are equal to the latter.
    Type
    a
  16. Egghe, L.; Liang, L.; Rousseau, R.: ¬A relation between h-index and impact factor in the power-law model (2009) 0.00
    0.0021806972 = product of:
      0.010903485 = sum of:
        0.010903485 = weight(_text_:a in 6759) [ClassicSimilarity], result of:
          0.010903485 = score(doc=6759,freq=10.0), product of:
            0.047845192 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.041494574 = queryNorm
            0.22789092 = fieldWeight in 6759, product of:
              3.1622777 = tf(freq=10.0), with freq of:
                10.0 = termFreq=10.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.0625 = fieldNorm(doc=6759)
      0.2 = coord(1/5)
    
    Abstract
    Using a power-law model, the two best-known topics in citation analysis, namely the impact factor and the Hirsch index, are unified into one relation (not a function). The validity of our model is, at least in a qualitative way, confirmed by real data.
    Type
    a
  17. Egghe, L.: On the relation between the association strength and other similarity measures (2010) 0.00
    0.0021806972 = product of:
      0.010903485 = sum of:
        0.010903485 = weight(_text_:a in 3598) [ClassicSimilarity], result of:
          0.010903485 = score(doc=3598,freq=10.0), product of:
            0.047845192 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.041494574 = queryNorm
            0.22789092 = fieldWeight in 3598, product of:
              3.1622777 = tf(freq=10.0), with freq of:
                10.0 = termFreq=10.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.0625 = fieldNorm(doc=3598)
      0.2 = coord(1/5)
    
    Abstract
    A graph in van Eck and Waltman [JASIST, 60(8), 2009, p. 1644], representing the relation between the association strength and the cosine, is partially explained as a sheaf of parabolas, each parabola being the functional relation between these similarity measures on the trajectories x*y=a, a constant. Based on earlier obtained relations between cosine and other similarity measures (e.g., Jaccard index), we can prove new relations between the association strength and these other measures.
    Type
    a
  18. Egghe, L.: Theory of the topical coverage of multiple databases (2013) 0.00
    0.0020902294 = product of:
      0.010451147 = sum of:
        0.010451147 = weight(_text_:a in 526) [ClassicSimilarity], result of:
          0.010451147 = score(doc=526,freq=12.0), product of:
            0.047845192 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.041494574 = queryNorm
            0.21843673 = fieldWeight in 526, product of:
              3.4641016 = tf(freq=12.0), with freq of:
                12.0 = termFreq=12.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.0546875 = fieldNorm(doc=526)
      0.2 = coord(1/5)
    
    Abstract
    We present a model that describes which fraction of the literature on a certain topic we will find when we use n (n = 1, 2, .) databases. It is a generalization of the theory of discovering usability problems. We prove that, in all practical cases, this fraction is a concave function of n, the number of used databases, thereby explaining some graphs that exist in the literature. We also study limiting features of this fraction for n very high and we characterize the case that we find all literature on a certain topic for n high enough.
    Type
    a
  19. Egghe, L.: Special features of the author - publication relationship and a new explanation of Lotka's law based on convolution theory (1994) 0.00
    0.0020687906 = product of:
      0.010343953 = sum of:
        0.010343953 = weight(_text_:a in 5068) [ClassicSimilarity], result of:
          0.010343953 = score(doc=5068,freq=4.0), product of:
            0.047845192 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.041494574 = queryNorm
            0.2161963 = fieldWeight in 5068, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.09375 = fieldNorm(doc=5068)
      0.2 = coord(1/5)
    
    Type
    a
  20. Egghe, L.: Note on a possible decomposition of the h-Index (2013) 0.00
    0.0020687906 = product of:
      0.010343953 = sum of:
        0.010343953 = weight(_text_:a in 683) [ClassicSimilarity], result of:
          0.010343953 = score(doc=683,freq=4.0), product of:
            0.047845192 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.041494574 = queryNorm
            0.2161963 = fieldWeight in 683, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.09375 = fieldNorm(doc=683)
      0.2 = coord(1/5)
    
    Type
    a