Search (60 results, page 1 of 3)

Egghe, L.: ¬A universal method of information retrieval evaluation : the "missing" link M and the universal IR surface (2004) 0.08

0.075962216 = product of:
  0.13022095 = sum of:
    0.021355685 = product of:
      0.06406705 = sum of:
        0.06406705 = weight(_text_:f in 2558) [ClassicSimilarity], result of:
          0.06406705 = score(doc=2558,freq=6.0), product of:
            0.13999219 = queryWeight, product of:
              3.985786 = idf(docFreq=2232, maxDocs=44218)
              0.035122856 = queryNorm
            0.45764732 = fieldWeight in 2558, product of:
              2.4494898 = tf(freq=6.0), with freq of:
                6.0 = termFreq=6.0
              3.985786 = idf(docFreq=2232, maxDocs=44218)
              0.046875 = fieldNorm(doc=2558)
      0.33333334 = coord(1/3)
    0.012427893 = weight(_text_:information in 2558) [ClassicSimilarity], result of:
      0.012427893 = score(doc=2558,freq=6.0), product of:
        0.0616574 = queryWeight, product of:
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.035122856 = queryNorm
        0.20156369 = fieldWeight in 2558, product of:
          2.4494898 = tf(freq=6.0), with freq of:
            6.0 = termFreq=6.0
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.046875 = fieldNorm(doc=2558)
    0.014216291 = weight(_text_:for in 2558) [ClassicSimilarity], result of:
      0.014216291 = score(doc=2558,freq=6.0), product of:
        0.06594466 = queryWeight, product of:
          1.8775425 = idf(docFreq=18385, maxDocs=44218)
          0.035122856 = queryNorm
        0.21557912 = fieldWeight in 2558, product of:
          2.4494898 = tf(freq=6.0), with freq of:
            6.0 = termFreq=6.0
          1.8775425 = idf(docFreq=18385, maxDocs=44218)
          0.046875 = fieldNorm(doc=2558)
    0.02592062 = weight(_text_:the in 2558) [ClassicSimilarity], result of:
      0.02592062 = score(doc=2558,freq=40.0), product of:
        0.05541559 = queryWeight, product of:
          1.5777643 = idf(docFreq=24812, maxDocs=44218)
          0.035122856 = queryNorm
        0.4677496 = fieldWeight in 2558, product of:
          6.3245554 = tf(freq=40.0), with freq of:
            40.0 = termFreq=40.0
          1.5777643 = idf(docFreq=24812, maxDocs=44218)
          0.046875 = fieldNorm(doc=2558)
    0.016103853 = weight(_text_:of in 2558) [ClassicSimilarity], result of:
      0.016103853 = score(doc=2558,freq=16.0), product of:
        0.054923624 = queryWeight, product of:
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.035122856 = queryNorm
        0.2932045 = fieldWeight in 2558, product of:
          4.0 = tf(freq=16.0), with freq of:
            16.0 = termFreq=16.0
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.046875 = fieldNorm(doc=2558)
    0.02592062 = weight(_text_:the in 2558) [ClassicSimilarity], result of:
      0.02592062 = score(doc=2558,freq=40.0), product of:
        0.05541559 = queryWeight, product of:
          1.5777643 = idf(docFreq=24812, maxDocs=44218)
          0.035122856 = queryNorm
        0.4677496 = fieldWeight in 2558, product of:
          6.3245554 = tf(freq=40.0), with freq of:
            40.0 = termFreq=40.0
          1.5777643 = idf(docFreq=24812, maxDocs=44218)
          0.046875 = fieldNorm(doc=2558)
    0.014275986 = product of:
      0.028551972 = sum of:
        0.028551972 = weight(_text_:22 in 2558) [ClassicSimilarity], result of:
          0.028551972 = score(doc=2558,freq=2.0), product of:
            0.12299426 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.035122856 = queryNorm
            0.23214069 = fieldWeight in 2558, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.046875 = fieldNorm(doc=2558)
      0.5 = coord(1/2)
  0.5833333 = coord(7/12)

Abstract: The paper shows that the present evaluation methods in information retrieval (basically recall R and precision P and in some cases fallout F ) lack universal comparability in the sense that their values depend on the generality of the IR problem. A solution is given by using all "parts" of the database, including the non-relevant documents and also the not-retrieved documents. It turns out that the solution is given by introducing the measure M being the fraction of the not-retrieved documents that are relevant (hence the "miss" measure). We prove that - independent of the IR problem or of the IR action - the quadruple (P,R,F,M) belongs to a universal IR surface, being the same for all IR-activities. This universality is then exploited by defining a new measure for evaluation in IR allowing for unbiased comparisons of all IR results. We also show that only using one, two or even three measures from the set {P,R,F,M} necessary leads to evaluation measures that are non-universal and hence not capable of comparing different IR situations.
Date: 14. 8.2004 19:17:22
Source: Information processing and management. 40(2004) no.1, S.21-30

Egghe, L.: Existence theorem of the quadruple (P, R, F, M) : precision, recall, fallout and miss (2007) 0.05

0.047856554 = product of:
  0.09571311 = sum of:
    0.027570067 = product of:
      0.0827102 = sum of:
        0.0827102 = weight(_text_:f in 2011) [ClassicSimilarity], result of:
          0.0827102 = score(doc=2011,freq=10.0), product of:
            0.13999219 = queryWeight, product of:
              3.985786 = idf(docFreq=2232, maxDocs=44218)
              0.035122856 = queryNorm
            0.5908201 = fieldWeight in 2011, product of:
              3.1622777 = tf(freq=10.0), with freq of:
                10.0 = termFreq=10.0
              3.985786 = idf(docFreq=2232, maxDocs=44218)
              0.046875 = fieldNorm(doc=2011)
      0.33333334 = coord(1/3)
    0.012427893 = weight(_text_:information in 2011) [ClassicSimilarity], result of:
      0.012427893 = score(doc=2011,freq=6.0), product of:
        0.0616574 = queryWeight, product of:
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.035122856 = queryNorm
        0.20156369 = fieldWeight in 2011, product of:
          2.4494898 = tf(freq=6.0), with freq of:
            6.0 = termFreq=6.0
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.046875 = fieldNorm(doc=2011)
    0.008207779 = weight(_text_:for in 2011) [ClassicSimilarity], result of:
      0.008207779 = score(doc=2011,freq=2.0), product of:
        0.06594466 = queryWeight, product of:
          1.8775425 = idf(docFreq=18385, maxDocs=44218)
          0.035122856 = queryNorm
        0.12446466 = fieldWeight in 2011, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          1.8775425 = idf(docFreq=18385, maxDocs=44218)
          0.046875 = fieldNorm(doc=2011)
    0.01738808 = weight(_text_:the in 2011) [ClassicSimilarity], result of:
      0.01738808 = score(doc=2011,freq=18.0), product of:
        0.05541559 = queryWeight, product of:
          1.5777643 = idf(docFreq=24812, maxDocs=44218)
          0.035122856 = queryNorm
        0.31377596 = fieldWeight in 2011, product of:
          4.2426405 = tf(freq=18.0), with freq of:
            18.0 = termFreq=18.0
          1.5777643 = idf(docFreq=24812, maxDocs=44218)
          0.046875 = fieldNorm(doc=2011)
    0.012731214 = weight(_text_:of in 2011) [ClassicSimilarity], result of:
      0.012731214 = score(doc=2011,freq=10.0), product of:
        0.054923624 = queryWeight, product of:
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.035122856 = queryNorm
        0.23179851 = fieldWeight in 2011, product of:
          3.1622777 = tf(freq=10.0), with freq of:
            10.0 = termFreq=10.0
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.046875 = fieldNorm(doc=2011)
    0.01738808 = weight(_text_:the in 2011) [ClassicSimilarity], result of:
      0.01738808 = score(doc=2011,freq=18.0), product of:
        0.05541559 = queryWeight, product of:
          1.5777643 = idf(docFreq=24812, maxDocs=44218)
          0.035122856 = queryNorm
        0.31377596 = fieldWeight in 2011, product of:
          4.2426405 = tf(freq=18.0), with freq of:
            18.0 = termFreq=18.0
          1.5777643 = idf(docFreq=24812, maxDocs=44218)
          0.046875 = fieldNorm(doc=2011)
  0.5 = coord(6/12)

Abstract: In an earlier paper [Egghe, L. (2004). A universal method of information retrieval evaluation: the "missing" link M and the universal IR surface. Information Processing and Management, 40, 21-30] we showed that, given an IR system, and if P denotes precision, R recall, F fallout and M miss (re-introduced in the paper mentioned above), we have the following relationship between P, R, F and M: P/(1-P)*(1-R)/R*F/(1-F)*(1-M)/M = 1. In this paper we prove the (more difficult) converse: given any four rational numbers in the interval ]0, 1[ satisfying the above equation, then there exists an IR system such that these four numbers (in any order) are the precision, recall, fallout and miss of this IR system. As a consequence we show that any three rational numbers in ]0, 1[ represent any three measures taken from precision, recall, fallout and miss of a certain IR system. We also show that this result is also true for two numbers instead of three.
Source: Information processing and management. 43(2007) no.1, S.265-272

Egghe, L.; Rousseau, R.: Averaging and globalising quotients of informetric and scientometric data (1996) 0.04

0.044183705 = product of:
  0.08836741 = sum of:
    0.007175247 = weight(_text_:information in 7659) [ClassicSimilarity], result of:
      0.007175247 = score(doc=7659,freq=2.0), product of:
        0.0616574 = queryWeight, product of:
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.035122856 = queryNorm
        0.116372846 = fieldWeight in 7659, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.046875 = fieldNorm(doc=7659)
    0.008207779 = weight(_text_:for in 7659) [ClassicSimilarity], result of:
      0.008207779 = score(doc=7659,freq=2.0), product of:
        0.06594466 = queryWeight, product of:
          1.8775425 = idf(docFreq=18385, maxDocs=44218)
          0.035122856 = queryNorm
        0.12446466 = fieldWeight in 7659, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          1.8775425 = idf(docFreq=18385, maxDocs=44218)
          0.046875 = fieldNorm(doc=7659)
    0.018328644 = weight(_text_:the in 7659) [ClassicSimilarity], result of:
      0.018328644 = score(doc=7659,freq=20.0), product of:
        0.05541559 = queryWeight, product of:
          1.5777643 = idf(docFreq=24812, maxDocs=44218)
          0.035122856 = queryNorm
        0.3307489 = fieldWeight in 7659, product of:
          4.472136 = tf(freq=20.0), with freq of:
            20.0 = termFreq=20.0
          1.5777643 = idf(docFreq=24812, maxDocs=44218)
          0.046875 = fieldNorm(doc=7659)
    0.02205111 = weight(_text_:of in 7659) [ClassicSimilarity], result of:
      0.02205111 = score(doc=7659,freq=30.0), product of:
        0.054923624 = queryWeight, product of:
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.035122856 = queryNorm
        0.4014868 = fieldWeight in 7659, product of:
          5.477226 = tf(freq=30.0), with freq of:
            30.0 = termFreq=30.0
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.046875 = fieldNorm(doc=7659)
    0.018328644 = weight(_text_:the in 7659) [ClassicSimilarity], result of:
      0.018328644 = score(doc=7659,freq=20.0), product of:
        0.05541559 = queryWeight, product of:
          1.5777643 = idf(docFreq=24812, maxDocs=44218)
          0.035122856 = queryNorm
        0.3307489 = fieldWeight in 7659, product of:
          4.472136 = tf(freq=20.0), with freq of:
            20.0 = termFreq=20.0
          1.5777643 = idf(docFreq=24812, maxDocs=44218)
          0.046875 = fieldNorm(doc=7659)
    0.014275986 = product of:
      0.028551972 = sum of:
        0.028551972 = weight(_text_:22 in 7659) [ClassicSimilarity], result of:
          0.028551972 = score(doc=7659,freq=2.0), product of:
            0.12299426 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.035122856 = queryNorm
            0.23214069 = fieldWeight in 7659, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.046875 = fieldNorm(doc=7659)
      0.5 = coord(1/2)
  0.5 = coord(6/12)

Abstract: It is possible, using ISI's Journal Citation Report (JCR), to calculate average impact factors (AIF) for LCR's subject categories but it can be more useful to know the global Impact Factor (GIF) of a subject category and compare the 2 values. Reports results of a study to compare the relationships between AIFs and GIFs of subjects, based on the particular case of the average impact factor of a subfield versus the impact factor of this subfield as a whole, the difference being studied between an average of quotients, denoted as AQ, and a global average, obtained as a quotient of averages, and denoted as GQ. In the case of impact factors, AQ becomes the average impact factor of a field, and GQ becomes its global impact factor. Discusses a number of applications of this technique in the context of informetrics and scientometrics
Source: Journal of information science. 22(1996) no.3, S.165-170

Egghe, L.: On the law of Zipf-Mandelbrot for multi-word phrases (1999) 0.04

0.040289335 = product of:
  0.09669441 = sum of:
    0.009566996 = weight(_text_:information in 3058) [ClassicSimilarity], result of:
      0.009566996 = score(doc=3058,freq=2.0), product of:
        0.0616574 = queryWeight, product of:
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.035122856 = queryNorm
        0.1551638 = fieldWeight in 3058, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.0625 = fieldNorm(doc=3058)
    0.015476737 = weight(_text_:for in 3058) [ClassicSimilarity], result of:
      0.015476737 = score(doc=3058,freq=4.0), product of:
        0.06594466 = queryWeight, product of:
          1.8775425 = idf(docFreq=18385, maxDocs=44218)
          0.035122856 = queryNorm
        0.23469281 = fieldWeight in 3058, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          1.8775425 = idf(docFreq=18385, maxDocs=44218)
          0.0625 = fieldNorm(doc=3058)
    0.024438193 = weight(_text_:the in 3058) [ClassicSimilarity], result of:
      0.024438193 = score(doc=3058,freq=20.0), product of:
        0.05541559 = queryWeight, product of:
          1.5777643 = idf(docFreq=24812, maxDocs=44218)
          0.035122856 = queryNorm
        0.44099852 = fieldWeight in 3058, product of:
          4.472136 = tf(freq=20.0), with freq of:
            20.0 = termFreq=20.0
          1.5777643 = idf(docFreq=24812, maxDocs=44218)
          0.0625 = fieldNorm(doc=3058)
    0.022774287 = weight(_text_:of in 3058) [ClassicSimilarity], result of:
      0.022774287 = score(doc=3058,freq=18.0), product of:
        0.054923624 = queryWeight, product of:
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.035122856 = queryNorm
        0.41465375 = fieldWeight in 3058, product of:
          4.2426405 = tf(freq=18.0), with freq of:
            18.0 = termFreq=18.0
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.0625 = fieldNorm(doc=3058)
    0.024438193 = weight(_text_:the in 3058) [ClassicSimilarity], result of:
      0.024438193 = score(doc=3058,freq=20.0), product of:
        0.05541559 = queryWeight, product of:
          1.5777643 = idf(docFreq=24812, maxDocs=44218)
          0.035122856 = queryNorm
        0.44099852 = fieldWeight in 3058, product of:
          4.472136 = tf(freq=20.0), with freq of:
            20.0 = termFreq=20.0
          1.5777643 = idf(docFreq=24812, maxDocs=44218)
          0.0625 = fieldNorm(doc=3058)
  0.41666666 = coord(5/12)

Abstract: This article studies the probabilities of the occurence of multi-word (m-word) phrases (m=2,3,...) in relation to the probabilities of occurence of the single words. It is well known that, in the latter case, the lae of Zipf is valid (i.e., a power law). We prove that in the case of m-word phrases (m>=2), this is not the case. We present 2 independent proof of this
Source: Journal of the American Society for Information Science. 50(1999) no.3, S.233-241

Egghe, L.; Rousseau, R.: ¬A measure for the cohesion of weighted networks (2003) 0.04

0.040250868 = product of:
  0.09660208 = sum of:
    0.005979372 = weight(_text_:information in 5157) [ClassicSimilarity], result of:
      0.005979372 = score(doc=5157,freq=2.0), product of:
        0.0616574 = queryWeight, product of:
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.035122856 = queryNorm
        0.09697737 = fieldWeight in 5157, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.0390625 = fieldNorm(doc=5157)
    0.01675406 = weight(_text_:for in 5157) [ClassicSimilarity], result of:
      0.01675406 = score(doc=5157,freq=12.0), product of:
        0.06594466 = queryWeight, product of:
          1.8775425 = idf(docFreq=18385, maxDocs=44218)
          0.035122856 = queryNorm
        0.2540624 = fieldWeight in 5157, product of:
          3.4641016 = tf(freq=12.0), with freq of:
            12.0 = termFreq=12.0
          1.8775425 = idf(docFreq=18385, maxDocs=44218)
          0.0390625 = fieldNorm(doc=5157)
    0.027746364 = weight(_text_:the in 5157) [ClassicSimilarity], result of:
      0.027746364 = score(doc=5157,freq=66.0), product of:
        0.05541559 = queryWeight, product of:
          1.5777643 = idf(docFreq=24812, maxDocs=44218)
          0.035122856 = queryNorm
        0.500696 = fieldWeight in 5157, product of:
          8.124039 = tf(freq=66.0), with freq of:
            66.0 = termFreq=66.0
          1.5777643 = idf(docFreq=24812, maxDocs=44218)
          0.0390625 = fieldNorm(doc=5157)
    0.018375926 = weight(_text_:of in 5157) [ClassicSimilarity], result of:
      0.018375926 = score(doc=5157,freq=30.0), product of:
        0.054923624 = queryWeight, product of:
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.035122856 = queryNorm
        0.33457235 = fieldWeight in 5157, product of:
          5.477226 = tf(freq=30.0), with freq of:
            30.0 = termFreq=30.0
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.0390625 = fieldNorm(doc=5157)
    0.027746364 = weight(_text_:the in 5157) [ClassicSimilarity], result of:
      0.027746364 = score(doc=5157,freq=66.0), product of:
        0.05541559 = queryWeight, product of:
          1.5777643 = idf(docFreq=24812, maxDocs=44218)
          0.035122856 = queryNorm
        0.500696 = fieldWeight in 5157, product of:
          8.124039 = tf(freq=66.0), with freq of:
            66.0 = termFreq=66.0
          1.5777643 = idf(docFreq=24812, maxDocs=44218)
          0.0390625 = fieldNorm(doc=5157)
  0.41666666 = coord(5/12)

Abstract: Measurement of the degree of interconnectedness in graph like networks of hyperlinks or citations can indicate the existence of research fields and assist in comparative evaluation of research efforts. In this issue we begin with Egghe and Rousseau who review compactness measures and investigate the compactness of a network as a weighted graph with dissimilarity values characterizing the arcs between nodes. They make use of a generalization of the Botofogo, Rivlin, Shneiderman, (BRS) compaction measure which treats the distance between unreachable nodes not as infinity but rather as the number of nodes in the network. The dissimilarity values are determined by summing the reciprocals of the weights of the arcs in the shortest chain between two nodes where no weight is smaller than one. The BRS measure is then the maximum value for the sum of the dissimilarity measures less the actual sum divided by the difference between the maximum and minimum. The Wiener index, the sum of all elements in the dissimilarity matrix divided by two, is then computed for Small's particle physics co-citation data as well as the BRS measure, the dissimilarity values and shortest paths. The compactness measure for the weighted network is smaller than for the un-weighted. When the bibliographic coupling network is utilized it is shown to be less compact than the co-citation network which indicates that the new measure produces results that confirm to an obvious case.
Source: Journal of the American Society for Information Science and technology. 54(2003) no.3, S.193-202

Egghe, L.: ¬The influence of transformations on the h-index and the g-index (2008) 0.04

0.039894864 = product of:
  0.09574768 = sum of:
    0.0118385535 = weight(_text_:information in 1881) [ClassicSimilarity], result of:
      0.0118385535 = score(doc=1881,freq=4.0), product of:
        0.0616574 = queryWeight, product of:
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.035122856 = queryNorm
        0.1920054 = fieldWeight in 1881, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.0546875 = fieldNorm(doc=1881)
    0.013542145 = weight(_text_:for in 1881) [ClassicSimilarity], result of:
      0.013542145 = score(doc=1881,freq=4.0), product of:
        0.06594466 = queryWeight, product of:
          1.8775425 = idf(docFreq=18385, maxDocs=44218)
          0.035122856 = queryNorm
        0.20535621 = fieldWeight in 1881, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          1.8775425 = idf(docFreq=18385, maxDocs=44218)
          0.0546875 = fieldNorm(doc=1881)
    0.027048122 = weight(_text_:the in 1881) [ClassicSimilarity], result of:
      0.027048122 = score(doc=1881,freq=32.0), product of:
        0.05541559 = queryWeight, product of:
          1.5777643 = idf(docFreq=24812, maxDocs=44218)
          0.035122856 = queryNorm
        0.4880959 = fieldWeight in 1881, product of:
          5.656854 = tf(freq=32.0), with freq of:
            32.0 = termFreq=32.0
          1.5777643 = idf(docFreq=24812, maxDocs=44218)
          0.0546875 = fieldNorm(doc=1881)
    0.016270736 = weight(_text_:of in 1881) [ClassicSimilarity], result of:
      0.016270736 = score(doc=1881,freq=12.0), product of:
        0.054923624 = queryWeight, product of:
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.035122856 = queryNorm
        0.29624295 = fieldWeight in 1881, product of:
          3.4641016 = tf(freq=12.0), with freq of:
            12.0 = termFreq=12.0
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.0546875 = fieldNorm(doc=1881)
    0.027048122 = weight(_text_:the in 1881) [ClassicSimilarity], result of:
      0.027048122 = score(doc=1881,freq=32.0), product of:
        0.05541559 = queryWeight, product of:
          1.5777643 = idf(docFreq=24812, maxDocs=44218)
          0.035122856 = queryNorm
        0.4880959 = fieldWeight in 1881, product of:
          5.656854 = tf(freq=32.0), with freq of:
            32.0 = termFreq=32.0
          1.5777643 = idf(docFreq=24812, maxDocs=44218)
          0.0546875 = fieldNorm(doc=1881)
  0.41666666 = coord(5/12)

Abstract: In a previous article, we introduced a general transformation on sources and one on items in an arbitrary information production process (IPP). In this article, we investigate the influence of these transformations on the h-index and on the g-index. General formulae that describe this influence are presented. These are applied to the case that the size-frequency function is Lotkaian (i.e., is a decreasing power function). We further show that the h-index of the transformed IPP belongs to the interval bounded by the two transformations of the h-index of the original IPP, and we also show that this property is not true for the g-index.
Source: Journal of the American Society for Information Science and Technology. 59(2008) no.8, S.1304-1312

Egghe, L.; Guns, R.; Rousseau, R.; Leuven, K.U.: Erratum (2012) 0.04

0.03912053 = product of:
  0.07824106 = sum of:
    0.011958744 = weight(_text_:information in 4992) [ClassicSimilarity], result of:
      0.011958744 = score(doc=4992,freq=2.0), product of:
        0.0616574 = queryWeight, product of:
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.035122856 = queryNorm
        0.19395474 = fieldWeight in 4992, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.078125 = fieldNorm(doc=4992)
    0.013679632 = weight(_text_:for in 4992) [ClassicSimilarity], result of:
      0.013679632 = score(doc=4992,freq=2.0), product of:
        0.06594466 = queryWeight, product of:
          1.8775425 = idf(docFreq=18385, maxDocs=44218)
          0.035122856 = queryNorm
        0.20744109 = fieldWeight in 4992, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          1.8775425 = idf(docFreq=18385, maxDocs=44218)
          0.078125 = fieldNorm(doc=4992)
    0.009660044 = weight(_text_:the in 4992) [ClassicSimilarity], result of:
      0.009660044 = score(doc=4992,freq=2.0), product of:
        0.05541559 = queryWeight, product of:
          1.5777643 = idf(docFreq=24812, maxDocs=44218)
          0.035122856 = queryNorm
        0.17431997 = fieldWeight in 4992, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          1.5777643 = idf(docFreq=24812, maxDocs=44218)
          0.078125 = fieldNorm(doc=4992)
    0.009489287 = weight(_text_:of in 4992) [ClassicSimilarity], result of:
      0.009489287 = score(doc=4992,freq=2.0), product of:
        0.054923624 = queryWeight, product of:
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.035122856 = queryNorm
        0.17277241 = fieldWeight in 4992, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.078125 = fieldNorm(doc=4992)
    0.009660044 = weight(_text_:the in 4992) [ClassicSimilarity], result of:
      0.009660044 = score(doc=4992,freq=2.0), product of:
        0.05541559 = queryWeight, product of:
          1.5777643 = idf(docFreq=24812, maxDocs=44218)
          0.035122856 = queryNorm
        0.17431997 = fieldWeight in 4992, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          1.5777643 = idf(docFreq=24812, maxDocs=44218)
          0.078125 = fieldNorm(doc=4992)
    0.023793312 = product of:
      0.047586624 = sum of:
        0.047586624 = weight(_text_:22 in 4992) [ClassicSimilarity], result of:
          0.047586624 = score(doc=4992,freq=2.0), product of:
            0.12299426 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.035122856 = queryNorm
            0.38690117 = fieldWeight in 4992, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.078125 = fieldNorm(doc=4992)
      0.5 = coord(1/2)
  0.5 = coord(6/12)

Date: 14. 2.2012 12:53:22
Source: Journal of the American Society for Information Science and Technology. 63(2012) no.2, S.429

Egghe, L.: Informetric explanation of some Leiden Ranking graphs (2014) 0.04

0.03827717 = product of:
  0.09186521 = sum of:
    0.009566996 = weight(_text_:information in 1236) [ClassicSimilarity], result of:
      0.009566996 = score(doc=1236,freq=2.0), product of:
        0.0616574 = queryWeight, product of:
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.035122856 = queryNorm
        0.1551638 = fieldWeight in 1236, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.0625 = fieldNorm(doc=1236)
    0.018955056 = weight(_text_:for in 1236) [ClassicSimilarity], result of:
      0.018955056 = score(doc=1236,freq=6.0), product of:
        0.06594466 = queryWeight, product of:
          1.8775425 = idf(docFreq=18385, maxDocs=44218)
          0.035122856 = queryNorm
        0.28743884 = fieldWeight in 1236, product of:
          2.4494898 = tf(freq=6.0), with freq of:
            6.0 = termFreq=6.0
          1.8775425 = idf(docFreq=18385, maxDocs=44218)
          0.0625 = fieldNorm(doc=1236)
    0.023184106 = weight(_text_:the in 1236) [ClassicSimilarity], result of:
      0.023184106 = score(doc=1236,freq=18.0), product of:
        0.05541559 = queryWeight, product of:
          1.5777643 = idf(docFreq=24812, maxDocs=44218)
          0.035122856 = queryNorm
        0.41836792 = fieldWeight in 1236, product of:
          4.2426405 = tf(freq=18.0), with freq of:
            18.0 = termFreq=18.0
          1.5777643 = idf(docFreq=24812, maxDocs=44218)
          0.0625 = fieldNorm(doc=1236)
    0.016974952 = weight(_text_:of in 1236) [ClassicSimilarity], result of:
      0.016974952 = score(doc=1236,freq=10.0), product of:
        0.054923624 = queryWeight, product of:
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.035122856 = queryNorm
        0.3090647 = fieldWeight in 1236, product of:
          3.1622777 = tf(freq=10.0), with freq of:
            10.0 = termFreq=10.0
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.0625 = fieldNorm(doc=1236)
    0.023184106 = weight(_text_:the in 1236) [ClassicSimilarity], result of:
      0.023184106 = score(doc=1236,freq=18.0), product of:
        0.05541559 = queryWeight, product of:
          1.5777643 = idf(docFreq=24812, maxDocs=44218)
          0.035122856 = queryNorm
        0.41836792 = fieldWeight in 1236, product of:
          4.2426405 = tf(freq=18.0), with freq of:
            18.0 = termFreq=18.0
          1.5777643 = idf(docFreq=24812, maxDocs=44218)
          0.0625 = fieldNorm(doc=1236)
  0.41666666 = coord(5/12)

Abstract: The S-shaped functional relation between the mean citation score and the proportion of top 10% publications for the 500 Leiden Ranking universities is explained using results of the shifted Lotka function. Also the concave or convex relation between the proportion of top 100?% publications, for different fractions ?, is explained using the obtained new informetric model.
Source: Journal of the Association for Information Science and Technology. 65(2014) no.4, S.737-741

Egghe, L.; Rousseau, R.: Topological aspects of information retrieval (1998) 0.04

0.038190056 = product of:
  0.09165613 = sum of:
    0.014499208 = weight(_text_:information in 2157) [ClassicSimilarity], result of:
      0.014499208 = score(doc=2157,freq=6.0), product of:
        0.0616574 = queryWeight, product of:
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.035122856 = queryNorm
        0.23515764 = fieldWeight in 2157, product of:
          2.4494898 = tf(freq=6.0), with freq of:
            6.0 = termFreq=6.0
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.0546875 = fieldNorm(doc=2157)
    0.013542145 = weight(_text_:for in 2157) [ClassicSimilarity], result of:
      0.013542145 = score(doc=2157,freq=4.0), product of:
        0.06594466 = queryWeight, product of:
          1.8775425 = idf(docFreq=18385, maxDocs=44218)
          0.035122856 = queryNorm
        0.20535621 = fieldWeight in 2157, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          1.8775425 = idf(docFreq=18385, maxDocs=44218)
          0.0546875 = fieldNorm(doc=2157)
    0.024380848 = weight(_text_:the in 2157) [ClassicSimilarity], result of:
      0.024380848 = score(doc=2157,freq=26.0), product of:
        0.05541559 = queryWeight, product of:
          1.5777643 = idf(docFreq=24812, maxDocs=44218)
          0.035122856 = queryNorm
        0.4399637 = fieldWeight in 2157, product of:
          5.0990195 = tf(freq=26.0), with freq of:
            26.0 = termFreq=26.0
          1.5777643 = idf(docFreq=24812, maxDocs=44218)
          0.0546875 = fieldNorm(doc=2157)
    0.0148530835 = weight(_text_:of in 2157) [ClassicSimilarity], result of:
      0.0148530835 = score(doc=2157,freq=10.0), product of:
        0.054923624 = queryWeight, product of:
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.035122856 = queryNorm
        0.2704316 = fieldWeight in 2157, product of:
          3.1622777 = tf(freq=10.0), with freq of:
            10.0 = termFreq=10.0
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.0546875 = fieldNorm(doc=2157)
    0.024380848 = weight(_text_:the in 2157) [ClassicSimilarity], result of:
      0.024380848 = score(doc=2157,freq=26.0), product of:
        0.05541559 = queryWeight, product of:
          1.5777643 = idf(docFreq=24812, maxDocs=44218)
          0.035122856 = queryNorm
        0.4399637 = fieldWeight in 2157, product of:
          5.0990195 = tf(freq=26.0), with freq of:
            26.0 = termFreq=26.0
          1.5777643 = idf(docFreq=24812, maxDocs=44218)
          0.0546875 = fieldNorm(doc=2157)
  0.41666666 = coord(5/12)

Abstract: Let (DS, DQ, sim) be a retrieval system consisting of a document space DS, a query space QS, and a function sim, expressing the similarity between a document and a query. Following D.M. Everett and S.C. Cater (1992), we introduce topologies on the document space. These topologies are generated by the similarity function sim and the query space QS. 3 topologies will be studied: the retrieval topology, the similarity topology and the (pseudo-)metric one. It is shown that the retrieval topology is the coarsest of the three, while the (pseudo-)metric is the strongest. These 3 topologies are generally different, reflecting distinct topological aspects of information retrieval. We present necessary and sufficient conditions for these topological aspects to be equal
Source: Journal of the American Society for Information Science. 49(1998) no.13, S.1144-1160

Egghe, L.: Dynamic h-index : the Hirsch index in function of time (2007) 0.04

0.0368128 = product of:
  0.08835072 = sum of:
    0.009566996 = weight(_text_:information in 147) [ClassicSimilarity], result of:
      0.009566996 = score(doc=147,freq=2.0), product of:
        0.0616574 = queryWeight, product of:
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.035122856 = queryNorm
        0.1551638 = fieldWeight in 147, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.0625 = fieldNorm(doc=147)
    0.010943705 = weight(_text_:for in 147) [ClassicSimilarity], result of:
      0.010943705 = score(doc=147,freq=2.0), product of:
        0.06594466 = queryWeight, product of:
          1.8775425 = idf(docFreq=18385, maxDocs=44218)
          0.035122856 = queryNorm
        0.16595288 = fieldWeight in 147, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          1.8775425 = idf(docFreq=18385, maxDocs=44218)
          0.0625 = fieldNorm(doc=147)
    0.023184106 = weight(_text_:the in 147) [ClassicSimilarity], result of:
      0.023184106 = score(doc=147,freq=18.0), product of:
        0.05541559 = queryWeight, product of:
          1.5777643 = idf(docFreq=24812, maxDocs=44218)
          0.035122856 = queryNorm
        0.41836792 = fieldWeight in 147, product of:
          4.2426405 = tf(freq=18.0), with freq of:
            18.0 = termFreq=18.0
          1.5777643 = idf(docFreq=24812, maxDocs=44218)
          0.0625 = fieldNorm(doc=147)
    0.021471804 = weight(_text_:of in 147) [ClassicSimilarity], result of:
      0.021471804 = score(doc=147,freq=16.0), product of:
        0.054923624 = queryWeight, product of:
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.035122856 = queryNorm
        0.39093933 = fieldWeight in 147, product of:
          4.0 = tf(freq=16.0), with freq of:
            16.0 = termFreq=16.0
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.0625 = fieldNorm(doc=147)
    0.023184106 = weight(_text_:the in 147) [ClassicSimilarity], result of:
      0.023184106 = score(doc=147,freq=18.0), product of:
        0.05541559 = queryWeight, product of:
          1.5777643 = idf(docFreq=24812, maxDocs=44218)
          0.035122856 = queryNorm
        0.41836792 = fieldWeight in 147, product of:
          4.2426405 = tf(freq=18.0), with freq of:
            18.0 = termFreq=18.0
          1.5777643 = idf(docFreq=24812, maxDocs=44218)
          0.0625 = fieldNorm(doc=147)
  0.41666666 = coord(5/12)

Abstract: When there are a group of articles and the present time is fixed we can determine the unique number h being the number of articles that received h or more citations while the other articles received a number of citations which is not larger than h. In this article, the time dependence of the h-index is determined. This is important to describe the expected career evolution of a scientist's work or of a journal's production in a fixed year.
Source: Journal of the American Society for Information Science and Technology. 58(2007) no.3, S.452-454

Egghe, L.: Note on a possible decomposition of the h-Index (2013) 0.04

0.036260813 = product of:
  0.087025955 = sum of:
    0.014350494 = weight(_text_:information in 683) [ClassicSimilarity], result of:
      0.014350494 = score(doc=683,freq=2.0), product of:
        0.0616574 = queryWeight, product of:
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.035122856 = queryNorm
        0.23274569 = fieldWeight in 683, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.09375 = fieldNorm(doc=683)
    0.016415559 = weight(_text_:for in 683) [ClassicSimilarity], result of:
      0.016415559 = score(doc=683,freq=2.0), product of:
        0.06594466 = queryWeight, product of:
          1.8775425 = idf(docFreq=18385, maxDocs=44218)
          0.035122856 = queryNorm
        0.24892932 = fieldWeight in 683, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          1.8775425 = idf(docFreq=18385, maxDocs=44218)
          0.09375 = fieldNorm(doc=683)
    0.020078024 = weight(_text_:the in 683) [ClassicSimilarity], result of:
      0.020078024 = score(doc=683,freq=6.0), product of:
        0.05541559 = queryWeight, product of:
          1.5777643 = idf(docFreq=24812, maxDocs=44218)
          0.035122856 = queryNorm
        0.36231726 = fieldWeight in 683, product of:
          2.4494898 = tf(freq=6.0), with freq of:
            6.0 = termFreq=6.0
          1.5777643 = idf(docFreq=24812, maxDocs=44218)
          0.09375 = fieldNorm(doc=683)
    0.016103853 = weight(_text_:of in 683) [ClassicSimilarity], result of:
      0.016103853 = score(doc=683,freq=4.0), product of:
        0.054923624 = queryWeight, product of:
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.035122856 = queryNorm
        0.2932045 = fieldWeight in 683, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.09375 = fieldNorm(doc=683)
    0.020078024 = weight(_text_:the in 683) [ClassicSimilarity], result of:
      0.020078024 = score(doc=683,freq=6.0), product of:
        0.05541559 = queryWeight, product of:
          1.5777643 = idf(docFreq=24812, maxDocs=44218)
          0.035122856 = queryNorm
        0.36231726 = fieldWeight in 683, product of:
          2.4494898 = tf(freq=6.0), with freq of:
            6.0 = termFreq=6.0
          1.5777643 = idf(docFreq=24812, maxDocs=44218)
          0.09375 = fieldNorm(doc=683)
  0.41666666 = coord(5/12)

Series: Letter to the editor
Source: Journal of the American Society for Information Science and Technology. 64(2013) no.4, S.871

Egghe, L.: Remarks on the paper by A. De Visscher, "what does the g-index really measure?" (2012) 0.04

0.03622737 = product of:
  0.0869457 = sum of:
    0.008371122 = weight(_text_:information in 463) [ClassicSimilarity], result of:
      0.008371122 = score(doc=463,freq=2.0), product of:
        0.0616574 = queryWeight, product of:
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.035122856 = queryNorm
        0.13576832 = fieldWeight in 463, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.0546875 = fieldNorm(doc=463)
    0.013542145 = weight(_text_:for in 463) [ClassicSimilarity], result of:
      0.013542145 = score(doc=463,freq=4.0), product of:
        0.06594466 = queryWeight, product of:
          1.8775425 = idf(docFreq=18385, maxDocs=44218)
          0.035122856 = queryNorm
        0.20535621 = fieldWeight in 463, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          1.8775425 = idf(docFreq=18385, maxDocs=44218)
          0.0546875 = fieldNorm(doc=463)
    0.024380848 = weight(_text_:the in 463) [ClassicSimilarity], result of:
      0.024380848 = score(doc=463,freq=26.0), product of:
        0.05541559 = queryWeight, product of:
          1.5777643 = idf(docFreq=24812, maxDocs=44218)
          0.035122856 = queryNorm
        0.4399637 = fieldWeight in 463, product of:
          5.0990195 = tf(freq=26.0), with freq of:
            26.0 = termFreq=26.0
          1.5777643 = idf(docFreq=24812, maxDocs=44218)
          0.0546875 = fieldNorm(doc=463)
    0.016270736 = weight(_text_:of in 463) [ClassicSimilarity], result of:
      0.016270736 = score(doc=463,freq=12.0), product of:
        0.054923624 = queryWeight, product of:
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.035122856 = queryNorm
        0.29624295 = fieldWeight in 463, product of:
          3.4641016 = tf(freq=12.0), with freq of:
            12.0 = termFreq=12.0
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.0546875 = fieldNorm(doc=463)
    0.024380848 = weight(_text_:the in 463) [ClassicSimilarity], result of:
      0.024380848 = score(doc=463,freq=26.0), product of:
        0.05541559 = queryWeight, product of:
          1.5777643 = idf(docFreq=24812, maxDocs=44218)
          0.035122856 = queryNorm
        0.4399637 = fieldWeight in 463, product of:
          5.0990195 = tf(freq=26.0), with freq of:
            26.0 = termFreq=26.0
          1.5777643 = idf(docFreq=24812, maxDocs=44218)
          0.0546875 = fieldNorm(doc=463)
  0.41666666 = coord(5/12)

Abstract: The author presents a different view on properties of impact measures than given in the paper of De Visscher (2011). He argues that a good impact measure works better when citations are concentrated rather than spread out over articles. The author also presents theoretical evidence that the g-index and the R-index can be close to the square root of the total number of citations, whereas this is not the case for the A-index. Here the author confirms an assertion of De Visscher.
Source: Journal of the American Society for Information Science and Technology. 63(2012) no.10, S.2118-2121

Egghe, L.; Rousseau, R.: ¬The Hirsch index of a shifted Lotka function and its relation with the impact factor (2012) 0.04

0.03620675 = product of:
  0.086896196 = sum of:
    0.008371122 = weight(_text_:information in 243) [ClassicSimilarity], result of:
      0.008371122 = score(doc=243,freq=2.0), product of:
        0.0616574 = queryWeight, product of:
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.035122856 = queryNorm
        0.13576832 = fieldWeight in 243, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.0546875 = fieldNorm(doc=243)
    0.009575742 = weight(_text_:for in 243) [ClassicSimilarity], result of:
      0.009575742 = score(doc=243,freq=2.0), product of:
        0.06594466 = queryWeight, product of:
          1.8775425 = idf(docFreq=18385, maxDocs=44218)
          0.035122856 = queryNorm
        0.14520876 = fieldWeight in 243, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          1.8775425 = idf(docFreq=18385, maxDocs=44218)
          0.0546875 = fieldNorm(doc=243)
    0.027048122 = weight(_text_:the in 243) [ClassicSimilarity], result of:
      0.027048122 = score(doc=243,freq=32.0), product of:
        0.05541559 = queryWeight, product of:
          1.5777643 = idf(docFreq=24812, maxDocs=44218)
          0.035122856 = queryNorm
        0.4880959 = fieldWeight in 243, product of:
          5.656854 = tf(freq=32.0), with freq of:
            32.0 = termFreq=32.0
          1.5777643 = idf(docFreq=24812, maxDocs=44218)
          0.0546875 = fieldNorm(doc=243)
    0.0148530835 = weight(_text_:of in 243) [ClassicSimilarity], result of:
      0.0148530835 = score(doc=243,freq=10.0), product of:
        0.054923624 = queryWeight, product of:
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.035122856 = queryNorm
        0.2704316 = fieldWeight in 243, product of:
          3.1622777 = tf(freq=10.0), with freq of:
            10.0 = termFreq=10.0
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.0546875 = fieldNorm(doc=243)
    0.027048122 = weight(_text_:the in 243) [ClassicSimilarity], result of:
      0.027048122 = score(doc=243,freq=32.0), product of:
        0.05541559 = queryWeight, product of:
          1.5777643 = idf(docFreq=24812, maxDocs=44218)
          0.035122856 = queryNorm
        0.4880959 = fieldWeight in 243, product of:
          5.656854 = tf(freq=32.0), with freq of:
            32.0 = termFreq=32.0
          1.5777643 = idf(docFreq=24812, maxDocs=44218)
          0.0546875 = fieldNorm(doc=243)
  0.41666666 = coord(5/12)

Abstract: Based on earlier results about the shifted Lotka function, we prove an implicit functional relation between the Hirsch index (h-index) and the total number of sources (T). It is shown that the corresponding function, h(T), is concavely increasing. Next, we construct an implicit relation between the h-index and the impact factor IF (an average number of items per source). The corresponding function h(IF) is increasing and we show that if the parameter C in the numerator of the shifted Lotka function is high, then the relation between the h-index and the impact factor is almost linear.
Source: Journal of the American Society for Information Science and Technology. 63(2012) no.5, S.1048-1053

Egghe, L.: Empirical and combinatorial study of country occurrences in multi-authored papers (2006) 0.04

0.036196023 = product of:
  0.07239205 = sum of:
    0.004783498 = weight(_text_:information in 81) [ClassicSimilarity], result of:
      0.004783498 = score(doc=81,freq=2.0), product of:
        0.0616574 = queryWeight, product of:
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.035122856 = queryNorm
        0.0775819 = fieldWeight in 81, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.03125 = fieldNorm(doc=81)
    0.007624971 = weight(_text_:und in 81) [ClassicSimilarity], result of:
      0.007624971 = score(doc=81,freq=2.0), product of:
        0.07784514 = queryWeight, product of:
          2.216367 = idf(docFreq=13101, maxDocs=44218)
          0.035122856 = queryNorm
        0.09795051 = fieldWeight in 81, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          2.216367 = idf(docFreq=13101, maxDocs=44218)
          0.03125 = fieldNorm(doc=81)
    0.0077383686 = weight(_text_:for in 81) [ClassicSimilarity], result of:
      0.0077383686 = score(doc=81,freq=4.0), product of:
        0.06594466 = queryWeight, product of:
          1.8775425 = idf(docFreq=18385, maxDocs=44218)
          0.035122856 = queryNorm
        0.117346406 = fieldWeight in 81, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          1.8775425 = idf(docFreq=18385, maxDocs=44218)
          0.03125 = fieldNorm(doc=81)
    0.018531177 = weight(_text_:the in 81) [ClassicSimilarity], result of:
      0.018531177 = score(doc=81,freq=46.0), product of:
        0.05541559 = queryWeight, product of:
          1.5777643 = idf(docFreq=24812, maxDocs=44218)
          0.035122856 = queryNorm
        0.3344037 = fieldWeight in 81, product of:
          6.78233 = tf(freq=46.0), with freq of:
            46.0 = termFreq=46.0
          1.5777643 = idf(docFreq=24812, maxDocs=44218)
          0.03125 = fieldNorm(doc=81)
    0.015182858 = weight(_text_:of in 81) [ClassicSimilarity], result of:
      0.015182858 = score(doc=81,freq=32.0), product of:
        0.054923624 = queryWeight, product of:
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.035122856 = queryNorm
        0.27643585 = fieldWeight in 81, product of:
          5.656854 = tf(freq=32.0), with freq of:
            32.0 = termFreq=32.0
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.03125 = fieldNorm(doc=81)
    0.018531177 = weight(_text_:the in 81) [ClassicSimilarity], result of:
      0.018531177 = score(doc=81,freq=46.0), product of:
        0.05541559 = queryWeight, product of:
          1.5777643 = idf(docFreq=24812, maxDocs=44218)
          0.035122856 = queryNorm
        0.3344037 = fieldWeight in 81, product of:
          6.78233 = tf(freq=46.0), with freq of:
            46.0 = termFreq=46.0
          1.5777643 = idf(docFreq=24812, maxDocs=44218)
          0.03125 = fieldNorm(doc=81)
  0.5 = coord(6/12)

Abstract: Papers written by several authors can be classified according to the countries of the author affiliations. The empirical part of this paper consists of two datasets. One dataset consists of 1,035 papers retrieved via the search "pedagog*" in the years 2004 and 2005 (up to October) in Academic Search Elite which is a case where phi(m) = the number of papers with m =1, 2,3 ... authors is decreasing, hence most of the papers have a low number of authors. Here we find that #, m = the number of times a country occurs j times in a m-authored paper, j =1, ..., m-1 is decreasing and that # m, m is much higher than all the other #j, m values. The other dataset consists of 3,271 papers retrieved via the search "enzyme" in the year 2005 (up to October) in the same database which is a case of a non-decreasing phi(m): most papers have 3 or 4 authors and we even find many papers with a much higher number of authors. In this case we show again that # m, m is much higher than the other #j, m values but that #j, m is not decreasing anymore in j =1, ..., m-1, although #1, m is (apart from # m, m) the largest number amongst the #j,m. The combinatorial part gives a proof of the fact that #j,m decreases for j = 1, m-1, supposing that all cases are equally possible. This shows that the first dataset is more conform with this model than the second dataset. Explanations for these findings are given. From the data we also find the (we think: new) distribution of number of papers with n =1, 2,3,... countries (i.e. where there are n different countries involved amongst the m (a n) authors of a paper): a fast decreasing function e.g. as a power law with a very large Lotka exponent.
Source: Information - Wissenschaft und Praxis. 57(2006) H.8, S.427-432

Egghe, L.; Leydesdorff, L.: ¬The relation between Pearson's correlation coefficient r and Salton's cosine measure (2009) 0.04

0.03611513 = product of:
  0.08667631 = sum of:
    0.007175247 = weight(_text_:information in 2803) [ClassicSimilarity], result of:
      0.007175247 = score(doc=2803,freq=2.0), product of:
        0.0616574 = queryWeight, product of:
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.035122856 = queryNorm
        0.116372846 = fieldWeight in 2803, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.046875 = fieldNorm(doc=2803)
    0.014216291 = weight(_text_:for in 2803) [ClassicSimilarity], result of:
      0.014216291 = score(doc=2803,freq=6.0), product of:
        0.06594466 = queryWeight, product of:
          1.8775425 = idf(docFreq=18385, maxDocs=44218)
          0.035122856 = queryNorm
        0.21557912 = fieldWeight in 2803, product of:
          2.4494898 = tf(freq=6.0), with freq of:
            6.0 = termFreq=6.0
          1.8775425 = idf(docFreq=18385, maxDocs=44218)
          0.046875 = fieldNorm(doc=2803)
    0.024590457 = weight(_text_:the in 2803) [ClassicSimilarity], result of:
      0.024590457 = score(doc=2803,freq=36.0), product of:
        0.05541559 = queryWeight, product of:
          1.5777643 = idf(docFreq=24812, maxDocs=44218)
          0.035122856 = queryNorm
        0.44374618 = fieldWeight in 2803, product of:
          6.0 = tf(freq=36.0), with freq of:
            36.0 = termFreq=36.0
          1.5777643 = idf(docFreq=24812, maxDocs=44218)
          0.046875 = fieldNorm(doc=2803)
    0.016103853 = weight(_text_:of in 2803) [ClassicSimilarity], result of:
      0.016103853 = score(doc=2803,freq=16.0), product of:
        0.054923624 = queryWeight, product of:
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.035122856 = queryNorm
        0.2932045 = fieldWeight in 2803, product of:
          4.0 = tf(freq=16.0), with freq of:
            16.0 = termFreq=16.0
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.046875 = fieldNorm(doc=2803)
    0.024590457 = weight(_text_:the in 2803) [ClassicSimilarity], result of:
      0.024590457 = score(doc=2803,freq=36.0), product of:
        0.05541559 = queryWeight, product of:
          1.5777643 = idf(docFreq=24812, maxDocs=44218)
          0.035122856 = queryNorm
        0.44374618 = fieldWeight in 2803, product of:
          6.0 = tf(freq=36.0), with freq of:
            36.0 = termFreq=36.0
          1.5777643 = idf(docFreq=24812, maxDocs=44218)
          0.046875 = fieldNorm(doc=2803)
  0.41666666 = coord(5/12)

Abstract: The relation between Pearson's correlation coefficient and Salton's cosine measure is revealed based on the different possible values of the division of the L1-norm and the L2-norm of a vector. These different values yield a sheaf of increasingly straight lines which together form a cloud of points, being the investigated relation. The theoretical results are tested against the author co-citation relations among 24 informetricians for whom two matrices can be constructed, based on co-citations: the asymmetric occurrence matrix and the symmetric co-citation matrix. Both examples completely confirm the theoretical results. The results enable us to specify an algorithm that provides a threshold value for the cosine above which none of the corresponding Pearson correlations would be negative. Using this threshold value can be expected to optimize the visualization of the vector space.
Source: Journal of the American Society for Information Science and Technology. 60(2009) no.5, S.1027-1036

Egghe, L.: ¬A new short proof of Naranan's theorem, explaining Lotka's law and Zipf's law (2010) 0.04

0.035867404 = product of:
  0.08608177 = sum of:
    0.008371122 = weight(_text_:information in 3432) [ClassicSimilarity], result of:
      0.008371122 = score(doc=3432,freq=2.0), product of:
        0.0616574 = queryWeight, product of:
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.035122856 = queryNorm
        0.13576832 = fieldWeight in 3432, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.0546875 = fieldNorm(doc=3432)
    0.016585674 = weight(_text_:for in 3432) [ClassicSimilarity], result of:
      0.016585674 = score(doc=3432,freq=6.0), product of:
        0.06594466 = queryWeight, product of:
          1.8775425 = idf(docFreq=18385, maxDocs=44218)
          0.035122856 = queryNorm
        0.25150898 = fieldWeight in 3432, product of:
          2.4494898 = tf(freq=6.0), with freq of:
            6.0 = termFreq=6.0
          1.8775425 = idf(docFreq=18385, maxDocs=44218)
          0.0546875 = fieldNorm(doc=3432)
    0.02242712 = weight(_text_:the in 3432) [ClassicSimilarity], result of:
      0.02242712 = score(doc=3432,freq=22.0), product of:
        0.05541559 = queryWeight, product of:
          1.5777643 = idf(docFreq=24812, maxDocs=44218)
          0.035122856 = queryNorm
        0.40470776 = fieldWeight in 3432, product of:
          4.690416 = tf(freq=22.0), with freq of:
            22.0 = termFreq=22.0
          1.5777643 = idf(docFreq=24812, maxDocs=44218)
          0.0546875 = fieldNorm(doc=3432)
    0.016270736 = weight(_text_:of in 3432) [ClassicSimilarity], result of:
      0.016270736 = score(doc=3432,freq=12.0), product of:
        0.054923624 = queryWeight, product of:
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.035122856 = queryNorm
        0.29624295 = fieldWeight in 3432, product of:
          3.4641016 = tf(freq=12.0), with freq of:
            12.0 = termFreq=12.0
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.0546875 = fieldNorm(doc=3432)
    0.02242712 = weight(_text_:the in 3432) [ClassicSimilarity], result of:
      0.02242712 = score(doc=3432,freq=22.0), product of:
        0.05541559 = queryWeight, product of:
          1.5777643 = idf(docFreq=24812, maxDocs=44218)
          0.035122856 = queryNorm
        0.40470776 = fieldWeight in 3432, product of:
          4.690416 = tf(freq=22.0), with freq of:
            22.0 = termFreq=22.0
          1.5777643 = idf(docFreq=24812, maxDocs=44218)
          0.0546875 = fieldNorm(doc=3432)
  0.41666666 = coord(5/12)

Abstract: Naranan's important theorem, published in Nature in 1970, states that if the number of journals grows exponentially and if the number of articles in each journal grows exponentially (at the same rate for each journal), then the system satisfies Lotka's law and a formula for the Lotka's exponent is given in function of the growth rates of the journals and the articles. This brief communication re-proves this result by showing that the system satisfies Zipf's law, which is equivalent with Lotka's law. The proof is short and algebraic and does not use infinitesimal arguments.
Source: Journal of the American Society for Information Science and Technology. 61(2010) no.12, S.2581-2583

Egghe, L.; Rousseau, R.; Rousseau, S.: TOP-curves (2007) 0.04

0.03560687 = product of:
  0.08545648 = sum of:
    0.008371122 = weight(_text_:information in 50) [ClassicSimilarity], result of:
      0.008371122 = score(doc=50,freq=2.0), product of:
        0.0616574 = queryWeight, product of:
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.035122856 = queryNorm
        0.13576832 = fieldWeight in 50, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.0546875 = fieldNorm(doc=50)
    0.016585674 = weight(_text_:for in 50) [ClassicSimilarity], result of:
      0.016585674 = score(doc=50,freq=6.0), product of:
        0.06594466 = queryWeight, product of:
          1.8775425 = idf(docFreq=18385, maxDocs=44218)
          0.035122856 = queryNorm
        0.25150898 = fieldWeight in 50, product of:
          2.4494898 = tf(freq=6.0), with freq of:
            6.0 = termFreq=6.0
          1.8775425 = idf(docFreq=18385, maxDocs=44218)
          0.0546875 = fieldNorm(doc=50)
    0.020286093 = weight(_text_:the in 50) [ClassicSimilarity], result of:
      0.020286093 = score(doc=50,freq=18.0), product of:
        0.05541559 = queryWeight, product of:
          1.5777643 = idf(docFreq=24812, maxDocs=44218)
          0.035122856 = queryNorm
        0.36607194 = fieldWeight in 50, product of:
          4.2426405 = tf(freq=18.0), with freq of:
            18.0 = termFreq=18.0
          1.5777643 = idf(docFreq=24812, maxDocs=44218)
          0.0546875 = fieldNorm(doc=50)
    0.0199275 = weight(_text_:of in 50) [ClassicSimilarity], result of:
      0.0199275 = score(doc=50,freq=18.0), product of:
        0.054923624 = queryWeight, product of:
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.035122856 = queryNorm
        0.36282203 = fieldWeight in 50, product of:
          4.2426405 = tf(freq=18.0), with freq of:
            18.0 = termFreq=18.0
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.0546875 = fieldNorm(doc=50)
    0.020286093 = weight(_text_:the in 50) [ClassicSimilarity], result of:
      0.020286093 = score(doc=50,freq=18.0), product of:
        0.05541559 = queryWeight, product of:
          1.5777643 = idf(docFreq=24812, maxDocs=44218)
          0.035122856 = queryNorm
        0.36607194 = fieldWeight in 50, product of:
          4.2426405 = tf(freq=18.0), with freq of:
            18.0 = termFreq=18.0
          1.5777643 = idf(docFreq=24812, maxDocs=44218)
          0.0546875 = fieldNorm(doc=50)
  0.41666666 = coord(5/12)

Abstract: Several characteristics of classical Lorenz curves make them unsuitable for the study of a group of topperformers. TOP-curves, defined as a kind of mirror image of TIP-curves used in poverty studies, are shown to possess the properties necessary for adequate empirical ranking of various data arrays, based on the properties of the highest performers (i.e., the core). TOP-curves and essential TOP-curves, also introduced in this article, simultaneously represent the incidence, intensity, and inequality among the top. It is shown that TOPdominance partial order, introduced in this article, is stronger than Lorenz dominance order. In this way, this article contributes to the study of cores, a central issue in applied informetrics.
Source: Journal of the American Society for Information Science and Technology. 58(2007) no.6, S.777-785

Egghe, L.: Zipfian and Lotkaian continuous concentration theory (2005) 0.04

0.035437603 = product of:
  0.08505025 = sum of:
    0.007175247 = weight(_text_:information in 3678) [ClassicSimilarity], result of:
      0.007175247 = score(doc=3678,freq=2.0), product of:
        0.0616574 = queryWeight, product of:
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.035122856 = queryNorm
        0.116372846 = fieldWeight in 3678, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.046875 = fieldNorm(doc=3678)
    0.016415559 = weight(_text_:for in 3678) [ClassicSimilarity], result of:
      0.016415559 = score(doc=3678,freq=8.0), product of:
        0.06594466 = queryWeight, product of:
          1.8775425 = idf(docFreq=18385, maxDocs=44218)
          0.035122856 = queryNorm
        0.24892932 = fieldWeight in 3678, product of:
          2.828427 = tf(freq=8.0), with freq of:
            8.0 = termFreq=8.0
          1.8775425 = idf(docFreq=18385, maxDocs=44218)
          0.046875 = fieldNorm(doc=3678)
    0.020078024 = weight(_text_:the in 3678) [ClassicSimilarity], result of:
      0.020078024 = score(doc=3678,freq=24.0), product of:
        0.05541559 = queryWeight, product of:
          1.5777643 = idf(docFreq=24812, maxDocs=44218)
          0.035122856 = queryNorm
        0.36231726 = fieldWeight in 3678, product of:
          4.8989797 = tf(freq=24.0), with freq of:
            24.0 = termFreq=24.0
          1.5777643 = idf(docFreq=24812, maxDocs=44218)
          0.046875 = fieldNorm(doc=3678)
    0.021303395 = weight(_text_:of in 3678) [ClassicSimilarity], result of:
      0.021303395 = score(doc=3678,freq=28.0), product of:
        0.054923624 = queryWeight, product of:
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.035122856 = queryNorm
        0.38787308 = fieldWeight in 3678, product of:
          5.2915025 = tf(freq=28.0), with freq of:
            28.0 = termFreq=28.0
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.046875 = fieldNorm(doc=3678)
    0.020078024 = weight(_text_:the in 3678) [ClassicSimilarity], result of:
      0.020078024 = score(doc=3678,freq=24.0), product of:
        0.05541559 = queryWeight, product of:
          1.5777643 = idf(docFreq=24812, maxDocs=44218)
          0.035122856 = queryNorm
        0.36231726 = fieldWeight in 3678, product of:
          4.8989797 = tf(freq=24.0), with freq of:
            24.0 = termFreq=24.0
          1.5777643 = idf(docFreq=24812, maxDocs=44218)
          0.046875 = fieldNorm(doc=3678)
  0.41666666 = coord(5/12)

Abstract: In this article concentration (i.e., inequality) aspects of the functions of Zipf and of Lotka are studied. Since both functions are power laws (i.e., they are mathematically the same) it suffices to develop one concentration theory for power laws and apply it twice for the different interpretations of the laws of Zipf and Lotka. After a brief repetition of the functional relationships between Zipf's law and Lotka's law, we prove that Price's law of concentration is equivalent with Zipf's law. A major part of this article is devoted to the development of continuous concentration theory, based an Lorenz curves. The Lorenz curve for power functions is calculated and, based an this, some important concentration measures such as the ones of Gini, Theil, and the variation coefficient. Using Lorenz curves, it is shown that the concentration of a power law increases with its exponent and this result is interpreted in terms of the functions of Zipf and Lotka.
Source: Journal of the American Society for Information Science and Technology. 56(2005) no.9, S.935-945

Egghe, L.: ¬A model for the size-frequency function of coauthor pairs (2008) 0.04

0.035086416 = product of:
  0.0842074 = sum of:
    0.007175247 = weight(_text_:information in 2366) [ClassicSimilarity], result of:
      0.007175247 = score(doc=2366,freq=2.0), product of:
        0.0616574 = queryWeight, product of:
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.035122856 = queryNorm
        0.116372846 = fieldWeight in 2366, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.046875 = fieldNorm(doc=2366)
    0.011607553 = weight(_text_:for in 2366) [ClassicSimilarity], result of:
      0.011607553 = score(doc=2366,freq=4.0), product of:
        0.06594466 = queryWeight, product of:
          1.8775425 = idf(docFreq=18385, maxDocs=44218)
          0.035122856 = queryNorm
        0.17601961 = fieldWeight in 2366, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          1.8775425 = idf(docFreq=18385, maxDocs=44218)
          0.046875 = fieldNorm(doc=2366)
    0.021686744 = weight(_text_:the in 2366) [ClassicSimilarity], result of:
      0.021686744 = score(doc=2366,freq=28.0), product of:
        0.05541559 = queryWeight, product of:
          1.5777643 = idf(docFreq=24812, maxDocs=44218)
          0.035122856 = queryNorm
        0.39134735 = fieldWeight in 2366, product of:
          5.2915025 = tf(freq=28.0), with freq of:
            28.0 = termFreq=28.0
          1.5777643 = idf(docFreq=24812, maxDocs=44218)
          0.046875 = fieldNorm(doc=2366)
    0.02205111 = weight(_text_:of in 2366) [ClassicSimilarity], result of:
      0.02205111 = score(doc=2366,freq=30.0), product of:
        0.054923624 = queryWeight, product of:
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.035122856 = queryNorm
        0.4014868 = fieldWeight in 2366, product of:
          5.477226 = tf(freq=30.0), with freq of:
            30.0 = termFreq=30.0
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.046875 = fieldNorm(doc=2366)
    0.021686744 = weight(_text_:the in 2366) [ClassicSimilarity], result of:
      0.021686744 = score(doc=2366,freq=28.0), product of:
        0.05541559 = queryWeight, product of:
          1.5777643 = idf(docFreq=24812, maxDocs=44218)
          0.035122856 = queryNorm
        0.39134735 = fieldWeight in 2366, product of:
          5.2915025 = tf(freq=28.0), with freq of:
            28.0 = termFreq=28.0
          1.5777643 = idf(docFreq=24812, maxDocs=44218)
          0.046875 = fieldNorm(doc=2366)
  0.41666666 = coord(5/12)

Abstract: Lotka's law was formulated to describe the number of authors with a certain number of publications. Empirical results (Morris & Goldstein, 2007) indicate that Lotka's law is also valid if one counts the number of publications of coauthor pairs. This article gives a simple model proving this to be true, with the same Lotka exponent, if the number of coauthored papers is proportional to the number of papers of the individual coauthors. Under the assumption that this number of coauthored papers is more than proportional to the number of papers of the individual authors (to be explained in the article), we can prove that the size-frequency function of coauthor pairs is Lotkaian with an exponent that is higher than that of the Lotka function of individual authors, a fact that is confirmed in experimental results.
Source: Journal of the American Society for Information Science and Technology. 59(2008) no.13, S.2133-2137

Egghe, L.: Properties of the n-overlap vector and n-overlap similarity theory (2006) 0.03

0.034833424 = product of:
  0.08360022 = sum of:
    0.005979372 = weight(_text_:information in 194) [ClassicSimilarity], result of:
      0.005979372 = score(doc=194,freq=2.0), product of:
        0.0616574 = queryWeight, product of:
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.035122856 = queryNorm
        0.09697737 = fieldWeight in 194, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          1.7554779 = idf(docFreq=20772, maxDocs=44218)
          0.0390625 = fieldNorm(doc=194)
    0.00967296 = weight(_text_:for in 194) [ClassicSimilarity], result of:
      0.00967296 = score(doc=194,freq=4.0), product of:
        0.06594466 = queryWeight, product of:
          1.8775425 = idf(docFreq=18385, maxDocs=44218)
          0.035122856 = queryNorm
        0.14668301 = fieldWeight in 194, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          1.8775425 = idf(docFreq=18385, maxDocs=44218)
          0.0390625 = fieldNorm(doc=194)
    0.025097532 = weight(_text_:the in 194) [ClassicSimilarity], result of:
      0.025097532 = score(doc=194,freq=54.0), product of:
        0.05541559 = queryWeight, product of:
          1.5777643 = idf(docFreq=24812, maxDocs=44218)
          0.035122856 = queryNorm
        0.4528966 = fieldWeight in 194, product of:
          7.3484693 = tf(freq=54.0), with freq of:
            54.0 = termFreq=54.0
          1.5777643 = idf(docFreq=24812, maxDocs=44218)
          0.0390625 = fieldNorm(doc=194)
    0.01775283 = weight(_text_:of in 194) [ClassicSimilarity], result of:
      0.01775283 = score(doc=194,freq=28.0), product of:
        0.054923624 = queryWeight, product of:
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.035122856 = queryNorm
        0.32322758 = fieldWeight in 194, product of:
          5.2915025 = tf(freq=28.0), with freq of:
            28.0 = termFreq=28.0
          1.5637573 = idf(docFreq=25162, maxDocs=44218)
          0.0390625 = fieldNorm(doc=194)
    0.025097532 = weight(_text_:the in 194) [ClassicSimilarity], result of:
      0.025097532 = score(doc=194,freq=54.0), product of:
        0.05541559 = queryWeight, product of:
          1.5777643 = idf(docFreq=24812, maxDocs=44218)
          0.035122856 = queryNorm
        0.4528966 = fieldWeight in 194, product of:
          7.3484693 = tf(freq=54.0), with freq of:
            54.0 = termFreq=54.0
          1.5777643 = idf(docFreq=24812, maxDocs=44218)
          0.0390625 = fieldNorm(doc=194)
  0.41666666 = coord(5/12)

Abstract: In the first part of this article the author defines the n-overlap vector whose coordinates consist of the fraction of the objects (e.g., books, N-grams, etc.) that belong to 1, 2, , n sets (more generally: families) (e.g., libraries, databases, etc.). With the aid of the Lorenz concentration theory, a theory of n-overlap similarity is conceived together with corresponding measures, such as the generalized Jaccard index (generalizing the well-known Jaccard index in case n 5 2). Next, the distributional form of the n-overlap vector is determined assuming certain distributions of the object's and of the set (family) sizes. In this section the decreasing power law and decreasing exponential distribution is explained for the n-overlap vector. Both item (token) n-overlap and source (type) n-overlap are studied. The n-overlap properties of objects indexed by a hierarchical system (e.g., books indexed by numbers from a UDC or Dewey system or by N-grams) are presented in the final section. The author shows how the results given in the previous section can be applied as well as how the Lorenz order of the n-overlap vector is respected by an increase or a decrease of the level of refinement in the hierarchical system (e.g., the value N in N-grams).
Source: Journal of the American Society for Information Science and Technology. 57(2006) no.9, S.1165-1177

Search (60 results, page 1 of 3)

Authors

Years

Types

Themes

Subjects

Classifications