Document (#37378)

Author
Egghe, L.
Guns, R.
Title
Applications of the generalized law of Benford to informetric data
Source
Journal of the American Society for Information Science and Technology. 63(2012) no.8, S.1662-1665
Year
2012
Series
Brief communication
Abstract
In a previous work (Egghe, 2011), the first author showed that Benford's law (describing the logarithmic distribution of the numbers 1, 2, ... , 9 as first digits of data in decimal form) is related to the classical law of Zipf with exponent 1. The work of Campanario and Coslado (2011), however, shows that Benford's law does not always fit practical data in a statistical sense. In this article, we use a generalization of Benford's law related to the general law of Zipf with exponent ? > 0. Using data from Campanario and Coslado, we apply nonlinear least squares to determine the optimal ? and show that this generalized law of Benford fits the data better than the classical law of Benford.
Theme
Informetrie
Object
Zipf-Gesetz
Benford-Gesetz

Similar documents (author)

  1. Egghe, L.; Guns, R.; Rousseau, R.: Thoughts on uncitedness : Nobel laureates and Fields medalists as case studies (2011) 4.54
    4.5415845 = sum of:
      4.5415845 = sum of:
        1.4972583 = weight(author_txt:egghe in 1995) [ClassicSimilarity], result of:
          1.4972583 = score(doc=1995,freq=1.0), product of:
            0.5288207 = queryWeight, product of:
              7.550175 = idf(docFreq=59, maxDocs=41962)
              0.07004085 = queryNorm
            2.8313158 = fieldWeight in 1995, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              7.550175 = idf(docFreq=59, maxDocs=41962)
              0.375 = fieldNorm(doc=1995)
        3.044326 = weight(author_txt:guns in 1995) [ClassicSimilarity], result of:
          3.044326 = score(doc=1995,freq=1.0), product of:
            0.8487336 = queryWeight, product of:
              1.2668684 = boost
              9.565078 = idf(docFreq=7, maxDocs=41962)
              0.07004085 = queryNorm
            3.586904 = fieldWeight in 1995, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              9.565078 = idf(docFreq=7, maxDocs=41962)
              0.375 = fieldNorm(doc=1995)
    
  2. Egghe, L.; Guns, R.; Rousseau, R.; Leuven, K.U.: Erratum (2012) 3.78
    3.7846537 = sum of:
      3.7846537 = sum of:
        1.2477154 = weight(author_txt:egghe in 1993) [ClassicSimilarity], result of:
          1.2477154 = score(doc=1993,freq=1.0), product of:
            0.5288207 = queryWeight, product of:
              7.550175 = idf(docFreq=59, maxDocs=41962)
              0.07004085 = queryNorm
            2.3594298 = fieldWeight in 1993, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              7.550175 = idf(docFreq=59, maxDocs=41962)
              0.3125 = fieldNorm(doc=1993)
        2.5369384 = weight(author_txt:guns in 1993) [ClassicSimilarity], result of:
          2.5369384 = score(doc=1993,freq=1.0), product of:
            0.8487336 = queryWeight, product of:
              1.2668684 = boost
              9.565078 = idf(docFreq=7, maxDocs=41962)
              0.07004085 = queryNorm
            2.9890869 = fieldWeight in 1993, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              9.565078 = idf(docFreq=7, maxDocs=41962)
              0.3125 = fieldNorm(doc=1993)
    
  3. Guns, R.: ¬The three dimensions of informetrics : a conceptual view (2013) 2.54
    2.5369384 = sum of:
      2.5369384 = product of:
        5.073877 = sum of:
          5.073877 = weight(author_txt:guns in 2399) [ClassicSimilarity], result of:
            5.073877 = score(doc=2399,freq=1.0), product of:
              0.8487336 = queryWeight, product of:
                1.2668684 = boost
                9.565078 = idf(docFreq=7, maxDocs=41962)
                0.07004085 = queryNorm
              5.9781737 = fieldWeight in 2399, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.565078 = idf(docFreq=7, maxDocs=41962)
                0.625 = fieldNorm(doc=2399)
        0.5 = coord(1/2)
    
  4. Guns, R.: Tracing the origins of the semantic web (2013) 2.54
    2.5369384 = sum of:
      2.5369384 = product of:
        5.073877 = sum of:
          5.073877 = weight(author_txt:guns in 3094) [ClassicSimilarity], result of:
            5.073877 = score(doc=3094,freq=1.0), product of:
              0.8487336 = queryWeight, product of:
                1.2668684 = boost
                9.565078 = idf(docFreq=7, maxDocs=41962)
                0.07004085 = queryNorm
              5.9781737 = fieldWeight in 3094, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.565078 = idf(docFreq=7, maxDocs=41962)
                0.625 = fieldNorm(doc=3094)
        0.5 = coord(1/2)
    
  5. Guns, R.; Rousseau, R.: Simulating growth of the h-index (2009) 2.03
    2.0295508 = sum of:
      2.0295508 = product of:
        4.0591016 = sum of:
          4.0591016 = weight(author_txt:guns in 4718) [ClassicSimilarity], result of:
            4.0591016 = score(doc=4718,freq=1.0), product of:
              0.8487336 = queryWeight, product of:
                1.2668684 = boost
                9.565078 = idf(docFreq=7, maxDocs=41962)
                0.07004085 = queryNorm
              4.782539 = fieldWeight in 4718, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.565078 = idf(docFreq=7, maxDocs=41962)
                0.5 = fieldNorm(doc=4718)
        0.5 = coord(1/2)
    

Similar documents (content)

  1. Shan, S.: On the generalized Zipf distribution : part I (2005) 0.21
    0.20649402 = sum of:
      0.20649402 = product of:
        1.2905877 = sum of:
          0.11926931 = weight(abstract_txt:informetric in 3062) [ClassicSimilarity], result of:
            0.11926931 = score(doc=3062,freq=1.0), product of:
              0.16409846 = queryWeight, product of:
                1.3214376 = boost
                7.7526994 = idf(docFreq=48, maxDocs=41962)
                0.016017875 = queryNorm
              0.7268156 = fieldWeight in 3062, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.7526994 = idf(docFreq=48, maxDocs=41962)
                0.09375 = fieldNorm(doc=3062)
          0.14167194 = weight(abstract_txt:generalization in 3062) [ClassicSimilarity], result of:
            0.14167194 = score(doc=3062,freq=1.0), product of:
              0.1840523 = queryWeight, product of:
                1.3994746 = boost
                8.210532 = idf(docFreq=30, maxDocs=41962)
                0.016017875 = queryNorm
              0.76973736 = fieldWeight in 3062, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.210532 = idf(docFreq=30, maxDocs=41962)
                0.09375 = fieldNorm(doc=3062)
          0.31331667 = weight(abstract_txt:generalized in 3062) [ClassicSimilarity], result of:
            0.31331667 = score(doc=3062,freq=3.0), product of:
              0.2729254 = queryWeight, product of:
                2.4100797 = boost
                7.069809 = idf(docFreq=96, maxDocs=41962)
                0.016017875 = queryNorm
              1.1479938 = fieldWeight in 3062, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.069809 = idf(docFreq=96, maxDocs=41962)
                0.09375 = fieldNorm(doc=3062)
          0.71632975 = weight(abstract_txt:zipf in 3062) [ClassicSimilarity], result of:
            0.71632975 = score(doc=3062,freq=5.0), product of:
              0.39949745 = queryWeight, product of:
                2.9158583 = boost
                8.553477 = idf(docFreq=21, maxDocs=41962)
                0.016017875 = queryNorm
              1.7930772 = fieldWeight in 3062, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                8.553477 = idf(docFreq=21, maxDocs=41962)
                0.09375 = fieldNorm(doc=3062)
        0.16 = coord(4/25)
    
  2. Milojevic, S.: Power law distributions in information science : making the case for logarithmic binning (2010) 0.14
    0.13786316 = sum of:
      0.13786316 = product of:
        0.8616448 = sum of:
          0.010778121 = weight(abstract_txt:that in 1114) [ClassicSimilarity], result of:
            0.010778121 = score(doc=1114,freq=1.0), product of:
              0.047660045 = queryWeight, product of:
                1.2334805 = boost
                2.4122221 = idf(docFreq=10221, maxDocs=41962)
                0.016017875 = queryNorm
              0.22614583 = fieldWeight in 1114, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4122221 = idf(docFreq=10221, maxDocs=41962)
                0.09375 = fieldNorm(doc=1114)
          0.2951177 = weight(abstract_txt:logarithmic in 1114) [ClassicSimilarity], result of:
            0.2951177 = score(doc=1114,freq=2.0), product of:
              0.23827156 = queryWeight, product of:
                1.5923207 = boost
                9.341934 = idf(docFreq=9, maxDocs=41962)
                0.016017875 = queryNorm
              1.2385771 = fieldWeight in 1114, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.341934 = idf(docFreq=9, maxDocs=41962)
                0.09375 = fieldNorm(doc=1114)
          0.050192866 = weight(abstract_txt:data in 1114) [ClassicSimilarity], result of:
            0.050192866 = score(doc=1114,freq=1.0), product of:
              0.15758082 = queryWeight, product of:
                2.89555 = boost
                3.3975618 = idf(docFreq=3815, maxDocs=41962)
                0.016017875 = queryNorm
              0.3185214 = fieldWeight in 1114, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3975618 = idf(docFreq=3815, maxDocs=41962)
                0.09375 = fieldNorm(doc=1114)
          0.5055561 = weight(abstract_txt:exponent in 1114) [ClassicSimilarity], result of:
            0.5055561 = score(doc=1114,freq=2.0), product of:
              0.42979854 = queryWeight, product of:
                3.0244184 = boost
                8.871931 = idf(docFreq=15, maxDocs=41962)
                0.016017875 = queryNorm
              1.176263 = fieldWeight in 1114, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.871931 = idf(docFreq=15, maxDocs=41962)
                0.09375 = fieldNorm(doc=1114)
        0.16 = coord(4/25)
    
  3. Egghe, L.: Zipfian and Lotkaian continuous concentration theory (2005) 0.13
    0.1309448 = sum of:
      0.1309448 = product of:
        0.81840503 = sum of:
          0.045412224 = weight(abstract_txt:apply in 4679) [ClassicSimilarity], result of:
            0.045412224 = score(doc=4679,freq=1.0), product of:
              0.09734673 = queryWeight, product of:
                1.0177828 = boost
                5.9711967 = idf(docFreq=290, maxDocs=41962)
                0.016017875 = queryNorm
              0.46649975 = fieldWeight in 4679, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.9711967 = idf(docFreq=290, maxDocs=41962)
                0.078125 = fieldNorm(doc=4679)
          0.012702136 = weight(abstract_txt:that in 4679) [ClassicSimilarity], result of:
            0.012702136 = score(doc=4679,freq=2.0), product of:
              0.047660045 = queryWeight, product of:
                1.2334805 = boost
                2.4122221 = idf(docFreq=10221, maxDocs=41962)
                0.016017875 = queryNorm
              0.2665154 = fieldWeight in 4679, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.4122221 = idf(docFreq=10221, maxDocs=41962)
                0.078125 = fieldNorm(doc=4679)
          0.46238887 = weight(abstract_txt:zipf in 4679) [ClassicSimilarity], result of:
            0.46238887 = score(doc=4679,freq=3.0), product of:
              0.39949745 = queryWeight, product of:
                2.9158583 = boost
                8.553477 = idf(docFreq=21, maxDocs=41962)
                0.016017875 = queryNorm
              1.1574264 = fieldWeight in 4679, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                8.553477 = idf(docFreq=21, maxDocs=41962)
                0.078125 = fieldNorm(doc=4679)
          0.2979018 = weight(abstract_txt:exponent in 4679) [ClassicSimilarity], result of:
            0.2979018 = score(doc=4679,freq=1.0), product of:
              0.42979854 = queryWeight, product of:
                3.0244184 = boost
                8.871931 = idf(docFreq=15, maxDocs=41962)
                0.016017875 = queryNorm
              0.69311965 = fieldWeight in 4679, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.871931 = idf(docFreq=15, maxDocs=41962)
                0.078125 = fieldNorm(doc=4679)
        0.16 = coord(4/25)
    
  4. Sarabia, J.M.; Sarabia, M.: Explicit expressions for the Leimkuhler curve in parametric families (2008) 0.12
    0.12253121 = sum of:
      0.12253121 = product of:
        0.51054674 = sum of:
          0.019477248 = weight(abstract_txt:work in 4121) [ClassicSimilarity], result of:
            0.019477248 = score(doc=4121,freq=1.0), product of:
              0.080942124 = queryWeight, product of:
                1.3124921 = boost
                3.8501086 = idf(docFreq=2426, maxDocs=41962)
                0.016017875 = queryNorm
              0.24063179 = fieldWeight in 4121, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.8501086 = idf(docFreq=2426, maxDocs=41962)
                0.0625 = fieldNorm(doc=4121)
          0.07951287 = weight(abstract_txt:informetric in 4121) [ClassicSimilarity], result of:
            0.07951287 = score(doc=4121,freq=1.0), product of:
              0.16409846 = queryWeight, product of:
                1.3214376 = boost
                7.7526994 = idf(docFreq=48, maxDocs=41962)
                0.016017875 = queryNorm
              0.4845437 = fieldWeight in 4121, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.7526994 = idf(docFreq=48, maxDocs=41962)
                0.0625 = fieldNorm(doc=4121)
          0.03649702 = weight(abstract_txt:first in 4121) [ClassicSimilarity], result of:
            0.03649702 = score(doc=4121,freq=2.0), product of:
              0.097645245 = queryWeight, product of:
                1.4415674 = boost
                4.2287426 = idf(docFreq=1661, maxDocs=41962)
                0.016017875 = queryNorm
              0.37377158 = fieldWeight in 4121, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.2287426 = idf(docFreq=1661, maxDocs=41962)
                0.0625 = fieldNorm(doc=4121)
          0.1327199 = weight(abstract_txt:classical in 4121) [ClassicSimilarity], result of:
            0.1327199 = score(doc=4121,freq=2.0), product of:
              0.2309069 = queryWeight, product of:
                2.216807 = boost
                6.5028563 = idf(docFreq=170, maxDocs=41962)
                0.016017875 = queryNorm
              0.5747767 = fieldWeight in 4121, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.5028563 = idf(docFreq=170, maxDocs=41962)
                0.0625 = fieldNorm(doc=4121)
          0.2088778 = weight(abstract_txt:generalized in 4121) [ClassicSimilarity], result of:
            0.2088778 = score(doc=4121,freq=3.0), product of:
              0.2729254 = queryWeight, product of:
                2.4100797 = boost
                7.069809 = idf(docFreq=96, maxDocs=41962)
                0.016017875 = queryNorm
              0.76532924 = fieldWeight in 4121, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.069809 = idf(docFreq=96, maxDocs=41962)
                0.0625 = fieldNorm(doc=4121)
          0.03346191 = weight(abstract_txt:data in 4121) [ClassicSimilarity], result of:
            0.03346191 = score(doc=4121,freq=1.0), product of:
              0.15758082 = queryWeight, product of:
                2.89555 = boost
                3.3975618 = idf(docFreq=3815, maxDocs=41962)
                0.016017875 = queryNorm
              0.21234761 = fieldWeight in 4121, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3975618 = idf(docFreq=3815, maxDocs=41962)
                0.0625 = fieldNorm(doc=4121)
        0.24 = coord(6/25)
    
  5. Burrell, Q.L.: "Ambiguity" ans scientometric measurement : a dissenting view (2001) 0.12
    0.1169241 = sum of:
      0.1169241 = product of:
        0.5846205 = sum of:
          0.017963534 = weight(abstract_txt:that in 982) [ClassicSimilarity], result of:
            0.017963534 = score(doc=982,freq=4.0), product of:
              0.047660045 = queryWeight, product of:
                1.2334805 = boost
                2.4122221 = idf(docFreq=10221, maxDocs=41962)
                0.016017875 = queryNorm
              0.3769097 = fieldWeight in 982, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                2.4122221 = idf(docFreq=10221, maxDocs=41962)
                0.078125 = fieldNorm(doc=982)
          0.14056022 = weight(abstract_txt:informetric in 982) [ClassicSimilarity], result of:
            0.14056022 = score(doc=982,freq=2.0), product of:
              0.16409846 = queryWeight, product of:
                1.3214376 = boost
                7.7526994 = idf(docFreq=48, maxDocs=41962)
                0.016017875 = queryNorm
              0.85656035 = fieldWeight in 982, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.7526994 = idf(docFreq=48, maxDocs=41962)
                0.078125 = fieldNorm(doc=982)
          0.117308944 = weight(abstract_txt:classical in 982) [ClassicSimilarity], result of:
            0.117308944 = score(doc=982,freq=1.0), product of:
              0.2309069 = queryWeight, product of:
                2.216807 = boost
                6.5028563 = idf(docFreq=170, maxDocs=41962)
                0.016017875 = queryNorm
              0.50803566 = fieldWeight in 982, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.5028563 = idf(docFreq=170, maxDocs=41962)
                0.078125 = fieldNorm(doc=982)
          0.041827388 = weight(abstract_txt:data in 982) [ClassicSimilarity], result of:
            0.041827388 = score(doc=982,freq=1.0), product of:
              0.15758082 = queryWeight, product of:
                2.89555 = boost
                3.3975618 = idf(docFreq=3815, maxDocs=41962)
                0.016017875 = queryNorm
              0.2654345 = fieldWeight in 982, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3975618 = idf(docFreq=3815, maxDocs=41962)
                0.078125 = fieldNorm(doc=982)
          0.26696035 = weight(abstract_txt:zipf in 982) [ClassicSimilarity], result of:
            0.26696035 = score(doc=982,freq=1.0), product of:
              0.39949745 = queryWeight, product of:
                2.9158583 = boost
                8.553477 = idf(docFreq=21, maxDocs=41962)
                0.016017875 = queryNorm
              0.6682404 = fieldWeight in 982, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.553477 = idf(docFreq=21, maxDocs=41962)
                0.078125 = fieldNorm(doc=982)
        0.2 = coord(5/25)