Document (#37377)

Author
Egghe, L.
Guns, R.
Title
Applications of the generalized law of Benford to informetric data
Source
Journal of the American Society for Information Science and Technology. 63(2012) no.8, S.1662-1665
Year
2012
Series
Brief communication
Abstract
In a previous work (Egghe, 2011), the first author showed that Benford's law (describing the logarithmic distribution of the numbers 1, 2, ... , 9 as first digits of data in decimal form) is related to the classical law of Zipf with exponent 1. The work of Campanario and Coslado (2011), however, shows that Benford's law does not always fit practical data in a statistical sense. In this article, we use a generalization of Benford's law related to the general law of Zipf with exponent ? > 0. Using data from Campanario and Coslado, we apply nonlinear least squares to determine the optimal ? and show that this generalized law of Benford fits the data better than the classical law of Benford.
Theme
Informetrie
Object
Zipf-Gesetz
Benford-Gesetz

Similar documents (author)

  1. Egghe, L.; Guns, R.; Rousseau, R.: Thoughts on uncitedness : Nobel laureates and Fields medalists as case studies (2011) 4.43
    4.434249 = sum of:
      4.434249 = sum of:
        1.6157398 = weight(author_txt:egghe in 4994) [ClassicSimilarity], result of:
          1.6157398 = score(doc=4994,freq=1.0), product of:
            0.5679715 = queryWeight, product of:
              7.5860133 = idf(docFreq=60, maxDocs=44218)
              0.074870884 = queryNorm
            2.844755 = fieldWeight in 4994, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              7.5860133 = idf(docFreq=60, maxDocs=44218)
              0.375 = fieldNorm(doc=4994)
        2.818509 = weight(author_txt:guns in 4994) [ClassicSimilarity], result of:
          2.818509 = score(doc=4994,freq=1.0), product of:
            0.8230481 = queryWeight, product of:
              1.2037861 = boost
              9.131938 = idf(docFreq=12, maxDocs=44218)
              0.074870884 = queryNorm
            3.4244766 = fieldWeight in 4994, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              9.131938 = idf(docFreq=12, maxDocs=44218)
              0.375 = fieldNorm(doc=4994)
    
  2. Rousseau, R.; Egghe, L.; Guns, R.: Becoming metric-wise : a bibliometric guide for researchers (2018) 4.43
    4.434249 = sum of:
      4.434249 = sum of:
        1.6157398 = weight(author_txt:egghe in 5226) [ClassicSimilarity], result of:
          1.6157398 = score(doc=5226,freq=1.0), product of:
            0.5679715 = queryWeight, product of:
              7.5860133 = idf(docFreq=60, maxDocs=44218)
              0.074870884 = queryNorm
            2.844755 = fieldWeight in 5226, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              7.5860133 = idf(docFreq=60, maxDocs=44218)
              0.375 = fieldNorm(doc=5226)
        2.818509 = weight(author_txt:guns in 5226) [ClassicSimilarity], result of:
          2.818509 = score(doc=5226,freq=1.0), product of:
            0.8230481 = queryWeight, product of:
              1.2037861 = boost
              9.131938 = idf(docFreq=12, maxDocs=44218)
              0.074870884 = queryNorm
            3.4244766 = fieldWeight in 5226, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              9.131938 = idf(docFreq=12, maxDocs=44218)
              0.375 = fieldNorm(doc=5226)
    
  3. Egghe, L.; Guns, R.; Rousseau, R.; Leuven, K.U.: Erratum (2012) 3.70
    3.6952076 = sum of:
      3.6952076 = sum of:
        1.3464499 = weight(author_txt:egghe in 4992) [ClassicSimilarity], result of:
          1.3464499 = score(doc=4992,freq=1.0), product of:
            0.5679715 = queryWeight, product of:
              7.5860133 = idf(docFreq=60, maxDocs=44218)
              0.074870884 = queryNorm
            2.370629 = fieldWeight in 4992, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              7.5860133 = idf(docFreq=60, maxDocs=44218)
              0.3125 = fieldNorm(doc=4992)
        2.3487577 = weight(author_txt:guns in 4992) [ClassicSimilarity], result of:
          2.3487577 = score(doc=4992,freq=1.0), product of:
            0.8230481 = queryWeight, product of:
              1.2037861 = boost
              9.131938 = idf(docFreq=12, maxDocs=44218)
              0.074870884 = queryNorm
            2.8537307 = fieldWeight in 4992, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              9.131938 = idf(docFreq=12, maxDocs=44218)
              0.3125 = fieldNorm(doc=4992)
    
  4. Guns, R.: ¬The three dimensions of informetrics : a conceptual view (2013) 2.35
    2.3487577 = sum of:
      2.3487577 = product of:
        4.6975155 = sum of:
          4.6975155 = weight(author_txt:guns in 398) [ClassicSimilarity], result of:
            4.6975155 = score(doc=398,freq=1.0), product of:
              0.8230481 = queryWeight, product of:
                1.2037861 = boost
                9.131938 = idf(docFreq=12, maxDocs=44218)
                0.074870884 = queryNorm
              5.7074614 = fieldWeight in 398, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.131938 = idf(docFreq=12, maxDocs=44218)
                0.625 = fieldNorm(doc=398)
        0.5 = coord(1/2)
    
  5. Guns, R.: Tracing the origins of the semantic web (2013) 2.35
    2.3487577 = sum of:
      2.3487577 = product of:
        4.6975155 = sum of:
          4.6975155 = weight(author_txt:guns in 1093) [ClassicSimilarity], result of:
            4.6975155 = score(doc=1093,freq=1.0), product of:
              0.8230481 = queryWeight, product of:
                1.2037861 = boost
                9.131938 = idf(docFreq=12, maxDocs=44218)
                0.074870884 = queryNorm
              5.7074614 = fieldWeight in 1093, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.131938 = idf(docFreq=12, maxDocs=44218)
                0.625 = fieldNorm(doc=1093)
        0.5 = coord(1/2)
    

Similar documents (content)

  1. Shan, S.: On the generalized Zipf distribution : part I (2005) 0.21
    0.21038513 = sum of:
      0.21038513 = product of:
        1.3149071 = sum of:
          0.12238188 = weight(abstract_txt:informetric in 1061) [ClassicSimilarity], result of:
            0.12238188 = score(doc=1061,freq=1.0), product of:
              0.16725118 = queryWeight, product of:
                1.3386803 = boost
                7.805067 = idf(docFreq=48, maxDocs=44218)
                0.016007213 = queryNorm
              0.73172504 = fieldWeight in 1061, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.805067 = idf(docFreq=48, maxDocs=44218)
                0.09375 = fieldNorm(doc=1061)
          0.1435387 = weight(abstract_txt:generalization in 1061) [ClassicSimilarity], result of:
            0.1435387 = score(doc=1061,freq=1.0), product of:
              0.18601036 = queryWeight, product of:
                1.4117599 = boost
                8.231152 = idf(docFreq=31, maxDocs=44218)
                0.016007213 = queryNorm
              0.77167046 = fieldWeight in 1061, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.231152 = idf(docFreq=31, maxDocs=44218)
                0.09375 = fieldNorm(doc=1061)
          0.31534708 = weight(abstract_txt:generalized in 1061) [ClassicSimilarity], result of:
            0.31534708 = score(doc=1061,freq=3.0), product of:
              0.27461228 = queryWeight, product of:
                2.4258683 = boost
                7.071914 = idf(docFreq=101, maxDocs=44218)
                0.016007213 = queryNorm
              1.1483357 = fieldWeight in 1061, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.071914 = idf(docFreq=101, maxDocs=44218)
                0.09375 = fieldNorm(doc=1061)
          0.7336395 = weight(abstract_txt:zipf in 1061) [ClassicSimilarity], result of:
            0.7336395 = score(doc=1061,freq=5.0), product of:
              0.40666136 = queryWeight, product of:
                2.95205 = boost
                8.6058445 = idf(docFreq=21, maxDocs=44218)
                0.016007213 = queryNorm
              1.804055 = fieldWeight in 1061, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                8.6058445 = idf(docFreq=21, maxDocs=44218)
                0.09375 = fieldNorm(doc=1061)
        0.16 = coord(4/25)
    
  2. Milojevic, S.: Power law distributions in information science : making the case for logarithmic binning (2010) 0.18
    0.18174617 = sum of:
      0.18174617 = product of:
        0.90873086 = sum of:
          0.051013663 = weight(abstract_txt:least in 4113) [ClassicSimilarity], result of:
            0.051013663 = score(doc=4113,freq=1.0), product of:
              0.09332876 = queryWeight, product of:
                5.830419 = idf(docFreq=352, maxDocs=44218)
                0.016007213 = queryNorm
              0.5466018 = fieldWeight in 4113, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.830419 = idf(docFreq=352, maxDocs=44218)
                0.09375 = fieldNorm(doc=4113)
          0.010272235 = weight(abstract_txt:that in 4113) [ClassicSimilarity], result of:
            0.010272235 = score(doc=4113,freq=1.0), product of:
              0.046242524 = queryWeight, product of:
                1.2191963 = boost
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.016007213 = queryNorm
              0.22213829 = fieldWeight in 4113, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.09375 = fieldNorm(doc=4113)
          0.29269135 = weight(abstract_txt:logarithmic in 4113) [ClassicSimilarity], result of:
            0.29269135 = score(doc=4113,freq=2.0), product of:
              0.23740384 = queryWeight, product of:
                1.5949098 = boost
                9.298992 = idf(docFreq=10, maxDocs=44218)
                0.016007213 = queryNorm
              1.2328838 = fieldWeight in 4113, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.298992 = idf(docFreq=10, maxDocs=44218)
                0.09375 = fieldNorm(doc=4113)
          0.047793765 = weight(abstract_txt:data in 4113) [ClassicSimilarity], result of:
            0.047793765 = score(doc=4113,freq=1.0), product of:
              0.15280186 = queryWeight, product of:
                2.8611562 = boost
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.016007213 = queryNorm
              0.31278262 = fieldWeight in 4113, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.09375 = fieldNorm(doc=4113)
          0.50695986 = weight(abstract_txt:exponent in 4113) [ClassicSimilarity], result of:
            0.50695986 = score(doc=4113,freq=2.0), product of:
              0.4313934 = queryWeight, product of:
                3.040493 = boost
                8.863674 = idf(docFreq=16, maxDocs=44218)
                0.016007213 = queryNorm
              1.1751683 = fieldWeight in 4113, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.863674 = idf(docFreq=16, maxDocs=44218)
                0.09375 = fieldNorm(doc=4113)
        0.2 = coord(5/25)
    
  3. Egghe, L.: Zipfian and Lotkaian continuous concentration theory (2005) 0.13
    0.13260983 = sum of:
      0.13260983 = product of:
        0.8288114 = sum of:
          0.04441423 = weight(abstract_txt:apply in 3678) [ClassicSimilarity], result of:
            0.04441423 = score(doc=3678,freq=1.0), product of:
              0.096093364 = queryWeight, product of:
                1.014703 = boost
                5.916144 = idf(docFreq=323, maxDocs=44218)
                0.016007213 = queryNorm
              0.46219873 = fieldWeight in 3678, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.916144 = idf(docFreq=323, maxDocs=44218)
                0.078125 = fieldNorm(doc=3678)
          0.0121059455 = weight(abstract_txt:that in 3678) [ClassicSimilarity], result of:
            0.0121059455 = score(doc=3678,freq=2.0), product of:
              0.046242524 = queryWeight, product of:
                1.2191963 = boost
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.016007213 = queryNorm
              0.26179248 = fieldWeight in 3678, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.078125 = fieldNorm(doc=3678)
          0.47356224 = weight(abstract_txt:zipf in 3678) [ClassicSimilarity], result of:
            0.47356224 = score(doc=3678,freq=3.0), product of:
              0.40666136 = queryWeight, product of:
                2.95205 = boost
                8.6058445 = idf(docFreq=21, maxDocs=44218)
                0.016007213 = queryNorm
              1.1645125 = fieldWeight in 3678, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                8.6058445 = idf(docFreq=21, maxDocs=44218)
                0.078125 = fieldNorm(doc=3678)
          0.29872897 = weight(abstract_txt:exponent in 3678) [ClassicSimilarity], result of:
            0.29872897 = score(doc=3678,freq=1.0), product of:
              0.4313934 = queryWeight, product of:
                3.040493 = boost
                8.863674 = idf(docFreq=16, maxDocs=44218)
                0.016007213 = queryNorm
              0.69247454 = fieldWeight in 3678, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.863674 = idf(docFreq=16, maxDocs=44218)
                0.078125 = fieldNorm(doc=3678)
        0.16 = coord(4/25)
    
  4. Sarabia, J.M.; Sarabia, M.: Explicit expressions for the Leimkuhler curve in parametric families (2008) 0.12
    0.12266375 = sum of:
      0.12266375 = product of:
        0.511099 = sum of:
          0.018748011 = weight(abstract_txt:work in 2120) [ClassicSimilarity], result of:
            0.018748011 = score(doc=2120,freq=1.0), product of:
              0.0790555 = queryWeight, product of:
                1.3015873 = boost
                3.7943997 = idf(docFreq=2703, maxDocs=44218)
                0.016007213 = queryNorm
              0.23714998 = fieldWeight in 2120, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.7943997 = idf(docFreq=2703, maxDocs=44218)
                0.0625 = fieldNorm(doc=2120)
          0.08158792 = weight(abstract_txt:informetric in 2120) [ClassicSimilarity], result of:
            0.08158792 = score(doc=2120,freq=1.0), product of:
              0.16725118 = queryWeight, product of:
                1.3386803 = boost
                7.805067 = idf(docFreq=48, maxDocs=44218)
                0.016007213 = queryNorm
              0.4878167 = fieldWeight in 2120, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.805067 = idf(docFreq=48, maxDocs=44218)
                0.0625 = fieldNorm(doc=2120)
          0.035142258 = weight(abstract_txt:first in 2120) [ClassicSimilarity], result of:
            0.035142258 = score(doc=2120,freq=2.0), product of:
              0.09539049 = queryWeight, product of:
                1.429749 = boost
                4.168018 = idf(docFreq=1860, maxDocs=44218)
                0.016007213 = queryNorm
              0.3684042 = fieldWeight in 2120, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.168018 = idf(docFreq=1860, maxDocs=44218)
                0.0625 = fieldNorm(doc=2120)
          0.13352686 = weight(abstract_txt:classical in 2120) [ClassicSimilarity], result of:
            0.13352686 = score(doc=2120,freq=2.0), product of:
              0.23227246 = queryWeight, product of:
                2.2310336 = boost
                6.5039306 = idf(docFreq=179, maxDocs=44218)
                0.016007213 = queryNorm
              0.57487166 = fieldWeight in 2120, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.5039306 = idf(docFreq=179, maxDocs=44218)
                0.0625 = fieldNorm(doc=2120)
          0.2102314 = weight(abstract_txt:generalized in 2120) [ClassicSimilarity], result of:
            0.2102314 = score(doc=2120,freq=3.0), product of:
              0.27461228 = queryWeight, product of:
                2.4258683 = boost
                7.071914 = idf(docFreq=101, maxDocs=44218)
                0.016007213 = queryNorm
              0.76555717 = fieldWeight in 2120, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.071914 = idf(docFreq=101, maxDocs=44218)
                0.0625 = fieldNorm(doc=2120)
          0.03186251 = weight(abstract_txt:data in 2120) [ClassicSimilarity], result of:
            0.03186251 = score(doc=2120,freq=1.0), product of:
              0.15280186 = queryWeight, product of:
                2.8611562 = boost
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.016007213 = queryNorm
              0.20852174 = fieldWeight in 2120, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.0625 = fieldNorm(doc=2120)
        0.24 = coord(6/25)
    
  5. Burrell, Q.L.: "Ambiguity" ans scientometric measurement : a dissenting view (2001) 0.12
    0.118522085 = sum of:
      0.118522085 = product of:
        0.5926104 = sum of:
          0.017120393 = weight(abstract_txt:that in 6981) [ClassicSimilarity], result of:
            0.017120393 = score(doc=6981,freq=4.0), product of:
              0.046242524 = queryWeight, product of:
                1.2191963 = boost
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.016007213 = queryNorm
              0.3702305 = fieldWeight in 6981, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.078125 = fieldNorm(doc=6981)
          0.14422843 = weight(abstract_txt:informetric in 6981) [ClassicSimilarity], result of:
            0.14422843 = score(doc=6981,freq=2.0), product of:
              0.16725118 = queryWeight, product of:
                1.3386803 = boost
                7.805067 = idf(docFreq=48, maxDocs=44218)
                0.016007213 = queryNorm
              0.86234623 = fieldWeight in 6981, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.805067 = idf(docFreq=48, maxDocs=44218)
                0.078125 = fieldNorm(doc=6981)
          0.11802219 = weight(abstract_txt:classical in 6981) [ClassicSimilarity], result of:
            0.11802219 = score(doc=6981,freq=1.0), product of:
              0.23227246 = queryWeight, product of:
                2.2310336 = boost
                6.5039306 = idf(docFreq=179, maxDocs=44218)
                0.016007213 = queryNorm
              0.5081196 = fieldWeight in 6981, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.5039306 = idf(docFreq=179, maxDocs=44218)
                0.078125 = fieldNorm(doc=6981)
          0.039828137 = weight(abstract_txt:data in 6981) [ClassicSimilarity], result of:
            0.039828137 = score(doc=6981,freq=1.0), product of:
              0.15280186 = queryWeight, product of:
                2.8611562 = boost
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.016007213 = queryNorm
              0.26065218 = fieldWeight in 6981, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.078125 = fieldNorm(doc=6981)
          0.27341127 = weight(abstract_txt:zipf in 6981) [ClassicSimilarity], result of:
            0.27341127 = score(doc=6981,freq=1.0), product of:
              0.40666136 = queryWeight, product of:
                2.95205 = boost
                8.6058445 = idf(docFreq=21, maxDocs=44218)
                0.016007213 = queryNorm
              0.6723316 = fieldWeight in 6981, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.6058445 = idf(docFreq=21, maxDocs=44218)
                0.078125 = fieldNorm(doc=6981)
        0.2 = coord(5/25)