Document (#30141)

Author
Newby, G.B.
Greenberg, J.
Jones, P.
Title
Open source software development and Lotka's law : bibliometric patterns in programming
Source
Journal of the American Society for Information Science and technology. 54(2003) no.2, S.169-178
Year
2003
Abstract
Newby, Greenberg, and Jones analyze programming productivity of open source software by counting registered developers contributions found in the Linux Software Map and in Scourceforge. Using seven years of data from a subset of the Linux directory tree LSM data provided 4503 files with 3341 unique author names. The distribution follows Lotka's Law with an exponent of 2.82 as verified by the Kolmolgorov-Smirnov one sample goodness of fit test. Scourceforge data is broken into developers and administrators, but when both were used as authors the Lotka distribution exponent of 2.55 produces the lowest error. This would not be significant by the K-S test but the 3.54% maximum error would indicate a fit and calls into question the appropriateness of K-S for large populations of authors.
Theme
Informetrie

Similar documents (author)

  1. Newby, G.B.: Navigation: a fundamental concept for information systems with implications for information retrieval (1991) 1.37
    1.3746605 = sum of:
      1.3746605 = product of:
        4.1239815 = sum of:
          4.1239815 = weight(author_txt:newby in 3687) [ClassicSimilarity], result of:
            4.1239815 = score(doc=3687,freq=1.0), product of:
              0.71628135 = queryWeight, product of:
                1.3283867 = boost
                9.211981 = idf(docFreq=11, maxDocs=44218)
                0.058533717 = queryNorm
              5.7574883 = fieldWeight in 3687, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.211981 = idf(docFreq=11, maxDocs=44218)
                0.625 = fieldNorm(doc=3687)
        0.33333334 = coord(1/3)
    
  2. Newby, G.B.: ¬An investigation of the role of navigation for information retrieval (1992) 1.37
    1.3746605 = sum of:
      1.3746605 = product of:
        4.1239815 = sum of:
          4.1239815 = weight(author_txt:newby in 4504) [ClassicSimilarity], result of:
            4.1239815 = score(doc=4504,freq=1.0), product of:
              0.71628135 = queryWeight, product of:
                1.3283867 = boost
                9.211981 = idf(docFreq=11, maxDocs=44218)
                0.058533717 = queryNorm
              5.7574883 = fieldWeight in 4504, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.211981 = idf(docFreq=11, maxDocs=44218)
                0.625 = fieldNorm(doc=4504)
        0.33333334 = coord(1/3)
    
  3. Newby, G.B.: Virtual reality (1993) 1.37
    1.3746605 = sum of:
      1.3746605 = product of:
        4.1239815 = sum of:
          4.1239815 = weight(author_txt:newby in 7238) [ClassicSimilarity], result of:
            4.1239815 = score(doc=7238,freq=1.0), product of:
              0.71628135 = queryWeight, product of:
                1.3283867 = boost
                9.211981 = idf(docFreq=11, maxDocs=44218)
                0.058533717 = queryNorm
              5.7574883 = fieldWeight in 7238, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.211981 = idf(docFreq=11, maxDocs=44218)
                0.625 = fieldNorm(doc=7238)
        0.33333334 = coord(1/3)
    
  4. Newby, G.B.: ¬The maturation of norms for computer-mediated communication (1993) 1.37
    1.3746605 = sum of:
      1.3746605 = product of:
        4.1239815 = sum of:
          4.1239815 = weight(author_txt:newby in 8624) [ClassicSimilarity], result of:
            4.1239815 = score(doc=8624,freq=1.0), product of:
              0.71628135 = queryWeight, product of:
                1.3283867 = boost
                9.211981 = idf(docFreq=11, maxDocs=44218)
                0.058533717 = queryNorm
              5.7574883 = fieldWeight in 8624, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.211981 = idf(docFreq=11, maxDocs=44218)
                0.625 = fieldNorm(doc=8624)
        0.33333334 = coord(1/3)
    
  5. Newby, G.B.: Virtual reality and the entertainment industry (1994) 1.37
    1.3746605 = sum of:
      1.3746605 = product of:
        4.1239815 = sum of:
          4.1239815 = weight(author_txt:newby in 202) [ClassicSimilarity], result of:
            4.1239815 = score(doc=202,freq=1.0), product of:
              0.71628135 = queryWeight, product of:
                1.3283867 = boost
                9.211981 = idf(docFreq=11, maxDocs=44218)
                0.058533717 = queryNorm
              5.7574883 = fieldWeight in 202, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.211981 = idf(docFreq=11, maxDocs=44218)
                0.625 = fieldNorm(doc=202)
        0.33333334 = coord(1/3)
    

Similar documents (content)

  1. Kretschmer, H.; Rousseau, R.: Author inflation leads to a breakdown of Lotka's law : in and out of context (2001) 0.22
    0.21924013 = sum of:
      0.21924013 = product of:
        0.78300047 = sum of:
          0.08518931 = weight(abstract_txt:counting in 5205) [ClassicSimilarity], result of:
            0.08518931 = score(doc=5205,freq=1.0), product of:
              0.14342874 = queryWeight, product of:
                1.0126958 = boost
                7.602543 = idf(docFreq=59, maxDocs=44218)
                0.018629376 = queryNorm
              0.59394866 = fieldWeight in 5205, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.602543 = idf(docFreq=59, maxDocs=44218)
                0.078125 = fieldNorm(doc=5205)
          0.11813821 = weight(abstract_txt:lotka in 5205) [ClassicSimilarity], result of:
            0.11813821 = score(doc=5205,freq=1.0), product of:
              0.17836365 = queryWeight, product of:
                1.1293124 = boost
                8.478011 = idf(docFreq=24, maxDocs=44218)
                0.018629376 = queryNorm
              0.66234463 = fieldWeight in 5205, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.478011 = idf(docFreq=24, maxDocs=44218)
                0.078125 = fieldNorm(doc=5205)
          0.06745867 = weight(abstract_txt:authors in 5205) [ClassicSimilarity], result of:
            0.06745867 = score(doc=5205,freq=3.0), product of:
              0.107244305 = queryWeight, product of:
                1.238406 = boost
                4.648501 = idf(docFreq=1150, maxDocs=44218)
                0.018629376 = queryNorm
              0.62901866 = fieldWeight in 5205, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.648501 = idf(docFreq=1150, maxDocs=44218)
                0.078125 = fieldNorm(doc=5205)
          0.045004297 = weight(abstract_txt:would in 5205) [ClassicSimilarity], result of:
            0.045004297 = score(doc=5205,freq=1.0), product of:
              0.11809335 = queryWeight, product of:
                1.299537 = boost
                4.877963 = idf(docFreq=914, maxDocs=44218)
                0.018629376 = queryNorm
              0.38109088 = fieldWeight in 5205, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.877963 = idf(docFreq=914, maxDocs=44218)
                0.078125 = fieldNorm(doc=5205)
          0.0215995 = weight(abstract_txt:data in 5205) [ClassicSimilarity], result of:
            0.0215995 = score(doc=5205,freq=1.0), product of:
              0.08286713 = queryWeight, product of:
                1.3332534 = boost
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.018629376 = queryNorm
              0.26065218 = fieldWeight in 5205, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.078125 = fieldNorm(doc=5205)
          0.096121125 = weight(abstract_txt:distribution in 5205) [ClassicSimilarity], result of:
            0.096121125 = score(doc=5205,freq=2.0), product of:
              0.15545046 = queryWeight, product of:
                1.4909803 = boost
                5.596568 = idf(docFreq=445, maxDocs=44218)
                0.018629376 = queryNorm
              0.61833924 = fieldWeight in 5205, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.596568 = idf(docFreq=445, maxDocs=44218)
                0.078125 = fieldNorm(doc=5205)
          0.34948933 = weight(abstract_txt:lotka's in 5205) [ClassicSimilarity], result of:
            0.34948933 = score(doc=5205,freq=2.0), product of:
              0.36756608 = queryWeight, product of:
                2.292681 = boost
                8.6058445 = idf(docFreq=21, maxDocs=44218)
                0.018629376 = queryNorm
              0.95082045 = fieldWeight in 5205, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.6058445 = idf(docFreq=21, maxDocs=44218)
                0.078125 = fieldNorm(doc=5205)
        0.28 = coord(7/25)
    
  2. Egghe, L.: ¬A model for the size-frequency function of coauthor pairs (2008) 0.15
    0.15453957 = sum of:
      0.15453957 = product of:
        0.96587235 = sum of:
          0.16707265 = weight(abstract_txt:lotka in 2366) [ClassicSimilarity], result of:
            0.16707265 = score(doc=2366,freq=2.0), product of:
              0.17836365 = queryWeight, product of:
                1.1293124 = boost
                8.478011 = idf(docFreq=24, maxDocs=44218)
                0.018629376 = queryNorm
              0.93669677 = fieldWeight in 2366, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.478011 = idf(docFreq=24, maxDocs=44218)
                0.078125 = fieldNorm(doc=2366)
          0.06745867 = weight(abstract_txt:authors in 2366) [ClassicSimilarity], result of:
            0.06745867 = score(doc=2366,freq=3.0), product of:
              0.107244305 = queryWeight, product of:
                1.238406 = boost
                4.648501 = idf(docFreq=1150, maxDocs=44218)
                0.018629376 = queryNorm
              0.62901866 = fieldWeight in 2366, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.648501 = idf(docFreq=1150, maxDocs=44218)
                0.078125 = fieldNorm(doc=2366)
          0.34948933 = weight(abstract_txt:lotka's in 2366) [ClassicSimilarity], result of:
            0.34948933 = score(doc=2366,freq=2.0), product of:
              0.36756608 = queryWeight, product of:
                2.292681 = boost
                8.6058445 = idf(docFreq=21, maxDocs=44218)
                0.018629376 = queryNorm
              0.95082045 = fieldWeight in 2366, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.6058445 = idf(docFreq=21, maxDocs=44218)
                0.078125 = fieldNorm(doc=2366)
          0.38185173 = weight(abstract_txt:exponent in 2366) [ClassicSimilarity], result of:
            0.38185173 = score(doc=2366,freq=2.0), product of:
              0.3899204 = queryWeight, product of:
                2.3613691 = boost
                8.863674 = idf(docFreq=16, maxDocs=44218)
                0.018629376 = queryNorm
              0.9793068 = fieldWeight in 2366, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.863674 = idf(docFreq=16, maxDocs=44218)
                0.078125 = fieldNorm(doc=2366)
        0.16 = coord(4/25)
    
  3. Egghe, L.: Type/Token-Taken informetrics (2003) 0.15
    0.14754272 = sum of:
      0.14754272 = product of:
        0.61476135 = sum of:
          0.031157829 = weight(abstract_txt:authors in 1608) [ClassicSimilarity], result of:
            0.031157829 = score(doc=1608,freq=1.0), product of:
              0.107244305 = queryWeight, product of:
                1.238406 = boost
                4.648501 = idf(docFreq=1150, maxDocs=44218)
                0.018629376 = queryNorm
              0.2905313 = fieldWeight in 1608, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.648501 = idf(docFreq=1150, maxDocs=44218)
                0.0625 = fieldNorm(doc=1608)
          0.03481574 = weight(abstract_txt:open in 1608) [ClassicSimilarity], result of:
            0.03481574 = score(doc=1608,freq=1.0), product of:
              0.11548171 = queryWeight, product of:
                1.285087 = boost
                4.8237233 = idf(docFreq=965, maxDocs=44218)
                0.018629376 = queryNorm
              0.3014827 = fieldWeight in 1608, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.8237233 = idf(docFreq=965, maxDocs=44218)
                0.0625 = fieldNorm(doc=1608)
          0.058181904 = weight(abstract_txt:source in 1608) [ClassicSimilarity], result of:
            0.058181904 = score(doc=1608,freq=2.0), product of:
              0.12907578 = queryWeight, product of:
                1.3586209 = boost
                5.0997415 = idf(docFreq=732, maxDocs=44218)
                0.018629376 = queryNorm
              0.4507577 = fieldWeight in 1608, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.0997415 = idf(docFreq=732, maxDocs=44218)
                0.0625 = fieldNorm(doc=1608)
          0.0768969 = weight(abstract_txt:distribution in 1608) [ClassicSimilarity], result of:
            0.0768969 = score(doc=1608,freq=2.0), product of:
              0.15545046 = queryWeight, product of:
                1.4909803 = boost
                5.596568 = idf(docFreq=445, maxDocs=44218)
                0.018629376 = queryNorm
              0.4946714 = fieldWeight in 1608, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.596568 = idf(docFreq=445, maxDocs=44218)
                0.0625 = fieldNorm(doc=1608)
          0.19770104 = weight(abstract_txt:lotka's in 1608) [ClassicSimilarity], result of:
            0.19770104 = score(doc=1608,freq=1.0), product of:
              0.36756608 = queryWeight, product of:
                2.292681 = boost
                8.6058445 = idf(docFreq=21, maxDocs=44218)
                0.018629376 = queryNorm
              0.5378653 = fieldWeight in 1608, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.6058445 = idf(docFreq=21, maxDocs=44218)
                0.0625 = fieldNorm(doc=1608)
          0.21600796 = weight(abstract_txt:exponent in 1608) [ClassicSimilarity], result of:
            0.21600796 = score(doc=1608,freq=1.0), product of:
              0.3899204 = queryWeight, product of:
                2.3613691 = boost
                8.863674 = idf(docFreq=16, maxDocs=44218)
                0.018629376 = queryNorm
              0.55397964 = fieldWeight in 1608, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.863674 = idf(docFreq=16, maxDocs=44218)
                0.0625 = fieldNorm(doc=1608)
        0.24 = coord(6/25)
    
  4. Smolinsky, L.J.: Discrete power law with exponential cutoff and Lotka's law (2017) 0.13
    0.12911019 = sum of:
      0.12911019 = product of:
        0.80693865 = sum of:
          0.042764828 = weight(abstract_txt:data in 3699) [ClassicSimilarity], result of:
            0.042764828 = score(doc=3699,freq=2.0), product of:
              0.08286713 = queryWeight, product of:
                1.3332534 = boost
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.018629376 = queryNorm
              0.516065 = fieldWeight in 3699, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.109375 = fieldNorm(doc=3699)
          0.069769435 = weight(abstract_txt:test in 3699) [ClassicSimilarity], result of:
            0.069769435 = score(doc=3699,freq=1.0), product of:
              0.12640014 = queryWeight, product of:
                1.3444656 = boost
                5.046608 = idf(docFreq=772, maxDocs=44218)
                0.018629376 = queryNorm
              0.55197275 = fieldWeight in 3699, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.046608 = idf(docFreq=772, maxDocs=44218)
                0.109375 = fieldNorm(doc=3699)
          0.09515505 = weight(abstract_txt:distribution in 3699) [ClassicSimilarity], result of:
            0.09515505 = score(doc=3699,freq=1.0), product of:
              0.15545046 = queryWeight, product of:
                1.4909803 = boost
                5.596568 = idf(docFreq=445, maxDocs=44218)
                0.018629376 = queryNorm
              0.6121246 = fieldWeight in 3699, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.596568 = idf(docFreq=445, maxDocs=44218)
                0.109375 = fieldNorm(doc=3699)
          0.59924936 = weight(abstract_txt:lotka's in 3699) [ClassicSimilarity], result of:
            0.59924936 = score(doc=3699,freq=3.0), product of:
              0.36756608 = queryWeight, product of:
                2.292681 = boost
                8.6058445 = idf(docFreq=21, maxDocs=44218)
                0.018629376 = queryNorm
              1.6303174 = fieldWeight in 3699, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                8.6058445 = idf(docFreq=21, maxDocs=44218)
                0.109375 = fieldNorm(doc=3699)
        0.16 = coord(4/25)
    
  5. Arsenault, C.: Aggregation consistency and frequency of Chinese words and characters (2006) 0.11
    0.11491155 = sum of:
      0.11491155 = product of:
        0.5745577 = sum of:
          0.09451056 = weight(abstract_txt:lotka in 609) [ClassicSimilarity], result of:
            0.09451056 = score(doc=609,freq=1.0), product of:
              0.17836365 = queryWeight, product of:
                1.1293124 = boost
                8.478011 = idf(docFreq=24, maxDocs=44218)
                0.018629376 = queryNorm
              0.5298757 = fieldWeight in 609, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.478011 = idf(docFreq=24, maxDocs=44218)
                0.0625 = fieldNorm(doc=609)
          0.051838797 = weight(abstract_txt:data in 609) [ClassicSimilarity], result of:
            0.051838797 = score(doc=609,freq=9.0), product of:
              0.08286713 = queryWeight, product of:
                1.3332534 = boost
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.018629376 = queryNorm
              0.62556523 = fieldWeight in 609, product of:
                3.0 = tf(freq=9.0), with freq of:
                  9.0 = termFreq=9.0
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.0625 = fieldNorm(doc=609)
          0.039868247 = weight(abstract_txt:test in 609) [ClassicSimilarity], result of:
            0.039868247 = score(doc=609,freq=1.0), product of:
              0.12640014 = queryWeight, product of:
                1.3444656 = boost
                5.046608 = idf(docFreq=772, maxDocs=44218)
                0.018629376 = queryNorm
              0.315413 = fieldWeight in 609, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.046608 = idf(docFreq=772, maxDocs=44218)
                0.0625 = fieldNorm(doc=609)
          0.10874864 = weight(abstract_txt:distribution in 609) [ClassicSimilarity], result of:
            0.10874864 = score(doc=609,freq=4.0), product of:
              0.15545046 = queryWeight, product of:
                1.4909803 = boost
                5.596568 = idf(docFreq=445, maxDocs=44218)
                0.018629376 = queryNorm
              0.699571 = fieldWeight in 609, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.596568 = idf(docFreq=445, maxDocs=44218)
                0.0625 = fieldNorm(doc=609)
          0.27959147 = weight(abstract_txt:lotka's in 609) [ClassicSimilarity], result of:
            0.27959147 = score(doc=609,freq=2.0), product of:
              0.36756608 = queryWeight, product of:
                2.292681 = boost
                8.6058445 = idf(docFreq=21, maxDocs=44218)
                0.018629376 = queryNorm
              0.76065636 = fieldWeight in 609, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.6058445 = idf(docFreq=21, maxDocs=44218)
                0.0625 = fieldNorm(doc=609)
        0.2 = coord(5/25)