Document (#30142)

Author
Newby, G.B.
Greenberg, J.
Jones, P.
Title
Open source software development and Lotka's law : bibliometric patterns in programming
Source
Journal of the American Society for Information Science and technology. 54(2003) no.2, S.169-178
Year
2003
Abstract
Newby, Greenberg, and Jones analyze programming productivity of open source software by counting registered developers contributions found in the Linux Software Map and in Scourceforge. Using seven years of data from a subset of the Linux directory tree LSM data provided 4503 files with 3341 unique author names. The distribution follows Lotka's Law with an exponent of 2.82 as verified by the Kolmolgorov-Smirnov one sample goodness of fit test. Scourceforge data is broken into developers and administrators, but when both were used as authors the Lotka distribution exponent of 2.55 produces the lowest error. This would not be significant by the K-S test but the 3.54% maximum error would indicate a fit and calls into question the appropriateness of K-S for large populations of authors.
Theme
Informetrie

Similar documents (author)

  1. Newby, G.B.: Navigation: a fundamental concept for information systems with implications for information retrieval (1991) 1.35
    1.3499926 = sum of:
      1.3499926 = product of:
        4.049978 = sum of:
          4.049978 = weight(author_txt:newby in 3687) [ClassicSimilarity], result of:
            4.049978 = score(doc=3687,freq=1.0), product of:
              0.7062932 = queryWeight, product of:
                1.323491 = boost
                9.174609 = idf(docFreq=11, maxDocs=42596)
                0.058166977 = queryNorm
              5.734131 = fieldWeight in 3687, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.174609 = idf(docFreq=11, maxDocs=42596)
                0.625 = fieldNorm(doc=3687)
        0.33333334 = coord(1/3)
    
  2. Newby, G.B.: ¬An investigation of the role of navigation for information retrieval (1992) 1.35
    1.3499926 = sum of:
      1.3499926 = product of:
        4.049978 = sum of:
          4.049978 = weight(author_txt:newby in 4504) [ClassicSimilarity], result of:
            4.049978 = score(doc=4504,freq=1.0), product of:
              0.7062932 = queryWeight, product of:
                1.323491 = boost
                9.174609 = idf(docFreq=11, maxDocs=42596)
                0.058166977 = queryNorm
              5.734131 = fieldWeight in 4504, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.174609 = idf(docFreq=11, maxDocs=42596)
                0.625 = fieldNorm(doc=4504)
        0.33333334 = coord(1/3)
    
  3. Newby, G.B.: Virtual reality (1993) 1.35
    1.3499926 = sum of:
      1.3499926 = product of:
        4.049978 = sum of:
          4.049978 = weight(author_txt:newby in 7238) [ClassicSimilarity], result of:
            4.049978 = score(doc=7238,freq=1.0), product of:
              0.7062932 = queryWeight, product of:
                1.323491 = boost
                9.174609 = idf(docFreq=11, maxDocs=42596)
                0.058166977 = queryNorm
              5.734131 = fieldWeight in 7238, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.174609 = idf(docFreq=11, maxDocs=42596)
                0.625 = fieldNorm(doc=7238)
        0.33333334 = coord(1/3)
    
  4. Newby, G.B.: ¬The maturation of norms for computer-mediated communication (1993) 1.35
    1.3499926 = sum of:
      1.3499926 = product of:
        4.049978 = sum of:
          4.049978 = weight(author_txt:newby in 624) [ClassicSimilarity], result of:
            4.049978 = score(doc=624,freq=1.0), product of:
              0.7062932 = queryWeight, product of:
                1.323491 = boost
                9.174609 = idf(docFreq=11, maxDocs=42596)
                0.058166977 = queryNorm
              5.734131 = fieldWeight in 624, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.174609 = idf(docFreq=11, maxDocs=42596)
                0.625 = fieldNorm(doc=624)
        0.33333334 = coord(1/3)
    
  5. Newby, G.B.: Virtual reality and the entertainment industry (1994) 1.35
    1.3499926 = sum of:
      1.3499926 = product of:
        4.049978 = sum of:
          4.049978 = weight(author_txt:newby in 271) [ClassicSimilarity], result of:
            4.049978 = score(doc=271,freq=1.0), product of:
              0.7062932 = queryWeight, product of:
                1.323491 = boost
                9.174609 = idf(docFreq=11, maxDocs=42596)
                0.058166977 = queryNorm
              5.734131 = fieldWeight in 271, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.174609 = idf(docFreq=11, maxDocs=42596)
                0.625 = fieldNorm(doc=271)
        0.33333334 = coord(1/3)
    

Similar documents (content)

  1. Kretschmer, H.; Rousseau, R.: Author inflation leads to a breakdown of Lotka's law : in and out of context (2001) 0.22
    0.21821514 = sum of:
      0.21821514 = product of:
        0.7793398 = sum of:
          0.08561578 = weight(abstract_txt:counting in 206) [ClassicSimilarity], result of:
            0.08561578 = score(doc=206,freq=1.0), product of:
              0.14388329 = queryWeight, product of:
                1.0067801 = boost
                7.616464 = idf(docFreq=56, maxDocs=42596)
                0.018763864 = queryNorm
              0.59503627 = fieldWeight in 206, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.616464 = idf(docFreq=56, maxDocs=42596)
                0.078125 = fieldNorm(doc=206)
          0.11652512 = weight(abstract_txt:lotka in 206) [ClassicSimilarity], result of:
            0.11652512 = score(doc=206,freq=1.0), product of:
              0.17670718 = queryWeight, product of:
                1.1157235 = boost
                8.4406395 = idf(docFreq=24, maxDocs=42596)
                0.018763864 = queryNorm
              0.65942496 = fieldWeight in 206, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.4406395 = idf(docFreq=24, maxDocs=42596)
                0.078125 = fieldNorm(doc=206)
          0.06857615 = weight(abstract_txt:authors in 206) [ClassicSimilarity], result of:
            0.06857615 = score(doc=206,freq=3.0), product of:
              0.10840754 = queryWeight, product of:
                1.235874 = boost
                4.6747994 = idf(docFreq=1079, maxDocs=42596)
                0.018763864 = queryNorm
              0.6325773 = fieldWeight in 206, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.6747994 = idf(docFreq=1079, maxDocs=42596)
                0.078125 = fieldNorm(doc=206)
          0.0448705 = weight(abstract_txt:would in 206) [ClassicSimilarity], result of:
            0.0448705 = score(doc=206,freq=1.0), product of:
              0.11783974 = queryWeight, product of:
                1.2885176 = boost
                4.873928 = idf(docFreq=884, maxDocs=42596)
                0.018763864 = queryNorm
              0.38077563 = fieldWeight in 206, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.873928 = idf(docFreq=884, maxDocs=42596)
                0.078125 = fieldNorm(doc=206)
          0.022333065 = weight(abstract_txt:data in 206) [ClassicSimilarity], result of:
            0.022333065 = score(doc=206,freq=1.0), product of:
              0.08471893 = queryWeight, product of:
                1.3380746 = boost
                3.3742545 = idf(docFreq=3964, maxDocs=42596)
                0.018763864 = queryNorm
              0.26361364 = fieldWeight in 206, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3742545 = idf(docFreq=3964, maxDocs=42596)
                0.078125 = fieldNorm(doc=206)
          0.09663387 = weight(abstract_txt:distribution in 206) [ClassicSimilarity], result of:
            0.09663387 = score(doc=206,freq=2.0), product of:
              0.15597706 = queryWeight, product of:
                1.4824321 = boost
                5.6074266 = idf(docFreq=424, maxDocs=42596)
                0.018763864 = queryNorm
              0.61953896 = fieldWeight in 206, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.6074266 = idf(docFreq=424, maxDocs=42596)
                0.078125 = fieldNorm(doc=206)
          0.34478536 = weight(abstract_txt:lotka's in 206) [ClassicSimilarity], result of:
            0.34478536 = score(doc=206,freq=2.0), product of:
              0.36420038 = queryWeight, product of:
                2.2652423 = boost
                8.568473 = idf(docFreq=21, maxDocs=42596)
                0.018763864 = queryNorm
              0.9466914 = fieldWeight in 206, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.568473 = idf(docFreq=21, maxDocs=42596)
                0.078125 = fieldNorm(doc=206)
        0.28 = coord(7/25)
    
  2. Egghe, L.: ¬A model for the size-frequency function of coauthor pairs (2008) 0.15
    0.15405236 = sum of:
      0.15405236 = product of:
        0.9628273 = sum of:
          0.1647914 = weight(abstract_txt:lotka in 3546) [ClassicSimilarity], result of:
            0.1647914 = score(doc=3546,freq=2.0), product of:
              0.17670718 = queryWeight, product of:
                1.1157235 = boost
                8.4406395 = idf(docFreq=24, maxDocs=42596)
                0.018763864 = queryNorm
              0.9325677 = fieldWeight in 3546, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.4406395 = idf(docFreq=24, maxDocs=42596)
                0.078125 = fieldNorm(doc=3546)
          0.06857615 = weight(abstract_txt:authors in 3546) [ClassicSimilarity], result of:
            0.06857615 = score(doc=3546,freq=3.0), product of:
              0.10840754 = queryWeight, product of:
                1.235874 = boost
                4.6747994 = idf(docFreq=1079, maxDocs=42596)
                0.018763864 = queryNorm
              0.6325773 = fieldWeight in 3546, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.6747994 = idf(docFreq=1079, maxDocs=42596)
                0.078125 = fieldNorm(doc=3546)
          0.34478536 = weight(abstract_txt:lotka's in 3546) [ClassicSimilarity], result of:
            0.34478536 = score(doc=3546,freq=2.0), product of:
              0.36420038 = queryWeight, product of:
                2.2652423 = boost
                8.568473 = idf(docFreq=21, maxDocs=42596)
                0.018763864 = queryNorm
              0.9466914 = fieldWeight in 3546, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.568473 = idf(docFreq=21, maxDocs=42596)
                0.078125 = fieldNorm(doc=3546)
          0.38467446 = weight(abstract_txt:exponent in 3546) [ClassicSimilarity], result of:
            0.38467446 = score(doc=3546,freq=2.0), product of:
              0.391775 = queryWeight, product of:
                2.3494318 = boost
                8.886927 = idf(docFreq=15, maxDocs=42596)
                0.018763864 = queryNorm
              0.98187596 = fieldWeight in 3546, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.886927 = idf(docFreq=15, maxDocs=42596)
                0.078125 = fieldNorm(doc=3546)
        0.16 = coord(4/25)
    
  3. Egghe, L.: Type/Token-Taken informetrics (2003) 0.15
    0.14810072 = sum of:
      0.14810072 = product of:
        0.61708635 = sum of:
          0.03167397 = weight(abstract_txt:authors in 2609) [ClassicSimilarity], result of:
            0.03167397 = score(doc=2609,freq=1.0), product of:
              0.10840754 = queryWeight, product of:
                1.235874 = boost
                4.6747994 = idf(docFreq=1079, maxDocs=42596)
                0.018763864 = queryNorm
              0.29217497 = fieldWeight in 2609, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.6747994 = idf(docFreq=1079, maxDocs=42596)
                0.0625 = fieldNorm(doc=2609)
          0.036663778 = weight(abstract_txt:open in 2609) [ClassicSimilarity], result of:
            0.036663778 = score(doc=2609,freq=1.0), product of:
              0.11951323 = queryWeight, product of:
                1.2976347 = boost
                4.9084144 = idf(docFreq=854, maxDocs=42596)
                0.018763864 = queryNorm
              0.3067759 = fieldWeight in 2609, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.9084144 = idf(docFreq=854, maxDocs=42596)
                0.0625 = fieldNorm(doc=2609)
          0.0587967 = weight(abstract_txt:source in 2609) [ClassicSimilarity], result of:
            0.0587967 = score(doc=2609,freq=2.0), product of:
              0.12996203 = queryWeight, product of:
                1.3531711 = boost
                5.1184855 = idf(docFreq=692, maxDocs=42596)
                0.018763864 = queryNorm
              0.45241445 = fieldWeight in 2609, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.1184855 = idf(docFreq=692, maxDocs=42596)
                0.0625 = fieldNorm(doc=2609)
          0.07730709 = weight(abstract_txt:distribution in 2609) [ClassicSimilarity], result of:
            0.07730709 = score(doc=2609,freq=2.0), product of:
              0.15597706 = queryWeight, product of:
                1.4824321 = boost
                5.6074266 = idf(docFreq=424, maxDocs=42596)
                0.018763864 = queryNorm
              0.49563116 = fieldWeight in 2609, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.6074266 = idf(docFreq=424, maxDocs=42596)
                0.0625 = fieldNorm(doc=2609)
          0.19504006 = weight(abstract_txt:lotka's in 2609) [ClassicSimilarity], result of:
            0.19504006 = score(doc=2609,freq=1.0), product of:
              0.36420038 = queryWeight, product of:
                2.2652423 = boost
                8.568473 = idf(docFreq=21, maxDocs=42596)
                0.018763864 = queryNorm
              0.53552955 = fieldWeight in 2609, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.568473 = idf(docFreq=21, maxDocs=42596)
                0.0625 = fieldNorm(doc=2609)
          0.21760474 = weight(abstract_txt:exponent in 2609) [ClassicSimilarity], result of:
            0.21760474 = score(doc=2609,freq=1.0), product of:
              0.391775 = queryWeight, product of:
                2.3494318 = boost
                8.886927 = idf(docFreq=15, maxDocs=42596)
                0.018763864 = queryNorm
              0.5554329 = fieldWeight in 2609, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.886927 = idf(docFreq=15, maxDocs=42596)
                0.0625 = fieldNorm(doc=2609)
        0.24 = coord(6/25)
    
  4. Smolinsky, L.J.: Discrete power law with exponential cutoff and Lotka's law (2017) 0.13
    0.12825088 = sum of:
      0.12825088 = product of:
        0.80156803 = sum of:
          0.04421721 = weight(abstract_txt:data in 4700) [ClassicSimilarity], result of:
            0.04421721 = score(doc=4700,freq=2.0), product of:
              0.08471893 = queryWeight, product of:
                1.3380746 = boost
                3.3742545 = idf(docFreq=3964, maxDocs=42596)
                0.018763864 = queryNorm
              0.52192837 = fieldWeight in 4700, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.3742545 = idf(docFreq=3964, maxDocs=42596)
                0.109375 = fieldNorm(doc=4700)
          0.0705044 = weight(abstract_txt:test in 4700) [ClassicSimilarity], result of:
            0.0705044 = score(doc=4700,freq=1.0), product of:
              0.12726527 = queryWeight, product of:
                1.3390582 = boost
                5.065102 = idf(docFreq=730, maxDocs=42596)
                0.018763864 = queryNorm
              0.55399555 = fieldWeight in 4700, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.065102 = idf(docFreq=730, maxDocs=42596)
                0.109375 = fieldNorm(doc=4700)
          0.095662646 = weight(abstract_txt:distribution in 4700) [ClassicSimilarity], result of:
            0.095662646 = score(doc=4700,freq=1.0), product of:
              0.15597706 = queryWeight, product of:
                1.4824321 = boost
                5.6074266 = idf(docFreq=424, maxDocs=42596)
                0.018763864 = queryNorm
              0.6133123 = fieldWeight in 4700, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.6074266 = idf(docFreq=424, maxDocs=42596)
                0.109375 = fieldNorm(doc=4700)
          0.5911838 = weight(abstract_txt:lotka's in 4700) [ClassicSimilarity], result of:
            0.5911838 = score(doc=4700,freq=3.0), product of:
              0.36420038 = queryWeight, product of:
                2.2652423 = boost
                8.568473 = idf(docFreq=21, maxDocs=42596)
                0.018763864 = queryNorm
              1.6232376 = fieldWeight in 4700, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                8.568473 = idf(docFreq=21, maxDocs=42596)
                0.109375 = fieldNorm(doc=4700)
        0.16 = coord(4/25)
    
  5. Arsenault, C.: Aggregation consistency and frequency of Chinese words and characters (2006) 0.11
    0.11445295 = sum of:
      0.11445295 = product of:
        0.57226473 = sum of:
          0.0932201 = weight(abstract_txt:lotka in 914) [ClassicSimilarity], result of:
            0.0932201 = score(doc=914,freq=1.0), product of:
              0.17670718 = queryWeight, product of:
                1.1157235 = boost
                8.4406395 = idf(docFreq=24, maxDocs=42596)
                0.018763864 = queryNorm
              0.52753997 = fieldWeight in 914, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.4406395 = idf(docFreq=24, maxDocs=42596)
                0.0625 = fieldNorm(doc=914)
          0.053599354 = weight(abstract_txt:data in 914) [ClassicSimilarity], result of:
            0.053599354 = score(doc=914,freq=9.0), product of:
              0.08471893 = queryWeight, product of:
                1.3380746 = boost
                3.3742545 = idf(docFreq=3964, maxDocs=42596)
                0.018763864 = queryNorm
              0.6326727 = fieldWeight in 914, product of:
                3.0 = tf(freq=9.0), with freq of:
                  9.0 = termFreq=9.0
                3.3742545 = idf(docFreq=3964, maxDocs=42596)
                0.0625 = fieldNorm(doc=914)
          0.040288225 = weight(abstract_txt:test in 914) [ClassicSimilarity], result of:
            0.040288225 = score(doc=914,freq=1.0), product of:
              0.12726527 = queryWeight, product of:
                1.3390582 = boost
                5.065102 = idf(docFreq=730, maxDocs=42596)
                0.018763864 = queryNorm
              0.31656888 = fieldWeight in 914, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.065102 = idf(docFreq=730, maxDocs=42596)
                0.0625 = fieldNorm(doc=914)
          0.10932874 = weight(abstract_txt:distribution in 914) [ClassicSimilarity], result of:
            0.10932874 = score(doc=914,freq=4.0), product of:
              0.15597706 = queryWeight, product of:
                1.4824321 = boost
                5.6074266 = idf(docFreq=424, maxDocs=42596)
                0.018763864 = queryNorm
              0.70092833 = fieldWeight in 914, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.6074266 = idf(docFreq=424, maxDocs=42596)
                0.0625 = fieldNorm(doc=914)
          0.2758283 = weight(abstract_txt:lotka's in 914) [ClassicSimilarity], result of:
            0.2758283 = score(doc=914,freq=2.0), product of:
              0.36420038 = queryWeight, product of:
                2.2652423 = boost
                8.568473 = idf(docFreq=21, maxDocs=42596)
                0.018763864 = queryNorm
              0.7573531 = fieldWeight in 914, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.568473 = idf(docFreq=21, maxDocs=42596)
                0.0625 = fieldNorm(doc=914)
        0.2 = coord(5/25)