Document (#36114)

Author
Milojevic, S.
Title
Power law distributions in information science : making the case for logarithmic binning
Source
Journal of the American Society for Information Science and Technology. 61(2010) no.12, S.2417-2425
Year
2010
Abstract
We suggest partial logarithmic binning as the method of choice for uncovering the nature of many distributions encountered in information science (IS). Logarithmic binning retrieves information and trends "not visible" in noisy power law tails. We also argue that obtaining the exponent from logarithmically binned data using a simple least square method is in some cases warranted in addition to methods such as the maximum likelihood. We also show why often-used cumulative distributions can make it difficult to distinguish noise from genuine features and to obtain an accurate power law exponent of the underlying distribution. The treatment is nontechnical, aimed at IS researchers with little or no background in mathematics.
Theme
Informetrie

Similar documents (author)

  1. Milojevic, S.: Modes of collaboration in modern science : beyond power laws and preferential attachment (2010) 6.19
    6.190705 = sum of:
      6.190705 = weight(author_txt:milojevic in 3592) [ClassicSimilarity], result of:
        6.190705 = fieldWeight in 3592, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.905128 = idf(docFreq=5, maxDocs=44218)
          0.625 = fieldNorm(doc=3592)
    
  2. Zhang, G.; Ding, Y.; Milojevic, S.: Citation content analysis (CCA) : a framework for syntactic and semantic analysis of citation content (2013) 3.71
    3.7144227 = sum of:
      3.7144227 = weight(author_txt:milojevic in 975) [ClassicSimilarity], result of:
        3.7144227 = fieldWeight in 975, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.905128 = idf(docFreq=5, maxDocs=44218)
          0.375 = fieldNorm(doc=975)
    
  3. Milojevic, S.; Sugimoto, C.R.; Yan, E.; Ding, Y.: ¬The cognitive structure of Library and Information Science : analysis of article title words (2011) 3.10
    3.0953524 = sum of:
      3.0953524 = weight(author_txt:milojevic in 4608) [ClassicSimilarity], result of:
        3.0953524 = fieldWeight in 4608, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.905128 = idf(docFreq=5, maxDocs=44218)
          0.3125 = fieldNorm(doc=4608)
    
  4. Hu, B.; Dong, X.; Zhang, C.; Bowman, T.D.; Ding, Y.; Milojevic, S.; Ni, C.; Yan, E.; Larivière, V.: ¬A lead-lag analysis of the topic evolution patterns for preprints and publications (2015) 2.17
    2.1667466 = sum of:
      2.1667466 = weight(author_txt:milojevic in 2337) [ClassicSimilarity], result of:
        2.1667466 = fieldWeight in 2337, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.905128 = idf(docFreq=5, maxDocs=44218)
          0.21875 = fieldNorm(doc=2337)
    

Similar documents (content)

  1. Leydesdorff, L.; Bensman, S.: Classification and Powerlaws : the logarithmic transformation (2006) 0.24
    0.23740879 = sum of:
      0.23740879 = product of:
        1.1870439 = sum of:
          0.0073497565 = weight(abstract_txt:information in 6007) [ClassicSimilarity], result of:
            0.0073497565 = score(doc=6007,freq=1.0), product of:
              0.03885955 = queryWeight, product of:
                1.068584 = boost
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.015021177 = queryNorm
              0.18913643 = fieldWeight in 6007, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.078125 = fieldNorm(doc=6007)
          0.019874496 = weight(abstract_txt:science in 6007) [ClassicSimilarity], result of:
            0.019874496 = score(doc=6007,freq=1.0), product of:
              0.065889485 = queryWeight, product of:
                1.1361147 = boost
                3.8609126 = idf(docFreq=2529, maxDocs=44218)
                0.015021177 = queryNorm
              0.3016338 = fieldWeight in 6007, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.8609126 = idf(docFreq=2529, maxDocs=44218)
                0.078125 = fieldNorm(doc=6007)
          0.15359443 = weight(abstract_txt:tails in 6007) [ClassicSimilarity], result of:
            0.15359443 = score(doc=6007,freq=1.0), product of:
              0.20442109 = queryWeight, product of:
                1.415018 = boost
                9.617446 = idf(docFreq=7, maxDocs=44218)
                0.015021177 = queryNorm
              0.751363 = fieldWeight in 6007, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.617446 = idf(docFreq=7, maxDocs=44218)
                0.078125 = fieldNorm(doc=6007)
          0.28480923 = weight(abstract_txt:distributions in 6007) [ClassicSimilarity], result of:
            0.28480923 = score(doc=6007,freq=3.0), product of:
              0.30853996 = queryWeight, product of:
                3.0110326 = boost
                6.82169 = idf(docFreq=130, maxDocs=44218)
                0.015021177 = queryNorm
              0.923087 = fieldWeight in 6007, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.82169 = idf(docFreq=130, maxDocs=44218)
                0.078125 = fieldNorm(doc=6007)
          0.72141594 = weight(abstract_txt:logarithmic in 6007) [ClassicSimilarity], result of:
            0.72141594 = score(doc=6007,freq=3.0), product of:
              0.5733228 = queryWeight, product of:
                4.104491 = boost
                9.298992 = idf(docFreq=10, maxDocs=44218)
                0.015021177 = queryNorm
              1.2583067 = fieldWeight in 6007, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                9.298992 = idf(docFreq=10, maxDocs=44218)
                0.078125 = fieldNorm(doc=6007)
        0.2 = coord(5/25)
    
  2. Bodoff, D.; Wu, B.; Wong, K.Y.M.: Relevance data for language models using maximum likelihood (2003) 0.13
    0.12592159 = sum of:
      0.12592159 = product of:
        0.52467334 = sum of:
          0.01626338 = weight(abstract_txt:also in 1822) [ClassicSimilarity], result of:
            0.01626338 = score(doc=1822,freq=1.0), product of:
              0.05104718 = queryWeight, product of:
                3.3983476 = idf(docFreq=4017, maxDocs=44218)
                0.015021177 = queryNorm
              0.31859508 = fieldWeight in 1822, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3983476 = idf(docFreq=4017, maxDocs=44218)
                0.09375 = fieldNorm(doc=1822)
          0.11131793 = weight(abstract_txt:maximum in 1822) [ClassicSimilarity], result of:
            0.11131793 = score(doc=1822,freq=2.0), product of:
              0.1159279 = queryWeight, product of:
                1.0655973 = boost
                7.24254 = idf(docFreq=85, maxDocs=44218)
                0.015021177 = queryNorm
              0.9602342 = fieldWeight in 1822, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.24254 = idf(docFreq=85, maxDocs=44218)
                0.09375 = fieldNorm(doc=1822)
          0.008819709 = weight(abstract_txt:information in 1822) [ClassicSimilarity], result of:
            0.008819709 = score(doc=1822,freq=1.0), product of:
              0.03885955 = queryWeight, product of:
                1.068584 = boost
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.015021177 = queryNorm
              0.22696373 = fieldWeight in 1822, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.09375 = fieldNorm(doc=1822)
          0.12550513 = weight(abstract_txt:likelihood in 1822) [ClassicSimilarity], result of:
            0.12550513 = score(doc=1822,freq=2.0), product of:
              0.12557954 = queryWeight, product of:
                1.109069 = boost
                7.538004 = idf(docFreq=63, maxDocs=44218)
                0.015021177 = queryNorm
              0.99940753 = fieldWeight in 1822, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.538004 = idf(docFreq=63, maxDocs=44218)
                0.09375 = fieldNorm(doc=1822)
          0.06544562 = weight(abstract_txt:method in 1822) [ClassicSimilarity], result of:
            0.06544562 = score(doc=1822,freq=3.0), product of:
              0.08954565 = queryWeight, product of:
                1.3244525 = boost
                4.50095 = idf(docFreq=1333, maxDocs=44218)
                0.015021177 = queryNorm
              0.73086315 = fieldWeight in 1822, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.50095 = idf(docFreq=1333, maxDocs=44218)
                0.09375 = fieldNorm(doc=1822)
          0.19732162 = weight(abstract_txt:distributions in 1822) [ClassicSimilarity], result of:
            0.19732162 = score(doc=1822,freq=1.0), product of:
              0.30853996 = queryWeight, product of:
                3.0110326 = boost
                6.82169 = idf(docFreq=130, maxDocs=44218)
                0.015021177 = queryNorm
              0.63953346 = fieldWeight in 1822, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.82169 = idf(docFreq=130, maxDocs=44218)
                0.09375 = fieldNorm(doc=1822)
        0.24 = coord(6/25)
    
  3. Payne, N.; Thelwall, M.: Mathematical models for academic webs : linear relationship or non-linear power law? (2005) 0.09
    0.08912491 = sum of:
      0.08912491 = product of:
        0.7427076 = sum of:
          0.037785046 = weight(abstract_txt:method in 1066) [ClassicSimilarity], result of:
            0.037785046 = score(doc=1066,freq=1.0), product of:
              0.08954565 = queryWeight, product of:
                1.3244525 = boost
                4.50095 = idf(docFreq=1333, maxDocs=44218)
                0.015021177 = queryNorm
              0.42196405 = fieldWeight in 1066, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.50095 = idf(docFreq=1333, maxDocs=44218)
                0.09375 = fieldNorm(doc=1066)
          0.20511092 = weight(abstract_txt:power in 1066) [ClassicSimilarity], result of:
            0.20511092 = score(doc=1066,freq=3.0), product of:
              0.21952319 = queryWeight, product of:
                2.5398026 = boost
                5.754088 = idf(docFreq=380, maxDocs=44218)
                0.015021177 = queryNorm
              0.9343474 = fieldWeight in 1066, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.754088 = idf(docFreq=380, maxDocs=44218)
                0.09375 = fieldNorm(doc=1066)
          0.49981162 = weight(abstract_txt:logarithmic in 1066) [ClassicSimilarity], result of:
            0.49981162 = score(doc=1066,freq=1.0), product of:
              0.5733228 = queryWeight, product of:
                4.104491 = boost
                9.298992 = idf(docFreq=10, maxDocs=44218)
                0.015021177 = queryNorm
              0.8717805 = fieldWeight in 1066, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.298992 = idf(docFreq=10, maxDocs=44218)
                0.09375 = fieldNorm(doc=1066)
        0.12 = coord(3/25)
    
  4. Waltman, L.; Eck, N.J. van; Raan, A.F.J. van: Universality of citation distributions revisited (2012) 0.09
    0.08792765 = sum of:
      0.08792765 = product of:
        0.5495478 = sum of:
          0.01626338 = weight(abstract_txt:also in 4963) [ClassicSimilarity], result of:
            0.01626338 = score(doc=4963,freq=1.0), product of:
              0.05104718 = queryWeight, product of:
                3.3983476 = idf(docFreq=4017, maxDocs=44218)
                0.015021177 = queryNorm
              0.31859508 = fieldWeight in 4963, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3983476 = idf(docFreq=4017, maxDocs=44218)
                0.09375 = fieldNorm(doc=4963)
          0.033728138 = weight(abstract_txt:science in 4963) [ClassicSimilarity], result of:
            0.033728138 = score(doc=4963,freq=2.0), product of:
              0.065889485 = queryWeight, product of:
                1.1361147 = boost
                3.8609126 = idf(docFreq=2529, maxDocs=44218)
                0.015021177 = queryNorm
              0.5118895 = fieldWeight in 4963, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.8609126 = idf(docFreq=2529, maxDocs=44218)
                0.09375 = fieldNorm(doc=4963)
          0.1577852 = weight(abstract_txt:warranted in 4963) [ClassicSimilarity], result of:
            0.1577852 = score(doc=4963,freq=1.0), product of:
              0.18430285 = queryWeight, product of:
                1.343585 = boost
                9.131938 = idf(docFreq=12, maxDocs=44218)
                0.015021177 = queryNorm
              0.85611916 = fieldWeight in 4963, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.131938 = idf(docFreq=12, maxDocs=44218)
                0.09375 = fieldNorm(doc=4963)
          0.34177107 = weight(abstract_txt:distributions in 4963) [ClassicSimilarity], result of:
            0.34177107 = score(doc=4963,freq=3.0), product of:
              0.30853996 = queryWeight, product of:
                3.0110326 = boost
                6.82169 = idf(docFreq=130, maxDocs=44218)
                0.015021177 = queryNorm
              1.1077044 = fieldWeight in 4963, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.82169 = idf(docFreq=130, maxDocs=44218)
                0.09375 = fieldNorm(doc=4963)
        0.16 = coord(4/25)
    
  5. Ronda-Pupo, G.A.; Katz, J.S.: ¬The scaling relationship between citation-based performance and coauthorship patterns in natural sciences (2017) 0.08
    0.0797194 = sum of:
      0.0797194 = product of:
        0.66432834 = sum of:
          0.07688998 = weight(abstract_txt:cumulative in 3603) [ClassicSimilarity], result of:
            0.07688998 = score(doc=3603,freq=1.0), product of:
              0.12888089 = queryWeight, product of:
                1.1235526 = boost
                7.636444 = idf(docFreq=57, maxDocs=44218)
                0.015021177 = queryNorm
              0.5965972 = fieldWeight in 3603, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.636444 = idf(docFreq=57, maxDocs=44218)
                0.078125 = fieldNorm(doc=3603)
          0.17092578 = weight(abstract_txt:power in 3603) [ClassicSimilarity], result of:
            0.17092578 = score(doc=3603,freq=3.0), product of:
              0.21952319 = queryWeight, product of:
                2.5398026 = boost
                5.754088 = idf(docFreq=380, maxDocs=44218)
                0.015021177 = queryNorm
              0.77862287 = fieldWeight in 3603, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.754088 = idf(docFreq=380, maxDocs=44218)
                0.078125 = fieldNorm(doc=3603)
          0.4165126 = weight(abstract_txt:exponent in 3603) [ClassicSimilarity], result of:
            0.4165126 = score(doc=3603,freq=3.0), product of:
              0.34726715 = queryWeight, product of:
                2.6082306 = boost
                8.863674 = idf(docFreq=16, maxDocs=44218)
                0.015021177 = queryNorm
              1.1994011 = fieldWeight in 3603, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                8.863674 = idf(docFreq=16, maxDocs=44218)
                0.078125 = fieldNorm(doc=3603)
        0.12 = coord(3/25)