Document (#31008)

Author
Leydesdorff, L.
Bensman, S.
Title
Classification and Powerlaws : the logarithmic transformation
Source
Journal of the American Society for Information Science and Technology. 57(2006) no.11, S.1470-1486
Year
2006
Abstract
Logarithmic transformation of the data has been recommended by the literature in the case of highly skewed distributions such as those commonly found in information science. The purpose of the transformation is to make the data conform to the lognormal law of error for inferential purposes. How does this transformation affect the analysis? We factor analyze and visualize the citation environment of the Journal of the American Chemical Society (JACS) before and after a logarithmic transformation. The transformation strongly reduces the variance necessary for classificatory purposes and therefore is counterproductive to the purposes of the descriptive statistics. We recommend against the logarithmic transformation when sets cannot be defined unambiguously. The intellectual organization of the sciences is reflected in the curvilinear parts of the citation distributions while negative powerlaws fit excellently to the tails of the distributions.
Theme
Informetrie

Similar documents (author)

  1. Bensman, S.J.; Leydesdorff, L.: Definition and identification of journals as bibliographic and subject entities : librarianship versus ISI Journal Citation Reports methods and their effect on citation measures (2009) 5.81
    5.8140373 = sum of:
      5.8140373 = sum of:
        1.8892446 = weight(author_txt:leydesdorff in 2840) [ClassicSimilarity], result of:
          1.8892446 = score(doc=2840,freq=1.0), product of:
            0.523369 = queryWeight, product of:
              7.2195506 = idf(docFreq=87, maxDocs=44218)
              0.07249329 = queryNorm
            3.6097753 = fieldWeight in 2840, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              7.2195506 = idf(docFreq=87, maxDocs=44218)
              0.5 = fieldNorm(doc=2840)
        3.9247925 = weight(author_txt:bensman in 2840) [ClassicSimilarity], result of:
          3.9247925 = score(doc=2840,freq=1.0), product of:
            0.8521061 = queryWeight, product of:
              1.275977 = boost
              9.211981 = idf(docFreq=11, maxDocs=44218)
              0.07249329 = queryNorm
            4.6059904 = fieldWeight in 2840, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              9.211981 = idf(docFreq=11, maxDocs=44218)
              0.5 = fieldNorm(doc=2840)
    
  2. Bensman, S.J.: Garfield and the impact factors (2007) 2.45
    2.4529953 = sum of:
      2.4529953 = product of:
        4.9059906 = sum of:
          4.9059906 = weight(author_txt:bensman in 4680) [ClassicSimilarity], result of:
            4.9059906 = score(doc=4680,freq=1.0), product of:
              0.8521061 = queryWeight, product of:
                1.275977 = boost
                9.211981 = idf(docFreq=11, maxDocs=44218)
                0.07249329 = queryNorm
              5.7574883 = fieldWeight in 4680, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.211981 = idf(docFreq=11, maxDocs=44218)
                0.625 = fieldNorm(doc=4680)
        0.5 = coord(1/2)
    
  3. Bensman, S.J.: Probability distributions in library and information science : a historical and practitioner viewpoint (2000) 2.45
    2.4529953 = sum of:
      2.4529953 = product of:
        4.9059906 = sum of:
          4.9059906 = weight(author_txt:bensman in 4859) [ClassicSimilarity], result of:
            4.9059906 = score(doc=4859,freq=1.0), product of:
              0.8521061 = queryWeight, product of:
                1.275977 = boost
                9.211981 = idf(docFreq=11, maxDocs=44218)
                0.07249329 = queryNorm
              5.7574883 = fieldWeight in 4859, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.211981 = idf(docFreq=11, maxDocs=44218)
                0.625 = fieldNorm(doc=4859)
        0.5 = coord(1/2)
    
  4. Bensman, S.J.: Urquhart's and Garfield's laws : the British controversy over their validity (2001) 2.45
    2.4529953 = sum of:
      2.4529953 = product of:
        4.9059906 = sum of:
          4.9059906 = weight(author_txt:bensman in 6026) [ClassicSimilarity], result of:
            4.9059906 = score(doc=6026,freq=1.0), product of:
              0.8521061 = queryWeight, product of:
                1.275977 = boost
                9.211981 = idf(docFreq=11, maxDocs=44218)
                0.07249329 = queryNorm
              5.7574883 = fieldWeight in 6026, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.211981 = idf(docFreq=11, maxDocs=44218)
                0.625 = fieldNorm(doc=6026)
        0.5 = coord(1/2)
    
  5. Bensman, S.J.: Urquhart and probability : the transition from librarianship to library and information science (2005) 2.45
    2.4529953 = sum of:
      2.4529953 = product of:
        4.9059906 = sum of:
          4.9059906 = weight(author_txt:bensman in 3311) [ClassicSimilarity], result of:
            4.9059906 = score(doc=3311,freq=1.0), product of:
              0.8521061 = queryWeight, product of:
                1.275977 = boost
                9.211981 = idf(docFreq=11, maxDocs=44218)
                0.07249329 = queryNorm
              5.7574883 = fieldWeight in 3311, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.211981 = idf(docFreq=11, maxDocs=44218)
                0.625 = fieldNorm(doc=3311)
        0.5 = coord(1/2)
    

Similar documents (content)

  1. Milojevic, S.: Power law distributions in information science : making the case for logarithmic binning (2010) 0.18
    0.18245822 = sum of:
      0.18245822 = product of:
        1.1403639 = sum of:
          0.012348272 = weight(abstract_txt:data in 4113) [ClassicSimilarity], result of:
            0.012348272 = score(doc=4113,freq=1.0), product of:
              0.039478768 = queryWeight, product of:
                1.0631486 = boost
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.01113008 = queryNorm
              0.31278262 = fieldWeight in 4113, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.09375 = fieldNorm(doc=4113)
          0.14789093 = weight(abstract_txt:tails in 4113) [ClassicSimilarity], result of:
            0.14789093 = score(doc=4113,freq=1.0), product of:
              0.16402517 = queryWeight, product of:
                1.5323305 = boost
                9.617446 = idf(docFreq=7, maxDocs=44218)
                0.01113008 = queryNorm
              0.9016355 = fieldWeight in 4113, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.617446 = idf(docFreq=7, maxDocs=44218)
                0.09375 = fieldNorm(doc=4113)
          0.22391048 = weight(abstract_txt:distributions in 4113) [ClassicSimilarity], result of:
            0.22391048 = score(doc=4113,freq=2.0), product of:
              0.24756895 = queryWeight, product of:
                3.260663 = boost
                6.82169 = idf(docFreq=130, maxDocs=44218)
                0.01113008 = queryNorm
              0.9044368 = fieldWeight in 4113, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.82169 = idf(docFreq=130, maxDocs=44218)
                0.09375 = fieldNorm(doc=4113)
          0.75621426 = weight(abstract_txt:logarithmic in 4113) [ClassicSimilarity], result of:
            0.75621426 = score(doc=4113,freq=2.0), product of:
              0.61337024 = queryWeight, product of:
                5.9263673 = boost
                9.298992 = idf(docFreq=10, maxDocs=44218)
                0.01113008 = queryNorm
              1.2328838 = fieldWeight in 4113, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.298992 = idf(docFreq=10, maxDocs=44218)
                0.09375 = fieldNorm(doc=4113)
        0.16 = coord(4/25)
    
  2. Leydesdorff, L.: Similarity measures, author cocitation Analysis, and information theory (2005) 0.15
    0.14670281 = sum of:
      0.14670281 = product of:
        1.2225235 = sum of:
          0.012348272 = weight(abstract_txt:data in 3471) [ClassicSimilarity], result of:
            0.012348272 = score(doc=3471,freq=1.0), product of:
              0.039478768 = queryWeight, product of:
                1.0631486 = boost
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.01113008 = queryNorm
              0.31278262 = fieldWeight in 3471, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.09375 = fieldNorm(doc=3471)
          0.75621426 = weight(abstract_txt:logarithmic in 3471) [ClassicSimilarity], result of:
            0.75621426 = score(doc=3471,freq=2.0), product of:
              0.61337024 = queryWeight, product of:
                5.9263673 = boost
                9.298992 = idf(docFreq=10, maxDocs=44218)
                0.01113008 = queryNorm
              1.2328838 = fieldWeight in 3471, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.298992 = idf(docFreq=10, maxDocs=44218)
                0.09375 = fieldNorm(doc=3471)
          0.45396087 = weight(abstract_txt:transformation in 3471) [ClassicSimilarity], result of:
            0.45396087 = score(doc=3471,freq=2.0), product of:
              0.5259984 = queryWeight, product of:
                7.2600303 = boost
                6.5095015 = idf(docFreq=178, maxDocs=44218)
                0.01113008 = queryNorm
              0.86304605 = fieldWeight in 3471, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.5095015 = idf(docFreq=178, maxDocs=44218)
                0.09375 = fieldNorm(doc=3471)
        0.12 = coord(3/25)
    
  3. Bensman, S.J.; Smolinsky, L.J.; Pudovkin, A.I.: Mean citation rate per article in mathematics journals : differences from the scientific model (2010) 0.10
    0.10431664 = sum of:
      0.10431664 = product of:
        0.37255943 = sum of:
          0.034421597 = weight(abstract_txt:negative in 3595) [ClassicSimilarity], result of:
            0.034421597 = score(doc=3595,freq=2.0), product of:
              0.070558436 = queryWeight, product of:
                1.005013 = boost
                6.3078156 = idf(docFreq=218, maxDocs=44218)
                0.01113008 = queryNorm
              0.4878452 = fieldWeight in 3595, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.3078156 = idf(docFreq=218, maxDocs=44218)
                0.0546875 = fieldNorm(doc=3595)
          0.0072031585 = weight(abstract_txt:data in 3595) [ClassicSimilarity], result of:
            0.0072031585 = score(doc=3595,freq=1.0), product of:
              0.039478768 = queryWeight, product of:
                1.0631486 = boost
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.01113008 = queryNorm
              0.18245652 = fieldWeight in 3595, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.0546875 = fieldNorm(doc=3595)
          0.031207878 = weight(abstract_txt:error in 3595) [ClassicSimilarity], result of:
            0.031207878 = score(doc=3595,freq=1.0), product of:
              0.08327496 = queryWeight, product of:
                1.0918285 = boost
                6.8527 = idf(docFreq=126, maxDocs=44218)
                0.01113008 = queryNorm
              0.37475705 = fieldWeight in 3595, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.8527 = idf(docFreq=126, maxDocs=44218)
                0.0546875 = fieldNorm(doc=3595)
          0.06680789 = weight(abstract_txt:variance in 3595) [ClassicSimilarity], result of:
            0.06680789 = score(doc=3595,freq=2.0), product of:
              0.10978595 = queryWeight, product of:
                1.2536335 = boost
                7.8682456 = idf(docFreq=45, maxDocs=44218)
                0.01113008 = queryNorm
              0.60852855 = fieldWeight in 3595, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.8682456 = idf(docFreq=45, maxDocs=44218)
                0.0546875 = fieldNorm(doc=3595)
          0.056758054 = weight(abstract_txt:skewed in 3595) [ClassicSimilarity], result of:
            0.056758054 = score(doc=3595,freq=1.0), product of:
              0.12407662 = queryWeight, product of:
                1.33273 = boost
                8.364683 = idf(docFreq=27, maxDocs=44218)
                0.01113008 = queryNorm
              0.4574436 = fieldWeight in 3595, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.364683 = idf(docFreq=27, maxDocs=44218)
                0.0546875 = fieldNorm(doc=3595)
          0.0455464 = weight(abstract_txt:citation in 3595) [ClassicSimilarity], result of:
            0.0455464 = score(doc=3595,freq=4.0), product of:
              0.085041516 = queryWeight, product of:
                1.5603703 = boost
                4.896717 = idf(docFreq=897, maxDocs=44218)
                0.01113008 = queryNorm
              0.5355784 = fieldWeight in 3595, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.896717 = idf(docFreq=897, maxDocs=44218)
                0.0546875 = fieldNorm(doc=3595)
          0.13061446 = weight(abstract_txt:distributions in 3595) [ClassicSimilarity], result of:
            0.13061446 = score(doc=3595,freq=2.0), product of:
              0.24756895 = queryWeight, product of:
                3.260663 = boost
                6.82169 = idf(docFreq=130, maxDocs=44218)
                0.01113008 = queryNorm
              0.5275882 = fieldWeight in 3595, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.82169 = idf(docFreq=130, maxDocs=44218)
                0.0546875 = fieldNorm(doc=3595)
        0.28 = coord(7/25)
    
  4. Leydesdorff, L.; Zhou, P.; Bornmann, L.: How can journal impact factors be normalized across fields of science? : An assessment in terms of percentile ranks and fractional counts (2013) 0.10
    0.09809198 = sum of:
      0.09809198 = product of:
        0.49045992 = sum of:
          0.053988926 = weight(abstract_txt:variance in 532) [ClassicSimilarity], result of:
            0.053988926 = score(doc=532,freq=1.0), product of:
              0.10978595 = queryWeight, product of:
                1.2536335 = boost
                7.8682456 = idf(docFreq=45, maxDocs=44218)
                0.01113008 = queryNorm
              0.49176535 = fieldWeight in 532, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.8682456 = idf(docFreq=45, maxDocs=44218)
                0.0625 = fieldNorm(doc=532)
          0.06486635 = weight(abstract_txt:skewed in 532) [ClassicSimilarity], result of:
            0.06486635 = score(doc=532,freq=1.0), product of:
              0.12407662 = queryWeight, product of:
                1.33273 = boost
                8.364683 = idf(docFreq=27, maxDocs=44218)
                0.01113008 = queryNorm
              0.5227927 = fieldWeight in 532, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.364683 = idf(docFreq=27, maxDocs=44218)
                0.0625 = fieldNorm(doc=532)
          0.05205303 = weight(abstract_txt:citation in 532) [ClassicSimilarity], result of:
            0.05205303 = score(doc=532,freq=4.0), product of:
              0.085041516 = queryWeight, product of:
                1.5603703 = boost
                4.896717 = idf(docFreq=897, maxDocs=44218)
                0.01113008 = queryNorm
              0.61208963 = fieldWeight in 532, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.896717 = idf(docFreq=897, maxDocs=44218)
                0.0625 = fieldNorm(doc=532)
          0.10555241 = weight(abstract_txt:distributions in 532) [ClassicSimilarity], result of:
            0.10555241 = score(doc=532,freq=1.0), product of:
              0.24756895 = queryWeight, product of:
                3.260663 = boost
                6.82169 = idf(docFreq=130, maxDocs=44218)
                0.01113008 = queryNorm
              0.42635563 = fieldWeight in 532, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.82169 = idf(docFreq=130, maxDocs=44218)
                0.0625 = fieldNorm(doc=532)
          0.21399921 = weight(abstract_txt:transformation in 532) [ClassicSimilarity], result of:
            0.21399921 = score(doc=532,freq=1.0), product of:
              0.5259984 = queryWeight, product of:
                7.2600303 = boost
                6.5095015 = idf(docFreq=178, maxDocs=44218)
                0.01113008 = queryNorm
              0.40684384 = fieldWeight in 532, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.5095015 = idf(docFreq=178, maxDocs=44218)
                0.0625 = fieldNorm(doc=532)
        0.2 = coord(5/25)
    
  5. Bensman, S.J.: Distributional differences of the impact factor in the sciences versus the social sciences : an analysis of the probabilistic structure of the 2005 journal citation reports (2008) 0.10
    0.09756412 = sum of:
      0.09756412 = product of:
        0.4878206 = sum of:
          0.034771062 = weight(abstract_txt:negative in 1953) [ClassicSimilarity], result of:
            0.034771062 = score(doc=1953,freq=1.0), product of:
              0.070558436 = queryWeight, product of:
                1.005013 = boost
                6.3078156 = idf(docFreq=218, maxDocs=44218)
                0.01113008 = queryNorm
              0.4927981 = fieldWeight in 1953, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.3078156 = idf(docFreq=218, maxDocs=44218)
                0.078125 = fieldNorm(doc=1953)
          0.095439844 = weight(abstract_txt:variance in 1953) [ClassicSimilarity], result of:
            0.095439844 = score(doc=1953,freq=2.0), product of:
              0.10978595 = queryWeight, product of:
                1.2536335 = boost
                7.8682456 = idf(docFreq=45, maxDocs=44218)
                0.01113008 = queryNorm
              0.86932653 = fieldWeight in 1953, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.8682456 = idf(docFreq=45, maxDocs=44218)
                0.078125 = fieldNorm(doc=1953)
          0.114668585 = weight(abstract_txt:skewed in 1953) [ClassicSimilarity], result of:
            0.114668585 = score(doc=1953,freq=2.0), product of:
              0.12407662 = queryWeight, product of:
                1.33273 = boost
                8.364683 = idf(docFreq=27, maxDocs=44218)
                0.01113008 = queryNorm
              0.9241756 = fieldWeight in 1953, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.364683 = idf(docFreq=27, maxDocs=44218)
                0.078125 = fieldNorm(doc=1953)
          0.056349054 = weight(abstract_txt:citation in 1953) [ClassicSimilarity], result of:
            0.056349054 = score(doc=1953,freq=3.0), product of:
              0.085041516 = queryWeight, product of:
                1.5603703 = boost
                4.896717 = idf(docFreq=897, maxDocs=44218)
                0.01113008 = queryNorm
              0.6626064 = fieldWeight in 1953, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.896717 = idf(docFreq=897, maxDocs=44218)
                0.078125 = fieldNorm(doc=1953)
          0.18659207 = weight(abstract_txt:distributions in 1953) [ClassicSimilarity], result of:
            0.18659207 = score(doc=1953,freq=2.0), product of:
              0.24756895 = queryWeight, product of:
                3.260663 = boost
                6.82169 = idf(docFreq=130, maxDocs=44218)
                0.01113008 = queryNorm
              0.7536974 = fieldWeight in 1953, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.82169 = idf(docFreq=130, maxDocs=44218)
                0.078125 = fieldNorm(doc=1953)
        0.2 = coord(5/25)