Document (#38135)

Author
Serpa, F.G.
Graves, A.M.
Javier, A.
Title
Statistical common author networks
Source
Journal of the American Society for Information Science and Technology. 64(2013) no.12, S.2507-2512
Year
2013
Abstract
A new method for visualizing the relatedness of scientific areas has been developed that is based on measuring the overlap of researchers between areas. It is found that closely related areas have a high propensity to share a larger number of common authors. A method for comparing areas of vastly different sizes and to handle name homonymy is constructed, allowing for the robust deployment of this method on real data sets. A statistical analysis of the probability distributions of the common author overlap that accounts for noise is carried out along with the production of network maps with weighted links proportional to the overlap strength. This is demonstrated on 2 case studies, complexity science and neutrino physics, where the level of relatedness of areas within each area is expected to vary greatly. It is found that the results returned by this method closely match the intuitive expectation that the broad, multidisciplinary area of complexity science possesses areas that are weakly related to each other, whereas the much narrower area of neutrino physics shows very strongly related areas.
Theme
Informetrie

Similar documents (content)

  1. Braun, T.; Glanzel, W.; Grupp, H.: ¬The scientometric weight of 50 nations in 27 scientific areas, 1989-1993 : Pt.1: All fields combined, mathematics, engineering, chemistry and physics (1995) 0.17
    0.17430583 = sum of:
      0.17430583 = product of:
        0.6225208 = sum of:
          0.0468154 = weight(abstract_txt:science in 830) [ClassicSimilarity], result of:
            0.0468154 = score(doc=830,freq=3.0), product of:
              0.073838875 = queryWeight, product of:
                1.0022362 = boost
                3.904557 = idf(docFreq=2340, maxDocs=42740)
                0.018868752 = queryNorm
              0.63402104 = fieldWeight in 830, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.904557 = idf(docFreq=2340, maxDocs=42740)
                0.09375 = fieldNorm(doc=830)
          0.03205474 = weight(abstract_txt:each in 830) [ClassicSimilarity], result of:
            0.03205474 = score(doc=830,freq=1.0), product of:
              0.082729645 = queryWeight, product of:
                1.0608603 = boost
                4.132947 = idf(docFreq=1862, maxDocs=42740)
                0.018868752 = queryNorm
              0.38746378 = fieldWeight in 830, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.132947 = idf(docFreq=1862, maxDocs=42740)
                0.09375 = fieldNorm(doc=830)
          0.041626498 = weight(abstract_txt:found in 830) [ClassicSimilarity], result of:
            0.041626498 = score(doc=830,freq=1.0), product of:
              0.09847204 = queryWeight, product of:
                1.1574016 = boost
                4.5090566 = idf(docFreq=1278, maxDocs=42740)
                0.018868752 = queryNorm
              0.42272407 = fieldWeight in 830, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5090566 = idf(docFreq=1278, maxDocs=42740)
                0.09375 = fieldNorm(doc=830)
          0.13571957 = weight(abstract_txt:physics in 830) [ClassicSimilarity], result of:
            0.13571957 = score(doc=830,freq=1.0), product of:
              0.21651833 = queryWeight, product of:
                1.7162278 = boost
                6.6861567 = idf(docFreq=144, maxDocs=42740)
                0.018868752 = queryNorm
              0.6268272 = fieldWeight in 830, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.6861567 = idf(docFreq=144, maxDocs=42740)
                0.09375 = fieldNorm(doc=830)
          0.018705308 = weight(abstract_txt:that in 830) [ClassicSimilarity], result of:
            0.018705308 = score(doc=830,freq=1.0), product of:
              0.08332014 = queryWeight, product of:
                1.8440098 = boost
                2.3946586 = idf(docFreq=10595, maxDocs=42740)
                0.018868752 = queryNorm
              0.22449924 = fieldWeight in 830, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.3946586 = idf(docFreq=10595, maxDocs=42740)
                0.09375 = fieldNorm(doc=830)
          0.08611559 = weight(abstract_txt:area in 830) [ClassicSimilarity], result of:
            0.08611559 = score(doc=830,freq=1.0), product of:
              0.18301412 = queryWeight, product of:
                1.9324824 = boost
                5.0191007 = idf(docFreq=767, maxDocs=42740)
                0.018868752 = queryNorm
              0.4705407 = fieldWeight in 830, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.0191007 = idf(docFreq=767, maxDocs=42740)
                0.09375 = fieldNorm(doc=830)
          0.2614837 = weight(abstract_txt:areas in 830) [ClassicSimilarity], result of:
            0.2614837 = score(doc=830,freq=2.0), product of:
              0.40399447 = queryWeight, product of:
                4.385805 = boost
                4.881833 = idf(docFreq=880, maxDocs=42740)
                0.018868752 = queryNorm
              0.64724576 = fieldWeight in 830, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.881833 = idf(docFreq=880, maxDocs=42740)
                0.09375 = fieldNorm(doc=830)
        0.28 = coord(7/25)
    
  2. Wang, F.; Wolfram, D.: Assessment of journal similarity based on citing discipline analysis (2015) 0.16
    0.1600692 = sum of:
      0.1600692 = product of:
        0.50021625 = sum of:
          0.044137985 = weight(abstract_txt:science in 3850) [ClassicSimilarity], result of:
            0.044137985 = score(doc=3850,freq=6.0), product of:
              0.073838875 = queryWeight, product of:
                1.0022362 = boost
                3.904557 = idf(docFreq=2340, maxDocs=42740)
                0.018868752 = queryNorm
              0.5977608 = fieldWeight in 3850, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                3.904557 = idf(docFreq=2340, maxDocs=42740)
                0.0625 = fieldNorm(doc=3850)
          0.021369828 = weight(abstract_txt:each in 3850) [ClassicSimilarity], result of:
            0.021369828 = score(doc=3850,freq=1.0), product of:
              0.082729645 = queryWeight, product of:
                1.0608603 = boost
                4.132947 = idf(docFreq=1862, maxDocs=42740)
                0.018868752 = queryNorm
              0.2583092 = fieldWeight in 3850, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.132947 = idf(docFreq=1862, maxDocs=42740)
                0.0625 = fieldNorm(doc=3850)
          0.04890278 = weight(abstract_txt:related in 3850) [ClassicSimilarity], result of:
            0.04890278 = score(doc=3850,freq=2.0), product of:
              0.13052788 = queryWeight, product of:
                1.6320177 = boost
                4.238725 = idf(docFreq=1675, maxDocs=42740)
                0.018868752 = queryNorm
              0.3746539 = fieldWeight in 3850, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.238725 = idf(docFreq=1675, maxDocs=42740)
                0.0625 = fieldNorm(doc=3850)
          0.12756346 = weight(abstract_txt:closely in 3850) [ClassicSimilarity], result of:
            0.12756346 = score(doc=3850,freq=2.0), product of:
              0.21607342 = queryWeight, product of:
                1.7144636 = boost
                6.679284 = idf(docFreq=145, maxDocs=42740)
                0.018868752 = queryNorm
              0.5903709 = fieldWeight in 3850, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.679284 = idf(docFreq=145, maxDocs=42740)
                0.0625 = fieldNorm(doc=3850)
          0.02159903 = weight(abstract_txt:that in 3850) [ClassicSimilarity], result of:
            0.02159903 = score(doc=3850,freq=3.0), product of:
              0.08332014 = queryWeight, product of:
                1.8440098 = boost
                2.3946586 = idf(docFreq=10595, maxDocs=42740)
                0.018868752 = queryNorm
              0.2592294 = fieldWeight in 3850, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.3946586 = idf(docFreq=10595, maxDocs=42740)
                0.0625 = fieldNorm(doc=3850)
          0.057410393 = weight(abstract_txt:area in 3850) [ClassicSimilarity], result of:
            0.057410393 = score(doc=3850,freq=1.0), product of:
              0.18301412 = queryWeight, product of:
                1.9324824 = boost
                5.0191007 = idf(docFreq=767, maxDocs=42740)
                0.018868752 = queryNorm
              0.3136938 = fieldWeight in 3850, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.0191007 = idf(docFreq=767, maxDocs=42740)
                0.0625 = fieldNorm(doc=3850)
          0.05596817 = weight(abstract_txt:method in 3850) [ClassicSimilarity], result of:
            0.05596817 = score(doc=3850,freq=1.0), product of:
              0.19804531 = queryWeight, product of:
                2.3212657 = boost
                4.5216455 = idf(docFreq=1262, maxDocs=42740)
                0.018868752 = queryNorm
              0.28260285 = fieldWeight in 3850, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5216455 = idf(docFreq=1262, maxDocs=42740)
                0.0625 = fieldNorm(doc=3850)
          0.123264596 = weight(abstract_txt:areas in 3850) [ClassicSimilarity], result of:
            0.123264596 = score(doc=3850,freq=1.0), product of:
              0.40399447 = queryWeight, product of:
                4.385805 = boost
                4.881833 = idf(docFreq=880, maxDocs=42740)
                0.018868752 = queryNorm
              0.30511457 = fieldWeight in 3850, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.881833 = idf(docFreq=880, maxDocs=42740)
                0.0625 = fieldNorm(doc=3850)
        0.32 = coord(8/25)
    
  3. Boyack, K.W.; Wylie, B.N.; Davidson, G.S.: Domain visualization using VxInsight®) [register mark] for science and technology management (2002) 0.15
    0.14894046 = sum of:
      0.14894046 = product of:
        0.46543896 = sum of:
          0.027308984 = weight(abstract_txt:science in 245) [ClassicSimilarity], result of:
            0.027308984 = score(doc=245,freq=3.0), product of:
              0.073838875 = queryWeight, product of:
                1.0022362 = boost
                3.904557 = idf(docFreq=2340, maxDocs=42740)
                0.018868752 = queryNorm
              0.3698456 = fieldWeight in 245, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.904557 = idf(docFreq=2340, maxDocs=42740)
                0.0546875 = fieldNorm(doc=245)
          0.03238692 = weight(abstract_txt:each in 245) [ClassicSimilarity], result of:
            0.03238692 = score(doc=245,freq=3.0), product of:
              0.082729645 = queryWeight, product of:
                1.0608603 = boost
                4.132947 = idf(docFreq=1862, maxDocs=42740)
                0.018868752 = queryNorm
              0.39147905 = fieldWeight in 245, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.132947 = idf(docFreq=1862, maxDocs=42740)
                0.0546875 = fieldNorm(doc=245)
          0.030257052 = weight(abstract_txt:related in 245) [ClassicSimilarity], result of:
            0.030257052 = score(doc=245,freq=1.0), product of:
              0.13052788 = queryWeight, product of:
                1.6320177 = boost
                4.238725 = idf(docFreq=1675, maxDocs=42740)
                0.018868752 = queryNorm
              0.23180528 = fieldWeight in 245, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.238725 = idf(docFreq=1675, maxDocs=42740)
                0.0546875 = fieldNorm(doc=245)
          0.07916975 = weight(abstract_txt:physics in 245) [ClassicSimilarity], result of:
            0.07916975 = score(doc=245,freq=1.0), product of:
              0.21651833 = queryWeight, product of:
                1.7162278 = boost
                6.6861567 = idf(docFreq=144, maxDocs=42740)
                0.018868752 = queryNorm
              0.3656492 = fieldWeight in 245, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.6861567 = idf(docFreq=144, maxDocs=42740)
                0.0546875 = fieldNorm(doc=245)
          0.010911429 = weight(abstract_txt:that in 245) [ClassicSimilarity], result of:
            0.010911429 = score(doc=245,freq=1.0), product of:
              0.08332014 = queryWeight, product of:
                1.8440098 = boost
                2.3946586 = idf(docFreq=10595, maxDocs=42740)
                0.018868752 = queryNorm
              0.13095789 = fieldWeight in 245, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.3946586 = idf(docFreq=10595, maxDocs=42740)
                0.0546875 = fieldNorm(doc=245)
          0.044495694 = weight(abstract_txt:common in 245) [ClassicSimilarity], result of:
            0.044495694 = score(doc=245,freq=1.0), product of:
              0.1687968 = queryWeight, product of:
                1.8559031 = boost
                4.820207 = idf(docFreq=936, maxDocs=42740)
                0.018868752 = queryNorm
              0.2636051 = fieldWeight in 245, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.820207 = idf(docFreq=936, maxDocs=42740)
                0.0546875 = fieldNorm(doc=245)
          0.13305263 = weight(abstract_txt:overlap in 245) [ClassicSimilarity], result of:
            0.13305263 = score(doc=245,freq=1.0), product of:
              0.35034928 = queryWeight, product of:
                2.6737688 = boost
                6.9443917 = idf(docFreq=111, maxDocs=42740)
                0.018868752 = queryNorm
              0.3797714 = fieldWeight in 245, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.9443917 = idf(docFreq=111, maxDocs=42740)
                0.0546875 = fieldNorm(doc=245)
          0.10785653 = weight(abstract_txt:areas in 245) [ClassicSimilarity], result of:
            0.10785653 = score(doc=245,freq=1.0), product of:
              0.40399447 = queryWeight, product of:
                4.385805 = boost
                4.881833 = idf(docFreq=880, maxDocs=42740)
                0.018868752 = queryNorm
              0.26697525 = fieldWeight in 245, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.881833 = idf(docFreq=880, maxDocs=42740)
                0.0546875 = fieldNorm(doc=245)
        0.32 = coord(8/25)
    
  4. Talvensaari, T.; Laurikkala, J.; Järvelin, K.; Juhola, M.: ¬A study on automatic creation of a comparable document collection in cross-language information retrieval (2006) 0.14
    0.14311679 = sum of:
      0.14311679 = product of:
        0.5111314 = sum of:
          0.108683884 = weight(abstract_txt:weakly in 602) [ClassicSimilarity], result of:
            0.108683884 = score(doc=602,freq=1.0), product of:
              0.19419019 = queryWeight, product of:
                1.149281 = boost
                8.954841 = idf(docFreq=14, maxDocs=42740)
                0.018868752 = queryNorm
              0.55967754 = fieldWeight in 602, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.954841 = idf(docFreq=14, maxDocs=42740)
                0.0625 = fieldNorm(doc=602)
          0.027750999 = weight(abstract_txt:found in 602) [ClassicSimilarity], result of:
            0.027750999 = score(doc=602,freq=1.0), product of:
              0.09847204 = queryWeight, product of:
                1.1574016 = boost
                4.5090566 = idf(docFreq=1278, maxDocs=42740)
                0.018868752 = queryNorm
              0.28181604 = fieldWeight in 602, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5090566 = idf(docFreq=1278, maxDocs=42740)
                0.0625 = fieldNorm(doc=602)
          0.03457949 = weight(abstract_txt:related in 602) [ClassicSimilarity], result of:
            0.03457949 = score(doc=602,freq=1.0), product of:
              0.13052788 = queryWeight, product of:
                1.6320177 = boost
                4.238725 = idf(docFreq=1675, maxDocs=42740)
                0.018868752 = queryNorm
              0.26492032 = fieldWeight in 602, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.238725 = idf(docFreq=1675, maxDocs=42740)
                0.0625 = fieldNorm(doc=602)
          0.012470205 = weight(abstract_txt:that in 602) [ClassicSimilarity], result of:
            0.012470205 = score(doc=602,freq=1.0), product of:
              0.08332014 = queryWeight, product of:
                1.8440098 = boost
                2.3946586 = idf(docFreq=10595, maxDocs=42740)
                0.018868752 = queryNorm
              0.14966616 = fieldWeight in 602, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.3946586 = idf(docFreq=10595, maxDocs=42740)
                0.0625 = fieldNorm(doc=602)
          0.05085222 = weight(abstract_txt:common in 602) [ClassicSimilarity], result of:
            0.05085222 = score(doc=602,freq=1.0), product of:
              0.1687968 = queryWeight, product of:
                1.8559031 = boost
                4.820207 = idf(docFreq=936, maxDocs=42740)
                0.018868752 = queryNorm
              0.30126294 = fieldWeight in 602, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.820207 = idf(docFreq=936, maxDocs=42740)
                0.0625 = fieldNorm(doc=602)
          0.16485828 = weight(abstract_txt:relatedness in 602) [ClassicSimilarity], result of:
            0.16485828 = score(doc=602,freq=1.0), product of:
              0.32299888 = queryWeight, product of:
                2.0961778 = boost
                8.166383 = idf(docFreq=32, maxDocs=42740)
                0.018868752 = queryNorm
              0.5103989 = fieldWeight in 602, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.166383 = idf(docFreq=32, maxDocs=42740)
                0.0625 = fieldNorm(doc=602)
          0.11193634 = weight(abstract_txt:method in 602) [ClassicSimilarity], result of:
            0.11193634 = score(doc=602,freq=4.0), product of:
              0.19804531 = queryWeight, product of:
                2.3212657 = boost
                4.5216455 = idf(docFreq=1262, maxDocs=42740)
                0.018868752 = queryNorm
              0.5652057 = fieldWeight in 602, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.5216455 = idf(docFreq=1262, maxDocs=42740)
                0.0625 = fieldNorm(doc=602)
        0.28 = coord(7/25)
    
  5. Shibata, N.; Kajikawa, Y.; Sakata, I.: Measuring relatedness between communities in a citation network (2011) 0.13
    0.12971824 = sum of:
      0.12971824 = product of:
        0.6485912 = sum of:
          0.04322436 = weight(abstract_txt:related in 1485) [ClassicSimilarity], result of:
            0.04322436 = score(doc=1485,freq=1.0), product of:
              0.13052788 = queryWeight, product of:
                1.6320177 = boost
                4.238725 = idf(docFreq=1675, maxDocs=42740)
                0.018868752 = queryNorm
              0.3311504 = fieldWeight in 1485, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.238725 = idf(docFreq=1675, maxDocs=42740)
                0.078125 = fieldNorm(doc=1485)
          0.08989487 = weight(abstract_txt:common in 1485) [ClassicSimilarity], result of:
            0.08989487 = score(doc=1485,freq=2.0), product of:
              0.1687968 = queryWeight, product of:
                1.8559031 = boost
                4.820207 = idf(docFreq=936, maxDocs=42740)
                0.018868752 = queryNorm
              0.5325627 = fieldWeight in 1485, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.820207 = idf(docFreq=936, maxDocs=42740)
                0.078125 = fieldNorm(doc=1485)
          0.29143104 = weight(abstract_txt:relatedness in 1485) [ClassicSimilarity], result of:
            0.29143104 = score(doc=1485,freq=2.0), product of:
              0.32299888 = queryWeight, product of:
                2.0961778 = boost
                8.166383 = idf(docFreq=32, maxDocs=42740)
                0.018868752 = queryNorm
              0.9022664 = fieldWeight in 1485, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.166383 = idf(docFreq=32, maxDocs=42740)
                0.078125 = fieldNorm(doc=1485)
          0.06996021 = weight(abstract_txt:method in 1485) [ClassicSimilarity], result of:
            0.06996021 = score(doc=1485,freq=1.0), product of:
              0.19804531 = queryWeight, product of:
                2.3212657 = boost
                4.5216455 = idf(docFreq=1262, maxDocs=42740)
                0.018868752 = queryNorm
              0.35325354 = fieldWeight in 1485, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5216455 = idf(docFreq=1262, maxDocs=42740)
                0.078125 = fieldNorm(doc=1485)
          0.15408075 = weight(abstract_txt:areas in 1485) [ClassicSimilarity], result of:
            0.15408075 = score(doc=1485,freq=1.0), product of:
              0.40399447 = queryWeight, product of:
                4.385805 = boost
                4.881833 = idf(docFreq=880, maxDocs=42740)
                0.018868752 = queryNorm
              0.3813932 = fieldWeight in 1485, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.881833 = idf(docFreq=880, maxDocs=42740)
                0.078125 = fieldNorm(doc=1485)
        0.2 = coord(5/25)