Document (#36486)

Author
Shibata, N.
Kajikawa, Y.
Sakata, I.
Title
Measuring relatedness between communities in a citation network
Source
Journal of the American Society for Information Science and Technology. 62(2011) no.7, S.1360-1369
Year
2011
Abstract
As academic disciplines are segmented and specialized, it becomes more difficult to capture relevant research areas precisely by common retrieval strategies using either keywords or journal categories. This paper proposes a method of measuring the relatedness among sets of academic papers in order to detect unrelated communities which are not related to target topic. A citation network, extracted by given keywords, is divided into communities based on the density of links. We measured and compared four measures of relatedness between two communities in a citation network for three large-scale citation datasets. We used both link and semantic similarities. The topological distance from the center in a citation network is a more efficient measure for removing the unrelated communities than the other three measures: the ratio of the number of intercluster links over the all links, the ratio of the number of common terms over all terms, cosine similarity of tf-idf vectors.

Similar documents (author)

  1. Shibata, N.; Kajikawa, Y.; Matsushima, K.: Topological analysis of citation networks to discover the future core articles (2007) 3.70
    3.697847 = sum of:
      3.697847 = weight(author_txt:kajikawa in 2287) [ClassicSimilarity], result of:
        3.697847 = fieldWeight in 2287, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.860925 = idf(docFreq=5, maxDocs=42306)
          0.375 = fieldNorm(doc=2287)
    
  2. Shibata, N.; Kajikawa, Y.; Sakata, I.: Link prediction in citation networks (2012) 3.70
    3.697847 = sum of:
      3.697847 = weight(author_txt:kajikawa in 1965) [ClassicSimilarity], result of:
        3.697847 = fieldWeight in 1965, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.860925 = idf(docFreq=5, maxDocs=42306)
          0.375 = fieldNorm(doc=1965)
    
  3. Shibata, N.; Kajikawa, Y.; Takeda, Y.; Matsushima, K.: Comparative study on methods of detecting research fronts using different types of citation (2009) 3.08
    3.081539 = sum of:
      3.081539 = weight(author_txt:kajikawa in 563) [ClassicSimilarity], result of:
        3.081539 = fieldWeight in 563, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.860925 = idf(docFreq=5, maxDocs=42306)
          0.3125 = fieldNorm(doc=563)
    
  4. Tashiro, H.; Lau, A.; Mori, J.; Fujii, N.; Kajikawa, Y.: E-mail networks and leadership performance (2012) 3.08
    3.081539 = sum of:
      3.081539 = weight(author_txt:kajikawa in 2078) [ClassicSimilarity], result of:
        3.081539 = fieldWeight in 2078, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.860925 = idf(docFreq=5, maxDocs=42306)
          0.3125 = fieldNorm(doc=2078)
    

Similar documents (content)

  1. Klavans, R.; Boyack, K.W.: Identifying a better measure of relatedness for mapping science (2006) 0.20
    0.19782655 = sum of:
      0.19782655 = product of:
        0.9891327 = sum of:
          0.021366745 = weight(abstract_txt:between in 253) [ClassicSimilarity], result of:
            0.021366745 = score(doc=253,freq=3.0), product of:
              0.056422856 = queryWeight, product of:
                3.498184 = idf(docFreq=3478, maxDocs=42306)
                0.016129185 = queryNorm
              0.37868953 = fieldWeight in 253, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.498184 = idf(docFreq=3478, maxDocs=42306)
                0.0625 = fieldNorm(doc=253)
          0.09524999 = weight(abstract_txt:cosine in 253) [ClassicSimilarity], result of:
            0.09524999 = score(doc=253,freq=2.0), product of:
              0.13885447 = queryWeight, product of:
                1.1092703 = boost
                7.760864 = idf(docFreq=48, maxDocs=42306)
                0.016129185 = queryNorm
              0.6859699 = fieldWeight in 253, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.760864 = idf(docFreq=48, maxDocs=42306)
                0.0625 = fieldNorm(doc=253)
          0.12288379 = weight(abstract_txt:measures in 253) [ClassicSimilarity], result of:
            0.12288379 = score(doc=253,freq=7.0), product of:
              0.1365527 = queryWeight, product of:
                1.5556884 = boost
                5.4420843 = idf(docFreq=497, maxDocs=42306)
                0.016129185 = queryNorm
              0.8999001 = fieldWeight in 253, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                5.4420843 = idf(docFreq=497, maxDocs=42306)
                0.0625 = fieldNorm(doc=253)
          0.086277366 = weight(abstract_txt:measuring in 253) [ClassicSimilarity], result of:
            0.086277366 = score(doc=253,freq=1.0), product of:
              0.20634843 = queryWeight, product of:
                1.912375 = boost
                6.6898394 = idf(docFreq=142, maxDocs=42306)
                0.016129185 = queryNorm
              0.41811496 = fieldWeight in 253, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.6898394 = idf(docFreq=142, maxDocs=42306)
                0.0625 = fieldNorm(doc=253)
          0.6633548 = weight(abstract_txt:relatedness in 253) [ClassicSimilarity], result of:
            0.6633548 = score(doc=253,freq=8.0), product of:
              0.46008095 = queryWeight, product of:
                3.4973187 = boost
                8.156177 = idf(docFreq=32, maxDocs=42306)
                0.016129185 = queryNorm
              1.4418219 = fieldWeight in 253, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                8.156177 = idf(docFreq=32, maxDocs=42306)
                0.0625 = fieldNorm(doc=253)
        0.2 = coord(5/25)
    
  2. Shibata, N.; Kajikawa, Y.; Takeda, Y.; Matsushima, K.: Comparative study on methods of detecting research fronts using different types of citation (2009) 0.19
    0.19165546 = sum of:
      0.19165546 = product of:
        0.68448377 = sum of:
          0.074922144 = weight(abstract_txt:detect in 563) [ClassicSimilarity], result of:
            0.074922144 = score(doc=563,freq=2.0), product of:
              0.11831961 = queryWeight, product of:
                1.0239667 = boost
                7.1640477 = idf(docFreq=88, maxDocs=42306)
                0.016129185 = queryNorm
              0.63321835 = fieldWeight in 563, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.1640477 = idf(docFreq=88, maxDocs=42306)
                0.0625 = fieldNorm(doc=563)
          0.06844272 = weight(abstract_txt:density in 563) [ClassicSimilarity], result of:
            0.06844272 = score(doc=563,freq=1.0), product of:
              0.14034967 = queryWeight, product of:
                1.1152267 = boost
                7.8025365 = idf(docFreq=46, maxDocs=42306)
                0.016129185 = queryNorm
              0.48765853 = fieldWeight in 563, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.8025365 = idf(docFreq=46, maxDocs=42306)
                0.0625 = fieldNorm(doc=563)
          0.076497324 = weight(abstract_txt:topological in 563) [ClassicSimilarity], result of:
            0.076497324 = score(doc=563,freq=1.0), product of:
              0.15115555 = queryWeight, product of:
                1.1573628 = boost
                8.097336 = idf(docFreq=34, maxDocs=42306)
                0.016129185 = queryNorm
              0.5060835 = fieldWeight in 563, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.097336 = idf(docFreq=34, maxDocs=42306)
                0.0625 = fieldNorm(doc=563)
          0.044543784 = weight(abstract_txt:three in 563) [ClassicSimilarity], result of:
            0.044543784 = score(doc=563,freq=3.0), product of:
              0.09207765 = queryWeight, product of:
                1.2774667 = boost
                4.4688134 = idf(docFreq=1317, maxDocs=42306)
                0.016129185 = queryNorm
              0.48376325 = fieldWeight in 563, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.4688134 = idf(docFreq=1317, maxDocs=42306)
                0.0625 = fieldNorm(doc=563)
          0.04644571 = weight(abstract_txt:measures in 563) [ClassicSimilarity], result of:
            0.04644571 = score(doc=563,freq=1.0), product of:
              0.1365527 = queryWeight, product of:
                1.5556884 = boost
                5.4420843 = idf(docFreq=497, maxDocs=42306)
                0.016129185 = queryNorm
              0.34013027 = fieldWeight in 563, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.4420843 = idf(docFreq=497, maxDocs=42306)
                0.0625 = fieldNorm(doc=563)
          0.11559639 = weight(abstract_txt:network in 563) [ClassicSimilarity], result of:
            0.11559639 = score(doc=563,freq=4.0), product of:
              0.19904721 = queryWeight, product of:
                2.656229 = boost
                4.645989 = idf(docFreq=1103, maxDocs=42306)
                0.016129185 = queryNorm
              0.5807486 = fieldWeight in 563, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.645989 = idf(docFreq=1103, maxDocs=42306)
                0.0625 = fieldNorm(doc=563)
          0.25803575 = weight(abstract_txt:citation in 563) [ClassicSimilarity], result of:
            0.25803575 = score(doc=563,freq=9.0), product of:
              0.27948317 = queryWeight, product of:
                3.5190084 = boost
                4.9240556 = idf(docFreq=835, maxDocs=42306)
                0.016129185 = queryNorm
              0.92326045 = fieldWeight in 563, product of:
                3.0 = tf(freq=9.0), with freq of:
                  9.0 = termFreq=9.0
                4.9240556 = idf(docFreq=835, maxDocs=42306)
                0.0625 = fieldNorm(doc=563)
        0.28 = coord(7/25)
    
  3. Serpa, F.G.; Graves, A.M.; Javier, A.: Statistical common author networks (2013) 0.17
    0.17250045 = sum of:
      0.17250045 = product of:
        0.616073 = sum of:
          0.012336096 = weight(abstract_txt:between in 3134) [ClassicSimilarity], result of:
            0.012336096 = score(doc=3134,freq=1.0), product of:
              0.056422856 = queryWeight, product of:
                3.498184 = idf(docFreq=3478, maxDocs=42306)
                0.016129185 = queryNorm
              0.2186365 = fieldWeight in 3134, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.498184 = idf(docFreq=3478, maxDocs=42306)
                0.0625 = fieldNorm(doc=3134)
          0.020232843 = weight(abstract_txt:number in 3134) [ClassicSimilarity], result of:
            0.020232843 = score(doc=3134,freq=1.0), product of:
              0.07847076 = queryWeight, product of:
                1.1793057 = boost
                4.125428 = idf(docFreq=1857, maxDocs=42306)
                0.016129185 = queryNorm
              0.25783926 = fieldWeight in 3134, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.125428 = idf(docFreq=1857, maxDocs=42306)
                0.0625 = fieldNorm(doc=3134)
          0.045934338 = weight(abstract_txt:common in 3134) [ClassicSimilarity], result of:
            0.045934338 = score(doc=3134,freq=2.0), product of:
              0.10758496 = queryWeight, product of:
                1.3808556 = boost
                4.830487 = idf(docFreq=917, maxDocs=42306)
                0.016129185 = queryNorm
              0.42695874 = fieldWeight in 3134, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.830487 = idf(docFreq=917, maxDocs=42306)
                0.0625 = fieldNorm(doc=3134)
          0.086277366 = weight(abstract_txt:measuring in 3134) [ClassicSimilarity], result of:
            0.086277366 = score(doc=3134,freq=1.0), product of:
              0.20634843 = queryWeight, product of:
                1.912375 = boost
                6.6898394 = idf(docFreq=142, maxDocs=42306)
                0.016129185 = queryNorm
              0.41811496 = fieldWeight in 3134, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.6898394 = idf(docFreq=142, maxDocs=42306)
                0.0625 = fieldNorm(doc=3134)
          0.061816722 = weight(abstract_txt:links in 3134) [ClassicSimilarity], result of:
            0.061816722 = score(doc=3134,freq=1.0), product of:
              0.1891346 = queryWeight, product of:
                2.242351 = boost
                5.2294374 = idf(docFreq=615, maxDocs=42306)
                0.016129185 = queryNorm
              0.32683983 = fieldWeight in 3134, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.2294374 = idf(docFreq=615, maxDocs=42306)
                0.0625 = fieldNorm(doc=3134)
          0.057798196 = weight(abstract_txt:network in 3134) [ClassicSimilarity], result of:
            0.057798196 = score(doc=3134,freq=1.0), product of:
              0.19904721 = queryWeight, product of:
                2.656229 = boost
                4.645989 = idf(docFreq=1103, maxDocs=42306)
                0.016129185 = queryNorm
              0.2903743 = fieldWeight in 3134, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.645989 = idf(docFreq=1103, maxDocs=42306)
                0.0625 = fieldNorm(doc=3134)
          0.3316774 = weight(abstract_txt:relatedness in 3134) [ClassicSimilarity], result of:
            0.3316774 = score(doc=3134,freq=2.0), product of:
              0.46008095 = queryWeight, product of:
                3.4973187 = boost
                8.156177 = idf(docFreq=32, maxDocs=42306)
                0.016129185 = queryNorm
              0.72091097 = fieldWeight in 3134, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.156177 = idf(docFreq=32, maxDocs=42306)
                0.0625 = fieldNorm(doc=3134)
        0.28 = coord(7/25)
    
  4. Thelwall, M.: Extracting macroscopic information from Web links (2001) 0.17
    0.1689764 = sum of:
      0.1689764 = product of:
        0.52805126 = sum of:
          0.012336096 = weight(abstract_txt:between in 852) [ClassicSimilarity], result of:
            0.012336096 = score(doc=852,freq=1.0), product of:
              0.056422856 = queryWeight, product of:
                3.498184 = idf(docFreq=3478, maxDocs=42306)
                0.016129185 = queryNorm
              0.2186365 = fieldWeight in 852, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.498184 = idf(docFreq=3478, maxDocs=42306)
                0.0625 = fieldNorm(doc=852)
          0.022414813 = weight(abstract_txt:over in 852) [ClassicSimilarity], result of:
            0.022414813 = score(doc=852,freq=1.0), product of:
              0.08401561 = queryWeight, product of:
                1.2202603 = boost
                4.268695 = idf(docFreq=1609, maxDocs=42306)
                0.016129185 = queryNorm
              0.26679343 = fieldWeight in 852, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.268695 = idf(docFreq=1609, maxDocs=42306)
                0.0625 = fieldNorm(doc=852)
          0.06103035 = weight(abstract_txt:academic in 852) [ClassicSimilarity], result of:
            0.06103035 = score(doc=852,freq=4.0), product of:
              0.10320019 = queryWeight, product of:
                1.3524235 = boost
                4.731026 = idf(docFreq=1013, maxDocs=42306)
                0.016129185 = queryNorm
              0.5913783 = fieldWeight in 852, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.731026 = idf(docFreq=1013, maxDocs=42306)
                0.0625 = fieldNorm(doc=852)
          0.04644571 = weight(abstract_txt:measures in 852) [ClassicSimilarity], result of:
            0.04644571 = score(doc=852,freq=1.0), product of:
              0.1365527 = queryWeight, product of:
                1.5556884 = boost
                5.4420843 = idf(docFreq=497, maxDocs=42306)
                0.016129185 = queryNorm
              0.34013027 = fieldWeight in 852, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.4420843 = idf(docFreq=497, maxDocs=42306)
                0.0625 = fieldNorm(doc=852)
          0.086277366 = weight(abstract_txt:measuring in 852) [ClassicSimilarity], result of:
            0.086277366 = score(doc=852,freq=1.0), product of:
              0.20634843 = queryWeight, product of:
                1.912375 = boost
                6.6898394 = idf(docFreq=142, maxDocs=42306)
                0.016129185 = queryNorm
              0.41811496 = fieldWeight in 852, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.6898394 = idf(docFreq=142, maxDocs=42306)
                0.0625 = fieldNorm(doc=852)
          0.12611298 = weight(abstract_txt:ratio in 852) [ClassicSimilarity], result of:
            0.12611298 = score(doc=852,freq=1.0), product of:
              0.26577234 = queryWeight, product of:
                2.170338 = boost
                7.5922413 = idf(docFreq=57, maxDocs=42306)
                0.016129185 = queryNorm
              0.47451508 = fieldWeight in 852, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.5922413 = idf(docFreq=57, maxDocs=42306)
                0.0625 = fieldNorm(doc=852)
          0.08742204 = weight(abstract_txt:links in 852) [ClassicSimilarity], result of:
            0.08742204 = score(doc=852,freq=2.0), product of:
              0.1891346 = queryWeight, product of:
                2.242351 = boost
                5.2294374 = idf(docFreq=615, maxDocs=42306)
                0.016129185 = queryNorm
              0.46222132 = fieldWeight in 852, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.2294374 = idf(docFreq=615, maxDocs=42306)
                0.0625 = fieldNorm(doc=852)
          0.08601192 = weight(abstract_txt:citation in 852) [ClassicSimilarity], result of:
            0.08601192 = score(doc=852,freq=1.0), product of:
              0.27948317 = queryWeight, product of:
                3.5190084 = boost
                4.9240556 = idf(docFreq=835, maxDocs=42306)
                0.016129185 = queryNorm
              0.30775347 = fieldWeight in 852, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.9240556 = idf(docFreq=835, maxDocs=42306)
                0.0625 = fieldNorm(doc=852)
        0.32 = coord(8/25)
    
  5. Macias-Galindo, D.; Cavedon, L.; Thangarajah, J.; Wong, W.: Effects of domain on measures of semantic relatedness (2015) 0.16
    0.15791434 = sum of:
      0.15791434 = product of:
        0.7895717 = sum of:
          0.033398643 = weight(abstract_txt:terms in 4221) [ClassicSimilarity], result of:
            0.033398643 = score(doc=4221,freq=3.0), product of:
              0.07599448 = queryWeight, product of:
                1.160549 = boost
                4.059814 = idf(docFreq=1983, maxDocs=42306)
                0.016129185 = queryNorm
              0.43948776 = fieldWeight in 4221, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.059814 = idf(docFreq=1983, maxDocs=42306)
                0.0625 = fieldNorm(doc=4221)
          0.03169933 = weight(abstract_txt:over in 4221) [ClassicSimilarity], result of:
            0.03169933 = score(doc=4221,freq=2.0), product of:
              0.08401561 = queryWeight, product of:
                1.2202603 = boost
                4.268695 = idf(docFreq=1609, maxDocs=42306)
                0.016129185 = queryNorm
              0.37730289 = fieldWeight in 4221, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.268695 = idf(docFreq=1609, maxDocs=42306)
                0.0625 = fieldNorm(doc=4221)
          0.11376829 = weight(abstract_txt:measures in 4221) [ClassicSimilarity], result of:
            0.11376829 = score(doc=4221,freq=6.0), product of:
              0.1365527 = queryWeight, product of:
                1.5556884 = boost
                5.4420843 = idf(docFreq=497, maxDocs=42306)
                0.016129185 = queryNorm
              0.8331456 = fieldWeight in 4221, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                5.4420843 = idf(docFreq=497, maxDocs=42306)
                0.0625 = fieldNorm(doc=4221)
          0.086277366 = weight(abstract_txt:measuring in 4221) [ClassicSimilarity], result of:
            0.086277366 = score(doc=4221,freq=1.0), product of:
              0.20634843 = queryWeight, product of:
                1.912375 = boost
                6.6898394 = idf(docFreq=142, maxDocs=42306)
                0.016129185 = queryNorm
              0.41811496 = fieldWeight in 4221, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.6898394 = idf(docFreq=142, maxDocs=42306)
                0.0625 = fieldNorm(doc=4221)
          0.52442807 = weight(abstract_txt:relatedness in 4221) [ClassicSimilarity], result of:
            0.52442807 = score(doc=4221,freq=5.0), product of:
              0.46008095 = queryWeight, product of:
                3.4973187 = boost
                8.156177 = idf(docFreq=32, maxDocs=42306)
                0.016129185 = queryNorm
              1.1398604 = fieldWeight in 4221, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                8.156177 = idf(docFreq=32, maxDocs=42306)
                0.0625 = fieldNorm(doc=4221)
        0.2 = coord(5/25)