Document (#36485)

Author
Shibata, N.
Kajikawa, Y.
Sakata, I.
Title
Measuring relatedness between communities in a citation network
Source
Journal of the American Society for Information Science and Technology. 62(2011) no.7, S.1360-1369
Year
2011
Abstract
As academic disciplines are segmented and specialized, it becomes more difficult to capture relevant research areas precisely by common retrieval strategies using either keywords or journal categories. This paper proposes a method of measuring the relatedness among sets of academic papers in order to detect unrelated communities which are not related to target topic. A citation network, extracted by given keywords, is divided into communities based on the density of links. We measured and compared four measures of relatedness between two communities in a citation network for three large-scale citation datasets. We used both link and semantic similarities. The topological distance from the center in a citation network is a more efficient measure for removing the unrelated communities than the other three measures: the ratio of the number of intercluster links over the all links, the ratio of the number of common terms over all terms, cosine similarity of tf-idf vectors.

Similar documents (author)

  1. Shibata, N.; Kajikawa, Y.; Matsushima, K.: Topological analysis of citation networks to discover the future core articles (2007) 3.71
    3.7144227 = sum of:
      3.7144227 = weight(author_txt:kajikawa in 286) [ClassicSimilarity], result of:
        3.7144227 = fieldWeight in 286, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.905128 = idf(docFreq=5, maxDocs=44218)
          0.375 = fieldNorm(doc=286)
    
  2. Shibata, N.; Kajikawa, Y.; Sakata, I.: Link prediction in citation networks (2012) 3.71
    3.7144227 = sum of:
      3.7144227 = weight(author_txt:kajikawa in 4964) [ClassicSimilarity], result of:
        3.7144227 = fieldWeight in 4964, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.905128 = idf(docFreq=5, maxDocs=44218)
          0.375 = fieldNorm(doc=4964)
    
  3. Shibata, N.; Kajikawa, Y.; Takeda, Y.; Matsushima, K.: Comparative study on methods of detecting research fronts using different types of citation (2009) 3.10
    3.0953524 = sum of:
      3.0953524 = weight(author_txt:kajikawa in 2743) [ClassicSimilarity], result of:
        3.0953524 = fieldWeight in 2743, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.905128 = idf(docFreq=5, maxDocs=44218)
          0.3125 = fieldNorm(doc=2743)
    
  4. Tashiro, H.; Lau, A.; Mori, J.; Fujii, N.; Kajikawa, Y.: E-mail networks and leadership performance (2012) 3.10
    3.0953524 = sum of:
      3.0953524 = weight(author_txt:kajikawa in 77) [ClassicSimilarity], result of:
        3.0953524 = fieldWeight in 77, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.905128 = idf(docFreq=5, maxDocs=44218)
          0.3125 = fieldNorm(doc=77)
    

Similar documents (content)

  1. Klavans, R.; Boyack, K.W.: Identifying a better measure of relatedness for mapping science (2006) 0.20
    0.19919376 = sum of:
      0.19919376 = product of:
        0.99596876 = sum of:
          0.020903893 = weight(abstract_txt:between in 5252) [ClassicSimilarity], result of:
            0.020903893 = score(doc=5252,freq=3.0), product of:
              0.05575526 = queryWeight, product of:
                3.4633842 = idf(docFreq=3764, maxDocs=44218)
                0.016098492 = queryNorm
              0.37492234 = fieldWeight in 5252, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.4633842 = idf(docFreq=3764, maxDocs=44218)
                0.0625 = fieldNorm(doc=5252)
          0.09617988 = weight(abstract_txt:cosine in 5252) [ClassicSimilarity], result of:
            0.09617988 = score(doc=5252,freq=2.0), product of:
              0.14013426 = queryWeight, product of:
                1.1210222 = boost
                7.7650614 = idf(docFreq=50, maxDocs=44218)
                0.016098492 = queryNorm
              0.6863409 = fieldWeight in 5252, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.7650614 = idf(docFreq=50, maxDocs=44218)
                0.0625 = fieldNorm(doc=5252)
          0.12342612 = weight(abstract_txt:measures in 5252) [ClassicSimilarity], result of:
            0.12342612 = score(doc=5252,freq=7.0), product of:
              0.13732414 = queryWeight, product of:
                1.5693886 = boost
                5.4353957 = idf(docFreq=523, maxDocs=44218)
                0.016098492 = queryNorm
              0.89879405 = fieldWeight in 5252, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                5.4353957 = idf(docFreq=523, maxDocs=44218)
                0.0625 = fieldNorm(doc=5252)
          0.083177716 = weight(abstract_txt:measuring in 5252) [ClassicSimilarity], result of:
            0.083177716 = score(doc=5252,freq=1.0), product of:
              0.20192008 = queryWeight, product of:
                1.9030352 = boost
                6.590942 = idf(docFreq=164, maxDocs=44218)
                0.016098492 = queryNorm
              0.41193387 = fieldWeight in 5252, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.590942 = idf(docFreq=164, maxDocs=44218)
                0.0625 = fieldNorm(doc=5252)
          0.67228115 = weight(abstract_txt:relatedness in 5252) [ClassicSimilarity], result of:
            0.67228115 = score(doc=5252,freq=8.0), product of:
              0.46545303 = queryWeight, product of:
                3.5386746 = boost
                8.1705265 = idf(docFreq=33, maxDocs=44218)
                0.016098492 = queryNorm
              1.4443587 = fieldWeight in 5252, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                8.1705265 = idf(docFreq=33, maxDocs=44218)
                0.0625 = fieldNorm(doc=5252)
        0.2 = coord(5/25)
    
  2. Shibata, N.; Kajikawa, Y.; Takeda, Y.; Matsushima, K.: Comparative study on methods of detecting research fronts using different types of citation (2009) 0.19
    0.19087762 = sum of:
      0.19087762 = product of:
        0.6817058 = sum of:
          0.07357816 = weight(abstract_txt:detect in 2743) [ClassicSimilarity], result of:
            0.07357816 = score(doc=2743,freq=2.0), product of:
              0.11721614 = queryWeight, product of:
                1.0252641 = boost
                7.1017675 = idf(docFreq=98, maxDocs=44218)
                0.016098492 = queryNorm
              0.6277135 = fieldWeight in 2743, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.1017675 = idf(docFreq=98, maxDocs=44218)
                0.0625 = fieldNorm(doc=2743)
          0.069066025 = weight(abstract_txt:density in 2743) [ClassicSimilarity], result of:
            0.069066025 = score(doc=2743,freq=1.0), product of:
              0.14158192 = queryWeight, product of:
                1.1267978 = boost
                7.805067 = idf(docFreq=48, maxDocs=44218)
                0.016098492 = queryNorm
              0.4878167 = fieldWeight in 2743, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.805067 = idf(docFreq=48, maxDocs=44218)
                0.0625 = fieldNorm(doc=2743)
          0.078388825 = weight(abstract_txt:topological in 2743) [ClassicSimilarity], result of:
            0.078388825 = score(doc=2743,freq=1.0), product of:
              0.1540521 = queryWeight, product of:
                1.1753734 = boost
                8.14154 = idf(docFreq=34, maxDocs=44218)
                0.016098492 = queryNorm
              0.5088462 = fieldWeight in 2743, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.14154 = idf(docFreq=34, maxDocs=44218)
                0.0625 = fieldNorm(doc=2743)
          0.043337956 = weight(abstract_txt:three in 2743) [ClassicSimilarity], result of:
            0.043337956 = score(doc=2743,freq=3.0), product of:
              0.09065255 = queryWeight, product of:
                1.2751083 = boost
                4.41619 = idf(docFreq=1451, maxDocs=44218)
                0.016098492 = queryNorm
              0.4780666 = fieldWeight in 2743, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.41619 = idf(docFreq=1451, maxDocs=44218)
                0.0625 = fieldNorm(doc=2743)
          0.04665069 = weight(abstract_txt:measures in 2743) [ClassicSimilarity], result of:
            0.04665069 = score(doc=2743,freq=1.0), product of:
              0.13732414 = queryWeight, product of:
                1.5693886 = boost
                5.4353957 = idf(docFreq=523, maxDocs=44218)
                0.016098492 = queryNorm
              0.33971223 = fieldWeight in 2743, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.4353957 = idf(docFreq=523, maxDocs=44218)
                0.0625 = fieldNorm(doc=2743)
          0.11486038 = weight(abstract_txt:network in 2743) [ClassicSimilarity], result of:
            0.11486038 = score(doc=2743,freq=4.0), product of:
              0.19873682 = queryWeight, product of:
                2.6699998 = boost
                4.6236176 = idf(docFreq=1179, maxDocs=44218)
                0.016098492 = queryNorm
              0.5779522 = fieldWeight in 2743, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.6236176 = idf(docFreq=1179, maxDocs=44218)
                0.0625 = fieldNorm(doc=2743)
          0.25582373 = weight(abstract_txt:citation in 2743) [ClassicSimilarity], result of:
            0.25582373 = score(doc=2743,freq=9.0), product of:
              0.27863428 = queryWeight, product of:
                3.5346332 = boost
                4.896717 = idf(docFreq=897, maxDocs=44218)
                0.016098492 = queryNorm
              0.91813445 = fieldWeight in 2743, product of:
                3.0 = tf(freq=9.0), with freq of:
                  9.0 = termFreq=9.0
                4.896717 = idf(docFreq=897, maxDocs=44218)
                0.0625 = fieldNorm(doc=2743)
        0.28 = coord(7/25)
    
  3. Serpa, F.G.; Graves, A.M.; Javier, A.: Statistical common author networks (2013) 0.17
    0.17292298 = sum of:
      0.17292298 = product of:
        0.6175821 = sum of:
          0.012068868 = weight(abstract_txt:between in 1133) [ClassicSimilarity], result of:
            0.012068868 = score(doc=1133,freq=1.0), product of:
              0.05575526 = queryWeight, product of:
                3.4633842 = idf(docFreq=3764, maxDocs=44218)
                0.016098492 = queryNorm
              0.21646151 = fieldWeight in 1133, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4633842 = idf(docFreq=3764, maxDocs=44218)
                0.0625 = fieldNorm(doc=1133)
          0.020504544 = weight(abstract_txt:number in 1133) [ClassicSimilarity], result of:
            0.020504544 = score(doc=1133,freq=1.0), product of:
              0.07938557 = queryWeight, product of:
                1.1932402 = boost
                4.132649 = idf(docFreq=1927, maxDocs=44218)
                0.016098492 = queryNorm
              0.25829056 = fieldWeight in 1133, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.132649 = idf(docFreq=1927, maxDocs=44218)
                0.0625 = fieldNorm(doc=1133)
          0.045614854 = weight(abstract_txt:common in 1133) [ClassicSimilarity], result of:
            0.045614854 = score(doc=1133,freq=2.0), product of:
              0.1073748 = queryWeight, product of:
                1.3877405 = boost
                4.806278 = idf(docFreq=982, maxDocs=44218)
                0.016098492 = queryNorm
              0.424819 = fieldWeight in 1133, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.806278 = idf(docFreq=982, maxDocs=44218)
                0.0625 = fieldNorm(doc=1133)
          0.083177716 = weight(abstract_txt:measuring in 1133) [ClassicSimilarity], result of:
            0.083177716 = score(doc=1133,freq=1.0), product of:
              0.20192008 = queryWeight, product of:
                1.9030352 = boost
                6.590942 = idf(docFreq=164, maxDocs=44218)
                0.016098492 = queryNorm
              0.41193387 = fieldWeight in 1133, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.590942 = idf(docFreq=164, maxDocs=44218)
                0.0625 = fieldNorm(doc=1133)
          0.06264534 = weight(abstract_txt:links in 1133) [ClassicSimilarity], result of:
            0.06264534 = score(doc=1133,freq=1.0), product of:
              0.19133647 = queryWeight, product of:
                2.2688282 = boost
                5.2385488 = idf(docFreq=637, maxDocs=44218)
                0.016098492 = queryNorm
              0.3274093 = fieldWeight in 1133, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.2385488 = idf(docFreq=637, maxDocs=44218)
                0.0625 = fieldNorm(doc=1133)
          0.05743019 = weight(abstract_txt:network in 1133) [ClassicSimilarity], result of:
            0.05743019 = score(doc=1133,freq=1.0), product of:
              0.19873682 = queryWeight, product of:
                2.6699998 = boost
                4.6236176 = idf(docFreq=1179, maxDocs=44218)
                0.016098492 = queryNorm
              0.2889761 = fieldWeight in 1133, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.6236176 = idf(docFreq=1179, maxDocs=44218)
                0.0625 = fieldNorm(doc=1133)
          0.33614057 = weight(abstract_txt:relatedness in 1133) [ClassicSimilarity], result of:
            0.33614057 = score(doc=1133,freq=2.0), product of:
              0.46545303 = queryWeight, product of:
                3.5386746 = boost
                8.1705265 = idf(docFreq=33, maxDocs=44218)
                0.016098492 = queryNorm
              0.72217935 = fieldWeight in 1133, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.1705265 = idf(docFreq=33, maxDocs=44218)
                0.0625 = fieldNorm(doc=1133)
        0.28 = coord(7/25)
    
  4. Thelwall, M.: Extracting macroscopic information from Web links (2001) 0.17
    0.16806524 = sum of:
      0.16806524 = product of:
        0.5252039 = sum of:
          0.012068868 = weight(abstract_txt:between in 6851) [ClassicSimilarity], result of:
            0.012068868 = score(doc=6851,freq=1.0), product of:
              0.05575526 = queryWeight, product of:
                3.4633842 = idf(docFreq=3764, maxDocs=44218)
                0.016098492 = queryNorm
              0.21646151 = fieldWeight in 6851, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4633842 = idf(docFreq=3764, maxDocs=44218)
                0.0625 = fieldNorm(doc=6851)
          0.022214653 = weight(abstract_txt:over in 6851) [ClassicSimilarity], result of:
            0.022214653 = score(doc=6851,freq=1.0), product of:
              0.0837403 = queryWeight, product of:
                1.2255311 = boost
                4.244485 = idf(docFreq=1723, maxDocs=44218)
                0.016098492 = queryNorm
              0.2652803 = fieldWeight in 6851, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.244485 = idf(docFreq=1723, maxDocs=44218)
                0.0625 = fieldNorm(doc=6851)
          0.05956749 = weight(abstract_txt:academic in 6851) [ClassicSimilarity], result of:
            0.05956749 = score(doc=6851,freq=4.0), product of:
              0.10181873 = queryWeight, product of:
                1.3513596 = boost
                4.6802773 = idf(docFreq=1114, maxDocs=44218)
                0.016098492 = queryNorm
              0.58503467 = fieldWeight in 6851, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.6802773 = idf(docFreq=1114, maxDocs=44218)
                0.0625 = fieldNorm(doc=6851)
          0.04665069 = weight(abstract_txt:measures in 6851) [ClassicSimilarity], result of:
            0.04665069 = score(doc=6851,freq=1.0), product of:
              0.13732414 = queryWeight, product of:
                1.5693886 = boost
                5.4353957 = idf(docFreq=523, maxDocs=44218)
                0.016098492 = queryNorm
              0.33971223 = fieldWeight in 6851, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.4353957 = idf(docFreq=523, maxDocs=44218)
                0.0625 = fieldNorm(doc=6851)
          0.083177716 = weight(abstract_txt:measuring in 6851) [ClassicSimilarity], result of:
            0.083177716 = score(doc=6851,freq=1.0), product of:
              0.20192008 = queryWeight, product of:
                1.9030352 = boost
                6.590942 = idf(docFreq=164, maxDocs=44218)
                0.016098492 = queryNorm
              0.41193387 = fieldWeight in 6851, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.590942 = idf(docFreq=164, maxDocs=44218)
                0.0625 = fieldNorm(doc=6851)
          0.12765598 = weight(abstract_txt:ratio in 6851) [ClassicSimilarity], result of:
            0.12765598 = score(doc=6851,freq=1.0), product of:
              0.26865956 = queryWeight, product of:
                2.1951196 = boost
                7.602543 = idf(docFreq=59, maxDocs=44218)
                0.016098492 = queryNorm
              0.47515893 = fieldWeight in 6851, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.602543 = idf(docFreq=59, maxDocs=44218)
                0.0625 = fieldNorm(doc=6851)
          0.088593885 = weight(abstract_txt:links in 6851) [ClassicSimilarity], result of:
            0.088593885 = score(doc=6851,freq=2.0), product of:
              0.19133647 = queryWeight, product of:
                2.2688282 = boost
                5.2385488 = idf(docFreq=637, maxDocs=44218)
                0.016098492 = queryNorm
              0.46302667 = fieldWeight in 6851, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.2385488 = idf(docFreq=637, maxDocs=44218)
                0.0625 = fieldNorm(doc=6851)
          0.08527458 = weight(abstract_txt:citation in 6851) [ClassicSimilarity], result of:
            0.08527458 = score(doc=6851,freq=1.0), product of:
              0.27863428 = queryWeight, product of:
                3.5346332 = boost
                4.896717 = idf(docFreq=897, maxDocs=44218)
                0.016098492 = queryNorm
              0.30604482 = fieldWeight in 6851, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.896717 = idf(docFreq=897, maxDocs=44218)
                0.0625 = fieldNorm(doc=6851)
        0.32 = coord(8/25)
    
  5. Macias-Galindo, D.; Cavedon, L.; Thangarajah, J.; Wong, W.: Effects of domain on measures of semantic relatedness (2015) 0.16
    0.15872483 = sum of:
      0.15872483 = product of:
        0.7936241 = sum of:
          0.033274814 = weight(abstract_txt:terms in 2220) [ClassicSimilarity], result of:
            0.033274814 = score(doc=2220,freq=3.0), product of:
              0.0760113 = queryWeight, product of:
                1.1676055 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.016098492 = queryNorm
              0.4377614 = fieldWeight in 2220, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.0625 = fieldNorm(doc=2220)
          0.031416263 = weight(abstract_txt:over in 2220) [ClassicSimilarity], result of:
            0.031416263 = score(doc=2220,freq=2.0), product of:
              0.0837403 = queryWeight, product of:
                1.2255311 = boost
                4.244485 = idf(docFreq=1723, maxDocs=44218)
                0.016098492 = queryNorm
              0.375163 = fieldWeight in 2220, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.244485 = idf(docFreq=1723, maxDocs=44218)
                0.0625 = fieldNorm(doc=2220)
          0.11427039 = weight(abstract_txt:measures in 2220) [ClassicSimilarity], result of:
            0.11427039 = score(doc=2220,freq=6.0), product of:
              0.13732414 = queryWeight, product of:
                1.5693886 = boost
                5.4353957 = idf(docFreq=523, maxDocs=44218)
                0.016098492 = queryNorm
              0.8321217 = fieldWeight in 2220, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                5.4353957 = idf(docFreq=523, maxDocs=44218)
                0.0625 = fieldNorm(doc=2220)
          0.083177716 = weight(abstract_txt:measuring in 2220) [ClassicSimilarity], result of:
            0.083177716 = score(doc=2220,freq=1.0), product of:
              0.20192008 = queryWeight, product of:
                1.9030352 = boost
                6.590942 = idf(docFreq=164, maxDocs=44218)
                0.016098492 = queryNorm
              0.41193387 = fieldWeight in 2220, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.590942 = idf(docFreq=164, maxDocs=44218)
                0.0625 = fieldNorm(doc=2220)
          0.5314849 = weight(abstract_txt:relatedness in 2220) [ClassicSimilarity], result of:
            0.5314849 = score(doc=2220,freq=5.0), product of:
              0.46545303 = queryWeight, product of:
                3.5386746 = boost
                8.1705265 = idf(docFreq=33, maxDocs=44218)
                0.016098492 = queryNorm
              1.1418658 = fieldWeight in 2220, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                8.1705265 = idf(docFreq=33, maxDocs=44218)
                0.0625 = fieldNorm(doc=2220)
        0.2 = coord(5/25)