Document (#32332)

Author
Liben-Nowell, D.
Kleinberg, J.
Title
¬The link-prediction problem for social networks
Source
Journal of the American Society for Information Science and Technology. 58(2007) no.7, S.1019-1031
Year
2007
Abstract
Given a snapshot of a social network, can we infer which new interactions among its members are likely to occur in the near future? We formalize this question as the link-prediction problem, and we develop approaches to link prediction based on measures for analyzing the "proximity" of nodes in a network. Experiments on large coauthorship networks suggest that information about future interactions can be extracted from network topology alone, and that fairly subtle measures for detecting node proximity can outperform more direct measures.
Theme
Internet

Similar documents (author)

  1. Kleinberg, I.: Making the case for professional indexers : where is the proof? (1993) 6.17
    6.169457 = sum of:
      6.169457 = weight(author_txt:kleinberg in 7766) [ClassicSimilarity], result of:
        6.169457 = fieldWeight in 7766, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.871131 = idf(docFreq=5, maxDocs=42740)
          0.625 = fieldNorm(doc=7766)
    
  2. Kleinberg, I.: For want of an alphabetical index : some notes toward a history of the back-of-the-book index in nineteenth century America (1997) 6.17
    6.169457 = sum of:
      6.169457 = weight(author_txt:kleinberg in 4735) [ClassicSimilarity], result of:
        6.169457 = fieldWeight in 4735, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.871131 = idf(docFreq=5, maxDocs=42740)
          0.625 = fieldNorm(doc=4735)
    
  3. Kleinberg, J.M.: Authoritative sources in a hyperlinked environment (1998) 6.17
    6.169457 = sum of:
      6.169457 = weight(author_txt:kleinberg in 2006) [ClassicSimilarity], result of:
        6.169457 = fieldWeight in 2006, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.871131 = idf(docFreq=5, maxDocs=42740)
          0.625 = fieldNorm(doc=2006)
    
  4. Chakrabarti, S.; Dom, B.; Kumar, S.R.; Raghavan, P.; Rajagopalan, S.; Tomkins, A.; Kleinberg, J.M.; Gibson, D.: Neue Pfade durch den Internet-Dschungel : Die zweite Generation von Web-Suchmaschinen (1999) 2.47
    2.4677827 = sum of:
      2.4677827 = weight(author_txt:kleinberg in 2004) [ClassicSimilarity], result of:
        2.4677827 = fieldWeight in 2004, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.871131 = idf(docFreq=5, maxDocs=42740)
          0.25 = fieldNorm(doc=2004)
    

Similar documents (content)

  1. Hu, D.; Kaza, S.; Chen, H.: Identifying significant facilitators of dark network evolution (2009) 0.21
    0.214417 = sum of:
      0.214417 = product of:
        0.76577497 = sum of:
          0.061473366 = weight(abstract_txt:nodes in 4754) [ClassicSimilarity], result of:
            0.061473366 = score(doc=4754,freq=1.0), product of:
              0.13994753 = queryWeight, product of:
                1.1329705 = boost
                7.0281615 = idf(docFreq=102, maxDocs=42740)
                0.017575389 = queryNorm
              0.4392601 = fieldWeight in 4754, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.0281615 = idf(docFreq=102, maxDocs=42740)
                0.0625 = fieldNorm(doc=4754)
          0.047641408 = weight(abstract_txt:future in 4754) [ClassicSimilarity], result of:
            0.047641408 = score(doc=4754,freq=3.0), product of:
              0.10314936 = queryWeight, product of:
                1.3755748 = boost
                4.2665553 = idf(docFreq=1629, maxDocs=42740)
                0.017575389 = queryNorm
              0.46186817 = fieldWeight in 4754, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.2665553 = idf(docFreq=1629, maxDocs=42740)
                0.0625 = fieldNorm(doc=4754)
          0.062393565 = weight(abstract_txt:social in 4754) [ClassicSimilarity], result of:
            0.062393565 = score(doc=4754,freq=5.0), product of:
              0.104140684 = queryWeight, product of:
                1.382169 = boost
                4.2870083 = idf(docFreq=1596, maxDocs=42740)
                0.017575389 = queryNorm
              0.59912765 = fieldWeight in 4754, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.2870083 = idf(docFreq=1596, maxDocs=42740)
                0.0625 = fieldNorm(doc=4754)
          0.10833386 = weight(abstract_txt:networks in 4754) [ClassicSimilarity], result of:
            0.10833386 = score(doc=4754,freq=5.0), product of:
              0.15044233 = queryWeight, product of:
                1.6612538 = boost
                5.152632 = idf(docFreq=671, maxDocs=42740)
                0.017575389 = queryNorm
              0.72010225 = fieldWeight in 4754, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                5.152632 = idf(docFreq=671, maxDocs=42740)
                0.0625 = fieldNorm(doc=4754)
          0.106383845 = weight(abstract_txt:network in 4754) [ClassicSimilarity], result of:
            0.106383845 = score(doc=4754,freq=4.0), product of:
              0.18327847 = queryWeight, product of:
                2.245703 = boost
                4.643594 = idf(docFreq=1117, maxDocs=42740)
                0.017575389 = queryNorm
              0.5804492 = fieldWeight in 4754, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.643594 = idf(docFreq=1117, maxDocs=42740)
                0.0625 = fieldNorm(doc=4754)
          0.17126611 = weight(abstract_txt:link in 4754) [ClassicSimilarity], result of:
            0.17126611 = score(doc=4754,freq=3.0), product of:
              0.27709043 = queryWeight, product of:
                2.7612603 = boost
                5.709647 = idf(docFreq=384, maxDocs=42740)
                0.017575389 = queryNorm
              0.6180874 = fieldWeight in 4754, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.709647 = idf(docFreq=384, maxDocs=42740)
                0.0625 = fieldNorm(doc=4754)
          0.20828286 = weight(abstract_txt:prediction in 4754) [ClassicSimilarity], result of:
            0.20828286 = score(doc=4754,freq=1.0), product of:
              0.45531997 = queryWeight, product of:
                3.539606 = boost
                7.319085 = idf(docFreq=76, maxDocs=42740)
                0.017575389 = queryNorm
              0.45744282 = fieldWeight in 4754, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.319085 = idf(docFreq=76, maxDocs=42740)
                0.0625 = fieldNorm(doc=4754)
        0.28 = coord(7/25)
    
  2. Yan, E.; Ding, Y.: Applying centrality measures to impact analysis : a coauthorship network analysis (2009) 0.15
    0.15385221 = sum of:
      0.15385221 = product of:
        0.769261 = sum of:
          0.19258484 = weight(abstract_txt:coauthorship in 84) [ClassicSimilarity], result of:
            0.19258484 = score(doc=84,freq=3.0), product of:
              0.17903648 = queryWeight, product of:
                1.2814649 = boost
                7.9493184 = idf(docFreq=40, maxDocs=42740)
                0.017575389 = queryNorm
              1.0756737 = fieldWeight in 84, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.9493184 = idf(docFreq=40, maxDocs=42740)
                0.078125 = fieldNorm(doc=84)
          0.12965508 = weight(abstract_txt:topology in 84) [ClassicSimilarity], result of:
            0.12965508 = score(doc=84,freq=1.0), product of:
              0.1983476 = queryWeight, product of:
                1.3488058 = boost
                8.367054 = idf(docFreq=26, maxDocs=42740)
                0.017575389 = queryNorm
              0.6536761 = fieldWeight in 84, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.367054 = idf(docFreq=26, maxDocs=42740)
                0.078125 = fieldNorm(doc=84)
          0.06056047 = weight(abstract_txt:networks in 84) [ClassicSimilarity], result of:
            0.06056047 = score(doc=84,freq=1.0), product of:
              0.15044233 = queryWeight, product of:
                1.6612538 = boost
                5.152632 = idf(docFreq=671, maxDocs=42740)
                0.017575389 = queryNorm
              0.4025494 = fieldWeight in 84, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.152632 = idf(docFreq=671, maxDocs=42740)
                0.078125 = fieldNorm(doc=84)
          0.14867595 = weight(abstract_txt:network in 84) [ClassicSimilarity], result of:
            0.14867595 = score(doc=84,freq=5.0), product of:
              0.18327847 = queryWeight, product of:
                2.245703 = boost
                4.643594 = idf(docFreq=1117, maxDocs=42740)
                0.017575389 = queryNorm
              0.81120247 = fieldWeight in 84, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.643594 = idf(docFreq=1117, maxDocs=42740)
                0.078125 = fieldNorm(doc=84)
          0.2377846 = weight(abstract_txt:measures in 84) [ClassicSimilarity], result of:
            0.2377846 = score(doc=84,freq=5.0), product of:
              0.25065333 = queryWeight, product of:
                2.6262333 = boost
                5.4304423 = idf(docFreq=508, maxDocs=42740)
                0.017575389 = queryNorm
              0.94865924 = fieldWeight in 84, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                5.4304423 = idf(docFreq=508, maxDocs=42740)
                0.078125 = fieldNorm(doc=84)
        0.2 = coord(5/25)
    
  3. Ma, Z.; Sun, A.; Cong, G.: On predicting the popularity of newly emerging hashtags in Twitter (2013) 0.13
    0.13423215 = sum of:
      0.13423215 = product of:
        0.47940052 = sum of:
          0.042269904 = weight(abstract_txt:extracted in 2968) [ClassicSimilarity], result of:
            0.042269904 = score(doc=2968,freq=1.0), product of:
              0.1090255 = queryWeight, product of:
                6.2033052 = idf(docFreq=234, maxDocs=42740)
                0.017575389 = queryNorm
              0.38770658 = fieldWeight in 2968, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.2033052 = idf(docFreq=234, maxDocs=42740)
                0.0625 = fieldNorm(doc=2968)
          0.060237892 = weight(abstract_txt:near in 2968) [ClassicSimilarity], result of:
            0.060237892 = score(doc=2968,freq=1.0), product of:
              0.13806611 = queryWeight, product of:
                1.125329 = boost
                6.980759 = idf(docFreq=107, maxDocs=42740)
                0.017575389 = queryNorm
              0.43629745 = fieldWeight in 2968, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.980759 = idf(docFreq=107, maxDocs=42740)
                0.0625 = fieldNorm(doc=2968)
          0.08182382 = weight(abstract_txt:outperform in 2968) [ClassicSimilarity], result of:
            0.08182382 = score(doc=2968,freq=1.0), product of:
              0.16934033 = queryWeight, product of:
                1.2462815 = boost
                7.731065 = idf(docFreq=50, maxDocs=42740)
                0.017575389 = queryNorm
              0.48319155 = fieldWeight in 2968, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.731065 = idf(docFreq=50, maxDocs=42740)
                0.0625 = fieldNorm(doc=2968)
          0.027505778 = weight(abstract_txt:future in 2968) [ClassicSimilarity], result of:
            0.027505778 = score(doc=2968,freq=1.0), product of:
              0.10314936 = queryWeight, product of:
                1.3755748 = boost
                4.2665553 = idf(docFreq=1629, maxDocs=42740)
                0.017575389 = queryNorm
              0.2666597 = fieldWeight in 2968, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.2665553 = idf(docFreq=1629, maxDocs=42740)
                0.0625 = fieldNorm(doc=2968)
          0.027903248 = weight(abstract_txt:social in 2968) [ClassicSimilarity], result of:
            0.027903248 = score(doc=2968,freq=1.0), product of:
              0.104140684 = queryWeight, product of:
                1.382169 = boost
                4.2870083 = idf(docFreq=1596, maxDocs=42740)
                0.017575389 = queryNorm
              0.26793802 = fieldWeight in 2968, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.2870083 = idf(docFreq=1596, maxDocs=42740)
                0.0625 = fieldNorm(doc=2968)
          0.031376995 = weight(abstract_txt:problem in 2968) [ClassicSimilarity], result of:
            0.031376995 = score(doc=2968,freq=1.0), product of:
              0.11261376 = queryWeight, product of:
                1.4372976 = boost
                4.457998 = idf(docFreq=1345, maxDocs=42740)
                0.017575389 = queryNorm
              0.27862486 = fieldWeight in 2968, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.457998 = idf(docFreq=1345, maxDocs=42740)
                0.0625 = fieldNorm(doc=2968)
          0.20828286 = weight(abstract_txt:prediction in 2968) [ClassicSimilarity], result of:
            0.20828286 = score(doc=2968,freq=1.0), product of:
              0.45531997 = queryWeight, product of:
                3.539606 = boost
                7.319085 = idf(docFreq=76, maxDocs=42740)
                0.017575389 = queryNorm
              0.45744282 = fieldWeight in 2968, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.319085 = idf(docFreq=76, maxDocs=42740)
                0.0625 = fieldNorm(doc=2968)
        0.28 = coord(7/25)
    
  4. Zhao, S.X.; Ye, F.Y.: Power-law link strength distribution in paper cocitation networks (2013) 0.13
    0.13348754 = sum of:
      0.13348754 = product of:
        0.6674377 = sum of:
          0.10867059 = weight(abstract_txt:nodes in 2974) [ClassicSimilarity], result of:
            0.10867059 = score(doc=2974,freq=2.0), product of:
              0.13994753 = queryWeight, product of:
                1.1329705 = boost
                7.0281615 = idf(docFreq=102, maxDocs=42740)
                0.017575389 = queryNorm
              0.7765095 = fieldWeight in 2974, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.0281615 = idf(docFreq=102, maxDocs=42740)
                0.078125 = fieldNorm(doc=2974)
          0.14575978 = weight(abstract_txt:node in 2974) [ClassicSimilarity], result of:
            0.14575978 = score(doc=2974,freq=2.0), product of:
              0.17020895 = queryWeight, product of:
                1.2494738 = boost
                7.7508674 = idf(docFreq=49, maxDocs=42740)
                0.017575389 = queryNorm
              0.85635793 = fieldWeight in 2974, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.7508674 = idf(docFreq=49, maxDocs=42740)
                0.078125 = fieldNorm(doc=2974)
          0.10489381 = weight(abstract_txt:networks in 2974) [ClassicSimilarity], result of:
            0.10489381 = score(doc=2974,freq=3.0), product of:
              0.15044233 = queryWeight, product of:
                1.6612538 = boost
                5.152632 = idf(docFreq=671, maxDocs=42740)
                0.017575389 = queryNorm
              0.697236 = fieldWeight in 2974, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.152632 = idf(docFreq=671, maxDocs=42740)
                0.078125 = fieldNorm(doc=2974)
          0.094030924 = weight(abstract_txt:network in 2974) [ClassicSimilarity], result of:
            0.094030924 = score(doc=2974,freq=2.0), product of:
              0.18327847 = queryWeight, product of:
                2.245703 = boost
                4.643594 = idf(docFreq=1117, maxDocs=42740)
                0.017575389 = queryNorm
              0.5130495 = fieldWeight in 2974, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.643594 = idf(docFreq=1117, maxDocs=42740)
                0.078125 = fieldNorm(doc=2974)
          0.21408263 = weight(abstract_txt:link in 2974) [ClassicSimilarity], result of:
            0.21408263 = score(doc=2974,freq=3.0), product of:
              0.27709043 = queryWeight, product of:
                2.7612603 = boost
                5.709647 = idf(docFreq=384, maxDocs=42740)
                0.017575389 = queryNorm
              0.77260923 = fieldWeight in 2974, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.709647 = idf(docFreq=384, maxDocs=42740)
                0.078125 = fieldNorm(doc=2974)
        0.2 = coord(5/25)
    
  5. Zhang, J.; Zhai, S.; Liu, H.; Stevenson, J.A.: Social network analysis on a topic-based navigation guidance system in a public health portal (2016) 0.13
    0.12779133 = sum of:
      0.12779133 = product of:
        0.5324639 = sum of:
          0.11660783 = weight(abstract_txt:node in 4888) [ClassicSimilarity], result of:
            0.11660783 = score(doc=4888,freq=2.0), product of:
              0.17020895 = queryWeight, product of:
                1.2494738 = boost
                7.7508674 = idf(docFreq=49, maxDocs=42740)
                0.017575389 = queryNorm
              0.68508637 = fieldWeight in 4888, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.7508674 = idf(docFreq=49, maxDocs=42740)
                0.0625 = fieldNorm(doc=4888)
          0.027505778 = weight(abstract_txt:future in 4888) [ClassicSimilarity], result of:
            0.027505778 = score(doc=4888,freq=1.0), product of:
              0.10314936 = queryWeight, product of:
                1.3755748 = boost
                4.2665553 = idf(docFreq=1629, maxDocs=42740)
                0.017575389 = queryNorm
              0.2666597 = fieldWeight in 4888, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.2665553 = idf(docFreq=1629, maxDocs=42740)
                0.0625 = fieldNorm(doc=4888)
          0.027903248 = weight(abstract_txt:social in 4888) [ClassicSimilarity], result of:
            0.027903248 = score(doc=4888,freq=1.0), product of:
              0.104140684 = queryWeight, product of:
                1.382169 = boost
                4.2870083 = idf(docFreq=1596, maxDocs=42740)
                0.017575389 = queryNorm
              0.26793802 = fieldWeight in 4888, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.2870083 = idf(docFreq=1596, maxDocs=42740)
                0.0625 = fieldNorm(doc=4888)
          0.048448376 = weight(abstract_txt:networks in 4888) [ClassicSimilarity], result of:
            0.048448376 = score(doc=4888,freq=1.0), product of:
              0.15044233 = queryWeight, product of:
                1.6612538 = boost
                5.152632 = idf(docFreq=671, maxDocs=42740)
                0.017575389 = queryNorm
              0.3220395 = fieldWeight in 4888, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.152632 = idf(docFreq=671, maxDocs=42740)
                0.0625 = fieldNorm(doc=4888)
          0.1407326 = weight(abstract_txt:network in 4888) [ClassicSimilarity], result of:
            0.1407326 = score(doc=4888,freq=7.0), product of:
              0.18327847 = queryWeight, product of:
                2.245703 = boost
                4.643594 = idf(docFreq=1117, maxDocs=42740)
                0.017575389 = queryNorm
              0.76786214 = fieldWeight in 4888, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                4.643594 = idf(docFreq=1117, maxDocs=42740)
                0.0625 = fieldNorm(doc=4888)
          0.17126611 = weight(abstract_txt:link in 4888) [ClassicSimilarity], result of:
            0.17126611 = score(doc=4888,freq=3.0), product of:
              0.27709043 = queryWeight, product of:
                2.7612603 = boost
                5.709647 = idf(docFreq=384, maxDocs=42740)
                0.017575389 = queryNorm
              0.6180874 = fieldWeight in 4888, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.709647 = idf(docFreq=384, maxDocs=42740)
                0.0625 = fieldNorm(doc=4888)
        0.24 = coord(6/25)