Document (#39566)

Author
Boldi, P.
Santini, M.
Vigna, S.
Title
PageRank as a function of the damping factor
Source
http://vigna.di.unimi.it/ftp/papers/PageRankAsFunction.pdf [Proceedings of the ACM World Wide Web Conference (WWW), 2005]
Year
2005
Abstract
PageRank is defined as the stationary state of a Markov chain. The chain is obtained by perturbing the transition matrix induced by a web graph with a damping factor alpha that spreads uniformly part of the rank. The choice of alpha is eminently empirical, and in most cases the original suggestion alpha=0.85 by Brin and Page is still used. Recently, however, the behaviour of PageRank with respect to changes in alpha was discovered to be useful in link-spam detection. Moreover, an analytical justification of the value chosen for alpha is still missing. In this paper, we give the first mathematical analysis of PageRank when alpha changes. In particular, we show that, contrarily to popular belief, for real-world graphs values of alpha close to 1 do not give a more meaningful ranking. Then, we give closed-form formulae for PageRank derivatives of any order, and an extension of the Power Method that approximates them with convergence O(t**k*alpha**t) for the k-th derivative. Finally, we show a tight connection between iterated computation and analytical behaviour by proving that the k-th iteration of the Power Method gives exactly the PageRank value obtained using a Maclaurin polynomial of degree k. The latter result paves the way towards the application of analytical methods to the study of PageRank.
Theme
Suchmaschinen
Object
PageRank

Similar documents (content)

  1. Bressan, M.; Peserico, E.: Choose the damping, choose the ranking? (2010) 0.27
    0.2744388 = sum of:
      0.2744388 = product of:
        0.85762125 = sum of:
          0.011870759 = weight(abstract_txt:show in 4564) [ClassicSimilarity], result of:
            0.011870759 = score(doc=4564,freq=1.0), product of:
              0.048731737 = queryWeight, product of:
                1.0020561 = boost
                4.4542904 = idf(docFreq=1350, maxDocs=42740)
                0.010917955 = queryNorm
              0.243594 = fieldWeight in 4564, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4542904 = idf(docFreq=1350, maxDocs=42740)
                0.0546875 = fieldNorm(doc=4564)
          0.010433948 = weight(abstract_txt:that in 4564) [ClassicSimilarity], result of:
            0.010433948 = score(doc=4564,freq=8.0), product of:
              0.028169036 = queryWeight, product of:
                1.0774251 = boost
                2.3946586 = idf(docFreq=10595, maxDocs=42740)
                0.010917955 = queryNorm
              0.37040484 = fieldWeight in 4564, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                2.3946586 = idf(docFreq=10595, maxDocs=42740)
                0.0546875 = fieldNorm(doc=4564)
          0.06459719 = weight(abstract_txt:0.85 in 4564) [ClassicSimilarity], result of:
            0.06459719 = score(doc=4564,freq=1.0), product of:
              0.119662665 = queryWeight, product of:
                1.1103258 = boost
                9.871131 = idf(docFreq=5, maxDocs=42740)
                0.010917955 = queryNorm
              0.53982747 = fieldWeight in 4564, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.871131 = idf(docFreq=5, maxDocs=42740)
                0.0546875 = fieldNorm(doc=4564)
          0.017103184 = weight(abstract_txt:changes in 4564) [ClassicSimilarity], result of:
            0.017103184 = score(doc=4564,freq=1.0), product of:
              0.06216475 = queryWeight, product of:
                1.13177 = boost
                5.0308886 = idf(docFreq=758, maxDocs=42740)
                0.010917955 = queryNorm
              0.27512673 = fieldWeight in 4564, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.0308886 = idf(docFreq=758, maxDocs=42740)
                0.0546875 = fieldNorm(doc=4564)
          0.05936787 = weight(abstract_txt:factor in 4564) [ClassicSimilarity], result of:
            0.05936787 = score(doc=4564,freq=5.0), product of:
              0.08334327 = queryWeight, product of:
                1.3104527 = boost
                5.82516 = idf(docFreq=342, maxDocs=42740)
                0.010917955 = queryNorm
              0.7123295 = fieldWeight in 4564, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                5.82516 = idf(docFreq=342, maxDocs=42740)
                0.0546875 = fieldNorm(doc=4564)
          0.34856343 = weight(abstract_txt:damping in 4564) [ClassicSimilarity], result of:
            0.34856343 = score(doc=4564,freq=8.0), product of:
              0.23190892 = queryWeight, product of:
                2.1859732 = boost
                9.71698 = idf(docFreq=6, maxDocs=42740)
                0.010917955 = queryNorm
              1.5030186 = fieldWeight in 4564, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                9.71698 = idf(docFreq=6, maxDocs=42740)
                0.0546875 = fieldNorm(doc=4564)
          0.057438895 = weight(abstract_txt:analytical in 4564) [ClassicSimilarity], result of:
            0.057438895 = score(doc=4564,freq=1.0), product of:
              0.15958571 = queryWeight, product of:
                2.2208986 = boost
                6.581486 = idf(docFreq=160, maxDocs=42740)
                0.010917955 = queryNorm
              0.35992503 = fieldWeight in 4564, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.581486 = idf(docFreq=160, maxDocs=42740)
                0.0546875 = fieldNorm(doc=4564)
          0.28824598 = weight(abstract_txt:pagerank in 4564) [ClassicSimilarity], result of:
            0.28824598 = score(doc=4564,freq=2.0), product of:
              0.49243367 = queryWeight, product of:
                5.9592824 = boost
                7.568546 = idf(docFreq=59, maxDocs=42740)
                0.010917955 = queryNorm
              0.58534986 = fieldWeight in 4564, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.568546 = idf(docFreq=59, maxDocs=42740)
                0.0546875 = fieldNorm(doc=4564)
        0.32 = coord(8/25)
    
  2. Ding, Y.; Yan, E.; Frazho, A.; Caverlee, J.: PageRank for ranking authors in co-citation networks (2009) 0.16
    0.15665 = sum of:
      0.15665 = product of:
        0.97906256 = sum of:
          0.0059622554 = weight(abstract_txt:that in 162) [ClassicSimilarity], result of:
            0.0059622554 = score(doc=162,freq=2.0), product of:
              0.028169036 = queryWeight, product of:
                1.0774251 = boost
                2.3946586 = idf(docFreq=10595, maxDocs=42740)
                0.010917955 = queryNorm
              0.21165991 = fieldWeight in 162, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.3946586 = idf(docFreq=10595, maxDocs=42740)
                0.0625 = fieldNorm(doc=162)
          0.030342992 = weight(abstract_txt:factor in 162) [ClassicSimilarity], result of:
            0.030342992 = score(doc=162,freq=1.0), product of:
              0.08334327 = queryWeight, product of:
                1.3104527 = boost
                5.82516 = idf(docFreq=342, maxDocs=42740)
                0.010917955 = queryNorm
              0.3640725 = fieldWeight in 162, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.82516 = idf(docFreq=342, maxDocs=42740)
                0.0625 = fieldNorm(doc=162)
          0.24394359 = weight(abstract_txt:damping in 162) [ClassicSimilarity], result of:
            0.24394359 = score(doc=162,freq=3.0), product of:
              0.23190892 = queryWeight, product of:
                2.1859732 = boost
                9.71698 = idf(docFreq=6, maxDocs=42740)
                0.010917955 = queryNorm
              1.051894 = fieldWeight in 162, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                9.71698 = idf(docFreq=6, maxDocs=42740)
                0.0625 = fieldNorm(doc=162)
          0.69881374 = weight(abstract_txt:pagerank in 162) [ClassicSimilarity], result of:
            0.69881374 = score(doc=162,freq=9.0), product of:
              0.49243367 = queryWeight, product of:
                5.9592824 = boost
                7.568546 = idf(docFreq=59, maxDocs=42740)
                0.010917955 = queryNorm
              1.4191023 = fieldWeight in 162, product of:
                3.0 = tf(freq=9.0), with freq of:
                  9.0 = termFreq=9.0
                7.568546 = idf(docFreq=59, maxDocs=42740)
                0.0625 = fieldNorm(doc=162)
        0.16 = coord(4/25)
    
  3. Yan, E.; Ding, Y.: Discovering author impact : a PageRank perspective (2011) 0.15
    0.14987981 = sum of:
      0.14987981 = product of:
        0.93674886 = sum of:
          0.020349873 = weight(abstract_txt:show in 4705) [ClassicSimilarity], result of:
            0.020349873 = score(doc=4705,freq=1.0), product of:
              0.048731737 = queryWeight, product of:
                1.0020561 = boost
                4.4542904 = idf(docFreq=1350, maxDocs=42740)
                0.010917955 = queryNorm
              0.41758972 = fieldWeight in 4705, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4542904 = idf(docFreq=1350, maxDocs=42740)
                0.09375 = fieldNorm(doc=4705)
          0.006323927 = weight(abstract_txt:that in 4705) [ClassicSimilarity], result of:
            0.006323927 = score(doc=4705,freq=1.0), product of:
              0.028169036 = queryWeight, product of:
                1.0774251 = boost
                2.3946586 = idf(docFreq=10595, maxDocs=42740)
                0.010917955 = queryNorm
              0.22449924 = fieldWeight in 4705, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.3946586 = idf(docFreq=10595, maxDocs=42740)
                0.09375 = fieldNorm(doc=4705)
          0.21126135 = weight(abstract_txt:damping in 4705) [ClassicSimilarity], result of:
            0.21126135 = score(doc=4705,freq=1.0), product of:
              0.23190892 = queryWeight, product of:
                2.1859732 = boost
                9.71698 = idf(docFreq=6, maxDocs=42740)
                0.010917955 = queryNorm
              0.9109669 = fieldWeight in 4705, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.71698 = idf(docFreq=6, maxDocs=42740)
                0.09375 = fieldNorm(doc=4705)
          0.69881374 = weight(abstract_txt:pagerank in 4705) [ClassicSimilarity], result of:
            0.69881374 = score(doc=4705,freq=4.0), product of:
              0.49243367 = queryWeight, product of:
                5.9592824 = boost
                7.568546 = idf(docFreq=59, maxDocs=42740)
                0.010917955 = queryNorm
              1.4191023 = fieldWeight in 4705, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                7.568546 = idf(docFreq=59, maxDocs=42740)
                0.09375 = fieldNorm(doc=4705)
        0.16 = coord(4/25)
    
  4. Dominich, S.; Skrop, A.: PageRank and interaction information retrieval (2005) 0.14
    0.14211996 = sum of:
      0.14211996 = product of:
        0.88824975 = sum of:
          0.03966609 = weight(abstract_txt:method in 4269) [ClassicSimilarity], result of:
            0.03966609 = score(doc=4269,freq=5.0), product of:
              0.050216664 = queryWeight, product of:
                1.0172086 = boost
                4.5216455 = idf(docFreq=1262, maxDocs=42740)
                0.010917955 = queryNorm
              0.789899 = fieldWeight in 4269, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.5216455 = idf(docFreq=1262, maxDocs=42740)
                0.078125 = fieldNorm(doc=4269)
          0.0052699395 = weight(abstract_txt:that in 4269) [ClassicSimilarity], result of:
            0.0052699395 = score(doc=4269,freq=1.0), product of:
              0.028169036 = queryWeight, product of:
                1.0774251 = boost
                2.3946586 = idf(docFreq=10595, maxDocs=42740)
                0.010917955 = queryNorm
              0.18708271 = fieldWeight in 4269, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.3946586 = idf(docFreq=10595, maxDocs=42740)
                0.078125 = fieldNorm(doc=4269)
          0.072943926 = weight(abstract_txt:chain in 4269) [ClassicSimilarity], result of:
            0.072943926 = score(doc=4269,freq=1.0), product of:
              0.12888955 = queryWeight, product of:
                1.6296523 = boost
                7.24405 = idf(docFreq=82, maxDocs=42740)
                0.010917955 = queryNorm
              0.5659414 = fieldWeight in 4269, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.24405 = idf(docFreq=82, maxDocs=42740)
                0.078125 = fieldNorm(doc=4269)
          0.77036977 = weight(abstract_txt:pagerank in 4269) [ClassicSimilarity], result of:
            0.77036977 = score(doc=4269,freq=7.0), product of:
              0.49243367 = queryWeight, product of:
                5.9592824 = boost
                7.568546 = idf(docFreq=59, maxDocs=42740)
                0.010917955 = queryNorm
              1.5644133 = fieldWeight in 4269, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                7.568546 = idf(docFreq=59, maxDocs=42740)
                0.078125 = fieldNorm(doc=4269)
        0.16 = coord(4/25)
    
  5. Bauckhage, C.: Marginalizing over the PageRank damping factor (2014) 0.13
    0.12529962 = sum of:
      0.12529962 = product of:
        0.78312266 = sum of:
          0.027133163 = weight(abstract_txt:show in 928) [ClassicSimilarity], result of:
            0.027133163 = score(doc=928,freq=1.0), product of:
              0.048731737 = queryWeight, product of:
                1.0020561 = boost
                4.4542904 = idf(docFreq=1350, maxDocs=42740)
                0.010917955 = queryNorm
              0.5567863 = fieldWeight in 928, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4542904 = idf(docFreq=1350, maxDocs=42740)
                0.125 = fieldNorm(doc=928)
          0.008431903 = weight(abstract_txt:that in 928) [ClassicSimilarity], result of:
            0.008431903 = score(doc=928,freq=1.0), product of:
              0.028169036 = queryWeight, product of:
                1.0774251 = boost
                2.3946586 = idf(docFreq=10595, maxDocs=42740)
                0.010917955 = queryNorm
              0.29933232 = fieldWeight in 928, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.3946586 = idf(docFreq=10595, maxDocs=42740)
                0.125 = fieldNorm(doc=928)
          0.28168178 = weight(abstract_txt:damping in 928) [ClassicSimilarity], result of:
            0.28168178 = score(doc=928,freq=1.0), product of:
              0.23190892 = queryWeight, product of:
                2.1859732 = boost
                9.71698 = idf(docFreq=6, maxDocs=42740)
                0.010917955 = queryNorm
              1.2146225 = fieldWeight in 928, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.71698 = idf(docFreq=6, maxDocs=42740)
                0.125 = fieldNorm(doc=928)
          0.46587583 = weight(abstract_txt:pagerank in 928) [ClassicSimilarity], result of:
            0.46587583 = score(doc=928,freq=1.0), product of:
              0.49243367 = queryWeight, product of:
                5.9592824 = boost
                7.568546 = idf(docFreq=59, maxDocs=42740)
                0.010917955 = queryNorm
              0.9460682 = fieldWeight in 928, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.568546 = idf(docFreq=59, maxDocs=42740)
                0.125 = fieldNorm(doc=928)
        0.16 = coord(4/25)