Document (#39565)

Author
Bressan, M.
Peserico, E.
Title
Choose the damping, choose the ranking?
Source
Journal of discrete algorithms. 8(2010) no.2, S.199-213
Year
2010
Abstract
To what extent can changes in PageRank's damping factor affect node ranking? We prove that, at least on some graphs, the top k nodes assume all possible k! orderings as the damping factor varies, even if it varies within an arbitrarily small interval (e.g. [0.84999,0.85001][0.84999,0.85001]). Thus, the rank of a node for a given (finite set of discrete) damping factor(s) provides very little information about the rank of that node as the damping factor varies over a continuous interval. We bypass this problem introducing lineage analysis and proving that there is a simple condition, with a "natural" interpretation independent of PageRank, that allows one to verify "in one shot" if a node outperforms another simultaneously for all damping factors and all damping variables (informally, time variant damping factors). The novel notions of strong rank and weak rank of a node provide a measure of the fuzziness of the rank of that node, of the objective orderability of a graph's nodes, and of the quality of results returned by different ranking algorithms based on the random surfer model. We deploy our analytical tools on a 41M node snapshot of the .it Web domain and on a 0.7M node snapshot of the CiteSeer citation graph. Among other findings, we show that rank is indeed relatively stable in both graphs; that "classic" PageRank (d=0.85) marginally outperforms Weighted In-degree (d->0), mainly due to its ability to ferret out "niche" items; and that, for both the Web and CiteSeer, the ideal damping factor appears to be 0.8-0.9 to obtain those items of high importance to at least one (model of randomly surfing) user, but only 0.5-0.6 to obtain those items important to every (model of randomly surfing) user.
Content
This paper addresses the fundamental question of how the ranking induced by PageRank can be affected by variations of the damping factor. This introduction briefly reviews the PageRank algorithm (Section 1.1) and the crucial difference between score and rank (Section 1.2) before presenting an overview of our results and the organization of the rest of the paper (Section 1.3). Vgl. auch: doi:10.1016/j.jda.2009.11.001. http://www.sciencedirect.com/science/article/pii/S1570866709000926.
Theme
Suchmaschinen
Object
PageRank

Similar documents (content)

  1. Ding, Y.; Yan, E.; Frazho, A.; Caverlee, J.: PageRank for ranking authors in co-citation networks (2009) 0.34
    0.3385644 = sum of:
      0.3385644 = product of:
        1.2091585 = sum of:
          0.023932785 = weight(abstract_txt:factors in 162) [ClassicSimilarity], result of:
            0.023932785 = score(doc=162,freq=3.0), product of:
              0.044139184 = queryWeight, product of:
                1.0702082 = boost
                5.008738 = idf(docFreq=775, maxDocs=42740)
                0.008234319 = queryNorm
              0.5422118 = fieldWeight in 162, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.008738 = idf(docFreq=775, maxDocs=42740)
                0.0625 = fieldNorm(doc=162)
          0.14302313 = weight(abstract_txt:pagerank in 162) [ClassicSimilarity], result of:
            0.14302313 = score(doc=162,freq=9.0), product of:
              0.10078423 = queryWeight, product of:
                1.6171578 = boost
                7.568546 = idf(docFreq=59, maxDocs=42740)
                0.008234319 = queryNorm
              1.4191023 = fieldWeight in 162, product of:
                3.0 = tf(freq=9.0), with freq of:
                  9.0 = termFreq=9.0
                7.568546 = idf(docFreq=59, maxDocs=42740)
                0.0625 = fieldNorm(doc=162)
          0.040945843 = weight(abstract_txt:ranking in 162) [ClassicSimilarity], result of:
            0.040945843 = score(doc=162,freq=2.0), product of:
              0.08273631 = queryWeight, product of:
                1.7945263 = boost
                5.5991054 = idf(docFreq=429, maxDocs=42740)
                0.008234319 = queryNorm
              0.49489567 = fieldWeight in 162, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.5991054 = idf(docFreq=429, maxDocs=42740)
                0.0625 = fieldNorm(doc=162)
          0.008541879 = weight(abstract_txt:that in 162) [ClassicSimilarity], result of:
            0.008541879 = score(doc=162,freq=2.0), product of:
              0.040356625 = queryWeight, product of:
                2.04665 = boost
                2.3946586 = idf(docFreq=10595, maxDocs=42740)
                0.008234319 = queryNorm
              0.21165991 = fieldWeight in 162, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.3946586 = idf(docFreq=10595, maxDocs=42740)
                0.0625 = fieldNorm(doc=162)
          0.054338954 = weight(abstract_txt:factor in 162) [ClassicSimilarity], result of:
            0.054338954 = score(doc=162,freq=1.0), product of:
              0.14925312 = queryWeight, product of:
                3.1116292 = boost
                5.82516 = idf(docFreq=342, maxDocs=42740)
                0.008234319 = queryNorm
              0.3640725 = fieldWeight in 162, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.82516 = idf(docFreq=342, maxDocs=42740)
                0.0625 = fieldNorm(doc=162)
          0.15202795 = weight(abstract_txt:rank in 162) [ClassicSimilarity], result of:
            0.15202795 = score(doc=162,freq=3.0), product of:
              0.21834914 = queryWeight, product of:
                4.1228023 = boost
                6.431782 = idf(docFreq=186, maxDocs=42740)
                0.008234319 = queryNorm
              0.6962608 = fieldWeight in 162, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.431782 = idf(docFreq=186, maxDocs=42740)
                0.0625 = fieldNorm(doc=162)
          0.786348 = weight(abstract_txt:damping in 162) [ClassicSimilarity], result of:
            0.786348 = score(doc=162,freq=3.0), product of:
              0.7475544 = queryWeight, product of:
                9.342945 = boost
                9.71698 = idf(docFreq=6, maxDocs=42740)
                0.008234319 = queryNorm
              1.051894 = fieldWeight in 162, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                9.71698 = idf(docFreq=6, maxDocs=42740)
                0.0625 = fieldNorm(doc=162)
        0.28 = coord(7/25)
    
  2. Boldi, P.; Santini, M.; Vigna, S.: PageRank as a function of the damping factor (2005) 0.27
    0.27155402 = sum of:
      0.27155402 = product of:
        0.84860635 = sum of:
          0.052883286 = weight(abstract_txt:0.85 in 4565) [ClassicSimilarity], result of:
            0.052883286 = score(doc=4565,freq=1.0), product of:
              0.085717894 = queryWeight, product of:
                1.0545735 = boost
                9.871131 = idf(docFreq=5, maxDocs=42740)
                0.008234319 = queryNorm
              0.6169457 = fieldWeight in 4565, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.871131 = idf(docFreq=5, maxDocs=42740)
                0.0625 = fieldNorm(doc=4565)
          0.041801512 = weight(abstract_txt:graphs in 4565) [ClassicSimilarity], result of:
            0.041801512 = score(doc=4565,freq=1.0), product of:
              0.092327386 = queryWeight, product of:
                1.5478234 = boost
                7.24405 = idf(docFreq=82, maxDocs=42740)
                0.008234319 = queryNorm
              0.45275313 = fieldWeight in 4565, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.24405 = idf(docFreq=82, maxDocs=42740)
                0.0625 = fieldNorm(doc=4565)
          0.116777904 = weight(abstract_txt:pagerank in 4565) [ClassicSimilarity], result of:
            0.116777904 = score(doc=4565,freq=6.0), product of:
              0.10078423 = queryWeight, product of:
                1.6171578 = boost
                7.568546 = idf(docFreq=59, maxDocs=42740)
                0.008234319 = queryNorm
              1.1586922 = fieldWeight in 4565, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                7.568546 = idf(docFreq=59, maxDocs=42740)
                0.0625 = fieldNorm(doc=4565)
          0.028953083 = weight(abstract_txt:ranking in 4565) [ClassicSimilarity], result of:
            0.028953083 = score(doc=4565,freq=1.0), product of:
              0.08273631 = queryWeight, product of:
                1.7945263 = boost
                5.5991054 = idf(docFreq=429, maxDocs=42740)
                0.008234319 = queryNorm
              0.34994408 = fieldWeight in 4565, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.5991054 = idf(docFreq=429, maxDocs=42740)
                0.0625 = fieldNorm(doc=4565)
          0.012080043 = weight(abstract_txt:that in 4565) [ClassicSimilarity], result of:
            0.012080043 = score(doc=4565,freq=4.0), product of:
              0.040356625 = queryWeight, product of:
                2.04665 = boost
                2.3946586 = idf(docFreq=10595, maxDocs=42740)
                0.008234319 = queryNorm
              0.29933232 = fieldWeight in 4565, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                2.3946586 = idf(docFreq=10595, maxDocs=42740)
                0.0625 = fieldNorm(doc=4565)
          0.054338954 = weight(abstract_txt:factor in 4565) [ClassicSimilarity], result of:
            0.054338954 = score(doc=4565,freq=1.0), product of:
              0.14925312 = queryWeight, product of:
                3.1116292 = boost
                5.82516 = idf(docFreq=342, maxDocs=42740)
                0.008234319 = queryNorm
              0.3640725 = fieldWeight in 4565, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.82516 = idf(docFreq=342, maxDocs=42740)
                0.0625 = fieldNorm(doc=4565)
          0.087773375 = weight(abstract_txt:rank in 4565) [ClassicSimilarity], result of:
            0.087773375 = score(doc=4565,freq=1.0), product of:
              0.21834914 = queryWeight, product of:
                4.1228023 = boost
                6.431782 = idf(docFreq=186, maxDocs=42740)
                0.008234319 = queryNorm
              0.40198636 = fieldWeight in 4565, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.431782 = idf(docFreq=186, maxDocs=42740)
                0.0625 = fieldNorm(doc=4565)
          0.4539982 = weight(abstract_txt:damping in 4565) [ClassicSimilarity], result of:
            0.4539982 = score(doc=4565,freq=1.0), product of:
              0.7475544 = queryWeight, product of:
                9.342945 = boost
                9.71698 = idf(docFreq=6, maxDocs=42740)
                0.008234319 = queryNorm
              0.60731125 = fieldWeight in 4565, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.71698 = idf(docFreq=6, maxDocs=42740)
                0.0625 = fieldNorm(doc=4565)
        0.32 = coord(8/25)
    
  3. Baeza-Yates, R.; Boldi, P.; Castillo, C.: Generalizing PageRank : damping functions for linkbased ranking algorithms (2006) 0.21
    0.20754766 = sum of:
      0.20754766 = product of:
        1.0377383 = sum of:
          0.09534875 = weight(abstract_txt:pagerank in 4566) [ClassicSimilarity], result of:
            0.09534875 = score(doc=4566,freq=4.0), product of:
              0.10078423 = queryWeight, product of:
                1.6171578 = boost
                7.568546 = idf(docFreq=59, maxDocs=42740)
                0.008234319 = queryNorm
              0.9460682 = fieldWeight in 4566, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                7.568546 = idf(docFreq=59, maxDocs=42740)
                0.0625 = fieldNorm(doc=4566)
          0.05014821 = weight(abstract_txt:ranking in 4566) [ClassicSimilarity], result of:
            0.05014821 = score(doc=4566,freq=3.0), product of:
              0.08273631 = queryWeight, product of:
                1.7945263 = boost
                5.5991054 = idf(docFreq=429, maxDocs=42740)
                0.008234319 = queryNorm
              0.60612094 = fieldWeight in 4566, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.5991054 = idf(docFreq=429, maxDocs=42740)
                0.0625 = fieldNorm(doc=4566)
          0.018120063 = weight(abstract_txt:that in 4566) [ClassicSimilarity], result of:
            0.018120063 = score(doc=4566,freq=9.0), product of:
              0.040356625 = queryWeight, product of:
                2.04665 = boost
                2.3946586 = idf(docFreq=10595, maxDocs=42740)
                0.008234319 = queryNorm
              0.44899848 = fieldWeight in 4566, product of:
                3.0 = tf(freq=9.0), with freq of:
                  9.0 = termFreq=9.0
                2.3946586 = idf(docFreq=10595, maxDocs=42740)
                0.0625 = fieldNorm(doc=4566)
          0.087773375 = weight(abstract_txt:rank in 4566) [ClassicSimilarity], result of:
            0.087773375 = score(doc=4566,freq=1.0), product of:
              0.21834914 = queryWeight, product of:
                4.1228023 = boost
                6.431782 = idf(docFreq=186, maxDocs=42740)
                0.008234319 = queryNorm
              0.40198636 = fieldWeight in 4566, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.431782 = idf(docFreq=186, maxDocs=42740)
                0.0625 = fieldNorm(doc=4566)
          0.786348 = weight(abstract_txt:damping in 4566) [ClassicSimilarity], result of:
            0.786348 = score(doc=4566,freq=3.0), product of:
              0.7475544 = queryWeight, product of:
                9.342945 = boost
                9.71698 = idf(docFreq=6, maxDocs=42740)
                0.008234319 = queryNorm
              1.051894 = fieldWeight in 4566, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                9.71698 = idf(docFreq=6, maxDocs=42740)
                0.0625 = fieldNorm(doc=4566)
        0.2 = coord(5/25)
    
  4. Bauckhage, C.: Marginalizing over the PageRank damping factor (2014) 0.17
    0.1713734 = sum of:
      0.1713734 = product of:
        1.0710838 = sum of:
          0.05565855 = weight(abstract_txt:obtain in 928) [ClassicSimilarity], result of:
            0.05565855 = score(doc=928,freq=1.0), product of:
              0.07039424 = queryWeight, product of:
                1.351527 = boost
                6.3253527 = idf(docFreq=207, maxDocs=42740)
                0.008234319 = queryNorm
              0.7906691 = fieldWeight in 928, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.3253527 = idf(docFreq=207, maxDocs=42740)
                0.125 = fieldNorm(doc=928)
          0.09534875 = weight(abstract_txt:pagerank in 928) [ClassicSimilarity], result of:
            0.09534875 = score(doc=928,freq=1.0), product of:
              0.10078423 = queryWeight, product of:
                1.6171578 = boost
                7.568546 = idf(docFreq=59, maxDocs=42740)
                0.008234319 = queryNorm
              0.9460682 = fieldWeight in 928, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.568546 = idf(docFreq=59, maxDocs=42740)
                0.125 = fieldNorm(doc=928)
          0.012080043 = weight(abstract_txt:that in 928) [ClassicSimilarity], result of:
            0.012080043 = score(doc=928,freq=1.0), product of:
              0.040356625 = queryWeight, product of:
                2.04665 = boost
                2.3946586 = idf(docFreq=10595, maxDocs=42740)
                0.008234319 = queryNorm
              0.29933232 = fieldWeight in 928, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.3946586 = idf(docFreq=10595, maxDocs=42740)
                0.125 = fieldNorm(doc=928)
          0.9079964 = weight(abstract_txt:damping in 928) [ClassicSimilarity], result of:
            0.9079964 = score(doc=928,freq=1.0), product of:
              0.7475544 = queryWeight, product of:
                9.342945 = boost
                9.71698 = idf(docFreq=6, maxDocs=42740)
                0.008234319 = queryNorm
              1.2146225 = fieldWeight in 928, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.71698 = idf(docFreq=6, maxDocs=42740)
                0.125 = fieldNorm(doc=928)
        0.16 = coord(4/25)
    
  5. Yan, E.; Ding, Y.: Discovering author impact : a PageRank perspective (2011) 0.14
    0.13660909 = sum of:
      0.13660909 = product of:
        0.85380685 = sum of:
          0.020726401 = weight(abstract_txt:factors in 4705) [ClassicSimilarity], result of:
            0.020726401 = score(doc=4705,freq=1.0), product of:
              0.044139184 = queryWeight, product of:
                1.0702082 = boost
                5.008738 = idf(docFreq=775, maxDocs=42740)
                0.008234319 = queryNorm
              0.4695692 = fieldWeight in 4705, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.008738 = idf(docFreq=775, maxDocs=42740)
                0.09375 = fieldNorm(doc=4705)
          0.14302313 = weight(abstract_txt:pagerank in 4705) [ClassicSimilarity], result of:
            0.14302313 = score(doc=4705,freq=4.0), product of:
              0.10078423 = queryWeight, product of:
                1.6171578 = boost
                7.568546 = idf(docFreq=59, maxDocs=42740)
                0.008234319 = queryNorm
              1.4191023 = fieldWeight in 4705, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                7.568546 = idf(docFreq=59, maxDocs=42740)
                0.09375 = fieldNorm(doc=4705)
          0.009060032 = weight(abstract_txt:that in 4705) [ClassicSimilarity], result of:
            0.009060032 = score(doc=4705,freq=1.0), product of:
              0.040356625 = queryWeight, product of:
                2.04665 = boost
                2.3946586 = idf(docFreq=10595, maxDocs=42740)
                0.008234319 = queryNorm
              0.22449924 = fieldWeight in 4705, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.3946586 = idf(docFreq=10595, maxDocs=42740)
                0.09375 = fieldNorm(doc=4705)
          0.6809973 = weight(abstract_txt:damping in 4705) [ClassicSimilarity], result of:
            0.6809973 = score(doc=4705,freq=1.0), product of:
              0.7475544 = queryWeight, product of:
                9.342945 = boost
                9.71698 = idf(docFreq=6, maxDocs=42740)
                0.008234319 = queryNorm
              0.9109669 = fieldWeight in 4705, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.71698 = idf(docFreq=6, maxDocs=42740)
                0.09375 = fieldNorm(doc=4705)
        0.16 = coord(4/25)