Document (#32007)

Author
Kleinberg, J.M.
Title
Authoritative sources in a hyperlinked environment
Source
Journal of the Association for Computing Machinery. 46(1998) no.5, S.604-632
Year
1998
Abstract
The network structure of a hyperlinked environment can be a rich source of information about the content of the environment, provided we have effective means for understanding it. We develop a set of algorithmic tools for extracting information from the link structures of such environments, and report on experiments that demonstrate their effectiveness in a variety of contexts on the World Wide Web. The central issue we address within our framework is the distillation of broad search topics, through the discovery of "authoritative" information sources on such topics. We propose and test an algorithmic formulation of the notion of authority, based on the relationship between a set of relevant authoritative pages and the set of "hub pages" that join them together in the link structure. Our formulation has connections to the eigenvectors of certain matrices associated with the link graph; these connections in turn motivate additional heuristics for link-based analysis.
Content
Vorversionen auch in: Proceedings of the ACM-SIAM Symposium on Discrete Algorithms, 1998, und als IBM Research Report RJ 10076, May 1997.
Theme
Retrievalalgorithmen
Object
HITS-Algorithmus

Similar documents (author)

  1. Kleinberg, I.: Making the case for professional indexers : where is the proof? (1993) 6.17
    6.169457 = sum of:
      6.169457 = weight(author_txt:kleinberg in 7766) [ClassicSimilarity], result of:
        6.169457 = fieldWeight in 7766, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.871131 = idf(docFreq=5, maxDocs=42740)
          0.625 = fieldNorm(doc=7766)
    
  2. Kleinberg, I.: For want of an alphabetical index : some notes toward a history of the back-of-the-book index in nineteenth century America (1997) 6.17
    6.169457 = sum of:
      6.169457 = weight(author_txt:kleinberg in 4735) [ClassicSimilarity], result of:
        6.169457 = fieldWeight in 4735, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.871131 = idf(docFreq=5, maxDocs=42740)
          0.625 = fieldNorm(doc=4735)
    
  3. Liben-Nowell, D.; Kleinberg, J.: ¬The link-prediction problem for social networks (2007) 4.32
    4.3186197 = sum of:
      4.3186197 = weight(author_txt:kleinberg in 2331) [ClassicSimilarity], result of:
        4.3186197 = fieldWeight in 2331, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.871131 = idf(docFreq=5, maxDocs=42740)
          0.4375 = fieldNorm(doc=2331)
    
  4. Chakrabarti, S.; Dom, B.; Kumar, S.R.; Raghavan, P.; Rajagopalan, S.; Tomkins, A.; Kleinberg, J.M.; Gibson, D.: Neue Pfade durch den Internet-Dschungel : Die zweite Generation von Web-Suchmaschinen (1999) 2.47
    2.4677827 = sum of:
      2.4677827 = weight(author_txt:kleinberg in 2004) [ClassicSimilarity], result of:
        2.4677827 = fieldWeight in 2004, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.871131 = idf(docFreq=5, maxDocs=42740)
          0.25 = fieldNorm(doc=2004)
    

Similar documents (content)

  1. Lempel, R.; Moran, S.: SALSA: the stochastic approach for link-structure analysis (2001) 0.21
    0.21363078 = sum of:
      0.21363078 = product of:
        0.6675962 = sum of:
          0.035796653 = weight(abstract_txt:notion in 2011) [ClassicSimilarity], result of:
            0.035796653 = score(doc=2011,freq=1.0), product of:
              0.09461537 = queryWeight, product of:
                1.0397046 = boost
                6.0534186 = idf(docFreq=272, maxDocs=42740)
                0.015033186 = queryNorm
              0.37833866 = fieldWeight in 2011, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.0534186 = idf(docFreq=272, maxDocs=42740)
                0.0625 = fieldNorm(doc=2011)
          0.010663962 = weight(abstract_txt:based in 2011) [ClassicSimilarity], result of:
            0.010663962 = score(doc=2011,freq=1.0), product of:
              0.053172752 = queryWeight, product of:
                1.1022717 = boost
                3.2088501 = idf(docFreq=4693, maxDocs=42740)
                0.015033186 = queryNorm
              0.20055313 = fieldWeight in 2011, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.2088501 = idf(docFreq=4693, maxDocs=42740)
                0.0625 = fieldNorm(doc=2011)
          0.018761672 = weight(abstract_txt:such in 2011) [ClassicSimilarity], result of:
            0.018761672 = score(doc=2011,freq=2.0), product of:
              0.06150557 = queryWeight, product of:
                1.1854993 = boost
                3.451136 = idf(docFreq=3683, maxDocs=42740)
                0.015033186 = queryNorm
              0.3050402 = fieldWeight in 2011, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.451136 = idf(docFreq=3683, maxDocs=42740)
                0.0625 = fieldNorm(doc=2011)
          0.012033633 = weight(abstract_txt:information in 2011) [ClassicSimilarity], result of:
            0.012033633 = score(doc=2011,freq=3.0), product of:
              0.045743696 = queryWeight, product of:
                1.2521471 = boost
                2.430104 = idf(docFreq=10226, maxDocs=42740)
                0.015033186 = queryNorm
              0.26306647 = fieldWeight in 2011, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.430104 = idf(docFreq=10226, maxDocs=42740)
                0.0625 = fieldNorm(doc=2011)
          0.04673578 = weight(abstract_txt:structure in 2011) [ClassicSimilarity], result of:
            0.04673578 = score(doc=2011,freq=3.0), product of:
              0.09873459 = queryWeight, product of:
                1.5020305 = boost
                4.3725977 = idf(docFreq=1465, maxDocs=42740)
                0.015033186 = queryNorm
              0.47334757 = fieldWeight in 2011, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.3725977 = idf(docFreq=1465, maxDocs=42740)
                0.0625 = fieldNorm(doc=2011)
          0.112467684 = weight(abstract_txt:pages in 2011) [ClassicSimilarity], result of:
            0.112467684 = score(doc=2011,freq=4.0), product of:
              0.16109248 = queryWeight, product of:
                1.918588 = boost
                5.5852485 = idf(docFreq=435, maxDocs=42740)
                0.015033186 = queryNorm
              0.69815606 = fieldWeight in 2011, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.5852485 = idf(docFreq=435, maxDocs=42740)
                0.0625 = fieldNorm(doc=2011)
          0.19083448 = weight(abstract_txt:authoritative in 2011) [ClassicSimilarity], result of:
            0.19083448 = score(doc=2011,freq=1.0), product of:
              0.41643292 = queryWeight, product of:
                3.7780025 = boost
                7.332157 = idf(docFreq=75, maxDocs=42740)
                0.015033186 = queryNorm
              0.45825982 = fieldWeight in 2011, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.332157 = idf(docFreq=75, maxDocs=42740)
                0.0625 = fieldNorm(doc=2011)
          0.24030238 = weight(abstract_txt:link in 2011) [ClassicSimilarity], result of:
            0.24030238 = score(doc=2011,freq=4.0), product of:
              0.33669665 = queryWeight, product of:
                3.9226403 = boost
                5.709647 = idf(docFreq=384, maxDocs=42740)
                0.015033186 = queryNorm
              0.7137059 = fieldWeight in 2011, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.709647 = idf(docFreq=384, maxDocs=42740)
                0.0625 = fieldNorm(doc=2011)
        0.32 = coord(8/25)
    
  2. Menczer, F.: Lexical and semantic clustering by Web links (2004) 0.19
    0.19423816 = sum of:
      0.19423816 = product of:
        0.6937077 = sum of:
          0.05956161 = weight(abstract_txt:graph in 4091) [ClassicSimilarity], result of:
            0.05956161 = score(doc=4091,freq=1.0), product of:
              0.11449092 = queryWeight, product of:
                1.1437066 = boost
                6.658944 = idf(docFreq=148, maxDocs=42740)
                0.015033186 = queryNorm
              0.52023 = fieldWeight in 4091, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.658944 = idf(docFreq=148, maxDocs=42740)
                0.078125 = fieldNorm(doc=4091)
          0.02345209 = weight(abstract_txt:such in 4091) [ClassicSimilarity], result of:
            0.02345209 = score(doc=4091,freq=2.0), product of:
              0.06150557 = queryWeight, product of:
                1.1854993 = boost
                3.451136 = idf(docFreq=3683, maxDocs=42740)
                0.015033186 = queryNorm
              0.38130027 = fieldWeight in 4091, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.451136 = idf(docFreq=3683, maxDocs=42740)
                0.078125 = fieldNorm(doc=4091)
          0.008684527 = weight(abstract_txt:information in 4091) [ClassicSimilarity], result of:
            0.008684527 = score(doc=4091,freq=1.0), product of:
              0.045743696 = queryWeight, product of:
                1.2521471 = boost
                2.430104 = idf(docFreq=10226, maxDocs=42740)
                0.015033186 = queryNorm
              0.18985188 = fieldWeight in 4091, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.430104 = idf(docFreq=10226, maxDocs=42740)
                0.078125 = fieldNorm(doc=4091)
          0.03372864 = weight(abstract_txt:structure in 4091) [ClassicSimilarity], result of:
            0.03372864 = score(doc=4091,freq=1.0), product of:
              0.09873459 = queryWeight, product of:
                1.5020305 = boost
                4.3725977 = idf(docFreq=1465, maxDocs=42740)
                0.015033186 = queryNorm
              0.34160918 = fieldWeight in 4091, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.3725977 = idf(docFreq=1465, maxDocs=42740)
                0.078125 = fieldNorm(doc=4091)
          0.12174984 = weight(abstract_txt:pages in 4091) [ClassicSimilarity], result of:
            0.12174984 = score(doc=4091,freq=3.0), product of:
              0.16109248 = queryWeight, product of:
                1.918588 = boost
                5.5852485 = idf(docFreq=435, maxDocs=42740)
                0.015033186 = queryNorm
              0.7557761 = fieldWeight in 4091, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.5852485 = idf(docFreq=435, maxDocs=42740)
                0.078125 = fieldNorm(doc=4091)
          0.11069816 = weight(abstract_txt:connections in 4091) [ClassicSimilarity], result of:
            0.11069816 = score(doc=4091,freq=1.0), product of:
              0.2180538 = queryWeight, product of:
                2.2321632 = boost
                6.4981046 = idf(docFreq=174, maxDocs=42740)
                0.015033186 = queryNorm
              0.50766444 = fieldWeight in 4091, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.4981046 = idf(docFreq=174, maxDocs=42740)
                0.078125 = fieldNorm(doc=4091)
          0.3358328 = weight(abstract_txt:link in 4091) [ClassicSimilarity], result of:
            0.3358328 = score(doc=4091,freq=5.0), product of:
              0.33669665 = queryWeight, product of:
                3.9226403 = boost
                5.709647 = idf(docFreq=384, maxDocs=42740)
                0.015033186 = queryNorm
              0.9974343 = fieldWeight in 4091, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                5.709647 = idf(docFreq=384, maxDocs=42740)
                0.078125 = fieldNorm(doc=4091)
        0.28 = coord(7/25)
    
  3. Rauter, J.: ¬Die Bündelung von Kleinbergs authorities und hubs in van Rijsbergens Effektivitätsmaß (2006) 0.16
    0.16026074 = sum of:
      0.16026074 = product of:
        1.0016296 = sum of:
          0.06971274 = weight(abstract_txt:sources in 1201) [ClassicSimilarity], result of:
            0.06971274 = score(doc=1201,freq=1.0), product of:
              0.11711114 = queryWeight, product of:
                1.6358489 = boost
                4.76216 = idf(docFreq=992, maxDocs=42740)
                0.015033186 = queryNorm
              0.59527 = fieldWeight in 1201, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.76216 = idf(docFreq=992, maxDocs=42740)
                0.125 = fieldNorm(doc=1201)
          0.0966722 = weight(abstract_txt:environment in 1201) [ClassicSimilarity], result of:
            0.0966722 = score(doc=1201,freq=1.0), product of:
              0.16670741 = queryWeight, product of:
                2.3903813 = boost
                4.6391315 = idf(docFreq=1122, maxDocs=42740)
                0.015033186 = queryNorm
              0.57989144 = fieldWeight in 1201, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.6391315 = idf(docFreq=1122, maxDocs=42740)
                0.125 = fieldNorm(doc=1201)
          0.4535757 = weight(abstract_txt:hyperlinked in 1201) [ClassicSimilarity], result of:
            0.4535757 = score(doc=1201,freq=1.0), product of:
              0.40815327 = queryWeight, product of:
                3.0539064 = boost
                8.890302 = idf(docFreq=15, maxDocs=42740)
                0.015033186 = queryNorm
              1.1112877 = fieldWeight in 1201, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.890302 = idf(docFreq=15, maxDocs=42740)
                0.125 = fieldNorm(doc=1201)
          0.38166896 = weight(abstract_txt:authoritative in 1201) [ClassicSimilarity], result of:
            0.38166896 = score(doc=1201,freq=1.0), product of:
              0.41643292 = queryWeight, product of:
                3.7780025 = boost
                7.332157 = idf(docFreq=75, maxDocs=42740)
                0.015033186 = queryNorm
              0.91651964 = fieldWeight in 1201, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.332157 = idf(docFreq=75, maxDocs=42740)
                0.125 = fieldNorm(doc=1201)
        0.16 = coord(4/25)
    
  4. Krause, J.: Shell Model, Semantic Web and Web Information Retrieval (2006) 0.11
    0.114322804 = sum of:
      0.114322804 = product of:
        0.40829572 = sum of:
          0.010663962 = weight(abstract_txt:based in 1062) [ClassicSimilarity], result of:
            0.010663962 = score(doc=1062,freq=1.0), product of:
              0.053172752 = queryWeight, product of:
                1.1022717 = boost
                3.2088501 = idf(docFreq=4693, maxDocs=42740)
                0.015033186 = queryNorm
              0.20055313 = fieldWeight in 1062, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.2088501 = idf(docFreq=4693, maxDocs=42740)
                0.0625 = fieldNorm(doc=1062)
          0.013266506 = weight(abstract_txt:such in 1062) [ClassicSimilarity], result of:
            0.013266506 = score(doc=1062,freq=1.0), product of:
              0.06150557 = queryWeight, product of:
                1.1854993 = boost
                3.451136 = idf(docFreq=3683, maxDocs=42740)
                0.015033186 = queryNorm
              0.215696 = fieldWeight in 1062, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.451136 = idf(docFreq=3683, maxDocs=42740)
                0.0625 = fieldNorm(doc=1062)
          0.017018128 = weight(abstract_txt:information in 1062) [ClassicSimilarity], result of:
            0.017018128 = score(doc=1062,freq=6.0), product of:
              0.045743696 = queryWeight, product of:
                1.2521471 = boost
                2.430104 = idf(docFreq=10226, maxDocs=42740)
                0.015033186 = queryNorm
              0.3720322 = fieldWeight in 1062, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                2.430104 = idf(docFreq=10226, maxDocs=42740)
                0.0625 = fieldNorm(doc=1062)
          0.026982915 = weight(abstract_txt:structure in 1062) [ClassicSimilarity], result of:
            0.026982915 = score(doc=1062,freq=1.0), product of:
              0.09873459 = queryWeight, product of:
                1.5020305 = boost
                4.3725977 = idf(docFreq=1465, maxDocs=42740)
                0.015033186 = queryNorm
              0.27328736 = fieldWeight in 1062, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.3725977 = idf(docFreq=1465, maxDocs=42740)
                0.0625 = fieldNorm(doc=1062)
          0.03485637 = weight(abstract_txt:sources in 1062) [ClassicSimilarity], result of:
            0.03485637 = score(doc=1062,freq=1.0), product of:
              0.11711114 = queryWeight, product of:
                1.6358489 = boost
                4.76216 = idf(docFreq=992, maxDocs=42740)
                0.015033186 = queryNorm
              0.297635 = fieldWeight in 1062, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.76216 = idf(docFreq=992, maxDocs=42740)
                0.0625 = fieldNorm(doc=1062)
          0.097399876 = weight(abstract_txt:pages in 1062) [ClassicSimilarity], result of:
            0.097399876 = score(doc=1062,freq=3.0), product of:
              0.16109248 = queryWeight, product of:
                1.918588 = boost
                5.5852485 = idf(docFreq=435, maxDocs=42740)
                0.015033186 = queryNorm
              0.6046209 = fieldWeight in 1062, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.5852485 = idf(docFreq=435, maxDocs=42740)
                0.0625 = fieldNorm(doc=1062)
          0.20810796 = weight(abstract_txt:link in 1062) [ClassicSimilarity], result of:
            0.20810796 = score(doc=1062,freq=3.0), product of:
              0.33669665 = queryWeight, product of:
                3.9226403 = boost
                5.709647 = idf(docFreq=384, maxDocs=42740)
                0.015033186 = queryNorm
              0.6180874 = fieldWeight in 1062, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.709647 = idf(docFreq=384, maxDocs=42740)
                0.0625 = fieldNorm(doc=1062)
        0.28 = coord(7/25)
    
  5. Yang, P.; Gao, W.; Tan, Q.; Wong, K.-F.: ¬A link-bridged topic model for cross-domain document classification (2013) 0.11
    0.10810737 = sum of:
      0.10810737 = product of:
        0.4504474 = sum of:
          0.010663962 = weight(abstract_txt:based in 4707) [ClassicSimilarity], result of:
            0.010663962 = score(doc=4707,freq=1.0), product of:
              0.053172752 = queryWeight, product of:
                1.1022717 = boost
                3.2088501 = idf(docFreq=4693, maxDocs=42740)
                0.015033186 = queryNorm
              0.20055313 = fieldWeight in 4707, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.2088501 = idf(docFreq=4693, maxDocs=42740)
                0.0625 = fieldNorm(doc=4707)
          0.04764929 = weight(abstract_txt:graph in 4707) [ClassicSimilarity], result of:
            0.04764929 = score(doc=4707,freq=1.0), product of:
              0.11449092 = queryWeight, product of:
                1.1437066 = boost
                6.658944 = idf(docFreq=148, maxDocs=42740)
                0.015033186 = queryNorm
              0.416184 = fieldWeight in 4707, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.658944 = idf(docFreq=148, maxDocs=42740)
                0.0625 = fieldNorm(doc=4707)
          0.009825421 = weight(abstract_txt:information in 4707) [ClassicSimilarity], result of:
            0.009825421 = score(doc=4707,freq=2.0), product of:
              0.045743696 = queryWeight, product of:
                1.2521471 = boost
                2.430104 = idf(docFreq=10226, maxDocs=42740)
                0.015033186 = queryNorm
              0.21479288 = fieldWeight in 4707, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.430104 = idf(docFreq=10226, maxDocs=42740)
                0.0625 = fieldNorm(doc=4707)
          0.026982915 = weight(abstract_txt:structure in 4707) [ClassicSimilarity], result of:
            0.026982915 = score(doc=4707,freq=1.0), product of:
              0.09873459 = queryWeight, product of:
                1.5020305 = boost
                4.3725977 = idf(docFreq=1465, maxDocs=42740)
                0.015033186 = queryNorm
              0.27328736 = fieldWeight in 4707, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.3725977 = idf(docFreq=1465, maxDocs=42740)
                0.0625 = fieldNorm(doc=4707)
          0.08665959 = weight(abstract_txt:topics in 4707) [ClassicSimilarity], result of:
            0.08665959 = score(doc=4707,freq=4.0), product of:
              0.13539454 = queryWeight, product of:
                1.7589144 = boost
                5.1204185 = idf(docFreq=693, maxDocs=42740)
                0.015033186 = queryNorm
              0.6400523 = fieldWeight in 4707, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.1204185 = idf(docFreq=693, maxDocs=42740)
                0.0625 = fieldNorm(doc=4707)
          0.26866624 = weight(abstract_txt:link in 4707) [ClassicSimilarity], result of:
            0.26866624 = score(doc=4707,freq=5.0), product of:
              0.33669665 = queryWeight, product of:
                3.9226403 = boost
                5.709647 = idf(docFreq=384, maxDocs=42740)
                0.015033186 = queryNorm
              0.79794747 = fieldWeight in 4707, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                5.709647 = idf(docFreq=384, maxDocs=42740)
                0.0625 = fieldNorm(doc=4707)
        0.24 = coord(6/25)