Document (#28277)

Author
Kanaeva, Z.
Title
Ranking: Google und CiteSeer
Source
Information - Wissenschaft und Praxis. 56(2005) H.2, S.87-92
Year
2005
Abstract
Im Rahmen des klassischen Information Retrieval wurden verschiedene Verfahren für das Ranking sowie die Suche in einer homogenen strukturlosen Dokumentenmenge entwickelt. Die Erfolge der Suchmaschine Google haben gezeigt dass die Suche in einer zwar inhomogenen aber zusammenhängenden Dokumentenmenge wie dem Internet unter Berücksichtigung der Dokumentenverbindungen (Links) sehr effektiv sein kann. Unter den von der Suchmaschine Google realisierten Konzepten ist ein Verfahren zum Ranking von Suchergebnissen (PageRank), das in diesem Artikel kurz erklärt wird. Darüber hinaus wird auf die Konzepte eines Systems namens CiteSeer eingegangen, welches automatisch bibliographische Angaben indexiert (engl. Autonomous Citation Indexing, ACI). Letzteres erzeugt aus einer Menge von nicht vernetzten wissenschaftlichen Dokumenten eine zusammenhängende Dokumentenmenge und ermöglicht den Einsatz von Banking-Verfahren, die auf den von Google genutzten Verfahren basieren.
Theme
Suchmaschinen
Retrievalalgorithmen
Object
Google
CiteSeer

Similar documents (content)

  1. Schöch, V.C.: ¬Die Suchmaschine Google (2001) 0.40
    0.39549142 = sum of:
      0.39549142 = product of:
        1.0985873 = sum of:
          0.09738293 = weight(abstract_txt:konzepten in 180) [ClassicSimilarity], result of:
            0.09738293 = score(doc=180,freq=1.0), product of:
              0.13722393 = queryWeight, product of:
                7.5697527 = idf(docFreq=61, maxDocs=44218)
                0.018127928 = queryNorm
              0.70966434 = fieldWeight in 180, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.5697527 = idf(docFreq=61, maxDocs=44218)
                0.09375 = fieldNorm(doc=180)
          0.09865393 = weight(abstract_txt:pagerank in 180) [ClassicSimilarity], result of:
            0.09865393 = score(doc=180,freq=1.0), product of:
              0.13841534 = queryWeight, product of:
                1.0043317 = boost
                7.602543 = idf(docFreq=59, maxDocs=44218)
                0.018127928 = queryNorm
              0.7127384 = fieldWeight in 180, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.602543 = idf(docFreq=59, maxDocs=44218)
                0.09375 = fieldNorm(doc=180)
          0.14309296 = weight(abstract_txt:suchergebnissen in 180) [ClassicSimilarity], result of:
            0.14309296 = score(doc=180,freq=1.0), product of:
              0.1773591 = queryWeight, product of:
                1.1368726 = boost
                8.6058445 = idf(docFreq=21, maxDocs=44218)
                0.018127928 = queryNorm
              0.8067979 = fieldWeight in 180, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.6058445 = idf(docFreq=21, maxDocs=44218)
                0.09375 = fieldNorm(doc=180)
          0.04916778 = weight(abstract_txt:unter in 180) [ClassicSimilarity], result of:
            0.04916778 = score(doc=180,freq=1.0), product of:
              0.10962384 = queryWeight, product of:
                1.264016 = boost
                4.7841444 = idf(docFreq=1004, maxDocs=44218)
                0.018127928 = queryNorm
              0.44851354 = fieldWeight in 180, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.7841444 = idf(docFreq=1004, maxDocs=44218)
                0.09375 = fieldNorm(doc=180)
          0.055797014 = weight(abstract_txt:einer in 180) [ClassicSimilarity], result of:
            0.055797014 = score(doc=180,freq=2.0), product of:
              0.10836251 = queryWeight, product of:
                1.5391653 = boost
                3.8837 = idf(docFreq=2472, maxDocs=44218)
                0.018127928 = queryNorm
              0.5149107 = fieldWeight in 180, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.8837 = idf(docFreq=2472, maxDocs=44218)
                0.09375 = fieldNorm(doc=180)
          0.16793405 = weight(abstract_txt:suchmaschine in 180) [ClassicSimilarity], result of:
            0.16793405 = score(doc=180,freq=2.0), product of:
              0.19733334 = queryWeight, product of:
                1.6959002 = boost
                6.4187727 = idf(docFreq=195, maxDocs=44218)
                0.018127928 = queryNorm
              0.8510171 = fieldWeight in 180, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.4187727 = idf(docFreq=195, maxDocs=44218)
                0.09375 = fieldNorm(doc=180)
          0.11820802 = weight(abstract_txt:ranking in 180) [ClassicSimilarity], result of:
            0.11820802 = score(doc=180,freq=1.0), product of:
              0.22520585 = queryWeight, product of:
                2.218889 = boost
                5.598813 = idf(docFreq=444, maxDocs=44218)
                0.018127928 = queryNorm
              0.52488875 = fieldWeight in 180, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.598813 = idf(docFreq=444, maxDocs=44218)
                0.09375 = fieldNorm(doc=180)
          0.19655354 = weight(abstract_txt:google in 180) [ClassicSimilarity], result of:
            0.19655354 = score(doc=180,freq=2.0), product of:
              0.2761247 = queryWeight, product of:
                2.8370545 = boost
                5.3689504 = idf(docFreq=559, maxDocs=44218)
                0.018127928 = queryNorm
              0.71182895 = fieldWeight in 180, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.3689504 = idf(docFreq=559, maxDocs=44218)
                0.09375 = fieldNorm(doc=180)
          0.17179711 = weight(abstract_txt:verfahren in 180) [ClassicSimilarity], result of:
            0.17179711 = score(doc=180,freq=1.0), product of:
              0.31803277 = queryWeight, product of:
                3.0447457 = boost
                5.761993 = idf(docFreq=377, maxDocs=44218)
                0.018127928 = queryNorm
              0.5401868 = fieldWeight in 180, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.761993 = idf(docFreq=377, maxDocs=44218)
                0.09375 = fieldNorm(doc=180)
        0.36 = coord(9/25)
    
  2. Korves, J.: Seiten bewerten : Googles PageRank (2005) 0.16
    0.15829717 = sum of:
      0.15829717 = product of:
        0.565347 = sum of:
          0.11029845 = weight(abstract_txt:pagerank in 866) [ClassicSimilarity], result of:
            0.11029845 = score(doc=866,freq=5.0), product of:
              0.13841534 = queryWeight, product of:
                1.0043317 = boost
                7.602543 = idf(docFreq=59, maxDocs=44218)
                0.018127928 = queryNorm
              0.7968658 = fieldWeight in 866, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                7.602543 = idf(docFreq=59, maxDocs=44218)
                0.046875 = fieldNorm(doc=866)
          0.07154648 = weight(abstract_txt:namens in 866) [ClassicSimilarity], result of:
            0.07154648 = score(doc=866,freq=1.0), product of:
              0.1773591 = queryWeight, product of:
                1.1368726 = boost
                8.6058445 = idf(docFreq=21, maxDocs=44218)
                0.018127928 = queryNorm
              0.40339896 = fieldWeight in 866, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.6058445 = idf(docFreq=21, maxDocs=44218)
                0.046875 = fieldNorm(doc=866)
          0.03416856 = weight(abstract_txt:einer in 866) [ClassicSimilarity], result of:
            0.03416856 = score(doc=866,freq=3.0), product of:
              0.10836251 = queryWeight, product of:
                1.5391653 = boost
                3.8837 = idf(docFreq=2472, maxDocs=44218)
                0.018127928 = queryNorm
              0.31531715 = fieldWeight in 866, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.8837 = idf(docFreq=2472, maxDocs=44218)
                0.046875 = fieldNorm(doc=866)
          0.08396702 = weight(abstract_txt:suchmaschine in 866) [ClassicSimilarity], result of:
            0.08396702 = score(doc=866,freq=2.0), product of:
              0.19733334 = queryWeight, product of:
                1.6959002 = boost
                6.4187727 = idf(docFreq=195, maxDocs=44218)
                0.018127928 = queryNorm
              0.42550856 = fieldWeight in 866, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.4187727 = idf(docFreq=195, maxDocs=44218)
                0.046875 = fieldNorm(doc=866)
          0.05910401 = weight(abstract_txt:ranking in 866) [ClassicSimilarity], result of:
            0.05910401 = score(doc=866,freq=1.0), product of:
              0.22520585 = queryWeight, product of:
                2.218889 = boost
                5.598813 = idf(docFreq=444, maxDocs=44218)
                0.018127928 = queryNorm
              0.26244438 = fieldWeight in 866, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.598813 = idf(docFreq=444, maxDocs=44218)
                0.046875 = fieldNorm(doc=866)
          0.12036398 = weight(abstract_txt:google in 866) [ClassicSimilarity], result of:
            0.12036398 = score(doc=866,freq=3.0), product of:
              0.2761247 = queryWeight, product of:
                2.8370545 = boost
                5.3689504 = idf(docFreq=559, maxDocs=44218)
                0.018127928 = queryNorm
              0.43590444 = fieldWeight in 866, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.3689504 = idf(docFreq=559, maxDocs=44218)
                0.046875 = fieldNorm(doc=866)
          0.085898556 = weight(abstract_txt:verfahren in 866) [ClassicSimilarity], result of:
            0.085898556 = score(doc=866,freq=1.0), product of:
              0.31803277 = queryWeight, product of:
                3.0447457 = boost
                5.761993 = idf(docFreq=377, maxDocs=44218)
                0.018127928 = queryNorm
              0.2700934 = fieldWeight in 866, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.761993 = idf(docFreq=377, maxDocs=44218)
                0.046875 = fieldNorm(doc=866)
        0.28 = coord(7/25)
    
  3. Lewandowski, D.: Spezialsuche für wissenschaftliche Informationen (2004) 0.15
    0.14635977 = sum of:
      0.14635977 = product of:
        0.91474855 = sum of:
          0.06555703 = weight(abstract_txt:unter in 3298) [ClassicSimilarity], result of:
            0.06555703 = score(doc=3298,freq=1.0), product of:
              0.10962384 = queryWeight, product of:
                1.264016 = boost
                4.7841444 = idf(docFreq=1004, maxDocs=44218)
                0.018127928 = queryNorm
              0.59801805 = fieldWeight in 3298, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.7841444 = idf(docFreq=1004, maxDocs=44218)
                0.125 = fieldNorm(doc=3298)
          0.10571125 = weight(abstract_txt:suche in 3298) [ClassicSimilarity], result of:
            0.10571125 = score(doc=3298,freq=1.0), product of:
              0.15074386 = queryWeight, product of:
                1.4822446 = boost
                5.6101127 = idf(docFreq=439, maxDocs=44218)
                0.018127928 = queryNorm
              0.7012641 = fieldWeight in 3298, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.6101127 = idf(docFreq=439, maxDocs=44218)
                0.125 = fieldNorm(doc=3298)
          0.48140886 = weight(abstract_txt:citeseer in 3298) [ClassicSimilarity], result of:
            0.48140886 = score(doc=3298,freq=1.0), product of:
              0.41416004 = queryWeight, product of:
                2.4568813 = boost
                9.298992 = idf(docFreq=10, maxDocs=44218)
                0.018127928 = queryNorm
              1.162374 = fieldWeight in 3298, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.298992 = idf(docFreq=10, maxDocs=44218)
                0.125 = fieldNorm(doc=3298)
          0.2620714 = weight(abstract_txt:google in 3298) [ClassicSimilarity], result of:
            0.2620714 = score(doc=3298,freq=2.0), product of:
              0.2761247 = queryWeight, product of:
                2.8370545 = boost
                5.3689504 = idf(docFreq=559, maxDocs=44218)
                0.018127928 = queryNorm
              0.94910526 = fieldWeight in 3298, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.3689504 = idf(docFreq=559, maxDocs=44218)
                0.125 = fieldNorm(doc=3298)
        0.16 = coord(4/25)
    
  4. Hosbach, W.: Gates gegen Google : Neue Suchmaschine von MSN (2005) 0.14
    0.1388924 = sum of:
      0.1388924 = product of:
        0.86807746 = sum of:
          0.15856688 = weight(abstract_txt:suche in 3221) [ClassicSimilarity], result of:
            0.15856688 = score(doc=3221,freq=1.0), product of:
              0.15074386 = queryWeight, product of:
                1.4822446 = boost
                5.6101127 = idf(docFreq=439, maxDocs=44218)
                0.018127928 = queryNorm
              1.0518961 = fieldWeight in 3221, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.6101127 = idf(docFreq=439, maxDocs=44218)
                0.1875 = fieldNorm(doc=3221)
          0.078908905 = weight(abstract_txt:einer in 3221) [ClassicSimilarity], result of:
            0.078908905 = score(doc=3221,freq=1.0), product of:
              0.10836251 = queryWeight, product of:
                1.5391653 = boost
                3.8837 = idf(docFreq=2472, maxDocs=44218)
                0.018127928 = queryNorm
              0.72819376 = fieldWeight in 3221, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.8837 = idf(docFreq=2472, maxDocs=44218)
                0.1875 = fieldNorm(doc=3221)
          0.23749459 = weight(abstract_txt:suchmaschine in 3221) [ClassicSimilarity], result of:
            0.23749459 = score(doc=3221,freq=1.0), product of:
              0.19733334 = queryWeight, product of:
                1.6959002 = boost
                6.4187727 = idf(docFreq=195, maxDocs=44218)
                0.018127928 = queryNorm
              1.2035198 = fieldWeight in 3221, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.4187727 = idf(docFreq=195, maxDocs=44218)
                0.1875 = fieldNorm(doc=3221)
          0.3931071 = weight(abstract_txt:google in 3221) [ClassicSimilarity], result of:
            0.3931071 = score(doc=3221,freq=2.0), product of:
              0.2761247 = queryWeight, product of:
                2.8370545 = boost
                5.3689504 = idf(docFreq=559, maxDocs=44218)
                0.018127928 = queryNorm
              1.4236579 = fieldWeight in 3221, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.3689504 = idf(docFreq=559, maxDocs=44218)
                0.1875 = fieldNorm(doc=3221)
        0.16 = coord(4/25)
    
  5. tz: Mein Freund Google und ich (2006) 0.11
    0.11249884 = sum of:
      0.11249884 = product of:
        0.5624942 = sum of:
          0.046355825 = weight(abstract_txt:unter in 2144) [ClassicSimilarity], result of:
            0.046355825 = score(doc=2144,freq=2.0), product of:
              0.10962384 = queryWeight, product of:
                1.264016 = boost
                4.7841444 = idf(docFreq=1004, maxDocs=44218)
                0.018127928 = queryNorm
              0.42286262 = fieldWeight in 2144, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.7841444 = idf(docFreq=1004, maxDocs=44218)
                0.0625 = fieldNorm(doc=2144)
          0.07474914 = weight(abstract_txt:suche in 2144) [ClassicSimilarity], result of:
            0.07474914 = score(doc=2144,freq=2.0), product of:
              0.15074386 = queryWeight, product of:
                1.4822446 = boost
                5.6101127 = idf(docFreq=439, maxDocs=44218)
                0.018127928 = queryNorm
              0.4958686 = fieldWeight in 2144, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.6101127 = idf(docFreq=439, maxDocs=44218)
                0.0625 = fieldNorm(doc=2144)
          0.026302967 = weight(abstract_txt:einer in 2144) [ClassicSimilarity], result of:
            0.026302967 = score(doc=2144,freq=1.0), product of:
              0.10836251 = queryWeight, product of:
                1.5391653 = boost
                3.8837 = idf(docFreq=2472, maxDocs=44218)
                0.018127928 = queryNorm
              0.24273124 = fieldWeight in 2144, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.8837 = idf(docFreq=2472, maxDocs=44218)
                0.0625 = fieldNorm(doc=2144)
          0.13711756 = weight(abstract_txt:suchmaschine in 2144) [ClassicSimilarity], result of:
            0.13711756 = score(doc=2144,freq=3.0), product of:
              0.19733334 = queryWeight, product of:
                1.6959002 = boost
                6.4187727 = idf(docFreq=195, maxDocs=44218)
                0.018127928 = queryNorm
              0.69485253 = fieldWeight in 2144, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.4187727 = idf(docFreq=195, maxDocs=44218)
                0.0625 = fieldNorm(doc=2144)
          0.2779687 = weight(abstract_txt:google in 2144) [ClassicSimilarity], result of:
            0.2779687 = score(doc=2144,freq=9.0), product of:
              0.2761247 = queryWeight, product of:
                2.8370545 = boost
                5.3689504 = idf(docFreq=559, maxDocs=44218)
                0.018127928 = queryNorm
              1.0066782 = fieldWeight in 2144, product of:
                3.0 = tf(freq=9.0), with freq of:
                  9.0 = termFreq=9.0
                5.3689504 = idf(docFreq=559, maxDocs=44218)
                0.0625 = fieldNorm(doc=2144)
        0.2 = coord(5/25)