Document (#21063)

Author
Wätjen, H.-J.
Title
GERHARD : Automatisches Sammeln, Klassifizieren und Indexieren von wissenschaftlich relevanten Informationsressourcen im deutschen World Wide Web
Source
B.I.T.online. 1(1998) H.4, S.279-290
Year
1998
Abstract
Die intellektuelle Erschließung des Internet befindet sich in einer Krise. Yahoo und andere Dienste können mit dem Wachstum des Web nicht mithalten. GERHARD ist derzeit weltweit der einzige Such- und Navigationsdienst, der die mit einem Roboter gesammelten Internetressourcen mit computerlinguistischen und statistischen Verfahren auch automatisch vollständig klassifiziert. Weit über eine Million HTML-Dokumente von wissenschaftlich relevanten Servern in Deutschland können wie bei anderen Suchmaschinen in der Datenbank gesucht, aber auch über die Navigation in der dreisprachigen Universalen Dezimalklassifikation (ETH-Bibliothek Zürich) recherchiert werden
Footnote
Vgl. auch: http://www.gerhard.de/info/Dokumente/Bericht/bericht.pdf
Theme
Automatisches Klassifizieren
Internet
Klassifikationssysteme im Online-Retrieval
Object
GERHARD
DK
Harvest
UDC

Similar documents (author)

  1. Wätjen, H.-J.: ORBIS, der Oldenburger Online-Benutzerkatalog (1991) 4.60
    4.5984483 = sum of:
      4.5984483 = weight(author_txt:wätjen in 2068) [ClassicSimilarity], result of:
        4.5984483 = fieldWeight in 2068, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.196897 = idf(docFreq=11, maxDocs=43556)
          0.5 = fieldNorm(doc=2068)
    
  2. Wätjen, H.-J.: Mensch oder Maschine? : Auswahl und Erschließung vonm Informationsressourcen im Internet (1996) 4.60
    4.5984483 = sum of:
      4.5984483 = weight(author_txt:wätjen in 3227) [ClassicSimilarity], result of:
        4.5984483 = fieldWeight in 3227, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.196897 = idf(docFreq=11, maxDocs=43556)
          0.5 = fieldNorm(doc=3227)
    
  3. Wätjen, H.-J.: Hypertextbasierte OPACs im World-wide Web (1996) 4.60
    4.5984483 = sum of:
      4.5984483 = weight(author_txt:wätjen in 5522) [ClassicSimilarity], result of:
        4.5984483 = fieldWeight in 5522, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.196897 = idf(docFreq=11, maxDocs=43556)
          0.5 = fieldNorm(doc=5522)
    
  4. Wätjen, H.-J.: Automatisches Sammeln, Klassifizieren und Indexieren von wissenschaftlich relevanten Informationsressourcen im deutschen World Wide Web : das DFG-Projekt GERHARD (1998) 4.60
    4.5984483 = sum of:
      4.5984483 = weight(author_txt:wätjen in 4064) [ClassicSimilarity], result of:
        4.5984483 = fieldWeight in 4064, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.196897 = idf(docFreq=11, maxDocs=43556)
          0.5 = fieldNorm(doc=4064)
    
  5. Wätjen, H.-J.: Zur Realität virtueller Bibliotheken : Möglichkeiten, Aufgaben, Probleme (1999) 4.60
    4.5984483 = sum of:
      4.5984483 = weight(author_txt:wätjen in 5123) [ClassicSimilarity], result of:
        4.5984483 = fieldWeight in 5123, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.196897 = idf(docFreq=11, maxDocs=43556)
          0.5 = fieldNorm(doc=5123)
    

Similar documents (content)

  1. Krüger, C.: Evaluation des WWW-Suchdienstes GERHARD unter besonderer Beachtung automatischer Indexierung (1999) 0.34
    0.34463185 = sum of:
      0.34463185 = product of:
        1.230828 = sum of:
          0.07836996 = weight(abstract_txt:einzige in 2775) [ClassicSimilarity], result of:
            0.07836996 = score(doc=2775,freq=1.0), product of:
              0.16010912 = queryWeight, product of:
                1.0180476 = boost
                7.831655 = idf(docFreq=46, maxDocs=43556)
                0.020081421 = queryNorm
              0.48947844 = fieldWeight in 2775, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.831655 = idf(docFreq=46, maxDocs=43556)
                0.0625 = fieldNorm(doc=2775)
          0.07968266 = weight(abstract_txt:intellektuelle in 2775) [ClassicSimilarity], result of:
            0.07968266 = score(doc=2775,freq=1.0), product of:
              0.16189206 = queryWeight, product of:
                1.0237002 = boost
                7.87514 = idf(docFreq=44, maxDocs=43556)
                0.020081421 = queryNorm
              0.49219626 = fieldWeight in 2775, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.87514 = idf(docFreq=44, maxDocs=43556)
                0.0625 = fieldNorm(doc=2775)
          0.10184147 = weight(abstract_txt:servern in 2775) [ClassicSimilarity], result of:
            0.10184147 = score(doc=2775,freq=1.0), product of:
              0.19066285 = queryWeight, product of:
                1.1109463 = boost
                8.5463085 = idf(docFreq=22, maxDocs=43556)
                0.020081421 = queryNorm
              0.5341443 = fieldWeight in 2775, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.5463085 = idf(docFreq=22, maxDocs=43556)
                0.0625 = fieldNorm(doc=2775)
          0.15120773 = weight(abstract_txt:internetressourcen in 2775) [ClassicSimilarity], result of:
            0.15120773 = score(doc=2775,freq=2.0), product of:
              0.19694984 = queryWeight, product of:
                1.1291142 = boost
                8.68607 = idf(docFreq=19, maxDocs=43556)
                0.020081421 = queryNorm
              0.7677474 = fieldWeight in 2775, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.68607 = idf(docFreq=19, maxDocs=43556)
                0.0625 = fieldNorm(doc=2775)
          0.10882539 = weight(abstract_txt:klassifiziert in 2775) [ClassicSimilarity], result of:
            0.10882539 = score(doc=2775,freq=1.0), product of:
              0.1992828 = queryWeight, product of:
                1.1357819 = boost
                8.737364 = idf(docFreq=18, maxDocs=43556)
                0.020081421 = queryNorm
              0.54608524 = fieldWeight in 2775, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.737364 = idf(docFreq=18, maxDocs=43556)
                0.0625 = fieldNorm(doc=2775)
          0.13479842 = weight(abstract_txt:wissenschaftlich in 2775) [ClassicSimilarity], result of:
            0.13479842 = score(doc=2775,freq=1.0), product of:
              0.28958952 = queryWeight, product of:
                1.9362724 = boost
                7.447696 = idf(docFreq=68, maxDocs=43556)
                0.020081421 = queryNorm
              0.465481 = fieldWeight in 2775, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.447696 = idf(docFreq=68, maxDocs=43556)
                0.0625 = fieldNorm(doc=2775)
          0.5761024 = weight(abstract_txt:gerhard in 2775) [ClassicSimilarity], result of:
            0.5761024 = score(doc=2775,freq=8.0), product of:
              0.3813257 = queryWeight, product of:
                2.2218926 = boost
                8.5463085 = idf(docFreq=22, maxDocs=43556)
                0.020081421 = queryNorm
              1.5107882 = fieldWeight in 2775, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                8.5463085 = idf(docFreq=22, maxDocs=43556)
                0.0625 = fieldNorm(doc=2775)
        0.28 = coord(7/25)
    
  2. Oberhauser, O.: Automatisches Klassifizieren : Entwicklungsstand - Methodik - Anwendungsbereiche (2005) 0.13
    0.13218094 = sum of:
      0.13218094 = product of:
        1.1015079 = sum of:
          0.17868166 = weight(abstract_txt:klassifizieren in 1161) [ClassicSimilarity], result of:
            0.17868166 = score(doc=1161,freq=3.0), product of:
              0.21021184 = queryWeight, product of:
                1.1665105 = boost
                8.973753 = idf(docFreq=14, maxDocs=43556)
                0.020081421 = queryNorm
              0.85000753 = fieldWeight in 1161, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                8.973753 = idf(docFreq=14, maxDocs=43556)
                0.0546875 = fieldNorm(doc=1161)
          0.8252952 = weight(title_txt:automatisches in 1161) [ClassicSimilarity], result of:
            0.8252952 = score(doc=1161,freq=1.0), product of:
              0.21021184 = queryWeight, product of:
                1.1665105 = boost
                8.973753 = idf(docFreq=14, maxDocs=43556)
                0.020081421 = queryNorm
              3.9260168 = fieldWeight in 1161, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.973753 = idf(docFreq=14, maxDocs=43556)
                0.4375 = fieldNorm(doc=1161)
          0.09753114 = weight(abstract_txt:relevanten in 1161) [ClassicSimilarity], result of:
            0.09753114 = score(doc=1161,freq=1.0), product of:
              0.2551231 = queryWeight, product of:
                1.8173975 = boost
                6.9904547 = idf(docFreq=108, maxDocs=43556)
                0.020081421 = queryNorm
              0.38229048 = fieldWeight in 1161, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.9904547 = idf(docFreq=108, maxDocs=43556)
                0.0546875 = fieldNorm(doc=1161)
        0.12 = coord(3/25)
    
  3. Gödert, W.; Lepsky, K.; Nagelschmidt, M.: Informationserschließung und Automatisches Indexieren : ein Lehr- und Arbeitsbuch (2011) 0.12
    0.1222411 = sum of:
      0.1222411 = product of:
        0.7640069 = sum of:
          0.036099296 = weight(abstract_txt:über in 4548) [ClassicSimilarity], result of:
            0.036099296 = score(doc=4548,freq=2.0), product of:
              0.08229538 = queryWeight, product of:
                1.0321974 = boost
                3.9702537 = idf(docFreq=2233, maxDocs=43556)
                0.020081421 = queryNorm
              0.4386552 = fieldWeight in 4548, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.9702537 = idf(docFreq=2233, maxDocs=43556)
                0.078125 = fieldNorm(doc=4548)
          0.10224417 = weight(abstract_txt:indexieren in 4548) [ClassicSimilarity], result of:
            0.10224417 = score(doc=4548,freq=1.0), product of:
              0.16474111 = queryWeight, product of:
                1.0326687 = boost
                7.9441333 = idf(docFreq=41, maxDocs=43556)
                0.020081421 = queryNorm
              0.6206354 = fieldWeight in 4548, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.9441333 = idf(docFreq=41, maxDocs=43556)
                0.078125 = fieldNorm(doc=4548)
          0.03616684 = weight(abstract_txt:können in 4548) [ClassicSimilarity], result of:
            0.03616684 = score(doc=4548,freq=1.0), product of:
              0.103814974 = queryWeight, product of:
                1.1593245 = boost
                4.4592366 = idf(docFreq=1369, maxDocs=43556)
                0.020081421 = queryNorm
              0.34837785 = fieldWeight in 4548, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4592366 = idf(docFreq=1369, maxDocs=43556)
                0.078125 = fieldNorm(doc=4548)
          0.5894966 = weight(title_txt:automatisches in 4548) [ClassicSimilarity], result of:
            0.5894966 = score(doc=4548,freq=1.0), product of:
              0.21021184 = queryWeight, product of:
                1.1665105 = boost
                8.973753 = idf(docFreq=14, maxDocs=43556)
                0.020081421 = queryNorm
              2.804298 = fieldWeight in 4548, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.973753 = idf(docFreq=14, maxDocs=43556)
                0.3125 = fieldNorm(doc=4548)
        0.16 = coord(4/25)
    
  4. Oberhauser, O.: Automatisches Klassifizieren : Verfahren zur Erschließung elektronischer Dokumente (2004) 0.12
    0.118033044 = sum of:
      0.118033044 = product of:
        0.9836087 = sum of:
          0.17868166 = weight(abstract_txt:klassifizieren in 3485) [ClassicSimilarity], result of:
            0.17868166 = score(doc=3485,freq=3.0), product of:
              0.21021184 = queryWeight, product of:
                1.1665105 = boost
                8.973753 = idf(docFreq=14, maxDocs=43556)
                0.020081421 = queryNorm
              0.85000753 = fieldWeight in 3485, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                8.973753 = idf(docFreq=14, maxDocs=43556)
                0.0546875 = fieldNorm(doc=3485)
          0.7073959 = weight(title_txt:automatisches in 3485) [ClassicSimilarity], result of:
            0.7073959 = score(doc=3485,freq=1.0), product of:
              0.21021184 = queryWeight, product of:
                1.1665105 = boost
                8.973753 = idf(docFreq=14, maxDocs=43556)
                0.020081421 = queryNorm
              3.3651574 = fieldWeight in 3485, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.973753 = idf(docFreq=14, maxDocs=43556)
                0.375 = fieldNorm(doc=3485)
          0.09753114 = weight(abstract_txt:relevanten in 3485) [ClassicSimilarity], result of:
            0.09753114 = score(doc=3485,freq=1.0), product of:
              0.2551231 = queryWeight, product of:
                1.8173975 = boost
                6.9904547 = idf(docFreq=108, maxDocs=43556)
                0.020081421 = queryNorm
              0.38229048 = fieldWeight in 3485, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.9904547 = idf(docFreq=108, maxDocs=43556)
                0.0546875 = fieldNorm(doc=3485)
        0.12 = coord(3/25)
    
  5. Summann, F.; Wolf, S.: Suchmaschinentechnologie und wissenschaftliche Suchumgebung : Warum braucht man eine wissenschaftliche Suchmaschine? (2006) 0.11
    0.107317455 = sum of:
      0.107317455 = product of:
        0.44715607 = sum of:
          0.026527457 = weight(abstract_txt:über in 956) [ClassicSimilarity], result of:
            0.026527457 = score(doc=956,freq=3.0), product of:
              0.08229538 = queryWeight, product of:
                1.0321974 = boost
                3.9702537 = idf(docFreq=2233, maxDocs=43556)
                0.020081421 = queryNorm
              0.32234442 = fieldWeight in 956, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.9702537 = idf(docFreq=2233, maxDocs=43556)
                0.046875 = fieldNorm(doc=956)
          0.0613465 = weight(abstract_txt:indexieren in 956) [ClassicSimilarity], result of:
            0.0613465 = score(doc=956,freq=1.0), product of:
              0.16474111 = queryWeight, product of:
                1.0326687 = boost
                7.9441333 = idf(docFreq=41, maxDocs=43556)
                0.020081421 = queryNorm
              0.37238124 = fieldWeight in 956, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.9441333 = idf(docFreq=41, maxDocs=43556)
                0.046875 = fieldNorm(doc=956)
          0.0763811 = weight(abstract_txt:servern in 956) [ClassicSimilarity], result of:
            0.0763811 = score(doc=956,freq=1.0), product of:
              0.19066285 = queryWeight, product of:
                1.1109463 = boost
                8.5463085 = idf(docFreq=22, maxDocs=43556)
                0.020081421 = queryNorm
              0.4006082 = fieldWeight in 956, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.5463085 = idf(docFreq=22, maxDocs=43556)
                0.046875 = fieldNorm(doc=956)
          0.021700105 = weight(abstract_txt:können in 956) [ClassicSimilarity], result of:
            0.021700105 = score(doc=956,freq=1.0), product of:
              0.103814974 = queryWeight, product of:
                1.1593245 = boost
                4.4592366 = idf(docFreq=1369, maxDocs=43556)
                0.020081421 = queryNorm
              0.20902672 = fieldWeight in 956, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4592366 = idf(docFreq=1369, maxDocs=43556)
                0.046875 = fieldNorm(doc=956)
          0.1182256 = weight(abstract_txt:relevanten in 956) [ClassicSimilarity], result of:
            0.1182256 = score(doc=956,freq=2.0), product of:
              0.2551231 = queryWeight, product of:
                1.8173975 = boost
                6.9904547 = idf(docFreq=108, maxDocs=43556)
                0.020081421 = queryNorm
              0.46340606 = fieldWeight in 956, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.9904547 = idf(docFreq=108, maxDocs=43556)
                0.046875 = fieldNorm(doc=956)
          0.14297532 = weight(abstract_txt:wissenschaftlich in 956) [ClassicSimilarity], result of:
            0.14297532 = score(doc=956,freq=2.0), product of:
              0.28958952 = queryWeight, product of:
                1.9362724 = boost
                7.447696 = idf(docFreq=68, maxDocs=43556)
                0.020081421 = queryNorm
              0.49371716 = fieldWeight in 956, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.447696 = idf(docFreq=68, maxDocs=43556)
                0.046875 = fieldNorm(doc=956)
        0.24 = coord(6/25)