Document (#21066)

Author
Wätjen, H.-J.
Title
GERHARD : Automatisches Sammeln, Klassifizieren und Indexieren von wissenschaftlich relevanten Informationsressourcen im deutschen World Wide Web
Source
B.I.T.online. 1(1998) H.4, S.279-290
Year
1998
Abstract
Die intellektuelle Erschließung des Internet befindet sich in einer Krise. Yahoo und andere Dienste können mit dem Wachstum des Web nicht mithalten. GERHARD ist derzeit weltweit der einzige Such- und Navigationsdienst, der die mit einem Roboter gesammelten Internetressourcen mit computerlinguistischen und statistischen Verfahren auch automatisch vollständig klassifiziert. Weit über eine Million HTML-Dokumente von wissenschaftlich relevanten Servern in Deutschland können wie bei anderen Suchmaschinen in der Datenbank gesucht, aber auch über die Navigation in der dreisprachigen Universalen Dezimalklassifikation (ETH-Bibliothek Zürich) recherchiert werden
Footnote
Vgl. auch: http://www.gerhard.de/info/Dokumente/Bericht/bericht.pdf
Theme
Automatisches Klassifizieren
Internet
Klassifikationssysteme im Online-Retrieval
Object
GERHARD
DK
Harvest
UDC

Similar documents (author)

  1. Wätjen, H.-J.: ORBIS, der Oldenburger Online-Benutzerkatalog (1991) 4.59
    4.5889916 = sum of:
      4.5889916 = weight(author_txt:wätjen in 2068) [ClassicSimilarity], result of:
        4.5889916 = score(doc=2068,freq=1.0), product of:
          0.99999994 = queryWeight, product of:
            9.177984 = idf(docFreq=11, maxDocs=42740)
            0.10895638 = queryNorm
          4.588992 = fieldWeight in 2068, product of:
            1.0 = tf(freq=1.0), with freq of:
              1.0 = termFreq=1.0
            9.177984 = idf(docFreq=11, maxDocs=42740)
            0.5 = fieldNorm(doc=2068)
    
  2. Wätjen, H.-J.: Mensch oder Maschine? : Auswahl und Erschließung vonm Informationsressourcen im Internet (1996) 4.59
    4.5889916 = sum of:
      4.5889916 = weight(author_txt:wätjen in 3230) [ClassicSimilarity], result of:
        4.5889916 = score(doc=3230,freq=1.0), product of:
          0.99999994 = queryWeight, product of:
            9.177984 = idf(docFreq=11, maxDocs=42740)
            0.10895638 = queryNorm
          4.588992 = fieldWeight in 3230, product of:
            1.0 = tf(freq=1.0), with freq of:
              1.0 = termFreq=1.0
            9.177984 = idf(docFreq=11, maxDocs=42740)
            0.5 = fieldNorm(doc=3230)
    
  3. Wätjen, H.-J.: Hypertextbasierte OPACs im World-wide Web (1996) 4.59
    4.5889916 = sum of:
      4.5889916 = weight(author_txt:wätjen in 5525) [ClassicSimilarity], result of:
        4.5889916 = score(doc=5525,freq=1.0), product of:
          0.99999994 = queryWeight, product of:
            9.177984 = idf(docFreq=11, maxDocs=42740)
            0.10895638 = queryNorm
          4.588992 = fieldWeight in 5525, product of:
            1.0 = tf(freq=1.0), with freq of:
              1.0 = termFreq=1.0
            9.177984 = idf(docFreq=11, maxDocs=42740)
            0.5 = fieldNorm(doc=5525)
    
  4. Wätjen, H.-J.: Automatisches Sammeln, Klassifizieren und Indexieren von wissenschaftlich relevanten Informationsressourcen im deutschen World Wide Web : das DFG-Projekt GERHARD (1998) 4.59
    4.5889916 = sum of:
      4.5889916 = weight(author_txt:wätjen in 4067) [ClassicSimilarity], result of:
        4.5889916 = score(doc=4067,freq=1.0), product of:
          0.99999994 = queryWeight, product of:
            9.177984 = idf(docFreq=11, maxDocs=42740)
            0.10895638 = queryNorm
          4.588992 = fieldWeight in 4067, product of:
            1.0 = tf(freq=1.0), with freq of:
              1.0 = termFreq=1.0
            9.177984 = idf(docFreq=11, maxDocs=42740)
            0.5 = fieldNorm(doc=4067)
    
  5. Wätjen, H.-J.: Zur Realität virtueller Bibliotheken : Möglichkeiten, Aufgaben, Probleme (1999) 4.59
    4.5889916 = sum of:
      4.5889916 = weight(author_txt:wätjen in 5126) [ClassicSimilarity], result of:
        4.5889916 = score(doc=5126,freq=1.0), product of:
          0.99999994 = queryWeight, product of:
            9.177984 = idf(docFreq=11, maxDocs=42740)
            0.10895638 = queryNorm
          4.588992 = fieldWeight in 5126, product of:
            1.0 = tf(freq=1.0), with freq of:
              1.0 = termFreq=1.0
            9.177984 = idf(docFreq=11, maxDocs=42740)
            0.5 = fieldNorm(doc=5126)
    

Similar documents (content)

  1. Krüger, C.: Evaluation des WWW-Suchdienstes GERHARD unter besonderer Beachtung automatischer Indexierung (1999) 0.35
    0.34855986 = sum of:
      0.34855986 = product of:
        1.2448566 = sum of:
          0.078440025 = weight(abstract_txt:einzige in 2778) [ClassicSimilarity], result of:
            0.078440025 = score(doc=2778,freq=1.0), product of:
              0.15975101 = queryWeight, product of:
                1.0187484 = boost
                7.856228 = idf(docFreq=44, maxDocs=42740)
                0.019960094 = queryNorm
              0.49101424 = fieldWeight in 2778, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.856228 = idf(docFreq=44, maxDocs=42740)
                0.0625 = fieldNorm(doc=2778)
          0.07911508 = weight(abstract_txt:intellektuelle in 2778) [ClassicSimilarity], result of:
            0.07911508 = score(doc=2778,freq=1.0), product of:
              0.16066624 = queryWeight, product of:
                1.0216625 = boost
                7.8787007 = idf(docFreq=43, maxDocs=42740)
                0.019960094 = queryNorm
              0.4924188 = fieldWeight in 2778, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.8787007 = idf(docFreq=43, maxDocs=42740)
                0.0625 = fieldNorm(doc=2778)
          0.100310124 = weight(abstract_txt:servern in 2778) [ClassicSimilarity], result of:
            0.100310124 = score(doc=2778,freq=1.0), product of:
              0.18821244 = queryWeight, product of:
                1.1057814 = boost
                8.527396 = idf(docFreq=22, maxDocs=42740)
                0.019960094 = queryNorm
              0.53296226 = fieldWeight in 2778, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.527396 = idf(docFreq=22, maxDocs=42740)
                0.0625 = fieldNorm(doc=2778)
          0.14895004 = weight(abstract_txt:internetressourcen in 2778) [ClassicSimilarity], result of:
            0.14895004 = score(doc=2778,freq=2.0), product of:
              0.19443251 = queryWeight, product of:
                1.123905 = boost
                8.667158 = idf(docFreq=19, maxDocs=42740)
                0.019960094 = queryNorm
              0.7660758 = fieldWeight in 2778, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.667158 = idf(docFreq=19, maxDocs=42740)
                0.0625 = fieldNorm(doc=2778)
          0.10720462 = weight(abstract_txt:klassifiziert in 2778) [ClassicSimilarity], result of:
            0.10720462 = score(doc=2778,freq=1.0), product of:
              0.19674067 = queryWeight, product of:
                1.1305563 = boost
                8.7184515 = idf(docFreq=18, maxDocs=42740)
                0.019960094 = queryNorm
              0.5449032 = fieldWeight in 2778, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.7184515 = idf(docFreq=18, maxDocs=42740)
                0.0625 = fieldNorm(doc=2778)
          0.1350366 = weight(abstract_txt:wissenschaftlich in 2778) [ClassicSimilarity], result of:
            0.1350366 = score(doc=2778,freq=1.0), product of:
              0.2891098 = queryWeight, product of:
                1.9381685 = boost
                7.4732356 = idf(docFreq=65, maxDocs=42740)
                0.019960094 = queryNorm
              0.46707723 = fieldWeight in 2778, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.4732356 = idf(docFreq=65, maxDocs=42740)
                0.0625 = fieldNorm(doc=2778)
          0.59580016 = weight(abstract_txt:gerhard in 2778) [ClassicSimilarity], result of:
            0.59580016 = score(doc=2778,freq=8.0), product of:
              0.38886502 = queryWeight, product of:
                2.24781 = boost
                8.667158 = idf(docFreq=19, maxDocs=42740)
                0.019960094 = queryNorm
              1.5321516 = fieldWeight in 2778, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                8.667158 = idf(docFreq=19, maxDocs=42740)
                0.0625 = fieldNorm(doc=2778)
        0.28 = coord(7/25)
    
  2. Oberhauser, O.: Automatisches Klassifizieren : Entwicklungsstand - Methodik - Anwendungsbereiche (2005) 0.13
    0.13035275 = sum of:
      0.13035275 = product of:
        1.086273 = sum of:
          0.17605068 = weight(abstract_txt:klassifizieren in 1164) [ClassicSimilarity], result of:
            0.17605068 = score(doc=1164,freq=3.0), product of:
              0.20755403 = queryWeight, product of:
                1.1612098 = boost
                8.954841 = idf(docFreq=14, maxDocs=42740)
                0.019960094 = queryNorm
              0.8482162 = fieldWeight in 1164, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                8.954841 = idf(docFreq=14, maxDocs=42740)
                0.0546875 = fieldNorm(doc=1164)
          0.81314325 = weight(title_txt:automatisches in 1164) [ClassicSimilarity], result of:
            0.81314325 = score(doc=1164,freq=1.0), product of:
              0.20755403 = queryWeight, product of:
                1.1612098 = boost
                8.954841 = idf(docFreq=14, maxDocs=42740)
                0.019960094 = queryNorm
              3.9177427 = fieldWeight in 1164, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.954841 = idf(docFreq=14, maxDocs=42740)
                0.4375 = fieldNorm(doc=1164)
          0.09707907 = weight(abstract_txt:relevanten in 1164) [ClassicSimilarity], result of:
            0.09707907 = score(doc=1164,freq=1.0), product of:
              0.25361416 = queryWeight, product of:
                1.8152936 = boost
                6.9994516 = idf(docFreq=105, maxDocs=42740)
                0.019960094 = queryNorm
              0.38278252 = fieldWeight in 1164, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.9994516 = idf(docFreq=105, maxDocs=42740)
                0.0546875 = fieldNorm(doc=1164)
        0.12 = coord(3/25)
    
  3. Gödert, W.; Lepsky, K.; Nagelschmidt, M.: Informationserschließung und Automatisches Indexieren : ein Lehr- und Arbeitsbuch (2011) 0.12
    0.12063837 = sum of:
      0.12063837 = product of:
        0.7539898 = sum of:
          0.10065599 = weight(abstract_txt:indexieren in 4551) [ClassicSimilarity], result of:
            0.10065599 = score(doc=4551,freq=1.0), product of:
              0.16256917 = queryWeight, product of:
                1.027695 = boost
                7.925221 = idf(docFreq=41, maxDocs=42740)
                0.019960094 = queryNorm
              0.6191579 = fieldWeight in 4551, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.925221 = idf(docFreq=41, maxDocs=42740)
                0.078125 = fieldNorm(doc=4551)
          0.036232527 = weight(abstract_txt:über in 4551) [ClassicSimilarity], result of:
            0.036232527 = score(doc=4551,freq=2.0), product of:
              0.0822642 = queryWeight, product of:
                1.0338691 = boost
                3.9864168 = idf(docFreq=2156, maxDocs=42740)
                0.019960094 = queryNorm
              0.440441 = fieldWeight in 4551, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.9864168 = idf(docFreq=2156, maxDocs=42740)
                0.078125 = fieldNorm(doc=4551)
          0.036284644 = weight(abstract_txt:können in 4551) [ClassicSimilarity], result of:
            0.036284644 = score(doc=4551,freq=1.0), product of:
              0.103745766 = queryWeight, product of:
                1.1610351 = boost
                4.476746 = idf(docFreq=1320, maxDocs=42740)
                0.019960094 = queryNorm
              0.34974578 = fieldWeight in 4551, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.476746 = idf(docFreq=1320, maxDocs=42740)
                0.078125 = fieldNorm(doc=4551)
          0.5808166 = weight(title_txt:automatisches in 4551) [ClassicSimilarity], result of:
            0.5808166 = score(doc=4551,freq=1.0), product of:
              0.20755403 = queryWeight, product of:
                1.1612098 = boost
                8.954841 = idf(docFreq=14, maxDocs=42740)
                0.019960094 = queryNorm
              2.7983878 = fieldWeight in 4551, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.954841 = idf(docFreq=14, maxDocs=42740)
                0.3125 = fieldNorm(doc=4551)
        0.16 = coord(4/25)
    
  4. Oberhauser, O.: Automatisches Klassifizieren : Verfahren zur Erschließung elektronischer Dokumente (2004) 0.12
    0.11641316 = sum of:
      0.11641316 = product of:
        0.9701097 = sum of:
          0.17605068 = weight(abstract_txt:klassifizieren in 3488) [ClassicSimilarity], result of:
            0.17605068 = score(doc=3488,freq=3.0), product of:
              0.20755403 = queryWeight, product of:
                1.1612098 = boost
                8.954841 = idf(docFreq=14, maxDocs=42740)
                0.019960094 = queryNorm
              0.8482162 = fieldWeight in 3488, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                8.954841 = idf(docFreq=14, maxDocs=42740)
                0.0546875 = fieldNorm(doc=3488)
          0.69697994 = weight(title_txt:automatisches in 3488) [ClassicSimilarity], result of:
            0.69697994 = score(doc=3488,freq=1.0), product of:
              0.20755403 = queryWeight, product of:
                1.1612098 = boost
                8.954841 = idf(docFreq=14, maxDocs=42740)
                0.019960094 = queryNorm
              3.3580651 = fieldWeight in 3488, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.954841 = idf(docFreq=14, maxDocs=42740)
                0.375 = fieldNorm(doc=3488)
          0.09707907 = weight(abstract_txt:relevanten in 3488) [ClassicSimilarity], result of:
            0.09707907 = score(doc=3488,freq=1.0), product of:
              0.25361416 = queryWeight, product of:
                1.8152936 = boost
                6.9994516 = idf(docFreq=105, maxDocs=42740)
                0.019960094 = queryNorm
              0.38278252 = fieldWeight in 3488, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.9994516 = idf(docFreq=105, maxDocs=42740)
                0.0546875 = fieldNorm(doc=3488)
        0.12 = coord(3/25)
    
  5. Summann, F.; Wolf, S.: Suchmaschinentechnologie und wissenschaftliche Suchumgebung : Warum braucht man eine wissenschaftliche Suchmaschine? (2006) 0.11
    0.10678269 = sum of:
      0.10678269 = product of:
        0.44492787 = sum of:
          0.06039359 = weight(abstract_txt:indexieren in 959) [ClassicSimilarity], result of:
            0.06039359 = score(doc=959,freq=1.0), product of:
              0.16256917 = queryWeight, product of:
                1.027695 = boost
                7.925221 = idf(docFreq=41, maxDocs=42740)
                0.019960094 = queryNorm
              0.37149474 = fieldWeight in 959, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.925221 = idf(docFreq=41, maxDocs=42740)
                0.046875 = fieldNorm(doc=959)
          0.02662536 = weight(abstract_txt:über in 959) [ClassicSimilarity], result of:
            0.02662536 = score(doc=959,freq=3.0), product of:
              0.0822642 = queryWeight, product of:
                1.0338691 = boost
                3.9864168 = idf(docFreq=2156, maxDocs=42740)
                0.019960094 = queryNorm
              0.3236567 = fieldWeight in 959, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.9864168 = idf(docFreq=2156, maxDocs=42740)
                0.046875 = fieldNorm(doc=959)
          0.075232595 = weight(abstract_txt:servern in 959) [ClassicSimilarity], result of:
            0.075232595 = score(doc=959,freq=1.0), product of:
              0.18821244 = queryWeight, product of:
                1.1057814 = boost
                8.527396 = idf(docFreq=22, maxDocs=42740)
                0.019960094 = queryNorm
              0.39972168 = fieldWeight in 959, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.527396 = idf(docFreq=22, maxDocs=42740)
                0.046875 = fieldNorm(doc=959)
          0.021770788 = weight(abstract_txt:können in 959) [ClassicSimilarity], result of:
            0.021770788 = score(doc=959,freq=1.0), product of:
              0.103745766 = queryWeight, product of:
                1.1610351 = boost
                4.476746 = idf(docFreq=1320, maxDocs=42740)
                0.019960094 = queryNorm
              0.20984748 = fieldWeight in 959, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.476746 = idf(docFreq=1320, maxDocs=42740)
                0.046875 = fieldNorm(doc=959)
          0.117677584 = weight(abstract_txt:relevanten in 959) [ClassicSimilarity], result of:
            0.117677584 = score(doc=959,freq=2.0), product of:
              0.25361416 = queryWeight, product of:
                1.8152936 = boost
                6.9994516 = idf(docFreq=105, maxDocs=42740)
                0.019960094 = queryNorm
              0.46400243 = fieldWeight in 959, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.9994516 = idf(docFreq=105, maxDocs=42740)
                0.046875 = fieldNorm(doc=959)
          0.14322795 = weight(abstract_txt:wissenschaftlich in 959) [ClassicSimilarity], result of:
            0.14322795 = score(doc=959,freq=2.0), product of:
              0.2891098 = queryWeight, product of:
                1.9381685 = boost
                7.4732356 = idf(docFreq=65, maxDocs=42740)
                0.019960094 = queryNorm
              0.4954102 = fieldWeight in 959, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.4732356 = idf(docFreq=65, maxDocs=42740)
                0.046875 = fieldNorm(doc=959)
        0.24 = coord(6/25)