Document (#28156)

Author
Klein, H.
Title
Web Content Mining
Source
Wissensorganisation und Edutainment: Wissen im Spannungsfeld von Gesellschaft, Gestaltung und Industrie. Proceedings der 7. Tagung der Deutschen Sektion der Internationalen Gesellschaft für Wissensorganisation, Berlin, 21.-23.3.2001. Hrsg.: C. Lehner, H.P. Ohly u. G. Rahmstorf
Imprint
Würzburg : Ergon Verlag
Year
2004
Pages
S.217-221
Series
Fortschritte in der Wissensorganisation; Bd.7
Abstract
Web Mining - ein Schlagwort, das mit der Verbreitung des Internets immer öfter zu lesen und zu hören ist. Die gegenwärtige Forschung beschäftigt sich aber eher mit dem Nutzungsverhalten der Internetnutzer, und ein Blick in Tagungsprogramme einschlägiger Konferenzen (z.B. GOR - German Online Research) zeigt, dass die Analyse der Inhalte kaum Thema ist. Auf der GOR wurden 1999 zwei Vorträge zu diesem Thema gehalten, auf der Folgekonferenz 2001 kein einziger. Web Mining ist der Oberbegriff für zwei Typen von Web Mining: Web Usage Mining und Web Content Mining. Unter Web Usage Mining versteht man das Analysieren von Daten, wie sie bei der Nutzung des WWW anfallen und von den Servern protokolliert wenden. Man kann ermitteln, welche Seiten wie oft aufgerufen wurden, wie lange auf den Seiten verweilt wurde und vieles andere mehr. Beim Web Content Mining wird der Inhalt der Webseiten untersucht, der nicht nur Text, sondern auf Bilder, Video- und Audioinhalte enthalten kann. Die Software für die Analyse von Webseiten ist in den Grundzügen vorhanden, doch müssen die meisten Webseiten für die entsprechende Analysesoftware erst aufbereitet werden. Zuerst müssen die relevanten Websites ermittelt werden, die die gesuchten Inhalte enthalten. Das geschieht meist mit Suchmaschinen, von denen es mittlerweile Hunderte gibt. Allerdings kann man nicht davon ausgehen, dass die Suchmaschinen alle existierende Webseiten erfassen. Das ist unmöglich, denn durch das schnelle Wachstum des Internets kommen täglich Tausende von Webseiten hinzu, und bereits bestehende ändern sich der werden gelöscht. Oft weiß man auch nicht, wie die Suchmaschinen arbeiten, denn das gehört zu den Geschäftsgeheimnissen der Betreiber. Man muss also davon ausgehen, dass die Suchmaschinen nicht alle relevanten Websites finden (können). Der nächste Schritt ist das Herunterladen der Websites, dafür gibt es Software, die unter den Bezeichnungen OfflineReader oder Webspider zu finden ist. Das Ziel dieser Programme ist, die Website in einer Form herunterzuladen, die es erlaubt, die Website offline zu betrachten. Die Struktur der Website wird in der Regel beibehalten. Wer die Inhalte einer Website analysieren will, muss also alle Dateien mit seiner Analysesoftware verarbeiten können. Software für Inhaltsanalyse geht davon aus, dass nur Textinformationen in einer einzigen Datei verarbeitet werden. QDA Software (qualitative data analysis) verarbeitet dagegen auch Audiound Videoinhalte sowie internetspezifische Kommunikation wie z.B. Chats.
Theme
Data Mining
Internet

Similar documents (author)

  1. Klein, W.: Organisation des Wissens durch Sprache : Konsequenzen für die maschinelle Sprachanalyse (1977) 4.97
    4.9683237 = sum of:
      4.9683237 = weight(author_txt:klein in 1748) [ClassicSimilarity], result of:
        4.9683237 = score(doc=1748,freq=1.0), product of:
          0.99999994 = queryWeight, product of:
            7.9493184 = idf(docFreq=40, maxDocs=42740)
            0.12579694 = queryNorm
          4.968324 = fieldWeight in 1748, product of:
            1.0 = tf(freq=1.0), with freq of:
              1.0 = termFreq=1.0
            7.9493184 = idf(docFreq=40, maxDocs=42740)
            0.625 = fieldNorm(doc=1748)
    
  2. Klein, H.: GENIOS jetzt mit Thesaurus-Suche (1993) 4.97
    4.9683237 = sum of:
      4.9683237 = weight(author_txt:klein in 7537) [ClassicSimilarity], result of:
        4.9683237 = score(doc=7537,freq=1.0), product of:
          0.99999994 = queryWeight, product of:
            7.9493184 = idf(docFreq=40, maxDocs=42740)
            0.12579694 = queryNorm
          4.968324 = fieldWeight in 7537, product of:
            1.0 = tf(freq=1.0), with freq of:
              1.0 = termFreq=1.0
            7.9493184 = idf(docFreq=40, maxDocs=42740)
            0.625 = fieldNorm(doc=7537)
    
  3. Klein, R.D.: ¬The problem of cataloguing world literature using the Nippon Decimal Classification (1994) 4.97
    4.9683237 = sum of:
      4.9683237 = weight(author_txt:klein in 936) [ClassicSimilarity], result of:
        4.9683237 = score(doc=936,freq=1.0), product of:
          0.99999994 = queryWeight, product of:
            7.9493184 = idf(docFreq=40, maxDocs=42740)
            0.12579694 = queryNorm
          4.968324 = fieldWeight in 936, product of:
            1.0 = tf(freq=1.0), with freq of:
              1.0 = termFreq=1.0
            7.9493184 = idf(docFreq=40, maxDocs=42740)
            0.625 = fieldNorm(doc=936)
    
  4. Klein, G.M.: Is there a standard default keyword operator? : a bibliometric analysis of processing options chosen by libraries to execute keyword searches in online public access catalogs (1994) 4.97
    4.9683237 = sum of:
      4.9683237 = weight(author_txt:klein in 2269) [ClassicSimilarity], result of:
        4.9683237 = score(doc=2269,freq=1.0), product of:
          0.99999994 = queryWeight, product of:
            7.9493184 = idf(docFreq=40, maxDocs=42740)
            0.12579694 = queryNorm
          4.968324 = fieldWeight in 2269, product of:
            1.0 = tf(freq=1.0), with freq of:
              1.0 = termFreq=1.0
            7.9493184 = idf(docFreq=40, maxDocs=42740)
            0.625 = fieldNorm(doc=2269)
    
  5. Klein, J.T.: Interdisciplinary needs : the current context (1996) 4.97
    4.9683237 = sum of:
      4.9683237 = weight(author_txt:klein in 246) [ClassicSimilarity], result of:
        4.9683237 = score(doc=246,freq=1.0), product of:
          0.99999994 = queryWeight, product of:
            7.9493184 = idf(docFreq=40, maxDocs=42740)
            0.12579694 = queryNorm
          4.968324 = fieldWeight in 246, product of:
            1.0 = tf(freq=1.0), with freq of:
              1.0 = termFreq=1.0
            7.9493184 = idf(docFreq=40, maxDocs=42740)
            0.625 = fieldNorm(doc=246)
    

Similar documents (content)

  1. Blittkowsky, R.: ¬Das World Wide Web gleicht einer Fliege : Studien versuchen zu erklären, warum Suchmaschinen nicht immer fündig werden (2001) 0.28
    0.28378156 = sum of:
      0.28378156 = product of:
        0.7094539 = sum of:
          0.017803645 = weight(abstract_txt:einer in 2091) [ClassicSimilarity], result of:
            0.017803645 = score(doc=2091,freq=4.0), product of:
              0.07304304 = queryWeight, product of:
                1.0067317 = boost
                3.8998692 = idf(docFreq=2351, maxDocs=42740)
                0.018604374 = queryNorm
              0.24374183 = fieldWeight in 2091, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.8998692 = idf(docFreq=2351, maxDocs=42740)
                0.03125 = fieldNorm(doc=2091)
          0.082749575 = weight(abstract_txt:seiten in 2091) [ClassicSimilarity], result of:
            0.082749575 = score(doc=2091,freq=10.0), product of:
              0.1309402 = queryWeight, product of:
                1.1005638 = boost
                6.3950324 = idf(docFreq=193, maxDocs=42740)
                0.018604374 = queryNorm
              0.6319646 = fieldWeight in 2091, product of:
                3.1622777 = tf(freq=10.0), with freq of:
                  10.0 = termFreq=10.0
                6.3950324 = idf(docFreq=193, maxDocs=42740)
                0.03125 = fieldNorm(doc=2091)
          0.028163856 = weight(abstract_txt:kann in 2091) [ClassicSimilarity], result of:
            0.028163856 = score(doc=2091,freq=4.0), product of:
              0.09916711 = queryWeight, product of:
                1.1730274 = boost
                4.544064 = idf(docFreq=1234, maxDocs=42740)
                0.018604374 = queryNorm
              0.284004 = fieldWeight in 2091, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.544064 = idf(docFreq=1234, maxDocs=42740)
                0.03125 = fieldNorm(doc=2091)
          0.017524015 = weight(abstract_txt:werden in 2091) [ClassicSimilarity], result of:
            0.017524015 = score(doc=2091,freq=4.0), product of:
              0.07955025 = queryWeight, product of:
                1.2131499 = boost
                3.524618 = idf(docFreq=3422, maxDocs=42740)
                0.018604374 = queryNorm
              0.22028862 = fieldWeight in 2091, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.524618 = idf(docFreq=3422, maxDocs=42740)
                0.03125 = fieldNorm(doc=2091)
          0.03969569 = weight(abstract_txt:alle in 2091) [ClassicSimilarity], result of:
            0.03969569 = score(doc=2091,freq=4.0), product of:
              0.1246623 = queryWeight, product of:
                1.3152003 = boost
                5.0948124 = idf(docFreq=711, maxDocs=42740)
                0.018604374 = queryNorm
              0.31842577 = fieldWeight in 2091, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.0948124 = idf(docFreq=711, maxDocs=42740)
                0.03125 = fieldNorm(doc=2091)
          0.025074003 = weight(abstract_txt:nicht in 2091) [ClassicSimilarity], result of:
            0.025074003 = score(doc=2091,freq=4.0), product of:
              0.1010109 = queryWeight, product of:
                1.3670293 = boost
                3.9716904 = idf(docFreq=2188, maxDocs=42740)
                0.018604374 = queryNorm
              0.24823065 = fieldWeight in 2091, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.9716904 = idf(docFreq=2188, maxDocs=42740)
                0.03125 = fieldNorm(doc=2091)
          0.050260544 = weight(abstract_txt:verarbeitet in 2091) [ClassicSimilarity], result of:
            0.050260544 = score(doc=2091,freq=1.0), product of:
              0.20232394 = queryWeight, product of:
                1.3680512 = boost
                7.9493184 = idf(docFreq=40, maxDocs=42740)
                0.018604374 = queryNorm
              0.2484162 = fieldWeight in 2091, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.9493184 = idf(docFreq=40, maxDocs=42740)
                0.03125 = fieldNorm(doc=2091)
          0.059067782 = weight(abstract_txt:inhalte in 2091) [ClassicSimilarity], result of:
            0.059067782 = score(doc=2091,freq=3.0), product of:
              0.17883517 = queryWeight, product of:
                1.5752547 = boost
                6.102209 = idf(docFreq=259, maxDocs=42740)
                0.018604374 = queryNorm
              0.33029175 = fieldWeight in 2091, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.102209 = idf(docFreq=259, maxDocs=42740)
                0.03125 = fieldNorm(doc=2091)
          0.08132468 = weight(abstract_txt:suchmaschinen in 2091) [ClassicSimilarity], result of:
            0.08132468 = score(doc=2091,freq=4.0), product of:
              0.22132683 = queryWeight, product of:
                2.023535 = boost
                5.8790655 = idf(docFreq=324, maxDocs=42740)
                0.018604374 = queryNorm
              0.3674416 = fieldWeight in 2091, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.8790655 = idf(docFreq=324, maxDocs=42740)
                0.03125 = fieldNorm(doc=2091)
          0.30779004 = weight(abstract_txt:webseiten in 2091) [ClassicSimilarity], result of:
            0.30779004 = score(doc=2091,freq=11.0), product of:
              0.4132834 = queryWeight, product of:
                3.0915246 = boost
                7.1855536 = idf(docFreq=87, maxDocs=42740)
                0.018604374 = queryNorm
              0.7447433 = fieldWeight in 2091, product of:
                3.3166249 = tf(freq=11.0), with freq of:
                  11.0 = termFreq=11.0
                7.1855536 = idf(docFreq=87, maxDocs=42740)
                0.03125 = fieldNorm(doc=2091)
        0.4 = coord(10/25)
    
  2. Schweibenz, W.: Proactive Web design : Maßnahmen zur Verbesserung der Auffindbarkeit von Webseiten durch Suchmaschinen (1999) 0.24
    0.24232967 = sum of:
      0.24232967 = product of:
        1.009707 = sum of:
          0.03560729 = weight(abstract_txt:einer in 5066) [ClassicSimilarity], result of:
            0.03560729 = score(doc=5066,freq=1.0), product of:
              0.07304304 = queryWeight, product of:
                1.0067317 = boost
                3.8998692 = idf(docFreq=2351, maxDocs=42740)
                0.018604374 = queryNorm
              0.48748365 = fieldWeight in 5066, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.8998692 = idf(docFreq=2351, maxDocs=42740)
                0.125 = fieldNorm(doc=5066)
          0.10467085 = weight(abstract_txt:seiten in 5066) [ClassicSimilarity], result of:
            0.10467085 = score(doc=5066,freq=1.0), product of:
              0.1309402 = queryWeight, product of:
                1.1005638 = boost
                6.3950324 = idf(docFreq=193, maxDocs=42740)
                0.018604374 = queryNorm
              0.79937905 = fieldWeight in 5066, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.3950324 = idf(docFreq=193, maxDocs=42740)
                0.125 = fieldNorm(doc=5066)
          0.03504803 = weight(abstract_txt:werden in 5066) [ClassicSimilarity], result of:
            0.03504803 = score(doc=5066,freq=1.0), product of:
              0.07955025 = queryWeight, product of:
                1.2131499 = boost
                3.524618 = idf(docFreq=3422, maxDocs=42740)
                0.018604374 = queryNorm
              0.44057724 = fieldWeight in 5066, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.524618 = idf(docFreq=3422, maxDocs=42740)
                0.125 = fieldNorm(doc=5066)
          0.07939138 = weight(abstract_txt:alle in 5066) [ClassicSimilarity], result of:
            0.07939138 = score(doc=5066,freq=1.0), product of:
              0.1246623 = queryWeight, product of:
                1.3152003 = boost
                5.0948124 = idf(docFreq=711, maxDocs=42740)
                0.018604374 = queryNorm
              0.63685155 = fieldWeight in 5066, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.0948124 = idf(docFreq=711, maxDocs=42740)
                0.125 = fieldNorm(doc=5066)
          0.23002093 = weight(abstract_txt:suchmaschinen in 5066) [ClassicSimilarity], result of:
            0.23002093 = score(doc=5066,freq=2.0), product of:
              0.22132683 = queryWeight, product of:
                2.023535 = boost
                5.8790655 = idf(docFreq=324, maxDocs=42740)
                0.018604374 = queryNorm
              1.0392817 = fieldWeight in 5066, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.8790655 = idf(docFreq=324, maxDocs=42740)
                0.125 = fieldNorm(doc=5066)
          0.52496845 = weight(abstract_txt:webseiten in 5066) [ClassicSimilarity], result of:
            0.52496845 = score(doc=5066,freq=2.0), product of:
              0.4132834 = queryWeight, product of:
                3.0915246 = boost
                7.1855536 = idf(docFreq=87, maxDocs=42740)
                0.018604374 = queryNorm
              1.2702384 = fieldWeight in 5066, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.1855536 = idf(docFreq=87, maxDocs=42740)
                0.125 = fieldNorm(doc=5066)
        0.24 = coord(6/25)
    
  3. Charlier, M.: Pingpong mit Pingback : Lass mich Deine Suchmaschine sein: Webseiten finden neue Wege der Vernetzung (2003) 0.24
    0.24106501 = sum of:
      0.24106501 = product of:
        0.6026625 = sum of:
          0.012589077 = weight(abstract_txt:einer in 2476) [ClassicSimilarity], result of:
            0.012589077 = score(doc=2476,freq=2.0), product of:
              0.07304304 = queryWeight, product of:
                1.0067317 = boost
                3.8998692 = idf(docFreq=2351, maxDocs=42740)
                0.018604374 = queryNorm
              0.1723515 = fieldWeight in 2476, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.8998692 = idf(docFreq=2351, maxDocs=42740)
                0.03125 = fieldNorm(doc=2476)
          0.06923325 = weight(abstract_txt:seiten in 2476) [ClassicSimilarity], result of:
            0.06923325 = score(doc=2476,freq=7.0), product of:
              0.1309402 = queryWeight, product of:
                1.1005638 = boost
                6.3950324 = idf(docFreq=193, maxDocs=42740)
                0.018604374 = queryNorm
              0.5287395 = fieldWeight in 2476, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                6.3950324 = idf(docFreq=193, maxDocs=42740)
                0.03125 = fieldNorm(doc=2476)
          0.024390614 = weight(abstract_txt:kann in 2476) [ClassicSimilarity], result of:
            0.024390614 = score(doc=2476,freq=3.0), product of:
              0.09916711 = queryWeight, product of:
                1.1730274 = boost
                4.544064 = idf(docFreq=1234, maxDocs=42740)
                0.018604374 = queryNorm
              0.24595468 = fieldWeight in 2476, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.544064 = idf(docFreq=1234, maxDocs=42740)
                0.03125 = fieldNorm(doc=2476)
          0.017524015 = weight(abstract_txt:werden in 2476) [ClassicSimilarity], result of:
            0.017524015 = score(doc=2476,freq=4.0), product of:
              0.07955025 = queryWeight, product of:
                1.2131499 = boost
                3.524618 = idf(docFreq=3422, maxDocs=42740)
                0.018604374 = queryNorm
              0.22028862 = fieldWeight in 2476, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.524618 = idf(docFreq=3422, maxDocs=42740)
                0.03125 = fieldNorm(doc=2476)
          0.04158053 = weight(abstract_txt:nicht in 2476) [ClassicSimilarity], result of:
            0.04158053 = score(doc=2476,freq=11.0), product of:
              0.1010109 = queryWeight, product of:
                1.3670293 = boost
                3.9716904 = idf(docFreq=2188, maxDocs=42740)
                0.018604374 = queryNorm
              0.41164398 = fieldWeight in 2476, product of:
                3.3166249 = tf(freq=11.0), with freq of:
                  11.0 = termFreq=11.0
                3.9716904 = idf(docFreq=2188, maxDocs=42740)
                0.03125 = fieldNorm(doc=2476)
          0.02849692 = weight(abstract_txt:software in 2476) [ClassicSimilarity], result of:
            0.02849692 = score(doc=2476,freq=3.0), product of:
              0.12107766 = queryWeight, product of:
                1.4966688 = boost
                4.3483377 = idf(docFreq=1501, maxDocs=42740)
                0.018604374 = queryNorm
              0.23536068 = fieldWeight in 2476, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.3483377 = idf(docFreq=1501, maxDocs=42740)
                0.03125 = fieldNorm(doc=2476)
          0.04676762 = weight(abstract_txt:dass in 2476) [ClassicSimilarity], result of:
            0.04676762 = score(doc=2476,freq=6.0), product of:
              0.1337064 = queryWeight, product of:
                1.5727866 = boost
                4.569486 = idf(docFreq=1203, maxDocs=42740)
                0.018604374 = queryNorm
              0.34977844 = fieldWeight in 2476, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                4.569486 = idf(docFreq=1203, maxDocs=42740)
                0.03125 = fieldNorm(doc=2476)
          0.06433323 = weight(abstract_txt:davon in 2476) [ClassicSimilarity], result of:
            0.06433323 = score(doc=2476,freq=3.0), product of:
              0.18931109 = queryWeight, product of:
                1.6207362 = boost
                6.2783957 = idf(docFreq=217, maxDocs=42740)
                0.018604374 = queryNorm
              0.33982813 = fieldWeight in 2476, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.2783957 = idf(docFreq=217, maxDocs=42740)
                0.03125 = fieldNorm(doc=2476)
          0.07042924 = weight(abstract_txt:suchmaschinen in 2476) [ClassicSimilarity], result of:
            0.07042924 = score(doc=2476,freq=3.0), product of:
              0.22132683 = queryWeight, product of:
                2.023535 = boost
                5.8790655 = idf(docFreq=324, maxDocs=42740)
                0.018604374 = queryNorm
              0.31821376 = fieldWeight in 2476, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.8790655 = idf(docFreq=324, maxDocs=42740)
                0.03125 = fieldNorm(doc=2476)
          0.22731802 = weight(abstract_txt:webseiten in 2476) [ClassicSimilarity], result of:
            0.22731802 = score(doc=2476,freq=6.0), product of:
              0.4132834 = queryWeight, product of:
                3.0915246 = boost
                7.1855536 = idf(docFreq=87, maxDocs=42740)
                0.018604374 = queryNorm
              0.5500294 = fieldWeight in 2476, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                7.1855536 = idf(docFreq=87, maxDocs=42740)
                0.03125 = fieldNorm(doc=2476)
        0.4 = coord(10/25)
    
  4. James, M.: Suchmaschine mit Mehrwert : Mirago (2004) 0.23
    0.2313993 = sum of:
      0.2313993 = product of:
        0.57849824 = sum of:
          0.022254555 = weight(abstract_txt:einer in 3318) [ClassicSimilarity], result of:
            0.022254555 = score(doc=3318,freq=4.0), product of:
              0.07304304 = queryWeight, product of:
                1.0067317 = boost
                3.8998692 = idf(docFreq=2351, maxDocs=42740)
                0.018604374 = queryNorm
              0.30467728 = fieldWeight in 3318, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.8998692 = idf(docFreq=2351, maxDocs=42740)
                0.0390625 = fieldNorm(doc=3318)
          0.07314098 = weight(abstract_txt:seiten in 3318) [ClassicSimilarity], result of:
            0.07314098 = score(doc=3318,freq=5.0), product of:
              0.1309402 = queryWeight, product of:
                1.1005638 = boost
                6.3950324 = idf(docFreq=193, maxDocs=42740)
                0.018604374 = queryNorm
              0.5585831 = fieldWeight in 3318, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                6.3950324 = idf(docFreq=193, maxDocs=42740)
                0.0390625 = fieldNorm(doc=3318)
          0.04140772 = weight(abstract_txt:enthalten in 3318) [ClassicSimilarity], result of:
            0.04140772 = score(doc=3318,freq=1.0), product of:
              0.15322985 = queryWeight, product of:
                1.1905575 = boost
                6.9179583 = idf(docFreq=114, maxDocs=42740)
                0.018604374 = queryNorm
              0.27023274 = fieldWeight in 3318, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.9179583 = idf(docFreq=114, maxDocs=42740)
                0.0390625 = fieldNorm(doc=3318)
          0.04288837 = weight(abstract_txt:relevanten in 3318) [ClassicSimilarity], result of:
            0.04288837 = score(doc=3318,freq=1.0), product of:
              0.15686119 = queryWeight, product of:
                1.2045822 = boost
                6.9994516 = idf(docFreq=105, maxDocs=42740)
                0.018604374 = queryNorm
              0.27341607 = fieldWeight in 3318, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.9994516 = idf(docFreq=105, maxDocs=42740)
                0.0390625 = fieldNorm(doc=3318)
          0.024490556 = weight(abstract_txt:werden in 3318) [ClassicSimilarity], result of:
            0.024490556 = score(doc=3318,freq=5.0), product of:
              0.07955025 = queryWeight, product of:
                1.2131499 = boost
                3.524618 = idf(docFreq=3422, maxDocs=42740)
                0.018604374 = queryNorm
              0.3078627 = fieldWeight in 3318, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                3.524618 = idf(docFreq=3422, maxDocs=42740)
                0.0390625 = fieldNorm(doc=3318)
          0.022162495 = weight(abstract_txt:nicht in 3318) [ClassicSimilarity], result of:
            0.022162495 = score(doc=3318,freq=2.0), product of:
              0.1010109 = queryWeight, product of:
                1.3670293 = boost
                3.9716904 = idf(docFreq=2188, maxDocs=42740)
                0.018604374 = queryNorm
              0.21940696 = fieldWeight in 3318, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.9716904 = idf(docFreq=2188, maxDocs=42740)
                0.0390625 = fieldNorm(doc=3318)
          0.02056588 = weight(abstract_txt:software in 3318) [ClassicSimilarity], result of:
            0.02056588 = score(doc=3318,freq=1.0), product of:
              0.12107766 = queryWeight, product of:
                1.4966688 = boost
                4.3483377 = idf(docFreq=1501, maxDocs=42740)
                0.018604374 = queryNorm
              0.16985694 = fieldWeight in 3318, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.3483377 = idf(docFreq=1501, maxDocs=42740)
                0.0390625 = fieldNorm(doc=3318)
          0.0426285 = weight(abstract_txt:inhalte in 3318) [ClassicSimilarity], result of:
            0.0426285 = score(doc=3318,freq=1.0), product of:
              0.17883517 = queryWeight, product of:
                1.5752547 = boost
                6.102209 = idf(docFreq=259, maxDocs=42740)
                0.018604374 = queryNorm
              0.23836754 = fieldWeight in 3318, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.102209 = idf(docFreq=259, maxDocs=42740)
                0.0390625 = fieldNorm(doc=3318)
          0.08803655 = weight(abstract_txt:suchmaschinen in 3318) [ClassicSimilarity], result of:
            0.08803655 = score(doc=3318,freq=3.0), product of:
              0.22132683 = queryWeight, product of:
                2.023535 = boost
                5.8790655 = idf(docFreq=324, maxDocs=42740)
                0.018604374 = queryNorm
              0.3977672 = fieldWeight in 3318, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.8790655 = idf(docFreq=324, maxDocs=42740)
                0.0390625 = fieldNorm(doc=3318)
          0.20092262 = weight(abstract_txt:webseiten in 3318) [ClassicSimilarity], result of:
            0.20092262 = score(doc=3318,freq=3.0), product of:
              0.4132834 = queryWeight, product of:
                3.0915246 = boost
                7.1855536 = idf(docFreq=87, maxDocs=42740)
                0.018604374 = queryNorm
              0.48616186 = fieldWeight in 3318, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.1855536 = idf(docFreq=87, maxDocs=42740)
                0.0390625 = fieldNorm(doc=3318)
        0.4 = coord(10/25)
    
  5. Jörn, F.: Wie Google für uns nach der ominösen Gluonenkraft stöbert : Software-Krabbler machen sich vor der Anfrage auf die Suche - Das Netz ist etwa fünfhundertmal größer als alles Durchforschte (2001) 0.23
    0.23058698 = sum of:
      0.23058698 = product of:
        0.48038954 = sum of:
          0.009636505 = weight(abstract_txt:einer in 685) [ClassicSimilarity], result of:
            0.009636505 = score(doc=685,freq=3.0), product of:
              0.07304304 = queryWeight, product of:
                1.0067317 = boost
                3.8998692 = idf(docFreq=2351, maxDocs=42740)
                0.018604374 = queryNorm
              0.13192913 = fieldWeight in 685, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.8998692 = idf(docFreq=2351, maxDocs=42740)
                0.01953125 = fieldNorm(doc=685)
          0.061194137 = weight(abstract_txt:seiten in 685) [ClassicSimilarity], result of:
            0.061194137 = score(doc=685,freq=14.0), product of:
              0.1309402 = queryWeight, product of:
                1.1005638 = boost
                6.3950324 = idf(docFreq=193, maxDocs=42740)
                0.018604374 = queryNorm
              0.46734416 = fieldWeight in 685, product of:
                3.7416575 = tf(freq=14.0), with freq of:
                  14.0 = termFreq=14.0
                6.3950324 = idf(docFreq=193, maxDocs=42740)
                0.01953125 = fieldNorm(doc=685)
          0.01760241 = weight(abstract_txt:kann in 685) [ClassicSimilarity], result of:
            0.01760241 = score(doc=685,freq=4.0), product of:
              0.09916711 = queryWeight, product of:
                1.1730274 = boost
                4.544064 = idf(docFreq=1234, maxDocs=42740)
                0.018604374 = queryNorm
              0.1775025 = fieldWeight in 685, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.544064 = idf(docFreq=1234, maxDocs=42740)
                0.01953125 = fieldNorm(doc=685)
          0.02927968 = weight(abstract_txt:enthalten in 685) [ClassicSimilarity], result of:
            0.02927968 = score(doc=685,freq=2.0), product of:
              0.15322985 = queryWeight, product of:
                1.1905575 = boost
                6.9179583 = idf(docFreq=114, maxDocs=42740)
                0.018604374 = queryNorm
              0.1910834 = fieldWeight in 685, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.9179583 = idf(docFreq=114, maxDocs=42740)
                0.01953125 = fieldNorm(doc=685)
          0.018162683 = weight(abstract_txt:werden in 685) [ClassicSimilarity], result of:
            0.018162683 = score(doc=685,freq=11.0), product of:
              0.07955025 = queryWeight, product of:
                1.2131499 = boost
                3.524618 = idf(docFreq=3422, maxDocs=42740)
                0.018604374 = queryNorm
              0.2283171 = fieldWeight in 685, product of:
                3.3166249 = tf(freq=11.0), with freq of:
                  11.0 = termFreq=11.0
                3.524618 = idf(docFreq=3422, maxDocs=42740)
                0.01953125 = fieldNorm(doc=685)
          0.012404903 = weight(abstract_txt:alle in 685) [ClassicSimilarity], result of:
            0.012404903 = score(doc=685,freq=1.0), product of:
              0.1246623 = queryWeight, product of:
                1.3152003 = boost
                5.0948124 = idf(docFreq=711, maxDocs=42740)
                0.018604374 = queryNorm
              0.099508055 = fieldWeight in 685, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.0948124 = idf(docFreq=711, maxDocs=42740)
                0.01953125 = fieldNorm(doc=685)
          0.03590735 = weight(abstract_txt:nicht in 685) [ClassicSimilarity], result of:
            0.03590735 = score(doc=685,freq=21.0), product of:
              0.1010109 = queryWeight, product of:
                1.3670293 = boost
                3.9716904 = idf(docFreq=2188, maxDocs=42740)
                0.018604374 = queryNorm
              0.35547996 = fieldWeight in 685, product of:
                4.582576 = tf(freq=21.0), with freq of:
                  21.0 = termFreq=21.0
                3.9716904 = idf(docFreq=2188, maxDocs=42740)
                0.01953125 = fieldNorm(doc=685)
          0.01028294 = weight(abstract_txt:software in 685) [ClassicSimilarity], result of:
            0.01028294 = score(doc=685,freq=1.0), product of:
              0.12107766 = queryWeight, product of:
                1.4966688 = boost
                4.3483377 = idf(docFreq=1501, maxDocs=42740)
                0.018604374 = queryNorm
              0.08492847 = fieldWeight in 685, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.3483377 = idf(docFreq=1501, maxDocs=42740)
                0.01953125 = fieldNorm(doc=685)
          0.0602858 = weight(abstract_txt:inhalte in 685) [ClassicSimilarity], result of:
            0.0602858 = score(doc=685,freq=8.0), product of:
              0.17883517 = queryWeight, product of:
                1.5752547 = boost
                6.102209 = idf(docFreq=259, maxDocs=42740)
                0.018604374 = queryNorm
              0.3371026 = fieldWeight in 685, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                6.102209 = idf(docFreq=259, maxDocs=42740)
                0.01953125 = fieldNorm(doc=685)
          0.032829914 = weight(abstract_txt:davon in 685) [ClassicSimilarity], result of:
            0.032829914 = score(doc=685,freq=2.0), product of:
              0.18931109 = queryWeight, product of:
                1.6207362 = boost
                6.2783957 = idf(docFreq=217, maxDocs=42740)
                0.018604374 = queryNorm
              0.1734178 = fieldWeight in 685, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.2783957 = idf(docFreq=217, maxDocs=42740)
                0.01953125 = fieldNorm(doc=685)
          0.11077691 = weight(abstract_txt:suchmaschinen in 685) [ClassicSimilarity], result of:
            0.11077691 = score(doc=685,freq=19.0), product of:
              0.22132683 = queryWeight, product of:
                2.023535 = boost
                5.8790655 = idf(docFreq=324, maxDocs=42740)
                0.018604374 = queryNorm
              0.5005128 = fieldWeight in 685, product of:
                4.358899 = tf(freq=19.0), with freq of:
                  19.0 = termFreq=19.0
                5.8790655 = idf(docFreq=324, maxDocs=42740)
                0.01953125 = fieldNorm(doc=685)
          0.08202632 = weight(abstract_txt:webseiten in 685) [ClassicSimilarity], result of:
            0.08202632 = score(doc=685,freq=2.0), product of:
              0.4132834 = queryWeight, product of:
                3.0915246 = boost
                7.1855536 = idf(docFreq=87, maxDocs=42740)
                0.018604374 = queryNorm
              0.19847475 = fieldWeight in 685, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.1855536 = idf(docFreq=87, maxDocs=42740)
                0.01953125 = fieldNorm(doc=685)
        0.48 = coord(12/25)