Search (39 results, page 1 of 2)

  • × type_ss:"a"
  • × theme_ss:"Data Mining"
  1. Vaughan, L.; Chen, Y.: Data mining from web search queries : a comparison of Google trends and Baidu index (2015) 0.01
    0.014178077 = product of:
      0.04253423 = sum of:
        0.03788 = weight(_text_:suchmaschinen in 1605) [ClassicSimilarity], result of:
          0.03788 = score(doc=1605,freq=2.0), product of:
            0.15347718 = queryWeight, product of:
              4.4677734 = idf(docFreq=1378, maxDocs=44218)
              0.03435205 = queryNorm
            0.24681194 = fieldWeight in 1605, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              4.4677734 = idf(docFreq=1378, maxDocs=44218)
              0.0390625 = fieldNorm(doc=1605)
        0.0046542287 = product of:
          0.023271143 = sum of:
            0.023271143 = weight(_text_:22 in 1605) [ClassicSimilarity], result of:
              0.023271143 = score(doc=1605,freq=2.0), product of:
                0.120295025 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.03435205 = queryNorm
                0.19345059 = fieldWeight in 1605, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=1605)
          0.2 = coord(1/5)
      0.33333334 = coord(2/6)
    
    Source
    Journal of the Association for Information Science and Technology. 66(2015) no.1, S.13-22
    Theme
    Suchmaschinen
  2. Hölzig, C.: Google spürt Grippewellen auf : Die neue Anwendung ist bisher auf die USA beschränkt (2008) 0.01
    0.011342461 = product of:
      0.034027383 = sum of:
        0.030304 = weight(_text_:suchmaschinen in 2403) [ClassicSimilarity], result of:
          0.030304 = score(doc=2403,freq=2.0), product of:
            0.15347718 = queryWeight, product of:
              4.4677734 = idf(docFreq=1378, maxDocs=44218)
              0.03435205 = queryNorm
            0.19744955 = fieldWeight in 2403, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              4.4677734 = idf(docFreq=1378, maxDocs=44218)
              0.03125 = fieldNorm(doc=2403)
        0.003723383 = product of:
          0.018616915 = sum of:
            0.018616915 = weight(_text_:22 in 2403) [ClassicSimilarity], result of:
              0.018616915 = score(doc=2403,freq=2.0), product of:
                0.120295025 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.03435205 = queryNorm
                0.15476047 = fieldWeight in 2403, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.03125 = fieldNorm(doc=2403)
          0.2 = coord(1/5)
      0.33333334 = coord(2/6)
    
    Date
    3. 5.1997 8:44:22
    Theme
    Suchmaschinen
  3. Klein, H.: Web Content Mining (2004) 0.01
    0.010101333 = product of:
      0.060608 = sum of:
        0.060608 = weight(_text_:suchmaschinen in 3154) [ClassicSimilarity], result of:
          0.060608 = score(doc=3154,freq=8.0), product of:
            0.15347718 = queryWeight, product of:
              4.4677734 = idf(docFreq=1378, maxDocs=44218)
              0.03435205 = queryNorm
            0.3948991 = fieldWeight in 3154, product of:
              2.828427 = tf(freq=8.0), with freq of:
                8.0 = termFreq=8.0
              4.4677734 = idf(docFreq=1378, maxDocs=44218)
              0.03125 = fieldNorm(doc=3154)
      0.16666667 = coord(1/6)
    
    Abstract
    Web Mining - ein Schlagwort, das mit der Verbreitung des Internets immer öfter zu lesen und zu hören ist. Die gegenwärtige Forschung beschäftigt sich aber eher mit dem Nutzungsverhalten der Internetnutzer, und ein Blick in Tagungsprogramme einschlägiger Konferenzen (z.B. GOR - German Online Research) zeigt, dass die Analyse der Inhalte kaum Thema ist. Auf der GOR wurden 1999 zwei Vorträge zu diesem Thema gehalten, auf der Folgekonferenz 2001 kein einziger. Web Mining ist der Oberbegriff für zwei Typen von Web Mining: Web Usage Mining und Web Content Mining. Unter Web Usage Mining versteht man das Analysieren von Daten, wie sie bei der Nutzung des WWW anfallen und von den Servern protokolliert wenden. Man kann ermitteln, welche Seiten wie oft aufgerufen wurden, wie lange auf den Seiten verweilt wurde und vieles andere mehr. Beim Web Content Mining wird der Inhalt der Webseiten untersucht, der nicht nur Text, sondern auf Bilder, Video- und Audioinhalte enthalten kann. Die Software für die Analyse von Webseiten ist in den Grundzügen vorhanden, doch müssen die meisten Webseiten für die entsprechende Analysesoftware erst aufbereitet werden. Zuerst müssen die relevanten Websites ermittelt werden, die die gesuchten Inhalte enthalten. Das geschieht meist mit Suchmaschinen, von denen es mittlerweile Hunderte gibt. Allerdings kann man nicht davon ausgehen, dass die Suchmaschinen alle existierende Webseiten erfassen. Das ist unmöglich, denn durch das schnelle Wachstum des Internets kommen täglich Tausende von Webseiten hinzu, und bereits bestehende ändern sich der werden gelöscht. Oft weiß man auch nicht, wie die Suchmaschinen arbeiten, denn das gehört zu den Geschäftsgeheimnissen der Betreiber. Man muss also davon ausgehen, dass die Suchmaschinen nicht alle relevanten Websites finden (können). Der nächste Schritt ist das Herunterladen der Websites, dafür gibt es Software, die unter den Bezeichnungen OfflineReader oder Webspider zu finden ist. Das Ziel dieser Programme ist, die Website in einer Form herunterzuladen, die es erlaubt, die Website offline zu betrachten. Die Struktur der Website wird in der Regel beibehalten. Wer die Inhalte einer Website analysieren will, muss also alle Dateien mit seiner Analysesoftware verarbeiten können. Software für Inhaltsanalyse geht davon aus, dass nur Textinformationen in einer einzigen Datei verarbeitet werden. QDA Software (qualitative data analysis) verarbeitet dagegen auch Audiound Videoinhalte sowie internetspezifische Kommunikation wie z.B. Chats.
  4. Search tools (1997) 0.01
    0.008838667 = product of:
      0.053032 = sum of:
        0.053032 = weight(_text_:suchmaschinen in 3834) [ClassicSimilarity], result of:
          0.053032 = score(doc=3834,freq=2.0), product of:
            0.15347718 = queryWeight, product of:
              4.4677734 = idf(docFreq=1378, maxDocs=44218)
              0.03435205 = queryNorm
            0.3455367 = fieldWeight in 3834, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              4.4677734 = idf(docFreq=1378, maxDocs=44218)
              0.0546875 = fieldNorm(doc=3834)
      0.16666667 = coord(1/6)
    
    Theme
    Suchmaschinen
  5. Baeza-Yates, R.; Hurtado, C.; Mendoza, M.: Improving search engines by query clustering (2007) 0.01
    0.008838667 = product of:
      0.053032 = sum of:
        0.053032 = weight(_text_:suchmaschinen in 601) [ClassicSimilarity], result of:
          0.053032 = score(doc=601,freq=2.0), product of:
            0.15347718 = queryWeight, product of:
              4.4677734 = idf(docFreq=1378, maxDocs=44218)
              0.03435205 = queryNorm
            0.3455367 = fieldWeight in 601, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              4.4677734 = idf(docFreq=1378, maxDocs=44218)
              0.0546875 = fieldNorm(doc=601)
      0.16666667 = coord(1/6)
    
    Theme
    Suchmaschinen
  6. Whittle, M.; Eaglestone, B.; Ford, N.; Gillet, V.J.; Madden, A.: Data mining of search engine logs (2007) 0.01
    0.007576 = product of:
      0.045456 = sum of:
        0.045456 = weight(_text_:suchmaschinen in 1330) [ClassicSimilarity], result of:
          0.045456 = score(doc=1330,freq=2.0), product of:
            0.15347718 = queryWeight, product of:
              4.4677734 = idf(docFreq=1378, maxDocs=44218)
              0.03435205 = queryNorm
            0.29617432 = fieldWeight in 1330, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              4.4677734 = idf(docFreq=1378, maxDocs=44218)
              0.046875 = fieldNorm(doc=1330)
      0.16666667 = coord(1/6)
    
    Theme
    Suchmaschinen
  7. Shi, X.; Yang, C.C.: Mining related queries from Web search engine query logs using an improved association rule mining model (2007) 0.01
    0.0063133333 = product of:
      0.03788 = sum of:
        0.03788 = weight(_text_:suchmaschinen in 597) [ClassicSimilarity], result of:
          0.03788 = score(doc=597,freq=2.0), product of:
            0.15347718 = queryWeight, product of:
              4.4677734 = idf(docFreq=1378, maxDocs=44218)
              0.03435205 = queryNorm
            0.24681194 = fieldWeight in 597, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              4.4677734 = idf(docFreq=1378, maxDocs=44218)
              0.0390625 = fieldNorm(doc=597)
      0.16666667 = coord(1/6)
    
    Theme
    Suchmaschinen
  8. Liu, Y.; Zhang, M.; Cen, R.; Ru, L.; Ma, S.: Data cleansing for Web information retrieval using query independent features (2007) 0.01
    0.0063133333 = product of:
      0.03788 = sum of:
        0.03788 = weight(_text_:suchmaschinen in 607) [ClassicSimilarity], result of:
          0.03788 = score(doc=607,freq=2.0), product of:
            0.15347718 = queryWeight, product of:
              4.4677734 = idf(docFreq=1378, maxDocs=44218)
              0.03435205 = queryNorm
            0.24681194 = fieldWeight in 607, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              4.4677734 = idf(docFreq=1378, maxDocs=44218)
              0.0390625 = fieldNorm(doc=607)
      0.16666667 = coord(1/6)
    
    Theme
    Suchmaschinen
  9. Amir, A.; Feldman, R.; Kashi, R.: ¬A new and versatile method for association generation (1997) 0.00
    0.004987043 = product of:
      0.029922256 = sum of:
        0.029922256 = product of:
          0.07480564 = sum of:
            0.03757181 = weight(_text_:29 in 1270) [ClassicSimilarity], result of:
              0.03757181 = score(doc=1270,freq=2.0), product of:
                0.12083977 = queryWeight, product of:
                  3.5176873 = idf(docFreq=3565, maxDocs=44218)
                  0.03435205 = queryNorm
                0.31092256 = fieldWeight in 1270, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5176873 = idf(docFreq=3565, maxDocs=44218)
                  0.0625 = fieldNorm(doc=1270)
            0.03723383 = weight(_text_:22 in 1270) [ClassicSimilarity], result of:
              0.03723383 = score(doc=1270,freq=2.0), product of:
                0.120295025 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.03435205 = queryNorm
                0.30952093 = fieldWeight in 1270, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0625 = fieldNorm(doc=1270)
          0.4 = coord(2/5)
      0.16666667 = coord(1/6)
    
    Date
    5. 4.1996 15:29:15
    Source
    Information systems. 22(1997) nos.5/6, S.333-347
  10. Wiegmann, S.: Hättest du die Titanic überlebt? : Eine kurze Einführung in das Data Mining mit freier Software (2023) 0.00
    0.004464585 = product of:
      0.026787508 = sum of:
        0.026787508 = product of:
          0.06696877 = sum of:
            0.034093432 = weight(_text_:28 in 876) [ClassicSimilarity], result of:
              0.034093432 = score(doc=876,freq=2.0), product of:
                0.12305808 = queryWeight, product of:
                  3.5822632 = idf(docFreq=3342, maxDocs=44218)
                  0.03435205 = queryNorm
                0.27705154 = fieldWeight in 876, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5822632 = idf(docFreq=3342, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=876)
            0.032875333 = weight(_text_:29 in 876) [ClassicSimilarity], result of:
              0.032875333 = score(doc=876,freq=2.0), product of:
                0.12083977 = queryWeight, product of:
                  3.5176873 = idf(docFreq=3565, maxDocs=44218)
                  0.03435205 = queryNorm
                0.27205724 = fieldWeight in 876, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5176873 = idf(docFreq=3565, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=876)
          0.4 = coord(2/5)
      0.16666667 = coord(1/6)
    
    Date
    28. 1.2022 11:05:29
  11. Hofstede, A.H.M. ter; Proper, H.A.; Van der Weide, T.P.: Exploiting fact verbalisation in conceptual information modelling (1997) 0.00
    0.004363662 = product of:
      0.026181972 = sum of:
        0.026181972 = product of:
          0.06545493 = sum of:
            0.032875333 = weight(_text_:29 in 2908) [ClassicSimilarity], result of:
              0.032875333 = score(doc=2908,freq=2.0), product of:
                0.12083977 = queryWeight, product of:
                  3.5176873 = idf(docFreq=3565, maxDocs=44218)
                  0.03435205 = queryNorm
                0.27205724 = fieldWeight in 2908, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5176873 = idf(docFreq=3565, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=2908)
            0.0325796 = weight(_text_:22 in 2908) [ClassicSimilarity], result of:
              0.0325796 = score(doc=2908,freq=2.0), product of:
                0.120295025 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.03435205 = queryNorm
                0.2708308 = fieldWeight in 2908, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=2908)
          0.4 = coord(2/5)
      0.16666667 = coord(1/6)
    
    Date
    5. 4.1996 15:29:15
    Source
    Information systems. 22(1997) nos.5/6, S.349-385
  12. Budzik, J.; Hammond, K.J.; Birnbaum, L.: Information access in context (2001) 0.00
    0.0021916889 = product of:
      0.013150133 = sum of:
        0.013150133 = product of:
          0.065750666 = sum of:
            0.065750666 = weight(_text_:29 in 3835) [ClassicSimilarity], result of:
              0.065750666 = score(doc=3835,freq=2.0), product of:
                0.12083977 = queryWeight, product of:
                  3.5176873 = idf(docFreq=3565, maxDocs=44218)
                  0.03435205 = queryNorm
                0.5441145 = fieldWeight in 3835, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5176873 = idf(docFreq=3565, maxDocs=44218)
                  0.109375 = fieldNorm(doc=3835)
          0.2 = coord(1/5)
      0.16666667 = coord(1/6)
    
    Date
    29. 3.2002 17:31:17
  13. Chowdhury, G.G.: Template mining for information extraction from digital documents (1999) 0.00
    0.0021719735 = product of:
      0.01303184 = sum of:
        0.01303184 = product of:
          0.0651592 = sum of:
            0.0651592 = weight(_text_:22 in 4577) [ClassicSimilarity], result of:
              0.0651592 = score(doc=4577,freq=2.0), product of:
                0.120295025 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.03435205 = queryNorm
                0.5416616 = fieldWeight in 4577, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.109375 = fieldNorm(doc=4577)
          0.2 = coord(1/5)
      0.16666667 = coord(1/6)
    
    Date
    2. 4.2000 18:01:22
  14. Keim, D.A.: Data Mining mit bloßem Auge (2002) 0.00
    0.0018785906 = product of:
      0.011271543 = sum of:
        0.011271543 = product of:
          0.056357715 = sum of:
            0.056357715 = weight(_text_:29 in 1086) [ClassicSimilarity], result of:
              0.056357715 = score(doc=1086,freq=2.0), product of:
                0.12083977 = queryWeight, product of:
                  3.5176873 = idf(docFreq=3565, maxDocs=44218)
                  0.03435205 = queryNorm
                0.46638384 = fieldWeight in 1086, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5176873 = idf(docFreq=3565, maxDocs=44218)
                  0.09375 = fieldNorm(doc=1086)
          0.2 = coord(1/5)
      0.16666667 = coord(1/6)
    
    Date
    31.12.1996 19:29:41
  15. Kruse, R.; Borgelt, C.: Suche im Datendschungel (2002) 0.00
    0.0018785906 = product of:
      0.011271543 = sum of:
        0.011271543 = product of:
          0.056357715 = sum of:
            0.056357715 = weight(_text_:29 in 1087) [ClassicSimilarity], result of:
              0.056357715 = score(doc=1087,freq=2.0), product of:
                0.12083977 = queryWeight, product of:
                  3.5176873 = idf(docFreq=3565, maxDocs=44218)
                  0.03435205 = queryNorm
                0.46638384 = fieldWeight in 1087, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5176873 = idf(docFreq=3565, maxDocs=44218)
                  0.09375 = fieldNorm(doc=1087)
          0.2 = coord(1/5)
      0.16666667 = coord(1/6)
    
    Date
    31.12.1996 19:29:41
  16. Wrobel, S.: Lern- und Entdeckungsverfahren (2002) 0.00
    0.0018785906 = product of:
      0.011271543 = sum of:
        0.011271543 = product of:
          0.056357715 = sum of:
            0.056357715 = weight(_text_:29 in 1105) [ClassicSimilarity], result of:
              0.056357715 = score(doc=1105,freq=2.0), product of:
                0.12083977 = queryWeight, product of:
                  3.5176873 = idf(docFreq=3565, maxDocs=44218)
                  0.03435205 = queryNorm
                0.46638384 = fieldWeight in 1105, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5176873 = idf(docFreq=3565, maxDocs=44218)
                  0.09375 = fieldNorm(doc=1105)
          0.2 = coord(1/5)
      0.16666667 = coord(1/6)
    
    Date
    31.12.1996 19:29:41
  17. Borgelt, C.; Kruse, R.: Unsicheres Wissen nutzen (2002) 0.00
    0.0015654922 = product of:
      0.0093929535 = sum of:
        0.0093929535 = product of:
          0.046964765 = sum of:
            0.046964765 = weight(_text_:29 in 1104) [ClassicSimilarity], result of:
              0.046964765 = score(doc=1104,freq=2.0), product of:
                0.12083977 = queryWeight, product of:
                  3.5176873 = idf(docFreq=3565, maxDocs=44218)
                  0.03435205 = queryNorm
                0.38865322 = fieldWeight in 1104, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5176873 = idf(docFreq=3565, maxDocs=44218)
                  0.078125 = fieldNorm(doc=1104)
          0.2 = coord(1/5)
      0.16666667 = coord(1/6)
    
    Date
    31.12.1996 19:29:41
  18. Cardie, C.: Empirical methods in information extraction (1997) 0.00
    0.0012523937 = product of:
      0.007514362 = sum of:
        0.007514362 = product of:
          0.03757181 = sum of:
            0.03757181 = weight(_text_:29 in 3246) [ClassicSimilarity], result of:
              0.03757181 = score(doc=3246,freq=2.0), product of:
                0.12083977 = queryWeight, product of:
                  3.5176873 = idf(docFreq=3565, maxDocs=44218)
                  0.03435205 = queryNorm
                0.31092256 = fieldWeight in 3246, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5176873 = idf(docFreq=3565, maxDocs=44218)
                  0.0625 = fieldNorm(doc=3246)
          0.2 = coord(1/5)
      0.16666667 = coord(1/6)
    
    Date
    6. 3.1999 13:50:29
  19. Tiefschürfen in Datenbanken (2002) 0.00
    0.0012523937 = product of:
      0.007514362 = sum of:
        0.007514362 = product of:
          0.03757181 = sum of:
            0.03757181 = weight(_text_:29 in 996) [ClassicSimilarity], result of:
              0.03757181 = score(doc=996,freq=2.0), product of:
                0.12083977 = queryWeight, product of:
                  3.5176873 = idf(docFreq=3565, maxDocs=44218)
                  0.03435205 = queryNorm
                0.31092256 = fieldWeight in 996, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5176873 = idf(docFreq=3565, maxDocs=44218)
                  0.0625 = fieldNorm(doc=996)
          0.2 = coord(1/5)
      0.16666667 = coord(1/6)
    
    Date
    31.12.1996 19:29:41
  20. Bath, P.A.: Data mining in health and medical information (2003) 0.00
    0.0012523937 = product of:
      0.007514362 = sum of:
        0.007514362 = product of:
          0.03757181 = sum of:
            0.03757181 = weight(_text_:29 in 4263) [ClassicSimilarity], result of:
              0.03757181 = score(doc=4263,freq=2.0), product of:
                0.12083977 = queryWeight, product of:
                  3.5176873 = idf(docFreq=3565, maxDocs=44218)
                  0.03435205 = queryNorm
                0.31092256 = fieldWeight in 4263, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5176873 = idf(docFreq=3565, maxDocs=44218)
                  0.0625 = fieldNorm(doc=4263)
          0.2 = coord(1/5)
      0.16666667 = coord(1/6)
    
    Date
    23.10.2005 18:29:03