Document (#26612)

Author
Fenstermacher, K.D.
Ginsburg, M.
Title
Client-side monitoring for Web mining
Source
Journal of the American Society for Information Science and technology. 54(2003) no.7, S.625-637
Year
2003
Abstract
"Garbage in, garbage out" is a well-known phrase in computer analysis, and one that comes to mind when mining Web data to draw conclusions about Web users. The challenge is that data analysts wish to infer patterns of client-side behavior from server-side data. However, because only a fraction of the user's actions ever reaches the Web server, analysts must rely an incomplete data. In this paper, we propose a client-side monitoring system that is unobtrusive and supports flexible data collection. Moreover, the proposed framework encompasses client-side applications beyond the Web browser. Expanding monitoring beyond the browser to incorporate standard office productivity tools enables analysts to derive a much richer and more accurate picture of user behavior an the Web.
Footnote
Teil eines Themenheftes: "Web retrieval and mining: A machine learning perspective"
Theme
Data Mining
Internet
Object
WWW

Similar documents (content)

  1. Gilmour, R.: Serving XML : practical techniques for the dissemination of structured electronic information (2001) 0.16
    0.15748814 = sum of:
      0.15748814 = product of:
        0.9843009 = sum of:
          0.10058003 = weight(abstract_txt:server in 4795) [ClassicSimilarity], result of:
            0.10058003 = score(doc=4795,freq=1.0), product of:
              0.15213904 = queryWeight, product of:
                1.7701231 = boost
                6.044398 = idf(docFreq=284, maxDocs=44218)
                0.014219495 = queryNorm
              0.661106 = fieldWeight in 4795, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.044398 = idf(docFreq=284, maxDocs=44218)
                0.109375 = fieldNorm(doc=4795)
          0.07324319 = weight(abstract_txt:data in 4795) [ClassicSimilarity], result of:
            0.07324319 = score(doc=4795,freq=3.0), product of:
              0.11588235 = queryWeight, product of:
                2.4426532 = boost
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.014219495 = queryNorm
              0.6320479 = fieldWeight in 4795, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.109375 = fieldNorm(doc=4795)
          0.24562359 = weight(abstract_txt:client in 4795) [ClassicSimilarity], result of:
            0.24562359 = score(doc=4795,freq=1.0), product of:
              0.34760782 = queryWeight, product of:
                3.7839284 = boost
                6.4604454 = idf(docFreq=187, maxDocs=44218)
                0.014219495 = queryNorm
              0.7066112 = fieldWeight in 4795, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.4604454 = idf(docFreq=187, maxDocs=44218)
                0.109375 = fieldNorm(doc=4795)
          0.5648541 = weight(abstract_txt:side in 4795) [ClassicSimilarity], result of:
            0.5648541 = score(doc=4795,freq=2.0), product of:
              0.517798 = queryWeight, product of:
                5.163371 = boost
                7.0524964 = idf(docFreq=103, maxDocs=44218)
                0.014219495 = queryNorm
              1.0908773 = fieldWeight in 4795, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.0524964 = idf(docFreq=103, maxDocs=44218)
                0.109375 = fieldNorm(doc=4795)
        0.16 = coord(4/25)
    
  2. Catedge, L.D.; Pitkow, J.E.: Characterizing browsing strategies in the World-Wide Web (1995) 0.15
    0.15417503 = sum of:
      0.15417503 = product of:
        0.96359396 = sum of:
          0.009088672 = weight(abstract_txt:that in 2213) [ClassicSimilarity], result of:
            0.009088672 = score(doc=2213,freq=1.0), product of:
              0.035069548 = queryWeight, product of:
                1.0408636 = boost
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.014219495 = queryNorm
              0.25916135 = fieldWeight in 2213, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.109375 = fieldNorm(doc=2213)
          0.04228698 = weight(abstract_txt:data in 2213) [ClassicSimilarity], result of:
            0.04228698 = score(doc=2213,freq=1.0), product of:
              0.11588235 = queryWeight, product of:
                2.4426532 = boost
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.014219495 = queryNorm
              0.36491305 = fieldWeight in 2213, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.109375 = fieldNorm(doc=2213)
          0.34736422 = weight(abstract_txt:client in 2213) [ClassicSimilarity], result of:
            0.34736422 = score(doc=2213,freq=2.0), product of:
              0.34760782 = queryWeight, product of:
                3.7839284 = boost
                6.4604454 = idf(docFreq=187, maxDocs=44218)
                0.014219495 = queryNorm
              0.99929917 = fieldWeight in 2213, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.4604454 = idf(docFreq=187, maxDocs=44218)
                0.109375 = fieldNorm(doc=2213)
          0.5648541 = weight(abstract_txt:side in 2213) [ClassicSimilarity], result of:
            0.5648541 = score(doc=2213,freq=2.0), product of:
              0.517798 = queryWeight, product of:
                5.163371 = boost
                7.0524964 = idf(docFreq=103, maxDocs=44218)
                0.014219495 = queryNorm
              1.0908773 = fieldWeight in 2213, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.0524964 = idf(docFreq=103, maxDocs=44218)
                0.109375 = fieldNorm(doc=2213)
        0.16 = coord(4/25)
    
  3. Berghel, H.; Berleant, D.; Foy, T.; McGuire, M.: Cyberbrowsing : information customization on the Web (1999) 0.15
    0.14711525 = sum of:
      0.14711525 = product of:
        0.9194703 = sum of:
          0.10536567 = weight(abstract_txt:reaches in 3664) [ClassicSimilarity], result of:
            0.10536567 = score(doc=3664,freq=1.0), product of:
              0.15587421 = queryWeight, product of:
                1.2669377 = boost
                8.652365 = idf(docFreq=20, maxDocs=44218)
                0.014219495 = queryNorm
              0.675966 = fieldWeight in 3664, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.652365 = idf(docFreq=20, maxDocs=44218)
                0.078125 = fieldNorm(doc=3664)
          0.07184288 = weight(abstract_txt:server in 3664) [ClassicSimilarity], result of:
            0.07184288 = score(doc=3664,freq=1.0), product of:
              0.15213904 = queryWeight, product of:
                1.7701231 = boost
                6.044398 = idf(docFreq=284, maxDocs=44218)
                0.014219495 = queryNorm
              0.47221857 = fieldWeight in 3664, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.044398 = idf(docFreq=284, maxDocs=44218)
                0.078125 = fieldNorm(doc=3664)
          0.24811731 = weight(abstract_txt:client in 3664) [ClassicSimilarity], result of:
            0.24811731 = score(doc=3664,freq=2.0), product of:
              0.34760782 = queryWeight, product of:
                3.7839284 = boost
                6.4604454 = idf(docFreq=187, maxDocs=44218)
                0.014219495 = queryNorm
              0.7137852 = fieldWeight in 3664, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.4604454 = idf(docFreq=187, maxDocs=44218)
                0.078125 = fieldNorm(doc=3664)
          0.4941444 = weight(abstract_txt:side in 3664) [ClassicSimilarity], result of:
            0.4941444 = score(doc=3664,freq=3.0), product of:
              0.517798 = queryWeight, product of:
                5.163371 = boost
                7.0524964 = idf(docFreq=103, maxDocs=44218)
                0.014219495 = queryNorm
              0.9543189 = fieldWeight in 3664, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.0524964 = idf(docFreq=103, maxDocs=44218)
                0.078125 = fieldNorm(doc=3664)
        0.16 = coord(4/25)
    
  4. Tudhope, D.; Binding, C.: Toward terminology services : experiences with a pilot Web service thesaurus browser (2006) 0.14
    0.13872366 = sum of:
      0.13872366 = product of:
        0.57801527 = sum of:
          0.008709808 = weight(abstract_txt:that in 1955) [ClassicSimilarity], result of:
            0.008709808 = score(doc=1955,freq=5.0), product of:
              0.035069548 = queryWeight, product of:
                1.0408636 = boost
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.014219495 = queryNorm
              0.24835816 = fieldWeight in 1955, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.046875 = fieldNorm(doc=1955)
          0.08621146 = weight(abstract_txt:server in 1955) [ClassicSimilarity], result of:
            0.08621146 = score(doc=1955,freq=4.0), product of:
              0.15213904 = queryWeight, product of:
                1.7701231 = boost
                6.044398 = idf(docFreq=284, maxDocs=44218)
                0.014219495 = queryNorm
              0.5666623 = fieldWeight in 1955, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.044398 = idf(docFreq=284, maxDocs=44218)
                0.046875 = fieldNorm(doc=1955)
          0.06075337 = weight(abstract_txt:browser in 1955) [ClassicSimilarity], result of:
            0.06075337 = score(doc=1955,freq=1.0), product of:
              0.19124831 = queryWeight, product of:
                1.984641 = boost
                6.7769065 = idf(docFreq=136, maxDocs=44218)
                0.014219495 = queryNorm
              0.31766748 = fieldWeight in 1955, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.7769065 = idf(docFreq=136, maxDocs=44218)
                0.046875 = fieldNorm(doc=1955)
          0.03138994 = weight(abstract_txt:data in 1955) [ClassicSimilarity], result of:
            0.03138994 = score(doc=1955,freq=3.0), product of:
              0.11588235 = queryWeight, product of:
                2.4426532 = boost
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.014219495 = queryNorm
              0.27087766 = fieldWeight in 1955, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.046875 = fieldNorm(doc=1955)
          0.14887038 = weight(abstract_txt:client in 1955) [ClassicSimilarity], result of:
            0.14887038 = score(doc=1955,freq=2.0), product of:
              0.34760782 = queryWeight, product of:
                3.7839284 = boost
                6.4604454 = idf(docFreq=187, maxDocs=44218)
                0.014219495 = queryNorm
              0.42827109 = fieldWeight in 1955, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.4604454 = idf(docFreq=187, maxDocs=44218)
                0.046875 = fieldNorm(doc=1955)
          0.24208033 = weight(abstract_txt:side in 1955) [ClassicSimilarity], result of:
            0.24208033 = score(doc=1955,freq=2.0), product of:
              0.517798 = queryWeight, product of:
                5.163371 = boost
                7.0524964 = idf(docFreq=103, maxDocs=44218)
                0.014219495 = queryNorm
              0.46751887 = fieldWeight in 1955, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.0524964 = idf(docFreq=103, maxDocs=44218)
                0.046875 = fieldNorm(doc=1955)
        0.24 = coord(6/25)
    
  5. Cooper, M.D.: Design considerations in instrumenting and monitoring Web-based information retrieval systems (1998) 0.12
    0.11667609 = sum of:
      0.11667609 = product of:
        0.7292256 = sum of:
          0.15206271 = weight(abstract_txt:server in 1793) [ClassicSimilarity], result of:
            0.15206271 = score(doc=1793,freq=7.0), product of:
              0.15213904 = queryWeight, product of:
                1.7701231 = boost
                6.044398 = idf(docFreq=284, maxDocs=44218)
                0.014219495 = queryNorm
              0.9994983 = fieldWeight in 1793, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                6.044398 = idf(docFreq=284, maxDocs=44218)
                0.0625 = fieldNorm(doc=1793)
          0.041853257 = weight(abstract_txt:data in 1793) [ClassicSimilarity], result of:
            0.041853257 = score(doc=1793,freq=3.0), product of:
              0.11588235 = queryWeight, product of:
                2.4426532 = boost
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.014219495 = queryNorm
              0.36117023 = fieldWeight in 1793, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.0625 = fieldNorm(doc=1793)
          0.2922053 = weight(abstract_txt:monitoring in 1793) [ClassicSimilarity], result of:
            0.2922053 = score(doc=1793,freq=5.0), product of:
              0.3011323 = queryWeight, product of:
                3.0500534 = boost
                6.943297 = idf(docFreq=115, maxDocs=44218)
                0.014219495 = queryNorm
              0.9703553 = fieldWeight in 1793, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                6.943297 = idf(docFreq=115, maxDocs=44218)
                0.0625 = fieldNorm(doc=1793)
          0.2431043 = weight(abstract_txt:client in 1793) [ClassicSimilarity], result of:
            0.2431043 = score(doc=1793,freq=3.0), product of:
              0.34760782 = queryWeight, product of:
                3.7839284 = boost
                6.4604454 = idf(docFreq=187, maxDocs=44218)
                0.014219495 = queryNorm
              0.6993637 = fieldWeight in 1793, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.4604454 = idf(docFreq=187, maxDocs=44218)
                0.0625 = fieldNorm(doc=1793)
        0.16 = coord(4/25)