Document (#23442)

Author
Boley, D.
Gini, M.
Hastings, K.
Mobasher, B.
Moore, J.
Title
¬A client side Web agent for document categorization
Source
Internet research. Electronic networking applications and policy. 8(1998) no.5, S.387-399
Year
1998
Abstract
Proposes a client-side agent for exploring and categorizing documents on the World Wide Web. As the user browses the Web using a usual Web browser, this agent is designed to aid the user by classifying the documents the user finds most interesting into clusters. The agent carries out the task completely automatically and autonomously, with as little user intervention as the user desires. The principal novel components in this agent that make it possible are a scalable hierarchical clustering algorithm and a taxonomic label generator. Describes the overall architecture of this agent and discusses the details of the algorithms within its key components
Theme
Internet

Similar documents (author)

  1. Hastings, S.K.: ¬An exploratory study of intellectual access to digitized art images : the information industry and the role of the Internet (1995) 2.48
    2.484309 = sum of:
      2.484309 = product of:
        4.968618 = sum of:
          4.968618 = weight(author_txt:hastings in 3186) [ClassicSimilarity], result of:
            4.968618 = score(doc=3186,freq=1.0), product of:
              0.8266008 = queryWeight, product of:
                1.2119235 = boost
                9.617446 = idf(docFreq=7, maxDocs=44218)
                0.07091872 = queryNorm
              6.010904 = fieldWeight in 3186, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.617446 = idf(docFreq=7, maxDocs=44218)
                0.625 = fieldNorm(doc=3186)
        0.5 = coord(1/2)
    
  2. Hastings, S.K.: Evaluation of image retrieval systems : role of user feedback (1999) 2.48
    2.484309 = sum of:
      2.484309 = product of:
        4.968618 = sum of:
          4.968618 = weight(author_txt:hastings in 845) [ClassicSimilarity], result of:
            4.968618 = score(doc=845,freq=1.0), product of:
              0.8266008 = queryWeight, product of:
                1.2119235 = boost
                9.617446 = idf(docFreq=7, maxDocs=44218)
                0.07091872 = queryNorm
              6.010904 = fieldWeight in 845, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.617446 = idf(docFreq=7, maxDocs=44218)
                0.625 = fieldNorm(doc=845)
        0.5 = coord(1/2)
    
  3. Christian, E.J.; Hastings, M.: ¬The virtual library : a selective bibliography for exploration (1994) 1.99
    1.9874471 = sum of:
      1.9874471 = product of:
        3.9748943 = sum of:
          3.9748943 = weight(author_txt:hastings in 1402) [ClassicSimilarity], result of:
            3.9748943 = score(doc=1402,freq=1.0), product of:
              0.8266008 = queryWeight, product of:
                1.2119235 = boost
                9.617446 = idf(docFreq=7, maxDocs=44218)
                0.07091872 = queryNorm
              4.808723 = fieldWeight in 1402, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.617446 = idf(docFreq=7, maxDocs=44218)
                0.5 = fieldNorm(doc=1402)
        0.5 = coord(1/2)
    
  4. Lunin, L.F.; Martin, K.; Hastings, S.K.: Design: information technologies and creative practices (2009) 1.49
    1.4905853 = sum of:
      1.4905853 = product of:
        2.9811707 = sum of:
          2.9811707 = weight(author_txt:hastings in 4889) [ClassicSimilarity], result of:
            2.9811707 = score(doc=4889,freq=1.0), product of:
              0.8266008 = queryWeight, product of:
                1.2119235 = boost
                9.617446 = idf(docFreq=7, maxDocs=44218)
                0.07091872 = queryNorm
              3.606542 = fieldWeight in 4889, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.617446 = idf(docFreq=7, maxDocs=44218)
                0.375 = fieldNorm(doc=4889)
        0.5 = coord(1/2)
    
  5. Chung, E.-K.; Miksa, S.; Hastings, S.K.: ¬A framework of automatic subject term assignment for text categorization : an indexing conception-based approach (2010) 1.49
    1.4905853 = sum of:
      1.4905853 = product of:
        2.9811707 = sum of:
          2.9811707 = weight(author_txt:hastings in 3434) [ClassicSimilarity], result of:
            2.9811707 = score(doc=3434,freq=1.0), product of:
              0.8266008 = queryWeight, product of:
                1.2119235 = boost
                9.617446 = idf(docFreq=7, maxDocs=44218)
                0.07091872 = queryNorm
              3.606542 = fieldWeight in 3434, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.617446 = idf(docFreq=7, maxDocs=44218)
                0.375 = fieldNorm(doc=3434)
        0.5 = coord(1/2)
    

Similar documents (content)

  1. Park, J.S.; O'Brien, J.C.; Cai, C.J.; Ringel Morris, M.; Liang, P.; Bernstein, M.S.: Generative agents : interactive simulacra of human behavior (2023) 0.13
    0.1306218 = sum of:
      0.1306218 = product of:
        0.653109 = sum of:
          0.0074994704 = weight(abstract_txt:this in 972) [ClassicSimilarity], result of:
            0.0074994704 = score(doc=972,freq=2.0), product of:
              0.040185284 = queryWeight, product of:
                1.1111186 = boost
                2.4130175 = idf(docFreq=10762, maxDocs=44218)
                0.014988086 = queryNorm
              0.1866223 = fieldWeight in 972, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.4130175 = idf(docFreq=10762, maxDocs=44218)
                0.0546875 = fieldNorm(doc=972)
          0.107854284 = weight(abstract_txt:autonomously in 972) [ClassicSimilarity], result of:
            0.107854284 = score(doc=972,freq=1.0), product of:
              0.20760661 = queryWeight, product of:
                1.4580984 = boost
                9.499662 = idf(docFreq=8, maxDocs=44218)
                0.014988086 = queryNorm
              0.5195128 = fieldWeight in 972, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.499662 = idf(docFreq=8, maxDocs=44218)
                0.0546875 = fieldNorm(doc=972)
          0.04389632 = weight(abstract_txt:components in 972) [ClassicSimilarity], result of:
            0.04389632 = score(doc=972,freq=1.0), product of:
              0.14365199 = queryWeight, product of:
                1.715288 = boost
                5.58764 = idf(docFreq=449, maxDocs=44218)
                0.014988086 = queryNorm
              0.30557406 = fieldWeight in 972, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.58764 = idf(docFreq=449, maxDocs=44218)
                0.0546875 = fieldNorm(doc=972)
          0.03143986 = weight(abstract_txt:user in 972) [ClassicSimilarity], result of:
            0.03143986 = score(doc=972,freq=1.0), product of:
              0.15607259 = queryWeight, product of:
                2.8269267 = boost
                3.6835442 = idf(docFreq=3020, maxDocs=44218)
                0.014988086 = queryNorm
              0.20144382 = fieldWeight in 972, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6835442 = idf(docFreq=3020, maxDocs=44218)
                0.0546875 = fieldNorm(doc=972)
          0.46241906 = weight(abstract_txt:agent in 972) [ClassicSimilarity], result of:
            0.46241906 = score(doc=972,freq=3.0), product of:
              0.69031936 = queryWeight, product of:
                6.5127873 = boost
                7.071914 = idf(docFreq=101, maxDocs=44218)
                0.014988086 = queryNorm
              0.6698625 = fieldWeight in 972, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.071914 = idf(docFreq=101, maxDocs=44218)
                0.0546875 = fieldNorm(doc=972)
        0.2 = coord(5/25)
    
  2. Barrueco, J.M.; Inglada, V.J.: Reference linking in economics : the Citec project (2003) 0.12
    0.11648052 = sum of:
      0.11648052 = product of:
        0.97067106 = sum of:
          0.010605852 = weight(abstract_txt:this in 2718) [ClassicSimilarity], result of:
            0.010605852 = score(doc=2718,freq=1.0), product of:
              0.040185284 = queryWeight, product of:
                1.1111186 = boost
                2.4130175 = idf(docFreq=10762, maxDocs=44218)
                0.014988086 = queryNorm
              0.2639238 = fieldWeight in 2718, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4130175 = idf(docFreq=10762, maxDocs=44218)
                0.109375 = fieldNorm(doc=2718)
          0.03522706 = weight(abstract_txt:documents in 2718) [ClassicSimilarity], result of:
            0.03522706 = score(doc=2718,freq=1.0), product of:
              0.07814907 = queryWeight, product of:
                1.2651533 = boost
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.014988086 = queryNorm
              0.45076746 = fieldWeight in 2718, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.109375 = fieldNorm(doc=2718)
          0.9248381 = weight(abstract_txt:agent in 2718) [ClassicSimilarity], result of:
            0.9248381 = score(doc=2718,freq=3.0), product of:
              0.69031936 = queryWeight, product of:
                6.5127873 = boost
                7.071914 = idf(docFreq=101, maxDocs=44218)
                0.014988086 = queryNorm
              1.339725 = fieldWeight in 2718, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.071914 = idf(docFreq=101, maxDocs=44218)
                0.109375 = fieldNorm(doc=2718)
        0.12 = coord(3/25)
    
  3. Cheung, D.W.; Kao, B.; Lee, J.: Discovering user access patterns on the World Wide Web (1998) 0.12
    0.11517679 = sum of:
      0.11517679 = product of:
        0.71985495 = sum of:
          0.03522706 = weight(abstract_txt:documents in 332) [ClassicSimilarity], result of:
            0.03522706 = score(doc=332,freq=1.0), product of:
              0.07814907 = queryWeight, product of:
                1.2651533 = boost
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.014988086 = queryNorm
              0.45076746 = fieldWeight in 332, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.109375 = fieldNorm(doc=332)
          0.08779264 = weight(abstract_txt:components in 332) [ClassicSimilarity], result of:
            0.08779264 = score(doc=332,freq=1.0), product of:
              0.14365199 = queryWeight, product of:
                1.715288 = boost
                5.58764 = idf(docFreq=449, maxDocs=44218)
                0.014988086 = queryNorm
              0.6111481 = fieldWeight in 332, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.58764 = idf(docFreq=449, maxDocs=44218)
                0.109375 = fieldNorm(doc=332)
          0.06287972 = weight(abstract_txt:user in 332) [ClassicSimilarity], result of:
            0.06287972 = score(doc=332,freq=1.0), product of:
              0.15607259 = queryWeight, product of:
                2.8269267 = boost
                3.6835442 = idf(docFreq=3020, maxDocs=44218)
                0.014988086 = queryNorm
              0.40288764 = fieldWeight in 332, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6835442 = idf(docFreq=3020, maxDocs=44218)
                0.109375 = fieldNorm(doc=332)
          0.5339555 = weight(abstract_txt:agent in 332) [ClassicSimilarity], result of:
            0.5339555 = score(doc=332,freq=1.0), product of:
              0.69031936 = queryWeight, product of:
                6.5127873 = boost
                7.071914 = idf(docFreq=101, maxDocs=44218)
                0.014988086 = queryNorm
              0.7734906 = fieldWeight in 332, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.071914 = idf(docFreq=101, maxDocs=44218)
                0.109375 = fieldNorm(doc=332)
        0.16 = coord(4/25)
    
  4. Shafique, M.; Chaudhry, A.S.: Intelligent agent-based online information retrieval (1995) 0.11
    0.11295906 = sum of:
      0.11295906 = product of:
        0.9413255 = sum of:
          0.04358218 = weight(abstract_txt:documents in 3851) [ClassicSimilarity], result of:
            0.04358218 = score(doc=3851,freq=3.0), product of:
              0.07814907 = queryWeight, product of:
                1.2651533 = boost
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.014988086 = queryNorm
              0.5576801 = fieldWeight in 3851, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.078125 = fieldNorm(doc=3851)
          0.04491408 = weight(abstract_txt:user in 3851) [ClassicSimilarity], result of:
            0.04491408 = score(doc=3851,freq=1.0), product of:
              0.15607259 = queryWeight, product of:
                2.8269267 = boost
                3.6835442 = idf(docFreq=3020, maxDocs=44218)
                0.014988086 = queryNorm
              0.2877769 = fieldWeight in 3851, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6835442 = idf(docFreq=3020, maxDocs=44218)
                0.078125 = fieldNorm(doc=3851)
          0.8528292 = weight(abstract_txt:agent in 3851) [ClassicSimilarity], result of:
            0.8528292 = score(doc=3851,freq=5.0), product of:
              0.69031936 = queryWeight, product of:
                6.5127873 = boost
                7.071914 = idf(docFreq=101, maxDocs=44218)
                0.014988086 = queryNorm
              1.2354126 = fieldWeight in 3851, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                7.071914 = idf(docFreq=101, maxDocs=44218)
                0.078125 = fieldNorm(doc=3851)
        0.12 = coord(3/25)
    
  5. Fenstermacher, K.D.; Ginsburg, M.: Client-side monitoring for Web mining (2003) 0.11
    0.11033038 = sum of:
      0.11033038 = product of:
        0.5516519 = sum of:
          0.079108715 = weight(abstract_txt:browser in 1611) [ClassicSimilarity], result of:
            0.079108715 = score(doc=1611,freq=2.0), product of:
              0.105654456 = queryWeight, product of:
                1.0401839 = boost
                6.7769065 = idf(docFreq=136, maxDocs=44218)
                0.014988086 = queryNorm
              0.74874943 = fieldWeight in 1611, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.7769065 = idf(docFreq=136, maxDocs=44218)
                0.078125 = fieldNorm(doc=1611)
          0.007575609 = weight(abstract_txt:this in 1611) [ClassicSimilarity], result of:
            0.007575609 = score(doc=1611,freq=1.0), product of:
              0.040185284 = queryWeight, product of:
                1.1111186 = boost
                2.4130175 = idf(docFreq=10762, maxDocs=44218)
                0.014988086 = queryNorm
              0.18851699 = fieldWeight in 1611, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4130175 = idf(docFreq=10762, maxDocs=44218)
                0.078125 = fieldNorm(doc=1611)
          0.16787764 = weight(abstract_txt:client in 1611) [ClassicSimilarity], result of:
            0.16787764 = score(doc=1611,freq=3.0), product of:
              0.19203472 = queryWeight, product of:
                1.983221 = boost
                6.4604454 = idf(docFreq=187, maxDocs=44218)
                0.014988086 = queryNorm
              0.87420464 = fieldWeight in 1611, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.4604454 = idf(docFreq=187, maxDocs=44218)
                0.078125 = fieldNorm(doc=1611)
          0.25217584 = weight(abstract_txt:side in 1611) [ClassicSimilarity], result of:
            0.25217584 = score(doc=1611,freq=4.0), product of:
              0.22884455 = queryWeight, product of:
                2.1649683 = boost
                7.0524964 = idf(docFreq=103, maxDocs=44218)
                0.014988086 = queryNorm
              1.1019526 = fieldWeight in 1611, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                7.0524964 = idf(docFreq=103, maxDocs=44218)
                0.078125 = fieldNorm(doc=1611)
          0.04491408 = weight(abstract_txt:user in 1611) [ClassicSimilarity], result of:
            0.04491408 = score(doc=1611,freq=1.0), product of:
              0.15607259 = queryWeight, product of:
                2.8269267 = boost
                3.6835442 = idf(docFreq=3020, maxDocs=44218)
                0.014988086 = queryNorm
              0.2877769 = fieldWeight in 1611, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6835442 = idf(docFreq=3020, maxDocs=44218)
                0.078125 = fieldNorm(doc=1611)
        0.2 = coord(5/25)