Document (#28384)

Author
Prabowo, R.
Jackson, M.
Burden, P.
Knoell, H.-D.
Title
Ontology-based automatic classification for the Web pages : design, implementation and evaluation
Source
http://csdl.computer.org/comp/proceedings/wise/2002/1766/00/17660182abs.htm
Year
2002
Abstract
In recent years, we have witnessed the continual growth in the use of ontologies in order to provide a mechanism to enable machine reasoning. This paper describes an automatic classifier, which focuses on the use of ontologies for classifying Web pages with respect to the Dewey Decimal Classification (DDC) and Library of Congress Classification (LCC) schemes. Firstly, we explain how these ontologies can be built in a modular fashion, and mapped into DDC and LCC. Secondly, we propose the formal definition of a DDC-LCC and an ontology-classification-scheme mapping. Thirdly, we explain the way the classifier uses these ontologies to assist classification. Finally, an experiment in which the accuracy of the classifier was evaluated is presented. The experiment shows that our approach results an improved classification in terms of accuracy. This improvement, however, comes at a cost in a low overage ratio due to the incompleteness of the ontologies used
Content
Beitrag bei: The Third International Conference on Web Information Systems Engineering (WISE'00) Dec., 12-14, 2002, Singapore, S.182.
Theme
Automatisches Klassifizieren
Object
DDC
LCC

Similar documents (author)

  1. Jackson, P.: ¬A thesaurus for enhanced geographic access (1991) 5.44
    5.438222 = sum of:
      5.438222 = weight(author_txt:jackson in 2298) [ClassicSimilarity], result of:
        5.438222 = fieldWeight in 2298, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.701155 = idf(docFreq=19, maxDocs=44218)
          0.625 = fieldNorm(doc=2298)
    
  2. Jackson, S.L.: Dziatzko, Karl (1984) 5.44
    5.438222 = sum of:
      5.438222 = weight(author_txt:jackson in 2685) [ClassicSimilarity], result of:
        5.438222 = fieldWeight in 2685, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.701155 = idf(docFreq=19, maxDocs=44218)
          0.625 = fieldNorm(doc=2685)
    
  3. Jackson, S.L.: Drtina, Jaroslav (1984) 5.44
    5.438222 = sum of:
      5.438222 = weight(author_txt:jackson in 5796) [ClassicSimilarity], result of:
        5.438222 = fieldWeight in 5796, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.701155 = idf(docFreq=19, maxDocs=44218)
          0.625 = fieldNorm(doc=5796)
    
  4. Jackson, J.N.: Every-name indexing (1992) 5.44
    5.438222 = sum of:
      5.438222 = weight(author_txt:jackson in 6356) [ClassicSimilarity], result of:
        5.438222 = fieldWeight in 6356, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.701155 = idf(docFreq=19, maxDocs=44218)
          0.625 = fieldNorm(doc=6356)
    
  5. Jackson, K.: Easy and rapid access to national bibliographies and catalogs with software from On-line Computer Systems (1990) 5.44
    5.438222 = sum of:
      5.438222 = weight(author_txt:jackson in 3598) [ClassicSimilarity], result of:
        5.438222 = fieldWeight in 3598, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.701155 = idf(docFreq=19, maxDocs=44218)
          0.625 = fieldNorm(doc=3598)
    

Similar documents (content)

  1. Farazi, M.: Faceted lightweight ontologies : a formalization and some experiments (2010) 0.19
    0.18890029 = sum of:
      0.18890029 = product of:
        0.7870846 = sum of:
          0.015084337 = weight(abstract_txt:these in 4997) [ClassicSimilarity], result of:
            0.015084337 = score(doc=4997,freq=2.0), product of:
              0.05352976 = queryWeight, product of:
                1.0365019 = boost
                3.1881294 = idf(docFreq=4957, maxDocs=44218)
                0.01619904 = queryNorm
              0.28179348 = fieldWeight in 4997, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.1881294 = idf(docFreq=4957, maxDocs=44218)
                0.0625 = fieldNorm(doc=4997)
          0.042715322 = weight(abstract_txt:reasoning in 4997) [ClassicSimilarity], result of:
            0.042715322 = score(doc=4997,freq=1.0), product of:
              0.107143775 = queryWeight, product of:
                1.0369097 = boost
                6.378767 = idf(docFreq=203, maxDocs=44218)
                0.01619904 = queryNorm
              0.39867294 = fieldWeight in 4997, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.378767 = idf(docFreq=203, maxDocs=44218)
                0.0625 = fieldNorm(doc=4997)
          0.1364585 = weight(abstract_txt:ontology in 4997) [ClassicSimilarity], result of:
            0.1364585 = score(doc=4997,freq=6.0), product of:
              0.1611404 = queryWeight, product of:
                1.7983519 = boost
                5.5314693 = idf(docFreq=475, maxDocs=44218)
                0.01619904 = queryNorm
              0.8468299 = fieldWeight in 4997, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                5.5314693 = idf(docFreq=475, maxDocs=44218)
                0.0625 = fieldNorm(doc=4997)
          0.07003821 = weight(abstract_txt:accuracy in 4997) [ClassicSimilarity], result of:
            0.07003821 = score(doc=4997,freq=1.0), product of:
              0.18770584 = queryWeight, product of:
                1.9409367 = boost
                5.9700394 = idf(docFreq=306, maxDocs=44218)
                0.01619904 = queryNorm
              0.37312746 = fieldWeight in 4997, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.9700394 = idf(docFreq=306, maxDocs=44218)
                0.0625 = fieldNorm(doc=4997)
          0.06282331 = weight(abstract_txt:classification in 4997) [ClassicSimilarity], result of:
            0.06282331 = score(doc=4997,freq=1.0), product of:
              0.2517921 = queryWeight, product of:
                3.8936253 = boost
                3.9920752 = idf(docFreq=2218, maxDocs=44218)
                0.01619904 = queryNorm
              0.2495047 = fieldWeight in 4997, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.9920752 = idf(docFreq=2218, maxDocs=44218)
                0.0625 = fieldNorm(doc=4997)
          0.4599649 = weight(abstract_txt:ontologies in 4997) [ClassicSimilarity], result of:
            0.4599649 = score(doc=4997,freq=8.0), product of:
              0.44670513 = queryWeight, product of:
                4.734269 = boost
                5.8247695 = idf(docFreq=354, maxDocs=44218)
                0.01619904 = queryNorm
              1.0296835 = fieldWeight in 4997, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                5.8247695 = idf(docFreq=354, maxDocs=44218)
                0.0625 = fieldNorm(doc=4997)
        0.24 = coord(6/25)
    
  2. Chung, Y.-M.; Noh, Y.-H.: Developing a specialized directory system by automatically classifying Web documents (2003) 0.15
    0.15145822 = sum of:
      0.15145822 = product of:
        0.6310759 = sum of:
          0.047121152 = weight(abstract_txt:classifying in 1566) [ClassicSimilarity], result of:
            0.047121152 = score(doc=1566,freq=1.0), product of:
              0.11439009 = queryWeight, product of:
                1.0714 = boost
                6.590942 = idf(docFreq=164, maxDocs=44218)
                0.01619904 = queryNorm
              0.41193387 = fieldWeight in 1566, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.590942 = idf(docFreq=164, maxDocs=44218)
                0.0625 = fieldNorm(doc=1566)
          0.07231861 = weight(abstract_txt:ratio in 1566) [ClassicSimilarity], result of:
            0.07231861 = score(doc=1566,freq=1.0), product of:
              0.15219879 = queryWeight, product of:
                1.2358423 = boost
                7.602543 = idf(docFreq=59, maxDocs=44218)
                0.01619904 = queryNorm
              0.47515893 = fieldWeight in 1566, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.602543 = idf(docFreq=59, maxDocs=44218)
                0.0625 = fieldNorm(doc=1566)
          0.0461647 = weight(abstract_txt:automatic in 1566) [ClassicSimilarity], result of:
            0.0461647 = score(doc=1566,freq=1.0), product of:
              0.14216559 = queryWeight, product of:
                1.6891557 = boost
                5.1955976 = idf(docFreq=665, maxDocs=44218)
                0.01619904 = queryNorm
              0.32472485 = fieldWeight in 1566, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.1955976 = idf(docFreq=665, maxDocs=44218)
                0.0625 = fieldNorm(doc=1566)
          0.059727766 = weight(abstract_txt:experiment in 1566) [ClassicSimilarity], result of:
            0.059727766 = score(doc=1566,freq=1.0), product of:
              0.16879982 = queryWeight, product of:
                1.8405958 = boost
                5.6614056 = idf(docFreq=417, maxDocs=44218)
                0.01619904 = queryNorm
              0.35383785 = fieldWeight in 1566, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.6614056 = idf(docFreq=417, maxDocs=44218)
                0.0625 = fieldNorm(doc=1566)
          0.26526648 = weight(abstract_txt:classifier in 1566) [ClassicSimilarity], result of:
            0.26526648 = score(doc=1566,freq=2.0), product of:
              0.41437778 = queryWeight, product of:
                3.5319643 = boost
                7.24254 = idf(docFreq=85, maxDocs=44218)
                0.01619904 = queryNorm
              0.64015615 = fieldWeight in 1566, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.24254 = idf(docFreq=85, maxDocs=44218)
                0.0625 = fieldNorm(doc=1566)
          0.14047721 = weight(abstract_txt:classification in 1566) [ClassicSimilarity], result of:
            0.14047721 = score(doc=1566,freq=5.0), product of:
              0.2517921 = queryWeight, product of:
                3.8936253 = boost
                3.9920752 = idf(docFreq=2218, maxDocs=44218)
                0.01619904 = queryNorm
              0.5579095 = fieldWeight in 1566, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                3.9920752 = idf(docFreq=2218, maxDocs=44218)
                0.0625 = fieldNorm(doc=1566)
        0.24 = coord(6/25)
    
  3. Giunchiglia, F.; Zaihrayeu, I.; Farazi, F.: Converting classifications into OWL ontologies (2009) 0.12
    0.12431085 = sum of:
      0.12431085 = product of:
        0.62155426 = sum of:
          0.013332797 = weight(abstract_txt:these in 4690) [ClassicSimilarity], result of:
            0.013332797 = score(doc=4690,freq=1.0), product of:
              0.05352976 = queryWeight, product of:
                1.0365019 = boost
                3.1881294 = idf(docFreq=4957, maxDocs=44218)
                0.01619904 = queryNorm
              0.24907261 = fieldWeight in 4690, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.1881294 = idf(docFreq=4957, maxDocs=44218)
                0.078125 = fieldNorm(doc=4690)
          0.07551074 = weight(abstract_txt:reasoning in 4690) [ClassicSimilarity], result of:
            0.07551074 = score(doc=4690,freq=2.0), product of:
              0.107143775 = queryWeight, product of:
                1.0369097 = boost
                6.378767 = idf(docFreq=203, maxDocs=44218)
                0.01619904 = queryNorm
              0.70476085 = fieldWeight in 4690, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.378767 = idf(docFreq=203, maxDocs=44218)
                0.078125 = fieldNorm(doc=4690)
          0.06963619 = weight(abstract_txt:ontology in 4690) [ClassicSimilarity], result of:
            0.06963619 = score(doc=4690,freq=1.0), product of:
              0.1611404 = queryWeight, product of:
                1.7983519 = boost
                5.5314693 = idf(docFreq=475, maxDocs=44218)
                0.01619904 = queryNorm
              0.43214604 = fieldWeight in 4690, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.5314693 = idf(docFreq=475, maxDocs=44218)
                0.078125 = fieldNorm(doc=4690)
          0.1755965 = weight(abstract_txt:classification in 4690) [ClassicSimilarity], result of:
            0.1755965 = score(doc=4690,freq=5.0), product of:
              0.2517921 = queryWeight, product of:
                3.8936253 = boost
                3.9920752 = idf(docFreq=2218, maxDocs=44218)
                0.01619904 = queryNorm
              0.69738686 = fieldWeight in 4690, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                3.9920752 = idf(docFreq=2218, maxDocs=44218)
                0.078125 = fieldNorm(doc=4690)
          0.28747806 = weight(abstract_txt:ontologies in 4690) [ClassicSimilarity], result of:
            0.28747806 = score(doc=4690,freq=2.0), product of:
              0.44670513 = queryWeight, product of:
                4.734269 = boost
                5.8247695 = idf(docFreq=354, maxDocs=44218)
                0.01619904 = queryNorm
              0.6435522 = fieldWeight in 4690, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.8247695 = idf(docFreq=354, maxDocs=44218)
                0.078125 = fieldNorm(doc=4690)
        0.2 = coord(5/25)
    
  4. Prieto-Díaz, R.: ¬A faceted approach to building ontologies (2002) 0.12
    0.11819506 = sum of:
      0.11819506 = product of:
        0.5909753 = sum of:
          0.010666238 = weight(abstract_txt:these in 2259) [ClassicSimilarity], result of:
            0.010666238 = score(doc=2259,freq=1.0), product of:
              0.05352976 = queryWeight, product of:
                1.0365019 = boost
                3.1881294 = idf(docFreq=4957, maxDocs=44218)
                0.01619904 = queryNorm
              0.19925809 = fieldWeight in 2259, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.1881294 = idf(docFreq=4957, maxDocs=44218)
                0.0625 = fieldNorm(doc=2259)
          0.078784354 = weight(abstract_txt:ontology in 2259) [ClassicSimilarity], result of:
            0.078784354 = score(doc=2259,freq=2.0), product of:
              0.1611404 = queryWeight, product of:
                1.7983519 = boost
                5.5314693 = idf(docFreq=475, maxDocs=44218)
                0.01619904 = queryNorm
              0.48891744 = fieldWeight in 2259, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.5314693 = idf(docFreq=475, maxDocs=44218)
                0.0625 = fieldNorm(doc=2259)
          0.07506717 = weight(abstract_txt:explain in 2259) [ClassicSimilarity], result of:
            0.07506717 = score(doc=2259,freq=1.0), product of:
              0.19658686 = queryWeight, product of:
                1.9863222 = boost
                6.1096387 = idf(docFreq=266, maxDocs=44218)
                0.01619904 = queryNorm
              0.38185242 = fieldWeight in 2259, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.1096387 = idf(docFreq=266, maxDocs=44218)
                0.0625 = fieldNorm(doc=2259)
          0.06282331 = weight(abstract_txt:classification in 2259) [ClassicSimilarity], result of:
            0.06282331 = score(doc=2259,freq=1.0), product of:
              0.2517921 = queryWeight, product of:
                3.8936253 = boost
                3.9920752 = idf(docFreq=2218, maxDocs=44218)
                0.01619904 = queryNorm
              0.2495047 = fieldWeight in 2259, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.9920752 = idf(docFreq=2218, maxDocs=44218)
                0.0625 = fieldNorm(doc=2259)
          0.3636342 = weight(abstract_txt:ontologies in 2259) [ClassicSimilarity], result of:
            0.3636342 = score(doc=2259,freq=5.0), product of:
              0.44670513 = queryWeight, product of:
                4.734269 = boost
                5.8247695 = idf(docFreq=354, maxDocs=44218)
                0.01619904 = queryNorm
              0.8140363 = fieldWeight in 2259, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                5.8247695 = idf(docFreq=354, maxDocs=44218)
                0.0625 = fieldNorm(doc=2259)
        0.2 = coord(5/25)
    
  5. Almeida Campos, M.L. de; Machado Campos, M.L.; Dávila, A.M.R.; Espanha Gomes, H.; Campos, L.M.; Lira e Oliveira, L. de: Information sciences methodological aspects applied to ontology reuse tools : a study based on genomic annotations in the domain of trypanosomatides (2013) 0.12
    0.11668906 = sum of:
      0.11668906 = product of:
        0.5834453 = sum of:
          0.0383143 = weight(abstract_txt:improvement in 635) [ClassicSimilarity], result of:
            0.0383143 = score(doc=635,freq=1.0), product of:
              0.09965178 = queryWeight, product of:
                6.1517096 = idf(docFreq=255, maxDocs=44218)
                0.01619904 = queryNorm
              0.38448185 = fieldWeight in 635, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.1517096 = idf(docFreq=255, maxDocs=44218)
                0.0625 = fieldNorm(doc=635)
          0.0461647 = weight(abstract_txt:automatic in 635) [ClassicSimilarity], result of:
            0.0461647 = score(doc=635,freq=1.0), product of:
              0.14216559 = queryWeight, product of:
                1.6891557 = boost
                5.1955976 = idf(docFreq=665, maxDocs=44218)
                0.01619904 = queryNorm
              0.32472485 = fieldWeight in 635, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.1955976 = idf(docFreq=665, maxDocs=44218)
                0.0625 = fieldNorm(doc=635)
          0.15756871 = weight(abstract_txt:ontology in 635) [ClassicSimilarity], result of:
            0.15756871 = score(doc=635,freq=8.0), product of:
              0.1611404 = queryWeight, product of:
                1.7983519 = boost
                5.5314693 = idf(docFreq=475, maxDocs=44218)
                0.01619904 = queryNorm
              0.9778349 = fieldWeight in 635, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                5.5314693 = idf(docFreq=475, maxDocs=44218)
                0.0625 = fieldNorm(doc=635)
          0.059727766 = weight(abstract_txt:experiment in 635) [ClassicSimilarity], result of:
            0.059727766 = score(doc=635,freq=1.0), product of:
              0.16879982 = queryWeight, product of:
                1.8405958 = boost
                5.6614056 = idf(docFreq=417, maxDocs=44218)
                0.01619904 = queryNorm
              0.35383785 = fieldWeight in 635, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.6614056 = idf(docFreq=417, maxDocs=44218)
                0.0625 = fieldNorm(doc=635)
          0.28166983 = weight(abstract_txt:ontologies in 635) [ClassicSimilarity], result of:
            0.28166983 = score(doc=635,freq=3.0), product of:
              0.44670513 = queryWeight, product of:
                4.734269 = boost
                5.8247695 = idf(docFreq=354, maxDocs=44218)
                0.01619904 = queryNorm
              0.6305498 = fieldWeight in 635, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.8247695 = idf(docFreq=354, maxDocs=44218)
                0.0625 = fieldNorm(doc=635)
        0.2 = coord(5/25)