Document (#36693)

Author
Hepp, M.
Bruijn, J. de
Title
GenTax : a generic methodology for deriving OWL and RDF-S ontologies from hierarchical classifications, thesauri, and inconsistent taxonomies
Source
Proceedings of the 4th European Semantic Web Conference (ESWC 2007), June 3-7, Innsbruck, Austria
Imprint
Berlin : Springer
Year
2007
Pages
S.129-144
Series
Lecture notes in computer science; vol. 4519
Abstract
Hierarchical classifications, thesauri, and informal taxonomies are likely the most valuable input for creating, at reasonable cost, non-toy ontologies in many domains. They contain, readily available, a wealth of category definitions plus a hierarchy, and they reflect some degree of community consensus. However, their transformation into useful ontologies is not as straightforward as it appears. In this paper, we show that (1) it often depends on the context of usage whether an informal hierarchical categorization schema is a classification, a thesaurus, or a taxonomy, and (2) present a novel methodology for automatically deriving consistent RDF-S and OWL ontologies from such schemas. Finally, we (3) demonstrate the usefulness of this approach by transforming the two e-business categorization standards eCl@ss and UNSPSC into ontologies that overcome the limitations of earlier prototypes. Our approach allows for the script-based creation of meaningful ontology classes for a particular context while preserving the original hierarchy, even if the latter is not a real subsumption hierarchy in this particular context. Human intervention in the transformation is limited to checking some conceptual properties and identifying frequent anomalies, and the only input required is an informal categorization plus a notion of the target context. In particular, the approach does not require instance data, as ontology learning approaches would usually do.
Content
Vgl. unter: http://www.heppnetz.de/files/hepp-de-bruijn-ESWC2007-gentax-CRC.pdf.
Theme
Wissensrepräsentation
Object
GenTax-Algorithmus
OWL
SKOS2OWL

Similar documents (content)

  1. SKOS2OWL : Online tool for deriving OWL ontologies from SKOS categorization schemas (2007) 0.16
    0.16333641 = sum of:
      0.16333641 = product of:
        0.81668204 = sum of:
          0.03010655 = weight(abstract_txt:they in 4691) [ClassicSimilarity], result of:
            0.03010655 = score(doc=4691,freq=1.0), product of:
              0.06419238 = queryWeight, product of:
                3.7520406 = idf(docFreq=2820, maxDocs=44218)
                0.017108656 = queryNorm
              0.46900508 = fieldWeight in 4691, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.7520406 = idf(docFreq=2820, maxDocs=44218)
                0.125 = fieldNorm(doc=4691)
          0.09646725 = weight(abstract_txt:ontology in 4691) [ClassicSimilarity], result of:
            0.09646725 = score(doc=4691,freq=1.0), product of:
              0.13951772 = queryWeight, product of:
                1.4742563 = boost
                5.5314693 = idf(docFreq=475, maxDocs=44218)
                0.017108656 = queryNorm
              0.69143367 = fieldWeight in 4691, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.5314693 = idf(docFreq=475, maxDocs=44218)
                0.125 = fieldNorm(doc=4691)
          0.13095433 = weight(abstract_txt:classifications in 4691) [ClassicSimilarity], result of:
            0.13095433 = score(doc=4691,freq=1.0), product of:
              0.17104983 = queryWeight, product of:
                1.6323738 = boost
                6.124733 = idf(docFreq=262, maxDocs=44218)
                0.017108656 = queryNorm
              0.7655916 = fieldWeight in 4691, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.124733 = idf(docFreq=262, maxDocs=44218)
                0.125 = fieldNorm(doc=4691)
          0.1609096 = weight(abstract_txt:hierarchical in 4691) [ClassicSimilarity], result of:
            0.1609096 = score(doc=4691,freq=1.0), product of:
              0.2246266 = queryWeight, product of:
                2.2910497 = boost
                5.7307405 = idf(docFreq=389, maxDocs=44218)
                0.017108656 = queryNorm
              0.71634257 = fieldWeight in 4691, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.7307405 = idf(docFreq=389, maxDocs=44218)
                0.125 = fieldNorm(doc=4691)
          0.39824432 = weight(abstract_txt:ontologies in 4691) [ClassicSimilarity], result of:
            0.39824432 = score(doc=4691,freq=2.0), product of:
              0.38676384 = queryWeight, product of:
                3.8810678 = boost
                5.8247695 = idf(docFreq=354, maxDocs=44218)
                0.017108656 = queryNorm
              1.0296835 = fieldWeight in 4691, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.8247695 = idf(docFreq=354, maxDocs=44218)
                0.125 = fieldNorm(doc=4691)
        0.2 = coord(5/25)
    
  2. Hoffmann, P.; Médini and , L.; Ghodous, P.: Using context to improve semantic interoperability (2006) 0.16
    0.15926376 = sum of:
      0.15926376 = product of:
        0.79631877 = sum of:
          0.1364253 = weight(abstract_txt:ontology in 4434) [ClassicSimilarity], result of:
            0.1364253 = score(doc=4434,freq=2.0), product of:
              0.13951772 = queryWeight, product of:
                1.4742563 = boost
                5.5314693 = idf(docFreq=475, maxDocs=44218)
                0.017108656 = queryNorm
              0.9778349 = fieldWeight in 4434, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.5314693 = idf(docFreq=475, maxDocs=44218)
                0.125 = fieldNorm(doc=4434)
          0.044917874 = weight(abstract_txt:approach in 4434) [ClassicSimilarity], result of:
            0.044917874 = score(doc=4434,freq=1.0), product of:
              0.09594433 = queryWeight, product of:
                1.4973164 = boost
                3.745328 = idf(docFreq=2839, maxDocs=44218)
                0.017108656 = queryNorm
              0.468166 = fieldWeight in 4434, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.745328 = idf(docFreq=2839, maxDocs=44218)
                0.125 = fieldNorm(doc=4434)
          0.0931855 = weight(abstract_txt:context in 4434) [ClassicSimilarity], result of:
            0.0931855 = score(doc=4434,freq=1.0), product of:
              0.17177172 = queryWeight, product of:
                2.3133914 = boost
                4.339969 = idf(docFreq=1566, maxDocs=44218)
                0.017108656 = queryNorm
              0.54249614 = fieldWeight in 4434, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.339969 = idf(docFreq=1566, maxDocs=44218)
                0.125 = fieldNorm(doc=4434)
          0.24018879 = weight(abstract_txt:hierarchy in 4434) [ClassicSimilarity], result of:
            0.24018879 = score(doc=4434,freq=1.0), product of:
              0.29338756 = queryWeight, product of:
                2.6183324 = boost
                6.5493927 = idf(docFreq=171, maxDocs=44218)
                0.017108656 = queryNorm
              0.8186741 = fieldWeight in 4434, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.5493927 = idf(docFreq=171, maxDocs=44218)
                0.125 = fieldNorm(doc=4434)
          0.28160128 = weight(abstract_txt:ontologies in 4434) [ClassicSimilarity], result of:
            0.28160128 = score(doc=4434,freq=1.0), product of:
              0.38676384 = queryWeight, product of:
                3.8810678 = boost
                5.8247695 = idf(docFreq=354, maxDocs=44218)
                0.017108656 = queryNorm
              0.7280962 = fieldWeight in 4434, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8247695 = idf(docFreq=354, maxDocs=44218)
                0.125 = fieldNorm(doc=4434)
        0.2 = coord(5/25)
    
  3. Abbas, J.: Structures for organizing knowledge : exploring taxonomies, ontologies, and other schemas (2010) 0.16
    0.15544522 = sum of:
      0.15544522 = product of:
        0.5551615 = sum of:
          0.029452631 = weight(abstract_txt:they in 480) [ClassicSimilarity], result of:
            0.029452631 = score(doc=480,freq=5.0), product of:
              0.06419238 = queryWeight, product of:
                3.7520406 = idf(docFreq=2820, maxDocs=44218)
                0.017108656 = queryNorm
              0.4588182 = fieldWeight in 480, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                3.7520406 = idf(docFreq=2820, maxDocs=44218)
                0.0546875 = fieldNorm(doc=480)
          0.04220442 = weight(abstract_txt:ontology in 480) [ClassicSimilarity], result of:
            0.04220442 = score(doc=480,freq=1.0), product of:
              0.13951772 = queryWeight, product of:
                1.4742563 = boost
                5.5314693 = idf(docFreq=475, maxDocs=44218)
                0.017108656 = queryNorm
              0.30250221 = fieldWeight in 480, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.5314693 = idf(docFreq=475, maxDocs=44218)
                0.0546875 = fieldNorm(doc=480)
          0.01965157 = weight(abstract_txt:approach in 480) [ClassicSimilarity], result of:
            0.01965157 = score(doc=480,freq=1.0), product of:
              0.09594433 = queryWeight, product of:
                1.4973164 = boost
                3.745328 = idf(docFreq=2839, maxDocs=44218)
                0.017108656 = queryNorm
              0.20482263 = fieldWeight in 480, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.745328 = idf(docFreq=2839, maxDocs=44218)
                0.0546875 = fieldNorm(doc=480)
          0.057292514 = weight(abstract_txt:classifications in 480) [ClassicSimilarity], result of:
            0.057292514 = score(doc=480,freq=1.0), product of:
              0.17104983 = queryWeight, product of:
                1.6323738 = boost
                6.124733 = idf(docFreq=262, maxDocs=44218)
                0.017108656 = queryNorm
              0.33494633 = fieldWeight in 480, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.124733 = idf(docFreq=262, maxDocs=44218)
                0.0546875 = fieldNorm(doc=480)
          0.16694124 = weight(abstract_txt:taxonomies in 480) [ClassicSimilarity], result of:
            0.16694124 = score(doc=480,freq=4.0), product of:
              0.2198264 = queryWeight, product of:
                1.8505388 = boost
                6.943297 = idf(docFreq=115, maxDocs=44218)
                0.017108656 = queryNorm
              0.7594231 = fieldWeight in 480, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.943297 = idf(docFreq=115, maxDocs=44218)
                0.0546875 = fieldNorm(doc=480)
          0.11641854 = weight(abstract_txt:informal in 480) [ClassicSimilarity], result of:
            0.11641854 = score(doc=480,freq=1.0), product of:
              0.31412506 = queryWeight, product of:
                2.7092881 = boost
                6.7769065 = idf(docFreq=136, maxDocs=44218)
                0.017108656 = queryNorm
              0.37061208 = fieldWeight in 480, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.7769065 = idf(docFreq=136, maxDocs=44218)
                0.0546875 = fieldNorm(doc=480)
          0.123200566 = weight(abstract_txt:ontologies in 480) [ClassicSimilarity], result of:
            0.123200566 = score(doc=480,freq=1.0), product of:
              0.38676384 = queryWeight, product of:
                3.8810678 = boost
                5.8247695 = idf(docFreq=354, maxDocs=44218)
                0.017108656 = queryNorm
              0.3185421 = fieldWeight in 480, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8247695 = idf(docFreq=354, maxDocs=44218)
                0.0546875 = fieldNorm(doc=480)
        0.28 = coord(7/25)
    
  4. Campos, L.M.: Princípios teóricos usados na elaboracao de ontologias e sua influência na recuperacao da informacao com uso de de inferências [Theoretical principles used in ontology building and their influence on information retrieval using inferences] (2021) 0.15
    0.15107796 = sum of:
      0.15107796 = product of:
        0.53956413 = sum of:
          0.015053275 = weight(abstract_txt:they in 826) [ClassicSimilarity], result of:
            0.015053275 = score(doc=826,freq=1.0), product of:
              0.06419238 = queryWeight, product of:
                3.7520406 = idf(docFreq=2820, maxDocs=44218)
                0.017108656 = queryNorm
              0.23450254 = fieldWeight in 826, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.7520406 = idf(docFreq=2820, maxDocs=44218)
                0.0625 = fieldNorm(doc=826)
          0.10349251 = weight(abstract_txt:subsumption in 826) [ClassicSimilarity], result of:
            0.10349251 = score(doc=826,freq=1.0), product of:
              0.18421517 = queryWeight, product of:
                1.1978598 = boost
                8.988837 = idf(docFreq=14, maxDocs=44218)
                0.017108656 = queryNorm
              0.5618023 = fieldWeight in 826, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.988837 = idf(docFreq=14, maxDocs=44218)
                0.0625 = fieldNorm(doc=826)
          0.027315708 = weight(abstract_txt:methodology in 826) [ClassicSimilarity], result of:
            0.027315708 = score(doc=826,freq=1.0), product of:
              0.09550023 = queryWeight, product of:
                1.219721 = boost
                4.5764427 = idf(docFreq=1236, maxDocs=44218)
                0.017108656 = queryNorm
              0.28602767 = fieldWeight in 826, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5764427 = idf(docFreq=1236, maxDocs=44218)
                0.0625 = fieldNorm(doc=826)
          0.048233625 = weight(abstract_txt:ontology in 826) [ClassicSimilarity], result of:
            0.048233625 = score(doc=826,freq=1.0), product of:
              0.13951772 = queryWeight, product of:
                1.4742563 = boost
                5.5314693 = idf(docFreq=475, maxDocs=44218)
                0.017108656 = queryNorm
              0.34571683 = fieldWeight in 826, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.5314693 = idf(docFreq=475, maxDocs=44218)
                0.0625 = fieldNorm(doc=826)
          0.0804548 = weight(abstract_txt:hierarchical in 826) [ClassicSimilarity], result of:
            0.0804548 = score(doc=826,freq=1.0), product of:
              0.2246266 = queryWeight, product of:
                2.2910497 = boost
                5.7307405 = idf(docFreq=389, maxDocs=44218)
                0.017108656 = queryNorm
              0.35817128 = fieldWeight in 826, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.7307405 = idf(docFreq=389, maxDocs=44218)
                0.0625 = fieldNorm(doc=826)
          0.0658921 = weight(abstract_txt:context in 826) [ClassicSimilarity], result of:
            0.0658921 = score(doc=826,freq=2.0), product of:
              0.17177172 = queryWeight, product of:
                2.3133914 = boost
                4.339969 = idf(docFreq=1566, maxDocs=44218)
                0.017108656 = queryNorm
              0.3836027 = fieldWeight in 826, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.339969 = idf(docFreq=1566, maxDocs=44218)
                0.0625 = fieldNorm(doc=826)
          0.19912216 = weight(abstract_txt:ontologies in 826) [ClassicSimilarity], result of:
            0.19912216 = score(doc=826,freq=2.0), product of:
              0.38676384 = queryWeight, product of:
                3.8810678 = boost
                5.8247695 = idf(docFreq=354, maxDocs=44218)
                0.017108656 = queryNorm
              0.51484174 = fieldWeight in 826, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.8247695 = idf(docFreq=354, maxDocs=44218)
                0.0625 = fieldNorm(doc=826)
        0.28 = coord(7/25)
    
  5. Tzitzikas, Y.; Spyratos, N.; Constantopoulos, P.; Analyti, A.: Extended faceted ontologies (2002) 0.14
    0.14312313 = sum of:
      0.14312313 = product of:
        0.71561563 = sum of:
          0.12936564 = weight(abstract_txt:subsumption in 2280) [ClassicSimilarity], result of:
            0.12936564 = score(doc=2280,freq=1.0), product of:
              0.18421517 = queryWeight, product of:
                1.1978598 = boost
                8.988837 = idf(docFreq=14, maxDocs=44218)
                0.017108656 = queryNorm
              0.7022529 = fieldWeight in 2280, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.988837 = idf(docFreq=14, maxDocs=44218)
                0.078125 = fieldNorm(doc=2280)
          0.06029203 = weight(abstract_txt:ontology in 2280) [ClassicSimilarity], result of:
            0.06029203 = score(doc=2280,freq=1.0), product of:
              0.13951772 = queryWeight, product of:
                1.4742563 = boost
                5.5314693 = idf(docFreq=475, maxDocs=44218)
                0.017108656 = queryNorm
              0.43214604 = fieldWeight in 2280, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.5314693 = idf(docFreq=475, maxDocs=44218)
                0.078125 = fieldNorm(doc=2280)
          0.17648675 = weight(abstract_txt:deriving in 2280) [ClassicSimilarity], result of:
            0.17648675 = score(doc=2280,freq=1.0), product of:
              0.28549433 = queryWeight, product of:
                2.108905 = boost
                7.912698 = idf(docFreq=43, maxDocs=44218)
                0.017108656 = queryNorm
              0.6181795 = fieldWeight in 2280, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.912698 = idf(docFreq=43, maxDocs=44218)
                0.078125 = fieldNorm(doc=2280)
          0.100568496 = weight(abstract_txt:hierarchical in 2280) [ClassicSimilarity], result of:
            0.100568496 = score(doc=2280,freq=1.0), product of:
              0.2246266 = queryWeight, product of:
                2.2910497 = boost
                5.7307405 = idf(docFreq=389, maxDocs=44218)
                0.017108656 = queryNorm
              0.4477141 = fieldWeight in 2280, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.7307405 = idf(docFreq=389, maxDocs=44218)
                0.078125 = fieldNorm(doc=2280)
          0.24890271 = weight(abstract_txt:ontologies in 2280) [ClassicSimilarity], result of:
            0.24890271 = score(doc=2280,freq=2.0), product of:
              0.38676384 = queryWeight, product of:
                3.8810678 = boost
                5.8247695 = idf(docFreq=354, maxDocs=44218)
                0.017108656 = queryNorm
              0.6435522 = fieldWeight in 2280, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.8247695 = idf(docFreq=354, maxDocs=44218)
                0.078125 = fieldNorm(doc=2280)
        0.2 = coord(5/25)