Document (#36694)

Author
Hepp, M.
Bruijn, J. de
Title
GenTax : a generic methodology for deriving OWL and RDF-S ontologies from hierarchical classifications, thesauri, and inconsistent taxonomies
Source
Proceedings of the 4th European Semantic Web Conference (ESWC 2007), June 3-7, Innsbruck, Austria
Imprint
Berlin : Springer
Year
2007
Pages
S.129-144
Series
Lecture notes in computer science; vol. 4519
Abstract
Hierarchical classifications, thesauri, and informal taxonomies are likely the most valuable input for creating, at reasonable cost, non-toy ontologies in many domains. They contain, readily available, a wealth of category definitions plus a hierarchy, and they reflect some degree of community consensus. However, their transformation into useful ontologies is not as straightforward as it appears. In this paper, we show that (1) it often depends on the context of usage whether an informal hierarchical categorization schema is a classification, a thesaurus, or a taxonomy, and (2) present a novel methodology for automatically deriving consistent RDF-S and OWL ontologies from such schemas. Finally, we (3) demonstrate the usefulness of this approach by transforming the two e-business categorization standards eCl@ss and UNSPSC into ontologies that overcome the limitations of earlier prototypes. Our approach allows for the script-based creation of meaningful ontology classes for a particular context while preserving the original hierarchy, even if the latter is not a real subsumption hierarchy in this particular context. Human intervention in the transformation is limited to checking some conceptual properties and identifying frequent anomalies, and the only input required is an informal categorization plus a notion of the target context. In particular, the approach does not require instance data, as ontology learning approaches would usually do.
Content
Vgl. unter: http://www.heppnetz.de/files/hepp-de-bruijn-ESWC2007-gentax-CRC.pdf.
Theme
Wissensrepräsentation
Object
GenTax-Algorithmus
OWL
SKOS2OWL

Similar documents (content)

  1. Hoffmann, P.; Médini and , L.; Ghodous, P.: Using context to improve semantic interoperability (2006) 0.16
    0.16058002 = sum of:
      0.16058002 = product of:
        0.8029001 = sum of:
          0.13845742 = weight(abstract_txt:ontology in 1435) [ClassicSimilarity], result of:
            0.13845742 = score(doc=1435,freq=2.0), product of:
              0.14008257 = queryWeight, product of:
                1.4728792 = boost
                5.591227 = idf(docFreq=428, maxDocs=42306)
                0.017010218 = queryNorm
              0.9883986 = fieldWeight in 1435, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.591227 = idf(docFreq=428, maxDocs=42306)
                0.125 = fieldNorm(doc=1435)
          0.04570422 = weight(abstract_txt:approach in 1435) [ClassicSimilarity], result of:
            0.04570422 = score(doc=1435,freq=1.0), product of:
              0.09649791 = queryWeight, product of:
                1.4971994 = boost
                3.789033 = idf(docFreq=2600, maxDocs=42306)
                0.017010218 = queryNorm
              0.47362912 = fieldWeight in 1435, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.789033 = idf(docFreq=2600, maxDocs=42306)
                0.125 = fieldNorm(doc=1435)
          0.09523724 = weight(abstract_txt:context in 1435) [ClassicSimilarity], result of:
            0.09523724 = score(doc=1435,freq=1.0), product of:
              0.1732731 = queryWeight, product of:
                2.3166244 = boost
                4.397093 = idf(docFreq=1415, maxDocs=42306)
                0.017010218 = queryNorm
              0.5496366 = fieldWeight in 1435, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.397093 = idf(docFreq=1415, maxDocs=42306)
                0.125 = fieldNorm(doc=1435)
          0.2377348 = weight(abstract_txt:hierarchy in 1435) [ClassicSimilarity], result of:
            0.2377348 = score(doc=1435,freq=1.0), product of:
              0.2896958 = queryWeight, product of:
                2.5941303 = boost
                6.565088 = idf(docFreq=161, maxDocs=42306)
                0.017010218 = queryNorm
              0.820636 = fieldWeight in 1435, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.565088 = idf(docFreq=161, maxDocs=42306)
                0.125 = fieldNorm(doc=1435)
          0.2857664 = weight(abstract_txt:ontologies in 1435) [ClassicSimilarity], result of:
            0.2857664 = score(doc=1435,freq=1.0), product of:
              0.38830298 = queryWeight, product of:
                3.877309 = boost
                5.887493 = idf(docFreq=318, maxDocs=42306)
                0.017010218 = queryNorm
              0.73593664 = fieldWeight in 1435, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.887493 = idf(docFreq=318, maxDocs=42306)
                0.125 = fieldNorm(doc=1435)
        0.2 = coord(5/25)
    
  2. Tzitzikas, Y.; Spyratos, N.; Constantopoulos, P.; Analyti, A.: Extended faceted ontologies (2002) 0.14
    0.14379823 = sum of:
      0.14379823 = product of:
        0.71899116 = sum of:
          0.1313702 = weight(abstract_txt:subsumption in 4281) [ClassicSimilarity], result of:
            0.1313702 = score(doc=4281,freq=1.0), product of:
              0.18503384 = queryWeight, product of:
                1.1969765 = boost
                9.087735 = idf(docFreq=12, maxDocs=42306)
                0.017010218 = queryNorm
              0.7099793 = fieldWeight in 4281, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.087735 = idf(docFreq=12, maxDocs=42306)
                0.078125 = fieldNorm(doc=4281)
          0.061190113 = weight(abstract_txt:ontology in 4281) [ClassicSimilarity], result of:
            0.061190113 = score(doc=4281,freq=1.0), product of:
              0.14008257 = queryWeight, product of:
                1.4728792 = boost
                5.591227 = idf(docFreq=428, maxDocs=42306)
                0.017010218 = queryNorm
              0.4368146 = fieldWeight in 4281, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.591227 = idf(docFreq=428, maxDocs=42306)
                0.078125 = fieldNorm(doc=4281)
          0.17358616 = weight(abstract_txt:deriving in 4281) [ClassicSimilarity], result of:
            0.17358616 = score(doc=4281,freq=1.0), product of:
              0.28072 = queryWeight, product of:
                2.0850272 = boost
                7.9150147 = idf(docFreq=41, maxDocs=42306)
                0.017010218 = queryNorm
              0.6183605 = fieldWeight in 4281, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.9150147 = idf(docFreq=41, maxDocs=42306)
                0.078125 = fieldNorm(doc=4281)
          0.10026048 = weight(abstract_txt:hierarchical in 4281) [ClassicSimilarity], result of:
            0.10026048 = score(doc=4281,freq=1.0), product of:
              0.22286756 = queryWeight, product of:
                2.2753286 = boost
                5.758281 = idf(docFreq=362, maxDocs=42306)
                0.017010218 = queryNorm
              0.44986573 = fieldWeight in 4281, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.758281 = idf(docFreq=362, maxDocs=42306)
                0.078125 = fieldNorm(doc=4281)
          0.2525842 = weight(abstract_txt:ontologies in 4281) [ClassicSimilarity], result of:
            0.2525842 = score(doc=4281,freq=2.0), product of:
              0.38830298 = queryWeight, product of:
                3.877309 = boost
                5.887493 = idf(docFreq=318, maxDocs=42306)
                0.017010218 = queryNorm
              0.65048224 = fieldWeight in 4281, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.887493 = idf(docFreq=318, maxDocs=42306)
                0.078125 = fieldNorm(doc=4281)
        0.2 = coord(5/25)
    
  3. Amarger, F.; Chanet, J.-P.; Haemmerlé, O.; Hernandez, N.; Roussey, C.: SKOS sources transformations for ontology engineering : agronomical taxonomy use case (2014) 0.13
    0.13327132 = sum of:
      0.13327132 = product of:
        0.5552972 = sum of:
          0.049911752 = weight(abstract_txt:methodology in 3594) [ClassicSimilarity], result of:
            0.049911752 = score(doc=3594,freq=2.0), product of:
              0.097063325 = queryWeight, product of:
                1.2260344 = boost
                4.6541743 = idf(docFreq=1094, maxDocs=42306)
                0.017010218 = queryNorm
              0.51421845 = fieldWeight in 3594, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.6541743 = idf(docFreq=1094, maxDocs=42306)
                0.078125 = fieldNorm(doc=3594)
          0.056112777 = weight(abstract_txt:thesauri in 3594) [ClassicSimilarity], result of:
            0.056112777 = score(doc=3594,freq=1.0), product of:
              0.13222222 = queryWeight, product of:
                1.4309593 = boost
                5.432094 = idf(docFreq=502, maxDocs=42306)
                0.017010218 = queryNorm
              0.42438236 = fieldWeight in 3594, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.432094 = idf(docFreq=502, maxDocs=42306)
                0.078125 = fieldNorm(doc=3594)
          0.14988457 = weight(abstract_txt:ontology in 3594) [ClassicSimilarity], result of:
            0.14988457 = score(doc=3594,freq=6.0), product of:
              0.14008257 = queryWeight, product of:
                1.4728792 = boost
                5.591227 = idf(docFreq=428, maxDocs=42306)
                0.017010218 = queryNorm
              1.069973 = fieldWeight in 3594, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                5.591227 = idf(docFreq=428, maxDocs=42306)
                0.078125 = fieldNorm(doc=3594)
          0.08068657 = weight(abstract_txt:input in 3594) [ClassicSimilarity], result of:
            0.08068657 = score(doc=3594,freq=1.0), product of:
              0.16844732 = queryWeight, product of:
                1.6151286 = boost
                6.131223 = idf(docFreq=249, maxDocs=42306)
                0.017010218 = queryNorm
              0.47900182 = fieldWeight in 3594, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.131223 = idf(docFreq=249, maxDocs=42306)
                0.078125 = fieldNorm(doc=3594)
          0.10106905 = weight(abstract_txt:transformation in 3594) [ClassicSimilarity], result of:
            0.10106905 = score(doc=3594,freq=1.0), product of:
              0.1957381 = queryWeight, product of:
                1.7410561 = boost
                6.609259 = idf(docFreq=154, maxDocs=42306)
                0.017010218 = queryNorm
              0.51634836 = fieldWeight in 3594, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.609259 = idf(docFreq=154, maxDocs=42306)
                0.078125 = fieldNorm(doc=3594)
          0.1176325 = weight(abstract_txt:taxonomies in 3594) [ClassicSimilarity], result of:
            0.1176325 = score(doc=3594,freq=1.0), product of:
              0.21657825 = queryWeight, product of:
                1.8313969 = boost
                6.9522038 = idf(docFreq=109, maxDocs=42306)
                0.017010218 = queryNorm
              0.5431409 = fieldWeight in 3594, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.9522038 = idf(docFreq=109, maxDocs=42306)
                0.078125 = fieldNorm(doc=3594)
        0.24 = coord(6/25)
    
  4. Fischer, D.H.: From thesauri towards ontologies? (1998) 0.13
    0.12999797 = sum of:
      0.12999797 = product of:
        0.64998984 = sum of:
          0.18578553 = weight(abstract_txt:subsumption in 3177) [ClassicSimilarity], result of:
            0.18578553 = score(doc=3177,freq=2.0), product of:
              0.18503384 = queryWeight, product of:
                1.1969765 = boost
                9.087735 = idf(docFreq=12, maxDocs=42306)
                0.017010218 = queryNorm
              1.0040624 = fieldWeight in 3177, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.087735 = idf(docFreq=12, maxDocs=42306)
                0.078125 = fieldNorm(doc=3177)
          0.07935545 = weight(abstract_txt:thesauri in 3177) [ClassicSimilarity], result of:
            0.07935545 = score(doc=3177,freq=2.0), product of:
              0.13222222 = queryWeight, product of:
                1.4309593 = boost
                5.432094 = idf(docFreq=502, maxDocs=42306)
                0.017010218 = queryNorm
              0.6001673 = fieldWeight in 3177, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.432094 = idf(docFreq=502, maxDocs=42306)
                0.078125 = fieldNorm(doc=3177)
          0.105984375 = weight(abstract_txt:ontology in 3177) [ClassicSimilarity], result of:
            0.105984375 = score(doc=3177,freq=3.0), product of:
              0.14008257 = queryWeight, product of:
                1.4728792 = boost
                5.591227 = idf(docFreq=428, maxDocs=42306)
                0.017010218 = queryNorm
              0.75658506 = fieldWeight in 3177, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.591227 = idf(docFreq=428, maxDocs=42306)
                0.078125 = fieldNorm(doc=3177)
          0.10026048 = weight(abstract_txt:hierarchical in 3177) [ClassicSimilarity], result of:
            0.10026048 = score(doc=3177,freq=1.0), product of:
              0.22286756 = queryWeight, product of:
                2.2753286 = boost
                5.758281 = idf(docFreq=362, maxDocs=42306)
                0.017010218 = queryNorm
              0.44986573 = fieldWeight in 3177, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.758281 = idf(docFreq=362, maxDocs=42306)
                0.078125 = fieldNorm(doc=3177)
          0.17860399 = weight(abstract_txt:ontologies in 3177) [ClassicSimilarity], result of:
            0.17860399 = score(doc=3177,freq=1.0), product of:
              0.38830298 = queryWeight, product of:
                3.877309 = boost
                5.887493 = idf(docFreq=318, maxDocs=42306)
                0.017010218 = queryNorm
              0.4599604 = fieldWeight in 3177, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.887493 = idf(docFreq=318, maxDocs=42306)
                0.078125 = fieldNorm(doc=3177)
        0.2 = coord(5/25)
    
  5. SKOS2OWL : Online tool for deriving OWL ontologies from SKOS categorization schemas (2007) 0.13
    0.12706399 = sum of:
      0.12706399 = product of:
        0.79415 = sum of:
          0.09790418 = weight(abstract_txt:ontology in 1692) [ClassicSimilarity], result of:
            0.09790418 = score(doc=1692,freq=1.0), product of:
              0.14008257 = queryWeight, product of:
                1.4728792 = boost
                5.591227 = idf(docFreq=428, maxDocs=42306)
                0.017010218 = queryNorm
              0.6989034 = fieldWeight in 1692, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.591227 = idf(docFreq=428, maxDocs=42306)
                0.125 = fieldNorm(doc=1692)
          0.13169436 = weight(abstract_txt:classifications in 1692) [ClassicSimilarity], result of:
            0.13169436 = score(doc=1692,freq=1.0), product of:
              0.17069785 = queryWeight, product of:
                1.6258823 = boost
                6.172045 = idf(docFreq=239, maxDocs=42306)
                0.017010218 = queryNorm
              0.77150565 = fieldWeight in 1692, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.172045 = idf(docFreq=239, maxDocs=42306)
                0.125 = fieldNorm(doc=1692)
          0.16041677 = weight(abstract_txt:hierarchical in 1692) [ClassicSimilarity], result of:
            0.16041677 = score(doc=1692,freq=1.0), product of:
              0.22286756 = queryWeight, product of:
                2.2753286 = boost
                5.758281 = idf(docFreq=362, maxDocs=42306)
                0.017010218 = queryNorm
              0.71978515 = fieldWeight in 1692, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.758281 = idf(docFreq=362, maxDocs=42306)
                0.125 = fieldNorm(doc=1692)
          0.40413472 = weight(abstract_txt:ontologies in 1692) [ClassicSimilarity], result of:
            0.40413472 = score(doc=1692,freq=2.0), product of:
              0.38830298 = queryWeight, product of:
                3.877309 = boost
                5.887493 = idf(docFreq=318, maxDocs=42306)
                0.017010218 = queryNorm
              1.0407716 = fieldWeight in 1692, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.887493 = idf(docFreq=318, maxDocs=42306)
                0.125 = fieldNorm(doc=1692)
        0.16 = coord(4/25)