Document (#36886)

Author
Schulz, S.
Schober, D.
Tudose, I.
Stenzhorn, H.
Title
¬The pitfalls of thesaurus ontologization : the case of the NCI thesaurus
Issue
Published online 2010 November 13.
Source
AMIA Annual Symposium 2010: Improving Health: Informatics and IT Changing the World: Proceedings
Year
2010
Pages
S.727-731
Abstract
Thesauri that are "ontologized" into OWL-DL semantics are highly amenable to modeling errors resulting from falsely interpreting existential restrictions. We investigated the OWL-DL representation of the NCI Thesaurus (NCIT) in order to assess the correctness of existential restrictions. A random sample of 354 axioms using the someValuesFrom operator was taken. According to a rating performed by two domain experts, roughly half of these examples, and in consequence more than 76,000 axioms in the OWL-DL version, make incorrect assertions if interpreted according to description logics semantics. These axioms therefore constitute a huge source for unintended models, rendering most logic-based reasoning unreliable. After identifying typical error patterns we discuss some possible improvements. Our recommendation is to either amend the problematic axioms in the OWL-DL formalization or to consider some less strict representational format.
Content
Vgl.: http://www.ncbi.nlm.nih.gov/pmc/articles/PMC3041372/.
Theme
Wissensrepräsentation
Field
Medizin
Object
NCI Thesaurus

Similar documents (author)

  1. Schulz, H.: Zur Charakterisierung der BBK/A (1988) 4.66
    4.664238 = sum of:
      4.664238 = weight(author_txt:schulz in 90) [ClassicSimilarity], result of:
        4.664238 = fieldWeight in 90, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          7.462781 = idf(docFreq=68, maxDocs=44218)
          0.625 = fieldNorm(doc=90)
    
  2. Schulz, U.: Was ist eine sinnvolle Schlagwortsyntax (eine Polemik) (1991) 4.66
    4.664238 = sum of:
      4.664238 = weight(author_txt:schulz in 129) [ClassicSimilarity], result of:
        4.664238 = fieldWeight in 129, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          7.462781 = idf(docFreq=68, maxDocs=44218)
          0.625 = fieldNorm(doc=129)
    
  3. Schulz, U.: Einführung in die Grundlagen der inhaltlichen Erschließung mit BISMAS am Fachbereich BID der FHS Hannover (1991) 4.66
    4.664238 = sum of:
      4.664238 = weight(author_txt:schulz in 458) [ClassicSimilarity], result of:
        4.664238 = fieldWeight in 458, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          7.462781 = idf(docFreq=68, maxDocs=44218)
          0.625 = fieldNorm(doc=458)
    
  4. Schulz, U.: ¬Die niederländische Basisklassifikation: eine Alternative für die "Sachgruppen" im Fremddatenangebot der Deutschen Bibliothek (1991) 4.66
    4.664238 = sum of:
      4.664238 = weight(author_txt:schulz in 949) [ClassicSimilarity], result of:
        4.664238 = fieldWeight in 949, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          7.462781 = idf(docFreq=68, maxDocs=44218)
          0.625 = fieldNorm(doc=949)
    
  5. Schulz, H.: ¬Die Adaption der BBK : Ergänzungen zur Methodik (1983) 4.66
    4.664238 = sum of:
      4.664238 = weight(author_txt:schulz in 1125) [ClassicSimilarity], result of:
        4.664238 = fieldWeight in 1125, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          7.462781 = idf(docFreq=68, maxDocs=44218)
          0.625 = fieldNorm(doc=1125)
    

Similar documents (content)

  1. Blanco, E.; Moldovan, D.: ¬A model for composing semantic relations (2011) 0.11
    0.113964975 = sum of:
      0.113964975 = product of:
        0.94970816 = sum of:
          0.06833148 = weight(abstract_txt:semantics in 4762) [ClassicSimilarity], result of:
            0.06833148 = score(doc=4762,freq=1.0), product of:
              0.14512064 = queryWeight, product of:
                1.638694 = boost
                6.027006 = idf(docFreq=289, maxDocs=44218)
                0.014693649 = queryNorm
              0.47085986 = fieldWeight in 4762, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.027006 = idf(docFreq=289, maxDocs=44218)
                0.078125 = fieldNorm(doc=4762)
          0.14293043 = weight(abstract_txt:restrictions in 4762) [ClassicSimilarity], result of:
            0.14293043 = score(doc=4762,freq=1.0), product of:
              0.23735502 = queryWeight, product of:
                2.0957165 = boost
                7.7079034 = idf(docFreq=53, maxDocs=44218)
                0.014693649 = queryNorm
              0.60217994 = fieldWeight in 4762, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.7079034 = idf(docFreq=53, maxDocs=44218)
                0.078125 = fieldNorm(doc=4762)
          0.73844624 = weight(abstract_txt:axioms in 4762) [ClassicSimilarity], result of:
            0.73844624 = score(doc=4762,freq=3.0), product of:
              0.6196752 = queryWeight, product of:
                4.7888403 = boost
                8.806516 = idf(docFreq=17, maxDocs=44218)
                0.014693649 = queryNorm
              1.1916666 = fieldWeight in 4762, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                8.806516 = idf(docFreq=17, maxDocs=44218)
                0.078125 = fieldNorm(doc=4762)
        0.12 = coord(3/25)
    
  2. Das, S.; Naskar, D.; Roy, S.: Reorganizing educational institutional domain using faceted ontological principles (2022) 0.09
    0.09268109 = sum of:
      0.09268109 = product of:
        0.7723424 = sum of:
          0.010869891 = weight(abstract_txt:some in 1098) [ClassicSimilarity], result of:
            0.010869891 = score(doc=1098,freq=1.0), product of:
              0.054042246 = queryWeight, product of:
                3.6779325 = idf(docFreq=3037, maxDocs=44218)
                0.014693649 = queryNorm
              0.20113693 = fieldWeight in 1098, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6779325 = idf(docFreq=3037, maxDocs=44218)
                0.0546875 = fieldNorm(doc=1098)
          0.09414145 = weight(abstract_txt:formalization in 1098) [ClassicSimilarity], result of:
            0.09414145 = score(doc=1098,freq=2.0), product of:
              0.14357665 = queryWeight, product of:
                1.1525512 = boost
                8.478011 = idf(docFreq=24, maxDocs=44218)
                0.014693649 = queryNorm
              0.65568775 = fieldWeight in 1098, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.478011 = idf(docFreq=24, maxDocs=44218)
                0.0546875 = fieldNorm(doc=1098)
          0.66733104 = weight(abstract_txt:axioms in 1098) [ClassicSimilarity], result of:
            0.66733104 = score(doc=1098,freq=5.0), product of:
              0.6196752 = queryWeight, product of:
                4.7888403 = boost
                8.806516 = idf(docFreq=17, maxDocs=44218)
                0.014693649 = queryNorm
              1.0769045 = fieldWeight in 1098, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                8.806516 = idf(docFreq=17, maxDocs=44218)
                0.0546875 = fieldNorm(doc=1098)
        0.12 = coord(3/25)
    
  3. Martínez-González, M.M.; Alvite-Díez, M.L.: Thesauri and Semantic Web : discussion of the evolution of thesauri toward their integration with the Semantic Web (2019) 0.05
    0.0511827 = sum of:
      0.0511827 = product of:
        0.4265225 = sum of:
          0.012422733 = weight(abstract_txt:some in 5997) [ClassicSimilarity], result of:
            0.012422733 = score(doc=5997,freq=1.0), product of:
              0.054042246 = queryWeight, product of:
                3.6779325 = idf(docFreq=3037, maxDocs=44218)
                0.014693649 = queryNorm
              0.22987078 = fieldWeight in 5997, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6779325 = idf(docFreq=3037, maxDocs=44218)
                0.0625 = fieldNorm(doc=5997)
          0.07302604 = weight(abstract_txt:thesaurus in 5997) [ClassicSimilarity], result of:
            0.07302604 = score(doc=5997,freq=2.0), product of:
              0.15992911 = queryWeight, product of:
                2.106894 = boost
                5.1660094 = idf(docFreq=685, maxDocs=44218)
                0.014693649 = queryNorm
              0.45661503 = fieldWeight in 5997, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.1660094 = idf(docFreq=685, maxDocs=44218)
                0.0625 = fieldNorm(doc=5997)
          0.34107372 = weight(abstract_txt:axioms in 5997) [ClassicSimilarity], result of:
            0.34107372 = score(doc=5997,freq=1.0), product of:
              0.6196752 = queryWeight, product of:
                4.7888403 = boost
                8.806516 = idf(docFreq=17, maxDocs=44218)
                0.014693649 = queryNorm
              0.55040723 = fieldWeight in 5997, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.806516 = idf(docFreq=17, maxDocs=44218)
                0.0625 = fieldNorm(doc=5997)
        0.12 = coord(3/25)
    
  4. Fischer, D.H.: Converting a thesaurus to OWL : Notes on the paper "The National Cancer Institute's Thesaurus and Ontology" (2004) 0.05
    0.05050368 = sum of:
      0.05050368 = product of:
        0.2525184 = sum of:
          0.04167345 = weight(abstract_txt:strict in 2362) [ClassicSimilarity], result of:
            0.04167345 = score(doc=2362,freq=1.0), product of:
              0.13149168 = queryWeight, product of:
                1.1029794 = boost
                8.113368 = idf(docFreq=35, maxDocs=44218)
                0.014693649 = queryNorm
              0.31692845 = fieldWeight in 2362, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.113368 = idf(docFreq=35, maxDocs=44218)
                0.0390625 = fieldNorm(doc=2362)
          0.059421666 = weight(abstract_txt:assertions in 2362) [ClassicSimilarity], result of:
            0.059421666 = score(doc=2362,freq=1.0), product of:
              0.1665796 = queryWeight, product of:
                1.24145 = boost
                9.131938 = idf(docFreq=12, maxDocs=44218)
                0.014693649 = queryNorm
              0.35671633 = fieldWeight in 2362, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.131938 = idf(docFreq=12, maxDocs=44218)
                0.0390625 = fieldNorm(doc=2362)
          0.03187053 = weight(abstract_txt:according in 2362) [ClassicSimilarity], result of:
            0.03187053 = score(doc=2362,freq=2.0), product of:
              0.10996424 = queryWeight, product of:
                1.4264581 = boost
                5.2464166 = idf(docFreq=632, maxDocs=44218)
                0.014693649 = queryNorm
              0.2898263 = fieldWeight in 2362, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.2464166 = idf(docFreq=632, maxDocs=44218)
                0.0390625 = fieldNorm(doc=2362)
          0.03416574 = weight(abstract_txt:semantics in 2362) [ClassicSimilarity], result of:
            0.03416574 = score(doc=2362,freq=1.0), product of:
              0.14512064 = queryWeight, product of:
                1.638694 = boost
                6.027006 = idf(docFreq=289, maxDocs=44218)
                0.014693649 = queryNorm
              0.23542993 = fieldWeight in 2362, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.027006 = idf(docFreq=289, maxDocs=44218)
                0.0390625 = fieldNorm(doc=2362)
          0.08538699 = weight(abstract_txt:thesaurus in 2362) [ClassicSimilarity], result of:
            0.08538699 = score(doc=2362,freq=7.0), product of:
              0.15992911 = queryWeight, product of:
                2.106894 = boost
                5.1660094 = idf(docFreq=685, maxDocs=44218)
                0.014693649 = queryNorm
              0.53390527 = fieldWeight in 2362, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                5.1660094 = idf(docFreq=685, maxDocs=44218)
                0.0390625 = fieldNorm(doc=2362)
        0.2 = coord(5/25)
    
  5. Gnoli, C.: ISKO News (2007) 0.05
    0.048619915 = sum of:
      0.048619915 = product of:
        0.6077489 = sum of:
          0.010869891 = weight(abstract_txt:some in 1092) [ClassicSimilarity], result of:
            0.010869891 = score(doc=1092,freq=1.0), product of:
              0.054042246 = queryWeight, product of:
                3.6779325 = idf(docFreq=3037, maxDocs=44218)
                0.014693649 = queryNorm
              0.20113693 = fieldWeight in 1092, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6779325 = idf(docFreq=3037, maxDocs=44218)
                0.0546875 = fieldNorm(doc=1092)
          0.596879 = weight(abstract_txt:axioms in 1092) [ClassicSimilarity], result of:
            0.596879 = score(doc=1092,freq=4.0), product of:
              0.6196752 = queryWeight, product of:
                4.7888403 = boost
                8.806516 = idf(docFreq=17, maxDocs=44218)
                0.014693649 = queryNorm
              0.96321267 = fieldWeight in 1092, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                8.806516 = idf(docFreq=17, maxDocs=44218)
                0.0546875 = fieldNorm(doc=1092)
        0.08 = coord(2/25)