Document (#32923)

Author
Kim, J.-M.
Shin, H.
Kim, H.-J.
Title
Schema and constraints-based matching and merging of Topic Maps
Source
Information processing and management. 43(2007) no.4, S.930-945
Year
2007
Abstract
In this paper, we propose a multi-strategic matching and merging approach to find correspondences between ontologies based on the syntactic or semantic characteristics and constraints of the Topic Maps. Our multi-strategic matching approach consists of a linguistic module and a Topic Map constraints-based module. A linguistic module computes similarities between concepts using morphological analysis, string normalization and tokenization and language-dependent heuristics. A Topic Map constraints-based module takes advantage of several Topic Maps-dependent techniques such as a topic property-based matching, a hierarchy-based matching, and an association-based matching. This is a composite matching procedure and need not generate a cross-pair of all topics from the ontologies because unmatched pairs of topics can be removed by characteristics and constraints of the Topic Maps. Merging between Topic Maps follows the matching operations. We set up the MERGE function to integrate two Topic Maps into a new Topic Map, which satisfies such merge requirements as entity preservation, property preservation, relation preservation, and conflict resolution. For our experiments, we used oriental philosophy ontologies, western philosophy ontologies, Yahoo western philosophy dictionary, and Wikipedia philosophy ontology as input ontologies. Our experiments show that the automatically generated matching results conform to the outputs generated manually by domain experts and can be of great benefit to the following merging operations.
Theme
Semantische Interoperabilität

Similar documents (author)

  1. Shin, H.-s.: Quality of Korean cataloging records in shared databases (2003) 4.88
    4.8754888 = sum of:
      4.8754888 = weight(author_txt:shin in 5498) [ClassicSimilarity], result of:
        4.8754888 = fieldWeight in 5498, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.7509775 = idf(docFreq=6, maxDocs=44218)
          0.5 = fieldNorm(doc=5498)
    
  2. Shin, D.-H.: Next generation of information infrastructure : a comparative case study of Korea versus the United States of America (2008) 4.88
    4.8754888 = sum of:
      4.8754888 = weight(author_txt:shin in 2365) [ClassicSimilarity], result of:
        4.8754888 = fieldWeight in 2365, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.7509775 = idf(docFreq=6, maxDocs=44218)
          0.5 = fieldNorm(doc=2365)
    
  3. Leydesdorff, L.; Shin, J.C.: How to evaluate universities in terms of their relative citation impacts : fractional counting of citations and the normalization of differences among disciplines (2011) 4.88
    4.8754888 = sum of:
      4.8754888 = weight(author_txt:shin in 4466) [ClassicSimilarity], result of:
        4.8754888 = fieldWeight in 4466, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.7509775 = idf(docFreq=6, maxDocs=44218)
          0.5 = fieldNorm(doc=4466)
    
  4. Keselman, A.; Rosemblat, G.; Kilicoglu, H.; Fiszman, M.; Jin, H.; Shin, D.; Rindflesch, T.C.: Adapting semantic natural language processing technology to address information overload in influenza epidemic management (2010) 2.44
    2.4377444 = sum of:
      2.4377444 = weight(author_txt:shin in 1312) [ClassicSimilarity], result of:
        2.4377444 = fieldWeight in 1312, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.7509775 = idf(docFreq=6, maxDocs=44218)
          0.25 = fieldNorm(doc=1312)
    
  5. Rosemblat, G.; Resnick, M.P.; Auston, I.; Shin, D.; Sneiderman, C.; Fizsman, M.; Rindflesch, T.C.: Extending SemRep to the public health domain (2013) 2.44
    2.4377444 = sum of:
      2.4377444 = weight(author_txt:shin in 2096) [ClassicSimilarity], result of:
        2.4377444 = fieldWeight in 2096, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.7509775 = idf(docFreq=6, maxDocs=44218)
          0.25 = fieldNorm(doc=2096)
    

Similar documents (content)

  1. Widhalm, R.; Mueck, T.A.: Merging topics in well-formed XML topic maps (2003) 0.24
    0.240029 = sum of:
      0.240029 = product of:
        1.200145 = sum of:
          0.028222688 = weight(abstract_txt:topics in 2186) [ClassicSimilarity], result of:
            0.028222688 = score(doc=2186,freq=1.0), product of:
              0.07102572 = queryWeight, product of:
                1.1316683 = boost
                5.086191 = idf(docFreq=742, maxDocs=44218)
                0.012339679 = queryNorm
              0.3973587 = fieldWeight in 2186, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.086191 = idf(docFreq=742, maxDocs=44218)
                0.078125 = fieldNorm(doc=2186)
          0.36749327 = weight(abstract_txt:merging in 2186) [ClassicSimilarity], result of:
            0.36749327 = score(doc=2186,freq=4.0), product of:
              0.31201324 = queryWeight, product of:
                3.3543844 = boost
                7.538004 = idf(docFreq=63, maxDocs=44218)
                0.012339679 = queryNorm
              1.177813 = fieldWeight in 2186, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                7.538004 = idf(docFreq=63, maxDocs=44218)
                0.078125 = fieldNorm(doc=2186)
          0.23603144 = weight(abstract_txt:constraints in 2186) [ClassicSimilarity], result of:
            0.23603144 = score(doc=2186,freq=2.0), product of:
              0.3152342 = queryWeight, product of:
                3.7696235 = boost
                6.7769065 = idf(docFreq=136, maxDocs=44218)
                0.012339679 = queryNorm
              0.74874943 = fieldWeight in 2186, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.7769065 = idf(docFreq=136, maxDocs=44218)
                0.078125 = fieldNorm(doc=2186)
          0.22759907 = weight(abstract_txt:maps in 2186) [ClassicSimilarity], result of:
            0.22759907 = score(doc=2186,freq=3.0), product of:
              0.28562558 = queryWeight, product of:
                3.9307053 = boost
                5.888745 = idf(docFreq=332, maxDocs=44218)
                0.012339679 = queryNorm
              0.7968441 = fieldWeight in 2186, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.888745 = idf(docFreq=332, maxDocs=44218)
                0.078125 = fieldNorm(doc=2186)
          0.3407986 = weight(abstract_txt:topic in 2186) [ClassicSimilarity], result of:
            0.3407986 = score(doc=2186,freq=6.0), product of:
              0.3517938 = queryWeight, product of:
                5.6317115 = boost
                5.062254 = idf(docFreq=760, maxDocs=44218)
                0.012339679 = queryNorm
              0.9687453 = fieldWeight in 2186, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                5.062254 = idf(docFreq=760, maxDocs=44218)
                0.078125 = fieldNorm(doc=2186)
        0.2 = coord(5/25)
    
  2. Baião Salgado Silva, G.; Lima, G.Â. Borém de Oliveira: Using topic maps in establishing compatibility of semantically structured hypertext contents (2012) 0.22
    0.22236252 = sum of:
      0.22236252 = product of:
        0.79415184 = sum of:
          0.019943904 = weight(abstract_txt:characteristics in 633) [ClassicSimilarity], result of:
            0.019943904 = score(doc=633,freq=1.0), product of:
              0.06538782 = queryWeight, product of:
                1.0858248 = boost
                4.8801513 = idf(docFreq=912, maxDocs=44218)
                0.012339679 = queryNorm
              0.30500945 = fieldWeight in 633, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.8801513 = idf(docFreq=912, maxDocs=44218)
                0.0625 = fieldNorm(doc=633)
          0.015122325 = weight(abstract_txt:between in 633) [ClassicSimilarity], result of:
            0.015122325 = score(doc=633,freq=2.0), product of:
              0.049399536 = queryWeight, product of:
                1.155895 = boost
                3.4633842 = idf(docFreq=3764, maxDocs=44218)
                0.012339679 = queryNorm
              0.3061228 = fieldWeight in 633, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4633842 = idf(docFreq=3764, maxDocs=44218)
                0.0625 = fieldNorm(doc=633)
          0.046733424 = weight(abstract_txt:property in 633) [ClassicSimilarity], result of:
            0.046733424 = score(doc=633,freq=1.0), product of:
              0.11535644 = queryWeight, product of:
                1.4422224 = boost
                6.481951 = idf(docFreq=183, maxDocs=44218)
                0.012339679 = queryNorm
              0.40512195 = fieldWeight in 633, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.481951 = idf(docFreq=183, maxDocs=44218)
                0.0625 = fieldNorm(doc=633)
          0.18651089 = weight(abstract_txt:merge in 633) [ClassicSimilarity], result of:
            0.18651089 = score(doc=633,freq=3.0), product of:
              0.20124224 = queryWeight, product of:
                1.9048944 = boost
                8.561393 = idf(docFreq=22, maxDocs=44218)
                0.012339679 = queryNorm
              0.9267979 = fieldWeight in 633, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                8.561393 = idf(docFreq=22, maxDocs=44218)
                0.0625 = fieldNorm(doc=633)
          0.019458251 = weight(abstract_txt:based in 633) [ClassicSimilarity], result of:
            0.019458251 = score(doc=633,freq=1.0), product of:
              0.097659685 = queryWeight, product of:
                2.4825785 = boost
                3.1879277 = idf(docFreq=4958, maxDocs=44218)
                0.012339679 = queryNorm
              0.19924548 = fieldWeight in 633, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.1879277 = idf(docFreq=4958, maxDocs=44218)
                0.0625 = fieldNorm(doc=633)
          0.25749895 = weight(abstract_txt:maps in 633) [ClassicSimilarity], result of:
            0.25749895 = score(doc=633,freq=6.0), product of:
              0.28562558 = queryWeight, product of:
                3.9307053 = boost
                5.888745 = idf(docFreq=332, maxDocs=44218)
                0.012339679 = queryNorm
              0.9015263 = fieldWeight in 633, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                5.888745 = idf(docFreq=332, maxDocs=44218)
                0.0625 = fieldNorm(doc=633)
          0.24888408 = weight(abstract_txt:topic in 633) [ClassicSimilarity], result of:
            0.24888408 = score(doc=633,freq=5.0), product of:
              0.3517938 = queryWeight, product of:
                5.6317115 = boost
                5.062254 = idf(docFreq=760, maxDocs=44218)
                0.012339679 = queryNorm
              0.7074715 = fieldWeight in 633, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                5.062254 = idf(docFreq=760, maxDocs=44218)
                0.0625 = fieldNorm(doc=633)
        0.28 = coord(7/25)
    
  3. Green, R.: Topical relevance relationships : 2: an exploratory study and preliminary typology (1995) 0.20
    0.19881834 = sum of:
      0.19881834 = product of:
        0.8284098 = sum of:
          0.039912906 = weight(abstract_txt:topics in 3724) [ClassicSimilarity], result of:
            0.039912906 = score(doc=3724,freq=2.0), product of:
              0.07102572 = queryWeight, product of:
                1.1316683 = boost
                5.086191 = idf(docFreq=742, maxDocs=44218)
                0.012339679 = queryNorm
              0.56195 = fieldWeight in 3724, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.086191 = idf(docFreq=742, maxDocs=44218)
                0.078125 = fieldNorm(doc=3724)
          0.018902905 = weight(abstract_txt:between in 3724) [ClassicSimilarity], result of:
            0.018902905 = score(doc=3724,freq=2.0), product of:
              0.049399536 = queryWeight, product of:
                1.155895 = boost
                3.4633842 = idf(docFreq=3764, maxDocs=44218)
                0.012339679 = queryNorm
              0.3826535 = fieldWeight in 3724, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4633842 = idf(docFreq=3764, maxDocs=44218)
                0.078125 = fieldNorm(doc=3724)
          0.036097597 = weight(abstract_txt:generated in 3724) [ClassicSimilarity], result of:
            0.036097597 = score(doc=3724,freq=1.0), product of:
              0.083689116 = queryWeight, product of:
                1.2284169 = boost
                5.52102 = idf(docFreq=480, maxDocs=44218)
                0.012339679 = queryNorm
              0.43132967 = fieldWeight in 3724, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.52102 = idf(docFreq=480, maxDocs=44218)
                0.078125 = fieldNorm(doc=3724)
          0.16689944 = weight(abstract_txt:constraints in 3724) [ClassicSimilarity], result of:
            0.16689944 = score(doc=3724,freq=1.0), product of:
              0.3152342 = queryWeight, product of:
                3.7696235 = boost
                6.7769065 = idf(docFreq=136, maxDocs=44218)
                0.012339679 = queryNorm
              0.5294458 = fieldWeight in 3724, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.7769065 = idf(docFreq=136, maxDocs=44218)
                0.078125 = fieldNorm(doc=3724)
          0.19676013 = weight(abstract_txt:topic in 3724) [ClassicSimilarity], result of:
            0.19676013 = score(doc=3724,freq=2.0), product of:
              0.3517938 = queryWeight, product of:
                5.6317115 = boost
                5.062254 = idf(docFreq=760, maxDocs=44218)
                0.012339679 = queryNorm
              0.5593053 = fieldWeight in 3724, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.062254 = idf(docFreq=760, maxDocs=44218)
                0.078125 = fieldNorm(doc=3724)
          0.3698368 = weight(abstract_txt:matching in 3724) [ClassicSimilarity], result of:
            0.3698368 = score(doc=3724,freq=3.0), product of:
              0.45191208 = queryWeight, product of:
                6.0554237 = boost
                6.047913 = idf(docFreq=283, maxDocs=44218)
                0.012339679 = queryNorm
              0.8183822 = fieldWeight in 3724, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.047913 = idf(docFreq=283, maxDocs=44218)
                0.078125 = fieldNorm(doc=3724)
        0.24 = coord(6/25)
    
  4. Cregan, A.: ¬An OWL DL construction for the ISO Topic Map Data Model (2005) 0.17
    0.16552477 = sum of:
      0.16552477 = product of:
        0.8276238 = sum of:
          0.015122325 = weight(abstract_txt:between in 4718) [ClassicSimilarity], result of:
            0.015122325 = score(doc=4718,freq=2.0), product of:
              0.049399536 = queryWeight, product of:
                1.155895 = boost
                3.4633842 = idf(docFreq=3764, maxDocs=44218)
                0.012339679 = queryNorm
              0.3061228 = fieldWeight in 4718, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4633842 = idf(docFreq=3764, maxDocs=44218)
                0.0625 = fieldNorm(doc=4718)
          0.019458251 = weight(abstract_txt:based in 4718) [ClassicSimilarity], result of:
            0.019458251 = score(doc=4718,freq=1.0), product of:
              0.097659685 = queryWeight, product of:
                2.4825785 = boost
                3.1879277 = idf(docFreq=4958, maxDocs=44218)
                0.012339679 = queryNorm
              0.19924548 = fieldWeight in 4718, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.1879277 = idf(docFreq=4958, maxDocs=44218)
                0.0625 = fieldNorm(doc=4718)
          0.18882516 = weight(abstract_txt:constraints in 4718) [ClassicSimilarity], result of:
            0.18882516 = score(doc=4718,freq=2.0), product of:
              0.3152342 = queryWeight, product of:
                3.7696235 = boost
                6.7769065 = idf(docFreq=136, maxDocs=44218)
                0.012339679 = queryNorm
              0.59899956 = fieldWeight in 4718, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.7769065 = idf(docFreq=136, maxDocs=44218)
                0.0625 = fieldNorm(doc=4718)
          0.23506331 = weight(abstract_txt:maps in 4718) [ClassicSimilarity], result of:
            0.23506331 = score(doc=4718,freq=5.0), product of:
              0.28562558 = queryWeight, product of:
                3.9307053 = boost
                5.888745 = idf(docFreq=332, maxDocs=44218)
                0.012339679 = queryNorm
              0.8229771 = fieldWeight in 4718, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                5.888745 = idf(docFreq=332, maxDocs=44218)
                0.0625 = fieldNorm(doc=4718)
          0.36915475 = weight(abstract_txt:topic in 4718) [ClassicSimilarity], result of:
            0.36915475 = score(doc=4718,freq=11.0), product of:
              0.3517938 = queryWeight, product of:
                5.6317115 = boost
                5.062254 = idf(docFreq=760, maxDocs=44218)
                0.012339679 = queryNorm
              1.0493498 = fieldWeight in 4718, product of:
                3.3166249 = tf(freq=11.0), with freq of:
                  11.0 = termFreq=11.0
                5.062254 = idf(docFreq=760, maxDocs=44218)
                0.0625 = fieldNorm(doc=4718)
        0.2 = coord(5/25)
    
  5. Pepper, S.; Groenmo, G.O.: Towards a general theory of scope (2002) 0.15
    0.15183854 = sum of:
      0.15183854 = product of:
        0.7591927 = sum of:
          0.02257815 = weight(abstract_txt:topics in 539) [ClassicSimilarity], result of:
            0.02257815 = score(doc=539,freq=1.0), product of:
              0.07102572 = queryWeight, product of:
                1.1316683 = boost
                5.086191 = idf(docFreq=742, maxDocs=44218)
                0.012339679 = queryNorm
              0.31788695 = fieldWeight in 539, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.086191 = idf(docFreq=742, maxDocs=44218)
                0.0625 = fieldNorm(doc=539)
          0.10768212 = weight(abstract_txt:merge in 539) [ClassicSimilarity], result of:
            0.10768212 = score(doc=539,freq=1.0), product of:
              0.20124224 = queryWeight, product of:
                1.9048944 = boost
                8.561393 = idf(docFreq=22, maxDocs=44218)
                0.012339679 = queryNorm
              0.53508705 = fieldWeight in 539, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.561393 = idf(docFreq=22, maxDocs=44218)
                0.0625 = fieldNorm(doc=539)
          0.019458251 = weight(abstract_txt:based in 539) [ClassicSimilarity], result of:
            0.019458251 = score(doc=539,freq=1.0), product of:
              0.097659685 = queryWeight, product of:
                2.4825785 = boost
                3.1879277 = idf(docFreq=4958, maxDocs=44218)
                0.012339679 = queryNorm
              0.19924548 = fieldWeight in 539, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.1879277 = idf(docFreq=4958, maxDocs=44218)
                0.0625 = fieldNorm(doc=539)
          0.25749895 = weight(abstract_txt:maps in 539) [ClassicSimilarity], result of:
            0.25749895 = score(doc=539,freq=6.0), product of:
              0.28562558 = queryWeight, product of:
                3.9307053 = boost
                5.888745 = idf(docFreq=332, maxDocs=44218)
                0.012339679 = queryNorm
              0.9015263 = fieldWeight in 539, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                5.888745 = idf(docFreq=332, maxDocs=44218)
                0.0625 = fieldNorm(doc=539)
          0.35197526 = weight(abstract_txt:topic in 539) [ClassicSimilarity], result of:
            0.35197526 = score(doc=539,freq=10.0), product of:
              0.3517938 = queryWeight, product of:
                5.6317115 = boost
                5.062254 = idf(docFreq=760, maxDocs=44218)
                0.012339679 = queryNorm
              1.0005158 = fieldWeight in 539, product of:
                3.1622777 = tf(freq=10.0), with freq of:
                  10.0 = termFreq=10.0
                5.062254 = idf(docFreq=760, maxDocs=44218)
                0.0625 = fieldNorm(doc=539)
        0.2 = coord(5/25)