Document (#32924)

Author
Kim, J.-M.
Shin, H.
Kim, H.-J.
Title
Schema and constraints-based matching and merging of Topic Maps
Source
Information processing and management. 43(2007) no.4, S.930-945
Year
2007
Abstract
In this paper, we propose a multi-strategic matching and merging approach to find correspondences between ontologies based on the syntactic or semantic characteristics and constraints of the Topic Maps. Our multi-strategic matching approach consists of a linguistic module and a Topic Map constraints-based module. A linguistic module computes similarities between concepts using morphological analysis, string normalization and tokenization and language-dependent heuristics. A Topic Map constraints-based module takes advantage of several Topic Maps-dependent techniques such as a topic property-based matching, a hierarchy-based matching, and an association-based matching. This is a composite matching procedure and need not generate a cross-pair of all topics from the ontologies because unmatched pairs of topics can be removed by characteristics and constraints of the Topic Maps. Merging between Topic Maps follows the matching operations. We set up the MERGE function to integrate two Topic Maps into a new Topic Map, which satisfies such merge requirements as entity preservation, property preservation, relation preservation, and conflict resolution. For our experiments, we used oriental philosophy ontologies, western philosophy ontologies, Yahoo western philosophy dictionary, and Wikipedia philosophy ontology as input ontologies. Our experiments show that the automatically generated matching results conform to the outputs generated manually by domain experts and can be of great benefit to the following merging operations.
Theme
Semantische Interoperabilität

Similar documents (author)

  1. Shin, H.-s.: Quality of Korean cataloging records in shared databases (2003) 4.85
    4.853387 = sum of:
      4.853387 = weight(author_txt:shin in 499) [ClassicSimilarity], result of:
        4.853387 = fieldWeight in 499, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.706774 = idf(docFreq=6, maxDocs=42306)
          0.5 = fieldNorm(doc=499)
    
  2. Shin, D.-H.: Next generation of information infrastructure : a comparative case study of Korea versus the United States of America (2008) 4.85
    4.853387 = sum of:
      4.853387 = weight(author_txt:shin in 185) [ClassicSimilarity], result of:
        4.853387 = fieldWeight in 185, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.706774 = idf(docFreq=6, maxDocs=42306)
          0.5 = fieldNorm(doc=185)
    
  3. Leydesdorff, L.; Shin, J.C.: How to evaluate universities in terms of their relative citation impacts : fractional counting of citations and the normalization of differences among disciplines (2011) 4.85
    4.853387 = sum of:
      4.853387 = weight(author_txt:shin in 1467) [ClassicSimilarity], result of:
        4.853387 = fieldWeight in 1467, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.706774 = idf(docFreq=6, maxDocs=42306)
          0.5 = fieldNorm(doc=1467)
    
  4. Keselman, A.; Rosemblat, G.; Kilicoglu, H.; Fiszman, M.; Jin, H.; Shin, D.; Rindflesch, T.C.: Adapting semantic natural language processing technology to address information overload in influenza epidemic management (2010) 2.43
    2.4266934 = sum of:
      2.4266934 = weight(author_txt:shin in 3313) [ClassicSimilarity], result of:
        2.4266934 = fieldWeight in 3313, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.706774 = idf(docFreq=6, maxDocs=42306)
          0.25 = fieldNorm(doc=3313)
    
  5. Rosemblat, G.; Resnick, M.P.; Auston, I.; Shin, D.; Sneiderman, C.; Fizsman, M.; Rindflesch, T.C.: Extending SemRep to the public health domain (2013) 2.43
    2.4266934 = sum of:
      2.4266934 = weight(author_txt:shin in 4097) [ClassicSimilarity], result of:
        2.4266934 = fieldWeight in 4097, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.706774 = idf(docFreq=6, maxDocs=42306)
          0.25 = fieldNorm(doc=4097)
    

Similar documents (content)

  1. Widhalm, R.; Mueck, T.A.: Merging topics in well-formed XML topic maps (2003) 0.24
    0.24150108 = sum of:
      0.24150108 = product of:
        1.2075053 = sum of:
          0.028707333 = weight(abstract_txt:topics in 3187) [ClassicSimilarity], result of:
            0.028707333 = score(doc=3187,freq=1.0), product of:
              0.071620174 = queryWeight, product of:
                1.1384077 = boost
                5.1305914 = idf(docFreq=679, maxDocs=42306)
                0.012262248 = queryNorm
              0.40082747 = fieldWeight in 3187, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.1305914 = idf(docFreq=679, maxDocs=42306)
                0.078125 = fieldNorm(doc=3187)
          0.36959293 = weight(abstract_txt:merging in 3187) [ClassicSimilarity], result of:
            0.36959293 = score(doc=3187,freq=4.0), product of:
              0.3122573 = queryWeight, product of:
                3.361642 = boost
                7.5751467 = idf(docFreq=58, maxDocs=42306)
                0.012262248 = queryNorm
              1.1836166 = fieldWeight in 3187, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                7.5751467 = idf(docFreq=58, maxDocs=42306)
                0.078125 = fieldNorm(doc=3187)
          0.23719159 = weight(abstract_txt:constraints in 3187) [ClassicSimilarity], result of:
            0.23719159 = score(doc=3187,freq=2.0), product of:
              0.31531382 = queryWeight, product of:
                3.7767797 = boost
                6.808497 = idf(docFreq=126, maxDocs=42306)
                0.012262248 = queryNorm
              0.75223976 = fieldWeight in 3187, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.808497 = idf(docFreq=126, maxDocs=42306)
                0.078125 = fieldNorm(doc=3187)
          0.22576585 = weight(abstract_txt:maps in 3187) [ClassicSimilarity], result of:
            0.22576585 = score(doc=3187,freq=3.0), product of:
              0.28323418 = queryWeight, product of:
                3.9211514 = boost
                5.8906326 = idf(docFreq=317, maxDocs=42306)
                0.012262248 = queryNorm
              0.7970996 = fieldWeight in 3187, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.8906326 = idf(docFreq=317, maxDocs=42306)
                0.078125 = fieldNorm(doc=3187)
          0.3462477 = weight(abstract_txt:topic in 3187) [ClassicSimilarity], result of:
            0.3462477 = score(doc=3187,freq=6.0), product of:
              0.35446307 = queryWeight, product of:
                5.6630535 = boost
                5.104465 = idf(docFreq=697, maxDocs=42306)
                0.012262248 = queryNorm
              0.97682303 = fieldWeight in 3187, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                5.104465 = idf(docFreq=697, maxDocs=42306)
                0.078125 = fieldNorm(doc=3187)
        0.2 = coord(5/25)
    
  2. Baião Salgado Silva, G.; Lima, G.Â. Borém de Oliveira: Using topic maps in establishing compatibility of semantically structured hypertext contents (2012) 0.22
    0.22278625 = sum of:
      0.22278625 = product of:
        0.79566514 = sum of:
          0.020112228 = weight(abstract_txt:characteristics in 2634) [ClassicSimilarity], result of:
            0.020112228 = score(doc=2634,freq=1.0), product of:
              0.06555718 = queryWeight, product of:
                1.0891565 = boost
                4.908625 = idf(docFreq=848, maxDocs=42306)
                0.012262248 = queryNorm
              0.30678907 = fieldWeight in 2634, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.908625 = idf(docFreq=848, maxDocs=42306)
                0.0625 = fieldNorm(doc=2634)
          0.015442397 = weight(abstract_txt:between in 2634) [ClassicSimilarity], result of:
            0.015442397 = score(doc=2634,freq=2.0), product of:
              0.04994328 = queryWeight, product of:
                1.1642984 = boost
                3.498184 = idf(docFreq=3478, maxDocs=42306)
                0.012262248 = queryNorm
              0.3091987 = fieldWeight in 2634, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.498184 = idf(docFreq=3478, maxDocs=42306)
                0.0625 = fieldNorm(doc=2634)
          0.047193248 = weight(abstract_txt:property in 2634) [ClassicSimilarity], result of:
            0.047193248 = score(doc=2634,freq=1.0), product of:
              0.11576219 = queryWeight, product of:
                1.4473165 = boost
                6.5227857 = idf(docFreq=168, maxDocs=42306)
                0.012262248 = queryNorm
              0.4076741 = fieldWeight in 2634, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.5227857 = idf(docFreq=168, maxDocs=42306)
                0.0625 = fieldNorm(doc=2634)
          0.18484697 = weight(abstract_txt:merge in 2634) [ClassicSimilarity], result of:
            0.18484697 = score(doc=2634,freq=3.0), product of:
              0.19944109 = queryWeight, product of:
                1.8997107 = boost
                8.561642 = idf(docFreq=21, maxDocs=42306)
                0.012262248 = queryNorm
              0.92682487 = fieldWeight in 2634, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                8.561642 = idf(docFreq=21, maxDocs=42306)
                0.0625 = fieldNorm(doc=2634)
          0.01978181 = weight(abstract_txt:based in 2634) [ClassicSimilarity], result of:
            0.01978181 = score(doc=2634,freq=1.0), product of:
              0.09844195 = queryWeight, product of:
                2.496918 = boost
                3.2151837 = idf(docFreq=4616, maxDocs=42306)
                0.012262248 = queryNorm
              0.20094898 = fieldWeight in 2634, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.2151837 = idf(docFreq=4616, maxDocs=42306)
                0.0625 = fieldNorm(doc=2634)
          0.25542492 = weight(abstract_txt:maps in 2634) [ClassicSimilarity], result of:
            0.25542492 = score(doc=2634,freq=6.0), product of:
              0.28323418 = queryWeight, product of:
                3.9211514 = boost
                5.8906326 = idf(docFreq=317, maxDocs=42306)
                0.012262248 = queryNorm
              0.9018153 = fieldWeight in 2634, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                5.8906326 = idf(docFreq=317, maxDocs=42306)
                0.0625 = fieldNorm(doc=2634)
          0.25286356 = weight(abstract_txt:topic in 2634) [ClassicSimilarity], result of:
            0.25286356 = score(doc=2634,freq=5.0), product of:
              0.35446307 = queryWeight, product of:
                5.6630535 = boost
                5.104465 = idf(docFreq=697, maxDocs=42306)
                0.012262248 = queryNorm
              0.7133707 = fieldWeight in 2634, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                5.104465 = idf(docFreq=697, maxDocs=42306)
                0.0625 = fieldNorm(doc=2634)
        0.28 = coord(7/25)
    
  3. Green, R.: Topical relevance relationships : 2: an exploratory study and preliminary typology (1995) 0.20
    0.19976115 = sum of:
      0.19976115 = product of:
        0.83233815 = sum of:
          0.040598296 = weight(abstract_txt:topics in 3793) [ClassicSimilarity], result of:
            0.040598296 = score(doc=3793,freq=2.0), product of:
              0.071620174 = queryWeight, product of:
                1.1384077 = boost
                5.1305914 = idf(docFreq=679, maxDocs=42306)
                0.012262248 = queryNorm
              0.5668556 = fieldWeight in 3793, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.1305914 = idf(docFreq=679, maxDocs=42306)
                0.078125 = fieldNorm(doc=3793)
          0.019302998 = weight(abstract_txt:between in 3793) [ClassicSimilarity], result of:
            0.019302998 = score(doc=3793,freq=2.0), product of:
              0.04994328 = queryWeight, product of:
                1.1642984 = boost
                3.498184 = idf(docFreq=3478, maxDocs=42306)
                0.012262248 = queryNorm
              0.3864984 = fieldWeight in 3793, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.498184 = idf(docFreq=3478, maxDocs=42306)
                0.078125 = fieldNorm(doc=3793)
          0.036473617 = weight(abstract_txt:generated in 3793) [ClassicSimilarity], result of:
            0.036473617 = score(doc=3793,freq=1.0), product of:
              0.08401549 = queryWeight, product of:
                1.2329907 = boost
                5.5568595 = idf(docFreq=443, maxDocs=42306)
                0.012262248 = queryNorm
              0.43412966 = fieldWeight in 3793, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.5568595 = idf(docFreq=443, maxDocs=42306)
                0.078125 = fieldNorm(doc=3793)
          0.16771978 = weight(abstract_txt:constraints in 3793) [ClassicSimilarity], result of:
            0.16771978 = score(doc=3793,freq=1.0), product of:
              0.31531382 = queryWeight, product of:
                3.7767797 = boost
                6.808497 = idf(docFreq=126, maxDocs=42306)
                0.012262248 = queryNorm
              0.5319138 = fieldWeight in 3793, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.808497 = idf(docFreq=126, maxDocs=42306)
                0.078125 = fieldNorm(doc=3793)
          0.19990619 = weight(abstract_txt:topic in 3793) [ClassicSimilarity], result of:
            0.19990619 = score(doc=3793,freq=2.0), product of:
              0.35446307 = queryWeight, product of:
                5.6630535 = boost
                5.104465 = idf(docFreq=697, maxDocs=42306)
                0.012262248 = queryNorm
              0.563969 = fieldWeight in 3793, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.104465 = idf(docFreq=697, maxDocs=42306)
                0.078125 = fieldNorm(doc=3793)
          0.36833727 = weight(abstract_txt:matching in 3793) [ClassicSimilarity], result of:
            0.36833727 = score(doc=3793,freq=3.0), product of:
              0.44933236 = queryWeight, product of:
                6.0488143 = boost
                6.057973 = idf(docFreq=268, maxDocs=42306)
                0.012262248 = queryNorm
              0.8197435 = fieldWeight in 3793, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.057973 = idf(docFreq=268, maxDocs=42306)
                0.078125 = fieldNorm(doc=3793)
        0.24 = coord(6/25)
    
  4. Cregan, A.: ¬An OWL DL construction for the ISO Topic Map Data Model (2005) 0.17
    0.16664095 = sum of:
      0.16664095 = product of:
        0.83320475 = sum of:
          0.015442397 = weight(abstract_txt:between in 1719) [ClassicSimilarity], result of:
            0.015442397 = score(doc=1719,freq=2.0), product of:
              0.04994328 = queryWeight, product of:
                1.1642984 = boost
                3.498184 = idf(docFreq=3478, maxDocs=42306)
                0.012262248 = queryNorm
              0.3091987 = fieldWeight in 1719, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.498184 = idf(docFreq=3478, maxDocs=42306)
                0.0625 = fieldNorm(doc=1719)
          0.01978181 = weight(abstract_txt:based in 1719) [ClassicSimilarity], result of:
            0.01978181 = score(doc=1719,freq=1.0), product of:
              0.09844195 = queryWeight, product of:
                2.496918 = boost
                3.2151837 = idf(docFreq=4616, maxDocs=42306)
                0.012262248 = queryNorm
              0.20094898 = fieldWeight in 1719, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.2151837 = idf(docFreq=4616, maxDocs=42306)
                0.0625 = fieldNorm(doc=1719)
          0.18975326 = weight(abstract_txt:constraints in 1719) [ClassicSimilarity], result of:
            0.18975326 = score(doc=1719,freq=2.0), product of:
              0.31531382 = queryWeight, product of:
                3.7767797 = boost
                6.808497 = idf(docFreq=126, maxDocs=42306)
                0.012262248 = queryNorm
              0.6017918 = fieldWeight in 1719, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.808497 = idf(docFreq=126, maxDocs=42306)
                0.0625 = fieldNorm(doc=1719)
          0.23316997 = weight(abstract_txt:maps in 1719) [ClassicSimilarity], result of:
            0.23316997 = score(doc=1719,freq=5.0), product of:
              0.28323418 = queryWeight, product of:
                3.9211514 = boost
                5.8906326 = idf(docFreq=317, maxDocs=42306)
                0.012262248 = queryNorm
              0.82324094 = fieldWeight in 1719, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                5.8906326 = idf(docFreq=317, maxDocs=42306)
                0.0625 = fieldNorm(doc=1719)
          0.37505728 = weight(abstract_txt:topic in 1719) [ClassicSimilarity], result of:
            0.37505728 = score(doc=1719,freq=11.0), product of:
              0.35446307 = queryWeight, product of:
                5.6630535 = boost
                5.104465 = idf(docFreq=697, maxDocs=42306)
                0.012262248 = queryNorm
              1.0580997 = fieldWeight in 1719, product of:
                3.3166249 = tf(freq=11.0), with freq of:
                  11.0 = termFreq=11.0
                5.104465 = idf(docFreq=697, maxDocs=42306)
                0.0625 = fieldNorm(doc=1719)
        0.2 = coord(5/25)
    
  5. Pepper, S.; Groenmo, G.O.: Towards a general theory of scope (2002) 0.15
    0.15249942 = sum of:
      0.15249942 = product of:
        0.7624971 = sum of:
          0.022965865 = weight(abstract_txt:topics in 2540) [ClassicSimilarity], result of:
            0.022965865 = score(doc=2540,freq=1.0), product of:
              0.071620174 = queryWeight, product of:
                1.1384077 = boost
                5.1305914 = idf(docFreq=679, maxDocs=42306)
                0.012262248 = queryNorm
              0.32066196 = fieldWeight in 2540, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.1305914 = idf(docFreq=679, maxDocs=42306)
                0.0625 = fieldNorm(doc=2540)
          0.106721446 = weight(abstract_txt:merge in 2540) [ClassicSimilarity], result of:
            0.106721446 = score(doc=2540,freq=1.0), product of:
              0.19944109 = queryWeight, product of:
                1.8997107 = boost
                8.561642 = idf(docFreq=21, maxDocs=42306)
                0.012262248 = queryNorm
              0.5351026 = fieldWeight in 2540, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.561642 = idf(docFreq=21, maxDocs=42306)
                0.0625 = fieldNorm(doc=2540)
          0.01978181 = weight(abstract_txt:based in 2540) [ClassicSimilarity], result of:
            0.01978181 = score(doc=2540,freq=1.0), product of:
              0.09844195 = queryWeight, product of:
                2.496918 = boost
                3.2151837 = idf(docFreq=4616, maxDocs=42306)
                0.012262248 = queryNorm
              0.20094898 = fieldWeight in 2540, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.2151837 = idf(docFreq=4616, maxDocs=42306)
                0.0625 = fieldNorm(doc=2540)
          0.25542492 = weight(abstract_txt:maps in 2540) [ClassicSimilarity], result of:
            0.25542492 = score(doc=2540,freq=6.0), product of:
              0.28323418 = queryWeight, product of:
                3.9211514 = boost
                5.8906326 = idf(docFreq=317, maxDocs=42306)
                0.012262248 = queryNorm
              0.9018153 = fieldWeight in 2540, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                5.8906326 = idf(docFreq=317, maxDocs=42306)
                0.0625 = fieldNorm(doc=2540)
          0.35760307 = weight(abstract_txt:topic in 2540) [ClassicSimilarity], result of:
            0.35760307 = score(doc=2540,freq=10.0), product of:
              0.35446307 = queryWeight, product of:
                5.6630535 = boost
                5.104465 = idf(docFreq=697, maxDocs=42306)
                0.012262248 = queryNorm
              1.0088584 = fieldWeight in 2540, product of:
                3.1622777 = tf(freq=10.0), with freq of:
                  10.0 = termFreq=10.0
                5.104465 = idf(docFreq=697, maxDocs=42306)
                0.0625 = fieldNorm(doc=2540)
        0.2 = coord(5/25)