Document (#32924)

Author
Kim, J.-M.
Shin, H.
Kim, H.-J.
Title
Schema and constraints-based matching and merging of Topic Maps
Source
Information processing and management. 43(2007) no.4, S.930-945
Year
2007
Abstract
In this paper, we propose a multi-strategic matching and merging approach to find correspondences between ontologies based on the syntactic or semantic characteristics and constraints of the Topic Maps. Our multi-strategic matching approach consists of a linguistic module and a Topic Map constraints-based module. A linguistic module computes similarities between concepts using morphological analysis, string normalization and tokenization and language-dependent heuristics. A Topic Map constraints-based module takes advantage of several Topic Maps-dependent techniques such as a topic property-based matching, a hierarchy-based matching, and an association-based matching. This is a composite matching procedure and need not generate a cross-pair of all topics from the ontologies because unmatched pairs of topics can be removed by characteristics and constraints of the Topic Maps. Merging between Topic Maps follows the matching operations. We set up the MERGE function to integrate two Topic Maps into a new Topic Map, which satisfies such merge requirements as entity preservation, property preservation, relation preservation, and conflict resolution. For our experiments, we used oriental philosophy ontologies, western philosophy ontologies, Yahoo western philosophy dictionary, and Wikipedia philosophy ontology as input ontologies. Our experiments show that the automatically generated matching results conform to the outputs generated manually by domain experts and can be of great benefit to the following merging operations.
Theme
Semantische Interoperabilität

Similar documents (author)

  1. Shin, H.-s.: Quality of Korean cataloging records in shared databases (2003) 4.86
    4.8644676 = sum of:
      4.8644676 = weight(author_txt:shin in 499) [ClassicSimilarity], result of:
        4.8644676 = fieldWeight in 499, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.728935 = idf(docFreq=6, maxDocs=43254)
          0.5 = fieldNorm(doc=499)
    
  2. Shin, D.-H.: Next generation of information infrastructure : a comparative case study of Korea versus the United States of America (2008) 4.86
    4.8644676 = sum of:
      4.8644676 = weight(author_txt:shin in 4366) [ClassicSimilarity], result of:
        4.8644676 = fieldWeight in 4366, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.728935 = idf(docFreq=6, maxDocs=43254)
          0.5 = fieldNorm(doc=4366)
    
  3. Leydesdorff, L.; Shin, J.C.: How to evaluate universities in terms of their relative citation impacts : fractional counting of citations and the normalization of differences among disciplines (2011) 4.86
    4.8644676 = sum of:
      4.8644676 = weight(author_txt:shin in 931) [ClassicSimilarity], result of:
        4.8644676 = fieldWeight in 931, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.728935 = idf(docFreq=6, maxDocs=43254)
          0.5 = fieldNorm(doc=931)
    
  4. Keselman, A.; Rosemblat, G.; Kilicoglu, H.; Fiszman, M.; Jin, H.; Shin, D.; Rindflesch, T.C.: Adapting semantic natural language processing technology to address information overload in influenza epidemic management (2010) 2.43
    2.4322338 = sum of:
      2.4322338 = weight(author_txt:shin in 2777) [ClassicSimilarity], result of:
        2.4322338 = fieldWeight in 2777, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.728935 = idf(docFreq=6, maxDocs=43254)
          0.25 = fieldNorm(doc=2777)
    
  5. Rosemblat, G.; Resnick, M.P.; Auston, I.; Shin, D.; Sneiderman, C.; Fizsman, M.; Rindflesch, T.C.: Extending SemRep to the public health domain (2013) 2.43
    2.4322338 = sum of:
      2.4322338 = weight(author_txt:shin in 3561) [ClassicSimilarity], result of:
        2.4322338 = fieldWeight in 3561, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.728935 = idf(docFreq=6, maxDocs=43254)
          0.25 = fieldNorm(doc=3561)
    

Similar documents (content)

  1. Widhalm, R.; Mueck, T.A.: Merging topics in well-formed XML topic maps (2003) 0.24
    0.24070568 = sum of:
      0.24070568 = product of:
        1.2035284 = sum of:
          0.028466746 = weight(abstract_txt:topics in 4187) [ClassicSimilarity], result of:
            0.028466746 = score(doc=4187,freq=1.0), product of:
              0.07133164 = queryWeight, product of:
                1.1306535 = boost
                5.1081724 = idf(docFreq=710, maxDocs=43254)
                0.012350574 = queryNorm
              0.39907598 = fieldWeight in 4187, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.1081724 = idf(docFreq=710, maxDocs=43254)
                0.078125 = fieldNorm(doc=4187)
          0.3673219 = weight(abstract_txt:merging in 4187) [ClassicSimilarity], result of:
            0.3673219 = score(doc=4187,freq=4.0), product of:
              0.31146666 = queryWeight, product of:
                3.3412519 = boost
                7.5477104 = idf(docFreq=61, maxDocs=43254)
                0.012350574 = queryNorm
              1.1793298 = fieldWeight in 4187, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                7.5477104 = idf(docFreq=61, maxDocs=43254)
                0.078125 = fieldNorm(doc=4187)
          0.23738615 = weight(abstract_txt:constraints in 4187) [ClassicSimilarity], result of:
            0.23738615 = score(doc=4187,freq=2.0), product of:
              0.31598318 = queryWeight, product of:
                3.7626202 = boost
                6.799648 = idf(docFreq=130, maxDocs=43254)
                0.012350574 = queryNorm
              0.751262 = fieldWeight in 4187, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.799648 = idf(docFreq=130, maxDocs=43254)
                0.078125 = fieldNorm(doc=4187)
          0.22652432 = weight(abstract_txt:maps in 4187) [ClassicSimilarity], result of:
            0.22652432 = score(doc=4187,freq=3.0), product of:
              0.28431532 = queryWeight, product of:
                3.9097517 = boost
                5.8879476 = idf(docFreq=325, maxDocs=43254)
                0.012350574 = queryNorm
              0.79673624 = fieldWeight in 4187, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.8879476 = idf(docFreq=325, maxDocs=43254)
                0.078125 = fieldNorm(doc=4187)
          0.3438293 = weight(abstract_txt:topic in 4187) [ClassicSimilarity], result of:
            0.3438293 = score(doc=4187,freq=6.0), product of:
              0.35336635 = queryWeight, product of:
                5.627118 = boost
                5.084544 = idf(docFreq=727, maxDocs=43254)
                0.012350574 = queryNorm
              0.9730109 = fieldWeight in 4187, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                5.084544 = idf(docFreq=727, maxDocs=43254)
                0.078125 = fieldNorm(doc=4187)
        0.2 = coord(5/25)
    
  2. Baião Salgado Silva, G.; Lima, G.Â. Borém de Oliveira: Using topic maps in establishing compatibility of semantically structured hypertext contents (2012) 0.22
    0.22212622 = sum of:
      0.22212622 = product of:
        0.7933079 = sum of:
          0.02010872 = weight(abstract_txt:characteristics in 2098) [ClassicSimilarity], result of:
            0.02010872 = score(doc=2098,freq=1.0), product of:
              0.06565281 = queryWeight, product of:
                1.0847136 = boost
                4.900621 = idf(docFreq=874, maxDocs=43254)
                0.012350574 = queryNorm
              0.3062888 = fieldWeight in 2098, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.900621 = idf(docFreq=874, maxDocs=43254)
                0.0625 = fieldNorm(doc=2098)
          0.015261089 = weight(abstract_txt:between in 2098) [ClassicSimilarity], result of:
            0.015261089 = score(doc=2098,freq=2.0), product of:
              0.049629644 = queryWeight, product of:
                1.1550602 = boost
                3.4789596 = idf(docFreq=3625, maxDocs=43254)
                0.012350574 = queryNorm
              0.30749947 = fieldWeight in 2098, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4789596 = idf(docFreq=3625, maxDocs=43254)
                0.0625 = fieldNorm(doc=2098)
          0.046650365 = weight(abstract_txt:property in 2098) [ClassicSimilarity], result of:
            0.046650365 = score(doc=2098,freq=1.0), product of:
              0.115053646 = queryWeight, product of:
                1.4359477 = boost
                6.487459 = idf(docFreq=178, maxDocs=43254)
                0.012350574 = queryNorm
              0.4054662 = fieldWeight in 2098, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.487459 = idf(docFreq=178, maxDocs=43254)
                0.0625 = fieldNorm(doc=2098)
          0.18427494 = weight(abstract_txt:merge in 2098) [ClassicSimilarity], result of:
            0.18427494 = score(doc=2098,freq=3.0), product of:
              0.19934292 = queryWeight, product of:
                1.8901176 = boost
                8.5393505 = idf(docFreq=22, maxDocs=43254)
                0.012350574 = queryNorm
              0.9244118 = fieldWeight in 2098, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                8.5393505 = idf(docFreq=22, maxDocs=43254)
                0.0625 = fieldNorm(doc=2098)
          0.01963234 = weight(abstract_txt:based in 2098) [ClassicSimilarity], result of:
            0.01963234 = score(doc=2098,freq=1.0), product of:
              0.098099716 = queryWeight, product of:
                2.4805975 = boost
                3.2020218 = idf(docFreq=4782, maxDocs=43254)
                0.012350574 = queryNorm
              0.20012636 = fieldWeight in 2098, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.2020218 = idf(docFreq=4782, maxDocs=43254)
                0.0625 = fieldNorm(doc=2098)
          0.25628302 = weight(abstract_txt:maps in 2098) [ClassicSimilarity], result of:
            0.25628302 = score(doc=2098,freq=6.0), product of:
              0.28431532 = queryWeight, product of:
                3.9097517 = boost
                5.8879476 = idf(docFreq=325, maxDocs=43254)
                0.012350574 = queryNorm
              0.9014042 = fieldWeight in 2098, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                5.8879476 = idf(docFreq=325, maxDocs=43254)
                0.0625 = fieldNorm(doc=2098)
          0.2510974 = weight(abstract_txt:topic in 2098) [ClassicSimilarity], result of:
            0.2510974 = score(doc=2098,freq=5.0), product of:
              0.35336635 = queryWeight, product of:
                5.627118 = boost
                5.084544 = idf(docFreq=727, maxDocs=43254)
                0.012350574 = queryNorm
              0.71058667 = fieldWeight in 2098, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                5.084544 = idf(docFreq=727, maxDocs=43254)
                0.0625 = fieldNorm(doc=2098)
        0.28 = coord(7/25)
    
  3. Green, R.: Topical relevance relationships : 2: an exploratory study and preliminary typology (1995) 0.20
    0.19970408 = sum of:
      0.19970408 = product of:
        0.83210033 = sum of:
          0.040258054 = weight(abstract_txt:topics in 4793) [ClassicSimilarity], result of:
            0.040258054 = score(doc=4793,freq=2.0), product of:
              0.07133164 = queryWeight, product of:
                1.1306535 = boost
                5.1081724 = idf(docFreq=710, maxDocs=43254)
                0.012350574 = queryNorm
              0.5643786 = fieldWeight in 4793, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.1081724 = idf(docFreq=710, maxDocs=43254)
                0.078125 = fieldNorm(doc=4793)
          0.01907636 = weight(abstract_txt:between in 4793) [ClassicSimilarity], result of:
            0.01907636 = score(doc=4793,freq=2.0), product of:
              0.049629644 = queryWeight, product of:
                1.1550602 = boost
                3.4789596 = idf(docFreq=3625, maxDocs=43254)
                0.012350574 = queryNorm
              0.38437432 = fieldWeight in 4793, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4789596 = idf(docFreq=3625, maxDocs=43254)
                0.078125 = fieldNorm(doc=4793)
          0.036299538 = weight(abstract_txt:generated in 4793) [ClassicSimilarity], result of:
            0.036299538 = score(doc=4793,freq=1.0), product of:
              0.08387987 = queryWeight, product of:
                1.2260758 = boost
                5.53928 = idf(docFreq=461, maxDocs=43254)
                0.012350574 = queryNorm
              0.43275625 = fieldWeight in 4793, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.53928 = idf(docFreq=461, maxDocs=43254)
                0.078125 = fieldNorm(doc=4793)
          0.16785736 = weight(abstract_txt:constraints in 4793) [ClassicSimilarity], result of:
            0.16785736 = score(doc=4793,freq=1.0), product of:
              0.31598318 = queryWeight, product of:
                3.7626202 = boost
                6.799648 = idf(docFreq=130, maxDocs=43254)
                0.012350574 = queryNorm
              0.53122246 = fieldWeight in 4793, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.799648 = idf(docFreq=130, maxDocs=43254)
                0.078125 = fieldNorm(doc=4793)
          0.19850993 = weight(abstract_txt:topic in 4793) [ClassicSimilarity], result of:
            0.19850993 = score(doc=4793,freq=2.0), product of:
              0.35336635 = queryWeight, product of:
                5.627118 = boost
                5.084544 = idf(docFreq=727, maxDocs=43254)
                0.012350574 = queryNorm
              0.56176805 = fieldWeight in 4793, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.084544 = idf(docFreq=727, maxDocs=43254)
                0.078125 = fieldNorm(doc=4793)
          0.37009907 = weight(abstract_txt:matching in 4793) [ClassicSimilarity], result of:
            0.37009907 = score(doc=4793,freq=3.0), product of:
              0.45147407 = queryWeight, product of:
                6.03408 = boost
                6.058074 = idf(docFreq=274, maxDocs=43254)
                0.012350574 = queryNorm
              0.8197571 = fieldWeight in 4793, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.058074 = idf(docFreq=274, maxDocs=43254)
                0.078125 = fieldNorm(doc=4793)
        0.24 = coord(6/25)
    
  4. Cregan, A.: ¬An OWL DL construction for the ISO Topic Map Data Model (2005) 0.17
    0.16623867 = sum of:
      0.16623867 = product of:
        0.8311933 = sum of:
          0.015261089 = weight(abstract_txt:between in 1183) [ClassicSimilarity], result of:
            0.015261089 = score(doc=1183,freq=2.0), product of:
              0.049629644 = queryWeight, product of:
                1.1550602 = boost
                3.4789596 = idf(docFreq=3625, maxDocs=43254)
                0.012350574 = queryNorm
              0.30749947 = fieldWeight in 1183, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4789596 = idf(docFreq=3625, maxDocs=43254)
                0.0625 = fieldNorm(doc=1183)
          0.01963234 = weight(abstract_txt:based in 1183) [ClassicSimilarity], result of:
            0.01963234 = score(doc=1183,freq=1.0), product of:
              0.098099716 = queryWeight, product of:
                2.4805975 = boost
                3.2020218 = idf(docFreq=4782, maxDocs=43254)
                0.012350574 = queryNorm
              0.20012636 = fieldWeight in 1183, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.2020218 = idf(docFreq=4782, maxDocs=43254)
                0.0625 = fieldNorm(doc=1183)
          0.18990892 = weight(abstract_txt:constraints in 1183) [ClassicSimilarity], result of:
            0.18990892 = score(doc=1183,freq=2.0), product of:
              0.31598318 = queryWeight, product of:
                3.7626202 = boost
                6.799648 = idf(docFreq=130, maxDocs=43254)
                0.012350574 = queryNorm
              0.6010096 = fieldWeight in 1183, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.799648 = idf(docFreq=130, maxDocs=43254)
                0.0625 = fieldNorm(doc=1183)
          0.23395333 = weight(abstract_txt:maps in 1183) [ClassicSimilarity], result of:
            0.23395333 = score(doc=1183,freq=5.0), product of:
              0.28431532 = queryWeight, product of:
                3.9097517 = boost
                5.8879476 = idf(docFreq=325, maxDocs=43254)
                0.012350574 = queryNorm
              0.8228657 = fieldWeight in 1183, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                5.8879476 = idf(docFreq=325, maxDocs=43254)
                0.0625 = fieldNorm(doc=1183)
          0.37243766 = weight(abstract_txt:topic in 1183) [ClassicSimilarity], result of:
            0.37243766 = score(doc=1183,freq=11.0), product of:
              0.35336635 = queryWeight, product of:
                5.627118 = boost
                5.084544 = idf(docFreq=727, maxDocs=43254)
                0.012350574 = queryNorm
              1.0539703 = fieldWeight in 1183, product of:
                3.3166249 = tf(freq=11.0), with freq of:
                  11.0 = termFreq=11.0
                5.084544 = idf(docFreq=727, maxDocs=43254)
                0.0625 = fieldNorm(doc=1183)
        0.2 = coord(5/25)
    
  5. Pepper, S.; Groenmo, G.O.: Towards a general theory of scope (2002) 0.15
    0.15203707 = sum of:
      0.15203707 = product of:
        0.76018536 = sum of:
          0.022773396 = weight(abstract_txt:topics in 2004) [ClassicSimilarity], result of:
            0.022773396 = score(doc=2004,freq=1.0), product of:
              0.07133164 = queryWeight, product of:
                1.1306535 = boost
                5.1081724 = idf(docFreq=710, maxDocs=43254)
                0.012350574 = queryNorm
              0.31926078 = fieldWeight in 2004, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.1081724 = idf(docFreq=710, maxDocs=43254)
                0.0625 = fieldNorm(doc=2004)
          0.10639119 = weight(abstract_txt:merge in 2004) [ClassicSimilarity], result of:
            0.10639119 = score(doc=2004,freq=1.0), product of:
              0.19934292 = queryWeight, product of:
                1.8901176 = boost
                8.5393505 = idf(docFreq=22, maxDocs=43254)
                0.012350574 = queryNorm
              0.5337094 = fieldWeight in 2004, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.5393505 = idf(docFreq=22, maxDocs=43254)
                0.0625 = fieldNorm(doc=2004)
          0.01963234 = weight(abstract_txt:based in 2004) [ClassicSimilarity], result of:
            0.01963234 = score(doc=2004,freq=1.0), product of:
              0.098099716 = queryWeight, product of:
                2.4805975 = boost
                3.2020218 = idf(docFreq=4782, maxDocs=43254)
                0.012350574 = queryNorm
              0.20012636 = fieldWeight in 2004, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.2020218 = idf(docFreq=4782, maxDocs=43254)
                0.0625 = fieldNorm(doc=2004)
          0.25628302 = weight(abstract_txt:maps in 2004) [ClassicSimilarity], result of:
            0.25628302 = score(doc=2004,freq=6.0), product of:
              0.28431532 = queryWeight, product of:
                3.9097517 = boost
                5.8879476 = idf(docFreq=325, maxDocs=43254)
                0.012350574 = queryNorm
              0.9014042 = fieldWeight in 2004, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                5.8879476 = idf(docFreq=325, maxDocs=43254)
                0.0625 = fieldNorm(doc=2004)
          0.35510537 = weight(abstract_txt:topic in 2004) [ClassicSimilarity], result of:
            0.35510537 = score(doc=2004,freq=10.0), product of:
              0.35336635 = queryWeight, product of:
                5.627118 = boost
                5.084544 = idf(docFreq=727, maxDocs=43254)
                0.012350574 = queryNorm
              1.0049213 = fieldWeight in 2004, product of:
                3.1622777 = tf(freq=10.0), with freq of:
                  10.0 = termFreq=10.0
                5.084544 = idf(docFreq=727, maxDocs=43254)
                0.0625 = fieldNorm(doc=2004)
        0.2 = coord(5/25)