Document (#18596)

Author
Scheuermann, P.
Li, W.-S.
Clifton, C.
Title
Multidatabase query processing with uncertainty in global keys and attribute values
Source
Journal of the American Society for Information Science. 49(1998) no.3, S.283-301
Year
1998
Abstract
Semantic integration and data integration are 2 main processes that multidatabase systems need to employ in order to support interoperability. Both these processes involve uncertainty when attribute correspondences and global IDs are unknown or imprecise. The role-set approach is a new conceptual framework for data integration in multidatabase systems that maintains the materialization autonomy of local database systems by presenting the answer to a query as a set of sets representing the ddistinct intersections between the relations corresponding to the various roles played by an entity. In this article, we present an approach for dynamic database integration and query processing in the absence of information about attribute correspondences and global IDs. We define different types of equivalence conditions for the construction of global IDs. We propose a strategy based on ranked role-sets that makes use of an automated semantic integration procedure based on neural networks to determine candidate global IDs. The data integration and query processing stepts then produce a number a role-sets, ranked by the similarity of the candidate IDs

Similar documents (content)

  1. Gyseghem, N. van; Caluwe, R. de: Imprecision and uncertainty in the UFO database model (1998) 0.18
    0.18467495 = sum of:
      0.18467495 = product of:
        0.5771092 = sum of:
          0.117708795 = weight(abstract_txt:imprecise in 591) [ClassicSimilarity], result of:
            0.117708795 = score(doc=591,freq=2.0), product of:
              0.15920784 = queryWeight, product of:
                1.1640204 = boost
                8.364683 = idf(docFreq=27, maxDocs=44218)
                0.016351378 = queryNorm
              0.7393405 = fieldWeight in 591, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.364683 = idf(docFreq=27, maxDocs=44218)
                0.0625 = fieldNorm(doc=591)
          0.0589967 = weight(abstract_txt:database in 591) [ClassicSimilarity], result of:
            0.0589967 = score(doc=591,freq=7.0), product of:
              0.08336126 = queryWeight, product of:
                1.1911749 = boost
                4.2799077 = idf(docFreq=1663, maxDocs=44218)
                0.016351378 = queryNorm
              0.7077232 = fieldWeight in 591, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                4.2799077 = idf(docFreq=1663, maxDocs=44218)
                0.0625 = fieldNorm(doc=591)
          0.025477514 = weight(abstract_txt:semantic in 591) [ClassicSimilarity], result of:
            0.025477514 = score(doc=591,freq=1.0), product of:
              0.0911066 = queryWeight, product of:
                1.2452837 = boost
                4.4743214 = idf(docFreq=1369, maxDocs=44218)
                0.016351378 = queryNorm
              0.2796451 = fieldWeight in 591, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4743214 = idf(docFreq=1369, maxDocs=44218)
                0.0625 = fieldNorm(doc=591)
          0.015844578 = weight(abstract_txt:data in 591) [ClassicSimilarity], result of:
            0.015844578 = score(doc=591,freq=1.0), product of:
              0.07598526 = queryWeight, product of:
                1.3928479 = boost
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.016351378 = queryNorm
              0.20852174 = fieldWeight in 591, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.0625 = fieldNorm(doc=591)
          0.016945224 = weight(abstract_txt:systems in 591) [ClassicSimilarity], result of:
            0.016945224 = score(doc=591,freq=1.0), product of:
              0.079464614 = queryWeight, product of:
                1.4243802 = boost
                3.4118783 = idf(docFreq=3963, maxDocs=44218)
                0.016351378 = queryNorm
              0.2132424 = fieldWeight in 591, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4118783 = idf(docFreq=3963, maxDocs=44218)
                0.0625 = fieldNorm(doc=591)
          0.052682143 = weight(abstract_txt:role in 591) [ClassicSimilarity], result of:
            0.052682143 = score(doc=591,freq=2.0), product of:
              0.13435109 = queryWeight, product of:
                1.8520794 = boost
                4.4363647 = idf(docFreq=1422, maxDocs=44218)
                0.016351378 = queryNorm
              0.39212292 = fieldWeight in 591, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.4363647 = idf(docFreq=1422, maxDocs=44218)
                0.0625 = fieldNorm(doc=591)
          0.13464421 = weight(abstract_txt:uncertainty in 591) [ClassicSimilarity], result of:
            0.13464421 = score(doc=591,freq=2.0), product of:
              0.21939512 = queryWeight, product of:
                1.9324437 = boost
                6.943297 = idf(docFreq=115, maxDocs=44218)
                0.016351378 = queryNorm
              0.6137065 = fieldWeight in 591, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.943297 = idf(docFreq=115, maxDocs=44218)
                0.0625 = fieldNorm(doc=591)
          0.15481006 = weight(abstract_txt:attribute in 591) [ClassicSimilarity], result of:
            0.15481006 = score(doc=591,freq=1.0), product of:
              0.34727618 = queryWeight, product of:
                2.9776697 = boost
                7.132539 = idf(docFreq=95, maxDocs=44218)
                0.016351378 = queryNorm
              0.44578367 = fieldWeight in 591, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.132539 = idf(docFreq=95, maxDocs=44218)
                0.0625 = fieldNorm(doc=591)
        0.32 = coord(8/25)
    
  2. Boßmeyer, C.: OSI-Anwendungen in Bibliotheken oder Was ein Bibliothekar von OSI wissen sollte (1995) 0.18
    0.17586534 = sum of:
      0.17586534 = product of:
        0.73277223 = sum of:
          0.031689156 = weight(abstract_txt:data in 5082) [ClassicSimilarity], result of:
            0.031689156 = score(doc=5082,freq=1.0), product of:
              0.07598526 = queryWeight, product of:
                1.3928479 = boost
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.016351378 = queryNorm
              0.41704348 = fieldWeight in 5082, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.125 = fieldNorm(doc=5082)
          0.047928333 = weight(abstract_txt:systems in 5082) [ClassicSimilarity], result of:
            0.047928333 = score(doc=5082,freq=2.0), product of:
              0.079464614 = queryWeight, product of:
                1.4243802 = boost
                3.4118783 = idf(docFreq=3963, maxDocs=44218)
                0.016351378 = queryNorm
              0.6031406 = fieldWeight in 5082, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4118783 = idf(docFreq=3963, maxDocs=44218)
                0.125 = fieldNorm(doc=5082)
          0.102358945 = weight(abstract_txt:processing in 5082) [ClassicSimilarity], result of:
            0.102358945 = score(doc=5082,freq=1.0), product of:
              0.16603747 = queryWeight, product of:
                2.0589323 = boost
                4.931848 = idf(docFreq=866, maxDocs=44218)
                0.016351378 = queryNorm
              0.616481 = fieldWeight in 5082, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.931848 = idf(docFreq=866, maxDocs=44218)
                0.125 = fieldNorm(doc=5082)
          0.11895391 = weight(abstract_txt:sets in 5082) [ClassicSimilarity], result of:
            0.11895391 = score(doc=5082,freq=1.0), product of:
              0.18353042 = queryWeight, product of:
                2.1646767 = boost
                5.185142 = idf(docFreq=672, maxDocs=44218)
                0.016351378 = queryNorm
              0.64814276 = fieldWeight in 5082, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.185142 = idf(docFreq=672, maxDocs=44218)
                0.125 = fieldNorm(doc=5082)
          0.122221746 = weight(abstract_txt:query in 5082) [ClassicSimilarity], result of:
            0.122221746 = score(doc=5082,freq=1.0), product of:
              0.20568414 = queryWeight, product of:
                2.6461155 = boost
                4.7537646 = idf(docFreq=1035, maxDocs=44218)
                0.016351378 = queryNorm
              0.5942206 = fieldWeight in 5082, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.7537646 = idf(docFreq=1035, maxDocs=44218)
                0.125 = fieldNorm(doc=5082)
          0.3096201 = weight(abstract_txt:attribute in 5082) [ClassicSimilarity], result of:
            0.3096201 = score(doc=5082,freq=1.0), product of:
              0.34727618 = queryWeight, product of:
                2.9776697 = boost
                7.132539 = idf(docFreq=95, maxDocs=44218)
                0.016351378 = queryNorm
              0.89156735 = fieldWeight in 5082, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.132539 = idf(docFreq=95, maxDocs=44218)
                0.125 = fieldNorm(doc=5082)
        0.24 = coord(6/25)
    
  3. Quast, D.: Rationaliy, neural sets and deterministic chaos : knowledge organisation for the human mind; the user of libraries and information centres (1996) 0.15
    0.15414159 = sum of:
      0.15414159 = product of:
        0.5505057 = sum of:
          0.124849044 = weight(abstract_txt:imprecise in 2) [ClassicSimilarity], result of:
            0.124849044 = score(doc=2,freq=1.0), product of:
              0.15920784 = queryWeight, product of:
                1.1640204 = boost
                8.364683 = idf(docFreq=27, maxDocs=44218)
                0.016351378 = queryNorm
              0.78418905 = fieldWeight in 2, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.364683 = idf(docFreq=27, maxDocs=44218)
                0.09375 = fieldNorm(doc=2)
          0.047302593 = weight(abstract_txt:database in 2) [ClassicSimilarity], result of:
            0.047302593 = score(doc=2,freq=2.0), product of:
              0.08336126 = queryWeight, product of:
                1.1911749 = boost
                4.2799077 = idf(docFreq=1663, maxDocs=44218)
                0.016351378 = queryNorm
              0.5674409 = fieldWeight in 2, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.2799077 = idf(docFreq=1663, maxDocs=44218)
                0.09375 = fieldNorm(doc=2)
          0.033611428 = weight(abstract_txt:data in 2) [ClassicSimilarity], result of:
            0.033611428 = score(doc=2,freq=2.0), product of:
              0.07598526 = queryWeight, product of:
                1.3928479 = boost
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.016351378 = queryNorm
              0.44234142 = fieldWeight in 2, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.09375 = fieldNorm(doc=2)
          0.03594625 = weight(abstract_txt:systems in 2) [ClassicSimilarity], result of:
            0.03594625 = score(doc=2,freq=2.0), product of:
              0.079464614 = queryWeight, product of:
                1.4243802 = boost
                3.4118783 = idf(docFreq=3963, maxDocs=44218)
                0.016351378 = queryNorm
              0.45235544 = fieldWeight in 2, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4118783 = idf(docFreq=3963, maxDocs=44218)
                0.09375 = fieldNorm(doc=2)
          0.14281176 = weight(abstract_txt:uncertainty in 2) [ClassicSimilarity], result of:
            0.14281176 = score(doc=2,freq=1.0), product of:
              0.21939512 = queryWeight, product of:
                1.9324437 = boost
                6.943297 = idf(docFreq=115, maxDocs=44218)
                0.016351378 = queryNorm
              0.6509341 = fieldWeight in 2, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.943297 = idf(docFreq=115, maxDocs=44218)
                0.09375 = fieldNorm(doc=2)
          0.07676921 = weight(abstract_txt:processing in 2) [ClassicSimilarity], result of:
            0.07676921 = score(doc=2,freq=1.0), product of:
              0.16603747 = queryWeight, product of:
                2.0589323 = boost
                4.931848 = idf(docFreq=866, maxDocs=44218)
                0.016351378 = queryNorm
              0.46236074 = fieldWeight in 2, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.931848 = idf(docFreq=866, maxDocs=44218)
                0.09375 = fieldNorm(doc=2)
          0.08921543 = weight(abstract_txt:sets in 2) [ClassicSimilarity], result of:
            0.08921543 = score(doc=2,freq=1.0), product of:
              0.18353042 = queryWeight, product of:
                2.1646767 = boost
                5.185142 = idf(docFreq=672, maxDocs=44218)
                0.016351378 = queryNorm
              0.48610705 = fieldWeight in 2, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.185142 = idf(docFreq=672, maxDocs=44218)
                0.09375 = fieldNorm(doc=2)
        0.28 = coord(7/25)
    
  4. Euzenat, J.; Shvaiko, P.: Ontology matching (2010) 0.14
    0.14303958 = sum of:
      0.14303958 = product of:
        0.5959983 = sum of:
          0.052971236 = weight(abstract_txt:equivalence in 168) [ClassicSimilarity], result of:
            0.052971236 = score(doc=168,freq=1.0), product of:
              0.12876263 = queryWeight, product of:
                1.0468231 = boost
                7.5225 = idf(docFreq=64, maxDocs=44218)
                0.016351378 = queryNorm
              0.41138673 = fieldWeight in 168, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.5225 = idf(docFreq=64, maxDocs=44218)
                0.0546875 = fieldNorm(doc=168)
          0.033794604 = weight(abstract_txt:database in 168) [ClassicSimilarity], result of:
            0.033794604 = score(doc=168,freq=3.0), product of:
              0.08336126 = queryWeight, product of:
                1.1911749 = boost
                4.2799077 = idf(docFreq=1663, maxDocs=44218)
                0.016351378 = queryNorm
              0.40539938 = fieldWeight in 168, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.2799077 = idf(docFreq=1663, maxDocs=44218)
                0.0546875 = fieldNorm(doc=168)
          0.04458565 = weight(abstract_txt:semantic in 168) [ClassicSimilarity], result of:
            0.04458565 = score(doc=168,freq=4.0), product of:
              0.0911066 = queryWeight, product of:
                1.2452837 = boost
                4.4743214 = idf(docFreq=1369, maxDocs=44218)
                0.016351378 = queryNorm
              0.4893789 = fieldWeight in 168, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.4743214 = idf(docFreq=1369, maxDocs=44218)
                0.0546875 = fieldNorm(doc=168)
          0.036318764 = weight(abstract_txt:systems in 168) [ClassicSimilarity], result of:
            0.036318764 = score(doc=168,freq=6.0), product of:
              0.079464614 = queryWeight, product of:
                1.4243802 = boost
                3.4118783 = idf(docFreq=3963, maxDocs=44218)
                0.016351378 = queryNorm
              0.45704323 = fieldWeight in 168, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                3.4118783 = idf(docFreq=3963, maxDocs=44218)
                0.0546875 = fieldNorm(doc=168)
          0.2359865 = weight(abstract_txt:correspondences in 168) [ClassicSimilarity], result of:
            0.2359865 = score(doc=168,freq=2.0), product of:
              0.34862182 = queryWeight, product of:
                2.435963 = boost
                8.752448 = idf(docFreq=18, maxDocs=44218)
                0.016351378 = queryNorm
              0.6769126 = fieldWeight in 168, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.752448 = idf(docFreq=18, maxDocs=44218)
                0.0546875 = fieldNorm(doc=168)
          0.19234157 = weight(abstract_txt:integration in 168) [ClassicSimilarity], result of:
            0.19234157 = score(doc=168,freq=3.0), product of:
              0.38325563 = queryWeight, product of:
                4.4238286 = boost
                5.298292 = idf(docFreq=600, maxDocs=44218)
                0.016351378 = queryNorm
              0.50186235 = fieldWeight in 168, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.298292 = idf(docFreq=600, maxDocs=44218)
                0.0546875 = fieldNorm(doc=168)
        0.24 = coord(6/25)
    
  5. Cheung, W.; Hsu, C.: ¬The model-assisted global query system for multiple databases in distributed enterprises (1996) 0.13
    0.13453038 = sum of:
      0.13453038 = product of:
        0.6726519 = sum of:
          0.029955208 = weight(abstract_txt:systems in 7279) [ClassicSimilarity], result of:
            0.029955208 = score(doc=7279,freq=2.0), product of:
              0.079464614 = queryWeight, product of:
                1.4243802 = boost
                3.4118783 = idf(docFreq=3963, maxDocs=44218)
                0.016351378 = queryNorm
              0.37696287 = fieldWeight in 7279, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4118783 = idf(docFreq=3963, maxDocs=44218)
                0.078125 = fieldNorm(doc=7279)
          0.06397434 = weight(abstract_txt:processing in 7279) [ClassicSimilarity], result of:
            0.06397434 = score(doc=7279,freq=1.0), product of:
              0.16603747 = queryWeight, product of:
                2.0589323 = boost
                4.931848 = idf(docFreq=866, maxDocs=44218)
                0.016351378 = queryNorm
              0.38530064 = fieldWeight in 7279, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.931848 = idf(docFreq=866, maxDocs=44218)
                0.078125 = fieldNorm(doc=7279)
          0.1527772 = weight(abstract_txt:query in 7279) [ClassicSimilarity], result of:
            0.1527772 = score(doc=7279,freq=4.0), product of:
              0.20568414 = queryWeight, product of:
                2.6461155 = boost
                4.7537646 = idf(docFreq=1035, maxDocs=44218)
                0.016351378 = queryNorm
              0.74277574 = fieldWeight in 7279, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.7537646 = idf(docFreq=1035, maxDocs=44218)
                0.078125 = fieldNorm(doc=7279)
          0.26730448 = weight(abstract_txt:global in 7279) [ClassicSimilarity], result of:
            0.26730448 = score(doc=7279,freq=3.0), product of:
              0.35409153 = queryWeight, product of:
                3.881693 = boost
                5.57879 = idf(docFreq=453, maxDocs=44218)
                0.016351378 = queryNorm
              0.75490224 = fieldWeight in 7279, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.57879 = idf(docFreq=453, maxDocs=44218)
                0.078125 = fieldNorm(doc=7279)
          0.15864065 = weight(abstract_txt:integration in 7279) [ClassicSimilarity], result of:
            0.15864065 = score(doc=7279,freq=1.0), product of:
              0.38325563 = queryWeight, product of:
                4.4238286 = boost
                5.298292 = idf(docFreq=600, maxDocs=44218)
                0.016351378 = queryNorm
              0.41392908 = fieldWeight in 7279, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.298292 = idf(docFreq=600, maxDocs=44218)
                0.078125 = fieldNorm(doc=7279)
        0.2 = coord(5/25)