Document (#28393)

Author
Abiteboul, S.
Cluet, S.
Christophides, V.
Milo, T.
Moerkotte, G.
Siméon, J.
Title
Querying documents in object databases
Source
International journal of digital libraries. 1(1997) no.1, S.5-19
Year
1997
Abstract
We consider the problem of storing and accessing documents (SGML and HTML, in particular) using database technology. To specify the database image of documents, we use structuring schemas that consist in grammars annotated with database programs. To query documents, we introduce an extension of OQL, the ODMG standard query language for object databases. Our extension (named OQL-doc) allows us to query documents without a precise knowledge of their structure using in particular generalzed path expressions and pattern matching. This allows us to introduce in a declarative langugae (in the style of SQL or OQL), navigational and information retrieval styles of accessing data. We also consider the interaction of full-text indexes (e.g. inverted files) with standard database collection indexes (e.g, B-trees) that provide important speed-up
Object
ODMG
OQL

Similar documents (content)

  1. Falquet, G.; Guyot, J.; Nerima, L.: Languages and tools to specify hypertext views on databases (1999) 0.14
    0.1439429 = sum of:
      0.1439429 = product of:
        0.5997621 = sum of:
          0.103412755 = weight(abstract_txt:specify in 3968) [ClassicSimilarity], result of:
            0.103412755 = score(doc=3968,freq=1.0), product of:
              0.1756013 = queryWeight, product of:
                1.0717366 = boost
                7.538004 = idf(docFreq=63, maxDocs=44218)
                0.021736186 = queryNorm
              0.5889065 = fieldWeight in 3968, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.538004 = idf(docFreq=63, maxDocs=44218)
                0.078125 = fieldNorm(doc=3968)
          0.15905076 = weight(abstract_txt:declarative in 3968) [ClassicSimilarity], result of:
            0.15905076 = score(doc=3968,freq=1.0), product of:
              0.23397467 = queryWeight, product of:
                1.2371109 = boost
                8.701155 = idf(docFreq=19, maxDocs=44218)
                0.021736186 = queryNorm
              0.67977774 = fieldWeight in 3968, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.701155 = idf(docFreq=19, maxDocs=44218)
                0.078125 = fieldNorm(doc=3968)
          0.04122278 = weight(abstract_txt:particular in 3968) [ClassicSimilarity], result of:
            0.04122278 = score(doc=3968,freq=1.0), product of:
              0.11983394 = queryWeight, product of:
                1.2520714 = boost
                4.4031897 = idf(docFreq=1470, maxDocs=44218)
                0.021736186 = queryNorm
              0.3439992 = fieldWeight in 3968, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4031897 = idf(docFreq=1470, maxDocs=44218)
                0.078125 = fieldNorm(doc=3968)
          0.041491944 = weight(abstract_txt:databases in 3968) [ClassicSimilarity], result of:
            0.041491944 = score(doc=3968,freq=1.0), product of:
              0.12035502 = queryWeight, product of:
                1.2547907 = boost
                4.4127526 = idf(docFreq=1456, maxDocs=44218)
                0.021736186 = queryNorm
              0.3447463 = fieldWeight in 3968, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4127526 = idf(docFreq=1456, maxDocs=44218)
                0.078125 = fieldNorm(doc=3968)
          0.12344575 = weight(abstract_txt:object in 3968) [ClassicSimilarity], result of:
            0.12344575 = score(doc=3968,freq=2.0), product of:
              0.19760402 = queryWeight, product of:
                1.6078186 = boost
                5.6542544 = idf(docFreq=420, maxDocs=44218)
                0.021736186 = queryNorm
              0.62471277 = fieldWeight in 3968, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.6542544 = idf(docFreq=420, maxDocs=44218)
                0.078125 = fieldNorm(doc=3968)
          0.1311381 = weight(abstract_txt:database in 3968) [ClassicSimilarity], result of:
            0.1311381 = score(doc=3968,freq=3.0), product of:
              0.22643515 = queryWeight, product of:
                2.434031 = boost
                4.2799077 = idf(docFreq=1663, maxDocs=44218)
                0.021736186 = queryNorm
              0.579142 = fieldWeight in 3968, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.2799077 = idf(docFreq=1663, maxDocs=44218)
                0.078125 = fieldNorm(doc=3968)
        0.24 = coord(6/25)
    
  2. Castelli, V.: Progressive search and retrieval from image databases (2002) 0.12
    0.12467609 = sum of:
      0.12467609 = product of:
        0.3896128 = sum of:
          0.08774863 = weight(abstract_txt:specify in 4253) [ClassicSimilarity], result of:
            0.08774863 = score(doc=4253,freq=2.0), product of:
              0.1756013 = queryWeight, product of:
                1.0717366 = boost
                7.538004 = idf(docFreq=63, maxDocs=44218)
                0.021736186 = queryNorm
              0.49970376 = fieldWeight in 4253, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.538004 = idf(docFreq=63, maxDocs=44218)
                0.046875 = fieldNorm(doc=4253)
          0.024733666 = weight(abstract_txt:particular in 4253) [ClassicSimilarity], result of:
            0.024733666 = score(doc=4253,freq=1.0), product of:
              0.11983394 = queryWeight, product of:
                1.2520714 = boost
                4.4031897 = idf(docFreq=1470, maxDocs=44218)
                0.021736186 = queryNorm
              0.20639952 = fieldWeight in 4253, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4031897 = idf(docFreq=1470, maxDocs=44218)
                0.046875 = fieldNorm(doc=4253)
          0.035207085 = weight(abstract_txt:databases in 4253) [ClassicSimilarity], result of:
            0.035207085 = score(doc=4253,freq=2.0), product of:
              0.12035502 = queryWeight, product of:
                1.2547907 = boost
                4.4127526 = idf(docFreq=1456, maxDocs=44218)
                0.021736186 = queryNorm
              0.29252693 = fieldWeight in 4253, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.4127526 = idf(docFreq=1456, maxDocs=44218)
                0.046875 = fieldNorm(doc=4253)
          0.030566948 = weight(abstract_txt:standard in 4253) [ClassicSimilarity], result of:
            0.030566948 = score(doc=4253,freq=1.0), product of:
              0.13800311 = queryWeight, product of:
                1.3436421 = boost
                4.725219 = idf(docFreq=1065, maxDocs=44218)
                0.021736186 = queryNorm
              0.22149463 = fieldWeight in 4253, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.725219 = idf(docFreq=1065, maxDocs=44218)
                0.046875 = fieldNorm(doc=4253)
          0.047530748 = weight(abstract_txt:allows in 4253) [ClassicSimilarity], result of:
            0.047530748 = score(doc=4253,freq=1.0), product of:
              0.18522684 = queryWeight, product of:
                1.5566506 = boost
                5.474311 = idf(docFreq=503, maxDocs=44218)
                0.021736186 = queryNorm
              0.2566083 = fieldWeight in 4253, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.474311 = idf(docFreq=503, maxDocs=44218)
                0.046875 = fieldNorm(doc=4253)
          0.052373596 = weight(abstract_txt:object in 4253) [ClassicSimilarity], result of:
            0.052373596 = score(doc=4253,freq=1.0), product of:
              0.19760402 = queryWeight, product of:
                1.6078186 = boost
                5.6542544 = idf(docFreq=420, maxDocs=44218)
                0.021736186 = queryNorm
              0.26504317 = fieldWeight in 4253, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.6542544 = idf(docFreq=420, maxDocs=44218)
                0.046875 = fieldNorm(doc=4253)
          0.066024564 = weight(abstract_txt:query in 4253) [ClassicSimilarity], result of:
            0.066024564 = score(doc=4253,freq=2.0), product of:
              0.2095133 = queryWeight, product of:
                2.027639 = boost
                4.7537646 = idf(docFreq=1035, maxDocs=44218)
                0.021736186 = queryNorm
              0.31513304 = fieldWeight in 4253, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.7537646 = idf(docFreq=1035, maxDocs=44218)
                0.046875 = fieldNorm(doc=4253)
          0.045427576 = weight(abstract_txt:database in 4253) [ClassicSimilarity], result of:
            0.045427576 = score(doc=4253,freq=1.0), product of:
              0.22643515 = queryWeight, product of:
                2.434031 = boost
                4.2799077 = idf(docFreq=1663, maxDocs=44218)
                0.021736186 = queryNorm
              0.20062068 = fieldWeight in 4253, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.2799077 = idf(docFreq=1663, maxDocs=44218)
                0.046875 = fieldNorm(doc=4253)
        0.32 = coord(8/25)
    
  3. Ozkarahan, E.: Multimedia document retrieval (1995) 0.12
    0.124124005 = sum of:
      0.124124005 = product of:
        0.62062 = sum of:
          0.20698059 = weight(abstract_txt:langugae in 1492) [ClassicSimilarity], result of:
            0.20698059 = score(doc=1492,freq=1.0), product of:
              0.278889 = queryWeight, product of:
                1.3506409 = boost
                9.499662 = idf(docFreq=8, maxDocs=44218)
                0.021736186 = queryNorm
              0.74216115 = fieldWeight in 1492, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.499662 = idf(docFreq=8, maxDocs=44218)
                0.078125 = fieldNorm(doc=1492)
          0.087289326 = weight(abstract_txt:object in 1492) [ClassicSimilarity], result of:
            0.087289326 = score(doc=1492,freq=1.0), product of:
              0.19760402 = queryWeight, product of:
                1.6078186 = boost
                5.6542544 = idf(docFreq=420, maxDocs=44218)
                0.021736186 = queryNorm
              0.44173864 = fieldWeight in 1492, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.6542544 = idf(docFreq=420, maxDocs=44218)
                0.078125 = fieldNorm(doc=1492)
          0.13477208 = weight(abstract_txt:query in 1492) [ClassicSimilarity], result of:
            0.13477208 = score(doc=1492,freq=3.0), product of:
              0.2095133 = queryWeight, product of:
                2.027639 = boost
                4.7537646 = idf(docFreq=1035, maxDocs=44218)
                0.021736186 = queryNorm
              0.6432626 = fieldWeight in 1492, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.7537646 = idf(docFreq=1035, maxDocs=44218)
                0.078125 = fieldNorm(doc=1492)
          0.10707381 = weight(abstract_txt:database in 1492) [ClassicSimilarity], result of:
            0.10707381 = score(doc=1492,freq=2.0), product of:
              0.22643515 = queryWeight, product of:
                2.434031 = boost
                4.2799077 = idf(docFreq=1663, maxDocs=44218)
                0.021736186 = queryNorm
              0.47286746 = fieldWeight in 1492, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.2799077 = idf(docFreq=1663, maxDocs=44218)
                0.078125 = fieldNorm(doc=1492)
          0.08450427 = weight(abstract_txt:documents in 1492) [ClassicSimilarity], result of:
            0.08450427 = score(doc=1492,freq=1.0), product of:
              0.26245454 = queryWeight, product of:
                2.9297884 = boost
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.021736186 = queryNorm
              0.32197678 = fieldWeight in 1492, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.078125 = fieldNorm(doc=1492)
        0.2 = coord(5/25)
    
  4. Niemi, T.; Jämsen , J.: ¬A query language for discovering semantic associations, part I : approach and formal definition of query primitives (2007) 0.12
    0.120446034 = sum of:
      0.120446034 = product of:
        0.5018585 = sum of:
          0.12724061 = weight(abstract_txt:declarative in 591) [ClassicSimilarity], result of:
            0.12724061 = score(doc=591,freq=1.0), product of:
              0.23397467 = queryWeight, product of:
                1.2371109 = boost
                8.701155 = idf(docFreq=19, maxDocs=44218)
                0.021736186 = queryNorm
              0.54382217 = fieldWeight in 591, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.701155 = idf(docFreq=19, maxDocs=44218)
                0.0625 = fieldNorm(doc=591)
          0.03319356 = weight(abstract_txt:databases in 591) [ClassicSimilarity], result of:
            0.03319356 = score(doc=591,freq=1.0), product of:
              0.12035502 = queryWeight, product of:
                1.2547907 = boost
                4.4127526 = idf(docFreq=1456, maxDocs=44218)
                0.021736186 = queryNorm
              0.27579704 = fieldWeight in 591, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4127526 = idf(docFreq=1456, maxDocs=44218)
                0.0625 = fieldNorm(doc=591)
          0.088753715 = weight(abstract_txt:introduce in 591) [ClassicSimilarity], result of:
            0.088753715 = score(doc=591,freq=1.0), product of:
              0.23185655 = queryWeight, product of:
                1.7416018 = boost
                6.124733 = idf(docFreq=262, maxDocs=44218)
                0.021736186 = queryNorm
              0.3827958 = fieldWeight in 591, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.124733 = idf(docFreq=262, maxDocs=44218)
                0.0625 = fieldNorm(doc=591)
          0.124497116 = weight(abstract_txt:query in 591) [ClassicSimilarity], result of:
            0.124497116 = score(doc=591,freq=4.0), product of:
              0.2095133 = queryWeight, product of:
                2.027639 = boost
                4.7537646 = idf(docFreq=1035, maxDocs=44218)
                0.021736186 = queryNorm
              0.5942206 = fieldWeight in 591, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.7537646 = idf(docFreq=1035, maxDocs=44218)
                0.0625 = fieldNorm(doc=591)
          0.0605701 = weight(abstract_txt:database in 591) [ClassicSimilarity], result of:
            0.0605701 = score(doc=591,freq=1.0), product of:
              0.22643515 = queryWeight, product of:
                2.434031 = boost
                4.2799077 = idf(docFreq=1663, maxDocs=44218)
                0.021736186 = queryNorm
              0.26749423 = fieldWeight in 591, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.2799077 = idf(docFreq=1663, maxDocs=44218)
                0.0625 = fieldNorm(doc=591)
          0.06760341 = weight(abstract_txt:documents in 591) [ClassicSimilarity], result of:
            0.06760341 = score(doc=591,freq=1.0), product of:
              0.26245454 = queryWeight, product of:
                2.9297884 = boost
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.021736186 = queryNorm
              0.2575814 = fieldWeight in 591, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.0625 = fieldNorm(doc=591)
        0.24 = coord(6/25)
    
  5. Aldana, J.F.; Gómez, A.C.; Moreno, N.; Nebro, A.J.; Roldán, M.M.: Metadata functionality for semantic Web integration (2003) 0.12
    0.11514127 = sum of:
      0.11514127 = product of:
        0.41121882 = sum of:
          0.062047657 = weight(abstract_txt:specify in 2731) [ClassicSimilarity], result of:
            0.062047657 = score(doc=2731,freq=1.0), product of:
              0.1756013 = queryWeight, product of:
                1.0717366 = boost
                7.538004 = idf(docFreq=63, maxDocs=44218)
                0.021736186 = queryNorm
              0.35334393 = fieldWeight in 2731, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.538004 = idf(docFreq=63, maxDocs=44218)
                0.046875 = fieldNorm(doc=2731)
          0.024895169 = weight(abstract_txt:databases in 2731) [ClassicSimilarity], result of:
            0.024895169 = score(doc=2731,freq=1.0), product of:
              0.12035502 = queryWeight, product of:
                1.2547907 = boost
                4.4127526 = idf(docFreq=1456, maxDocs=44218)
                0.021736186 = queryNorm
              0.20684779 = fieldWeight in 2731, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4127526 = idf(docFreq=1456, maxDocs=44218)
                0.046875 = fieldNorm(doc=2731)
          0.030566948 = weight(abstract_txt:standard in 2731) [ClassicSimilarity], result of:
            0.030566948 = score(doc=2731,freq=1.0), product of:
              0.13800311 = queryWeight, product of:
                1.3436421 = boost
                4.725219 = idf(docFreq=1065, maxDocs=44218)
                0.021736186 = queryNorm
              0.22149463 = fieldWeight in 2731, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.725219 = idf(docFreq=1065, maxDocs=44218)
                0.046875 = fieldNorm(doc=2731)
          0.11763724 = weight(abstract_txt:extension in 2731) [ClassicSimilarity], result of:
            0.11763724 = score(doc=2731,freq=2.0), product of:
              0.26899284 = queryWeight, product of:
                1.8758994 = boost
                6.5970206 = idf(docFreq=163, maxDocs=44218)
                0.021736186 = queryNorm
              0.4373248 = fieldWeight in 2731, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.5970206 = idf(docFreq=163, maxDocs=44218)
                0.046875 = fieldNorm(doc=2731)
          0.04668642 = weight(abstract_txt:query in 2731) [ClassicSimilarity], result of:
            0.04668642 = score(doc=2731,freq=1.0), product of:
              0.2095133 = queryWeight, product of:
                2.027639 = boost
                4.7537646 = idf(docFreq=1035, maxDocs=44218)
                0.021736186 = queryNorm
              0.22283271 = fieldWeight in 2731, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.7537646 = idf(docFreq=1035, maxDocs=44218)
                0.046875 = fieldNorm(doc=2731)
          0.07868286 = weight(abstract_txt:database in 2731) [ClassicSimilarity], result of:
            0.07868286 = score(doc=2731,freq=3.0), product of:
              0.22643515 = queryWeight, product of:
                2.434031 = boost
                4.2799077 = idf(docFreq=1663, maxDocs=44218)
                0.021736186 = queryNorm
              0.34748518 = fieldWeight in 2731, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.2799077 = idf(docFreq=1663, maxDocs=44218)
                0.046875 = fieldNorm(doc=2731)
          0.050702557 = weight(abstract_txt:documents in 2731) [ClassicSimilarity], result of:
            0.050702557 = score(doc=2731,freq=1.0), product of:
              0.26245454 = queryWeight, product of:
                2.9297884 = boost
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.021736186 = queryNorm
              0.19318606 = fieldWeight in 2731, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.046875 = fieldNorm(doc=2731)
        0.28 = coord(7/25)