Document (#34291)

Author
Ghiselli, C.
Padula, M.
Title
¬A unified access to extract knowledge from heterogeneous Web archives
Source
Online information review. 25(2001) no.5, S.299-310
Year
2001
Abstract
This paper proposes the integration of tools to provide unified access to remote and heterogeneous archives, the contents of which can be grouped under the same subject, and which have been integrated to allow the user to navigate and conduct thematic searches. The information sources are locally frequently modified, added to, and removed, therefore attention has been paid to the permanence of their references. Source interoperability is supported at language, protocol and schema levels. The architecture is based on a new common schema of the archives which is defined in new representation and query languages on the basis of an ontology to avoid misunderstanding and ambiguity.
Theme
Verteilte bibliographische Datenbanken
Internet

Similar documents (content)

  1. Mannocci, A.; Casarosa, V.; Manghi, P.; Zoppi, F.: ¬The Europeana network of ancient Greek and Latin epigraphy data infrastructure (2014) 0.12
    0.11565525 = sum of:
      0.11565525 = product of:
        0.48189688 = sum of:
          0.049498215 = weight(abstract_txt:interoperability in 1591) [ClassicSimilarity], result of:
            0.049498215 = score(doc=1591,freq=1.0), product of:
              0.12890312 = queryWeight, product of:
                1.014095 = boost
                6.1439276 = idf(docFreq=257, maxDocs=44218)
                0.020688964 = queryNorm
              0.38399547 = fieldWeight in 1591, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.1439276 = idf(docFreq=257, maxDocs=44218)
                0.0625 = fieldNorm(doc=1591)
          0.02020856 = weight(abstract_txt:been in 1591) [ClassicSimilarity], result of:
            0.02020856 = score(doc=1591,freq=1.0), product of:
              0.08937938 = queryWeight, product of:
                1.1942096 = boost
                3.617579 = idf(docFreq=3226, maxDocs=44218)
                0.020688964 = queryNorm
              0.22609869 = fieldWeight in 1591, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.617579 = idf(docFreq=3226, maxDocs=44218)
                0.0625 = fieldNorm(doc=1591)
          0.035980713 = weight(abstract_txt:access in 1591) [ClassicSimilarity], result of:
            0.035980713 = score(doc=1591,freq=3.0), product of:
              0.0910374 = queryWeight, product of:
                1.2052352 = boost
                3.6509786 = idf(docFreq=3120, maxDocs=44218)
                0.020688964 = queryNorm
              0.39523003 = fieldWeight in 1591, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.6509786 = idf(docFreq=3120, maxDocs=44218)
                0.0625 = fieldNorm(doc=1591)
          0.015887456 = weight(abstract_txt:which in 1591) [ClassicSimilarity], result of:
            0.015887456 = score(doc=1591,freq=1.0), product of:
              0.087152615 = queryWeight, product of:
                1.4442679 = boost
                2.9167147 = idf(docFreq=6503, maxDocs=44218)
                0.020688964 = queryNorm
              0.18229467 = fieldWeight in 1591, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.9167147 = idf(docFreq=6503, maxDocs=44218)
                0.0625 = fieldNorm(doc=1591)
          0.11156143 = weight(abstract_txt:heterogeneous in 1591) [ClassicSimilarity], result of:
            0.11156143 = score(doc=1591,freq=1.0), product of:
              0.27918354 = queryWeight, product of:
                2.1106043 = boost
                6.3935823 = idf(docFreq=200, maxDocs=44218)
                0.020688964 = queryNorm
              0.3995989 = fieldWeight in 1591, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.3935823 = idf(docFreq=200, maxDocs=44218)
                0.0625 = fieldNorm(doc=1591)
          0.24876052 = weight(abstract_txt:archives in 1591) [ClassicSimilarity], result of:
            0.24876052 = score(doc=1591,freq=4.0), product of:
              0.34362003 = queryWeight, product of:
                2.8677862 = boost
                5.7915254 = idf(docFreq=366, maxDocs=44218)
                0.020688964 = queryNorm
              0.7239407 = fieldWeight in 1591, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.7915254 = idf(docFreq=366, maxDocs=44218)
                0.0625 = fieldNorm(doc=1591)
        0.24 = coord(6/25)
    
  2. Haslhofer, B.: ¬A Web-based mapping technique for establishing metadata interoperability (2008) 0.09
    0.093498096 = sum of:
      0.093498096 = product of:
        0.38957542 = sum of:
          0.05358339 = weight(abstract_txt:interoperability in 3173) [ClassicSimilarity], result of:
            0.05358339 = score(doc=3173,freq=3.0), product of:
              0.12890312 = queryWeight, product of:
                1.014095 = boost
                6.1439276 = idf(docFreq=257, maxDocs=44218)
                0.020688964 = queryNorm
              0.4156873 = fieldWeight in 3173, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.1439276 = idf(docFreq=257, maxDocs=44218)
                0.0390625 = fieldNorm(doc=3173)
          0.039289303 = weight(abstract_txt:protocol in 3173) [ClassicSimilarity], result of:
            0.039289303 = score(doc=3173,freq=1.0), product of:
              0.15117034 = queryWeight, product of:
                1.0981969 = boost
                6.653462 = idf(docFreq=154, maxDocs=44218)
                0.020688964 = queryNorm
              0.25990087 = fieldWeight in 3173, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.653462 = idf(docFreq=154, maxDocs=44218)
                0.0390625 = fieldNorm(doc=3173)
          0.019859321 = weight(abstract_txt:which in 3173) [ClassicSimilarity], result of:
            0.019859321 = score(doc=3173,freq=4.0), product of:
              0.087152615 = queryWeight, product of:
                1.4442679 = boost
                2.9167147 = idf(docFreq=6503, maxDocs=44218)
                0.020688964 = queryNorm
              0.22786833 = fieldWeight in 3173, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                2.9167147 = idf(docFreq=6503, maxDocs=44218)
                0.0390625 = fieldNorm(doc=3173)
          0.09860732 = weight(abstract_txt:heterogeneous in 3173) [ClassicSimilarity], result of:
            0.09860732 = score(doc=3173,freq=2.0), product of:
              0.27918354 = queryWeight, product of:
                2.1106043 = boost
                6.3935823 = idf(docFreq=200, maxDocs=44218)
                0.020688964 = queryNorm
              0.3531989 = fieldWeight in 3173, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.3935823 = idf(docFreq=200, maxDocs=44218)
                0.0390625 = fieldNorm(doc=3173)
          0.10049844 = weight(abstract_txt:schema in 3173) [ClassicSimilarity], result of:
            0.10049844 = score(doc=3173,freq=2.0), product of:
              0.28274176 = queryWeight, product of:
                2.1240115 = boost
                6.434197 = idf(docFreq=192, maxDocs=44218)
                0.020688964 = queryNorm
              0.3554425 = fieldWeight in 3173, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.434197 = idf(docFreq=192, maxDocs=44218)
                0.0390625 = fieldNorm(doc=3173)
          0.07773766 = weight(abstract_txt:archives in 3173) [ClassicSimilarity], result of:
            0.07773766 = score(doc=3173,freq=1.0), product of:
              0.34362003 = queryWeight, product of:
                2.8677862 = boost
                5.7915254 = idf(docFreq=366, maxDocs=44218)
                0.020688964 = queryNorm
              0.22623146 = fieldWeight in 3173, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.7915254 = idf(docFreq=366, maxDocs=44218)
                0.0390625 = fieldNorm(doc=3173)
        0.24 = coord(6/25)
    
  3. Dolin, R.; Agrawal, D.; El Abbadi, A.; Pearlman, J.: Using automated classification for summarizing and selecting heterogeneous information sources (1998) 0.09
    0.08555658 = sum of:
      0.08555658 = product of:
        0.30555922 = sum of:
          0.030638311 = weight(abstract_txt:extract in 1253) [ClassicSimilarity], result of:
            0.030638311 = score(doc=1253,freq=1.0), product of:
              0.14861648 = queryWeight, product of:
                1.0888809 = boost
                6.5970206 = idf(docFreq=163, maxDocs=44218)
                0.020688964 = queryNorm
              0.2061569 = fieldWeight in 1253, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.5970206 = idf(docFreq=163, maxDocs=44218)
                0.03125 = fieldNorm(doc=1253)
          0.031071737 = weight(abstract_txt:remote in 1253) [ClassicSimilarity], result of:
            0.031071737 = score(doc=1253,freq=1.0), product of:
              0.1500148 = queryWeight, product of:
                1.0939915 = boost
                6.627983 = idf(docFreq=158, maxDocs=44218)
                0.020688964 = queryNorm
              0.20712447 = fieldWeight in 1253, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.627983 = idf(docFreq=158, maxDocs=44218)
                0.03125 = fieldNorm(doc=1253)
          0.038892433 = weight(abstract_txt:navigate in 1253) [ClassicSimilarity], result of:
            0.038892433 = score(doc=1253,freq=1.0), product of:
              0.17423436 = queryWeight, product of:
                1.179 = boost
                7.14301 = idf(docFreq=94, maxDocs=44218)
                0.020688964 = queryNorm
              0.22321907 = fieldWeight in 1253, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.14301 = idf(docFreq=94, maxDocs=44218)
                0.03125 = fieldNorm(doc=1253)
          0.01428961 = weight(abstract_txt:been in 1253) [ClassicSimilarity], result of:
            0.01428961 = score(doc=1253,freq=2.0), product of:
              0.08937938 = queryWeight, product of:
                1.1942096 = boost
                3.617579 = idf(docFreq=3226, maxDocs=44218)
                0.020688964 = queryNorm
              0.15987591 = fieldWeight in 1253, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.617579 = idf(docFreq=3226, maxDocs=44218)
                0.03125 = fieldNorm(doc=1253)
          0.023831185 = weight(abstract_txt:which in 1253) [ClassicSimilarity], result of:
            0.023831185 = score(doc=1253,freq=9.0), product of:
              0.087152615 = queryWeight, product of:
                1.4442679 = boost
                2.9167147 = idf(docFreq=6503, maxDocs=44218)
                0.020688964 = queryNorm
              0.273442 = fieldWeight in 1253, product of:
                3.0 = tf(freq=9.0), with freq of:
                  9.0 = termFreq=9.0
                2.9167147 = idf(docFreq=6503, maxDocs=44218)
                0.03125 = fieldNorm(doc=1253)
          0.078885846 = weight(abstract_txt:heterogeneous in 1253) [ClassicSimilarity], result of:
            0.078885846 = score(doc=1253,freq=2.0), product of:
              0.27918354 = queryWeight, product of:
                2.1106043 = boost
                6.3935823 = idf(docFreq=200, maxDocs=44218)
                0.020688964 = queryNorm
              0.2825591 = fieldWeight in 1253, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.3935823 = idf(docFreq=200, maxDocs=44218)
                0.03125 = fieldNorm(doc=1253)
          0.087950125 = weight(abstract_txt:archives in 1253) [ClassicSimilarity], result of:
            0.087950125 = score(doc=1253,freq=2.0), product of:
              0.34362003 = queryWeight, product of:
                2.8677862 = boost
                5.7915254 = idf(docFreq=366, maxDocs=44218)
                0.020688964 = queryNorm
              0.25595167 = fieldWeight in 1253, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.7915254 = idf(docFreq=366, maxDocs=44218)
                0.03125 = fieldNorm(doc=1253)
        0.28 = coord(7/25)
    
  4. Intner, S.S.: Enhancing OPACs (1993) 0.08
    0.08421834 = sum of:
      0.08421834 = product of:
        0.5263646 = sum of:
          0.11865709 = weight(abstract_txt:added in 6583) [ClassicSimilarity], result of:
            0.11865709 = score(doc=6583,freq=1.0), product of:
              0.12534477 = queryWeight, product of:
                6.0585327 = idf(docFreq=280, maxDocs=44218)
                0.020688964 = queryNorm
              0.94664574 = fieldWeight in 6583, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.0585327 = idf(docFreq=280, maxDocs=44218)
                0.15625 = fieldNorm(doc=6583)
          0.0505214 = weight(abstract_txt:been in 6583) [ClassicSimilarity], result of:
            0.0505214 = score(doc=6583,freq=1.0), product of:
              0.08937938 = queryWeight, product of:
                1.1942096 = boost
                3.617579 = idf(docFreq=3226, maxDocs=44218)
                0.020688964 = queryNorm
              0.5652467 = fieldWeight in 6583, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.617579 = idf(docFreq=3226, maxDocs=44218)
                0.15625 = fieldNorm(doc=6583)
          0.3010155 = weight(abstract_txt:removed in 6583) [ClassicSimilarity], result of:
            0.3010155 = score(doc=6583,freq=1.0), product of:
              0.23315048 = queryWeight, product of:
                1.3638451 = boost
                8.2629 = idf(docFreq=30, maxDocs=44218)
                0.020688964 = queryNorm
              1.2910782 = fieldWeight in 6583, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.2629 = idf(docFreq=30, maxDocs=44218)
                0.15625 = fieldNorm(doc=6583)
          0.056170642 = weight(abstract_txt:which in 6583) [ClassicSimilarity], result of:
            0.056170642 = score(doc=6583,freq=2.0), product of:
              0.087152615 = queryWeight, product of:
                1.4442679 = boost
                2.9167147 = idf(docFreq=6503, maxDocs=44218)
                0.020688964 = queryNorm
              0.64450896 = fieldWeight in 6583, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.9167147 = idf(docFreq=6503, maxDocs=44218)
                0.15625 = fieldNorm(doc=6583)
        0.16 = coord(4/25)
    
  5. Jacobs, J.-H.; Mengel, T.; Müller, K.: Insights and Outlooks : a retrospective view on the CrissCross project (2011) 0.08
    0.08307246 = sum of:
      0.08307246 = product of:
        0.4153623 = sum of:
          0.12060815 = weight(abstract_txt:thematic in 4785) [ClassicSimilarity], result of:
            0.12060815 = score(doc=4785,freq=1.0), product of:
              0.1607297 = queryWeight, product of:
                1.1323873 = boost
                6.8606052 = idf(docFreq=125, maxDocs=44218)
                0.020688964 = queryNorm
              0.7503787 = fieldWeight in 4785, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.8606052 = idf(docFreq=125, maxDocs=44218)
                0.109375 = fieldNorm(doc=4785)
          0.03536498 = weight(abstract_txt:been in 4785) [ClassicSimilarity], result of:
            0.03536498 = score(doc=4785,freq=1.0), product of:
              0.08937938 = queryWeight, product of:
                1.1942096 = boost
                3.617579 = idf(docFreq=3226, maxDocs=44218)
                0.020688964 = queryNorm
              0.3956727 = fieldWeight in 4785, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.617579 = idf(docFreq=3226, maxDocs=44218)
                0.109375 = fieldNorm(doc=4785)
          0.03635358 = weight(abstract_txt:access in 4785) [ClassicSimilarity], result of:
            0.03635358 = score(doc=4785,freq=1.0), product of:
              0.0910374 = queryWeight, product of:
                1.2052352 = boost
                3.6509786 = idf(docFreq=3120, maxDocs=44218)
                0.020688964 = queryNorm
              0.3993258 = fieldWeight in 4785, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6509786 = idf(docFreq=3120, maxDocs=44218)
                0.109375 = fieldNorm(doc=4785)
          0.02780305 = weight(abstract_txt:which in 4785) [ClassicSimilarity], result of:
            0.02780305 = score(doc=4785,freq=1.0), product of:
              0.087152615 = queryWeight, product of:
                1.4442679 = boost
                2.9167147 = idf(docFreq=6503, maxDocs=44218)
                0.020688964 = queryNorm
              0.31901568 = fieldWeight in 4785, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.9167147 = idf(docFreq=6503, maxDocs=44218)
                0.109375 = fieldNorm(doc=4785)
          0.19523251 = weight(abstract_txt:heterogeneous in 4785) [ClassicSimilarity], result of:
            0.19523251 = score(doc=4785,freq=1.0), product of:
              0.27918354 = queryWeight, product of:
                2.1106043 = boost
                6.3935823 = idf(docFreq=200, maxDocs=44218)
                0.020688964 = queryNorm
              0.6992981 = fieldWeight in 4785, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.3935823 = idf(docFreq=200, maxDocs=44218)
                0.109375 = fieldNorm(doc=4785)
        0.2 = coord(5/25)