Document (#21970)

Author
Beeri, C.
Elber, G.
Milo, T.
Sagiv, Y.
Shmueli, O.
Tishby, N.
Kogan, Y.
Konopnicki, D.
Mogilevski, P.
Slonim, N.
Title
WebSuite - a tool suite for harnessing Web data
Source
The World Wide Web and Databases: International Workshop WebDB'98, Valencia, Spain, March 27-28, 1998, Selected papers. Eds.: P. Atzeni et al
Imprint
Berlin : Springer
Year
1999
Pages
S.136-151
Series
Lecture notes in computer science; vol.1590
Abstract
We present a system for searching, collecting, and integrating Web-resident data. The system consists of five tools, where each tool provides a specific functionality aimed at solving one aspect of the complex task of using and managing Web data. Each tool can be used in a stand-alone mode, in combination with the other tools, or even in conjunction with other systems. Together, the tools offer a wider range of capabilities that overcome many of the limitations in existing systems for harnessing Web data. The paper describes each tool, possible ways of combining the tools, and the architecture of the combined systesm
Theme
Internet
Object
WWW

Similar documents (content)

  1. Oinas-Kukkonen, H.: Towards greater flexibility in software design systems through hypermedia functionality (1997) 0.12
    0.12482826 = sum of:
      0.12482826 = product of:
        0.52011776 = sum of:
          0.06777916 = weight(abstract_txt:capabilities in 782) [ClassicSimilarity], result of:
            0.06777916 = score(doc=782,freq=1.0), product of:
              0.12090212 = queryWeight, product of:
                5.9798594 = idf(docFreq=303, maxDocs=44218)
                0.020218221 = queryNorm
              0.56061184 = fieldWeight in 782, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.9798594 = idf(docFreq=303, maxDocs=44218)
                0.09375 = fieldNorm(doc=782)
          0.08025316 = weight(abstract_txt:integrating in 782) [ClassicSimilarity], result of:
            0.08025316 = score(doc=782,freq=1.0), product of:
              0.13531457 = queryWeight, product of:
                1.057926 = boost
                6.326249 = idf(docFreq=214, maxDocs=44218)
                0.020218221 = queryNorm
              0.5930859 = fieldWeight in 782, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.326249 = idf(docFreq=214, maxDocs=44218)
                0.09375 = fieldNorm(doc=782)
          0.081152 = weight(abstract_txt:functionality in 782) [ClassicSimilarity], result of:
            0.081152 = score(doc=782,freq=1.0), product of:
              0.13632305 = queryWeight, product of:
                1.061861 = boost
                6.3497796 = idf(docFreq=209, maxDocs=44218)
                0.020218221 = queryNorm
              0.59529185 = fieldWeight in 782, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.3497796 = idf(docFreq=209, maxDocs=44218)
                0.09375 = fieldNorm(doc=782)
          0.025178732 = weight(abstract_txt:systems in 782) [ClassicSimilarity], result of:
            0.025178732 = score(doc=782,freq=1.0), product of:
              0.0787171 = queryWeight, product of:
                1.1411233 = boost
                3.4118783 = idf(docFreq=3963, maxDocs=44218)
                0.020218221 = queryNorm
              0.3198636 = fieldWeight in 782, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4118783 = idf(docFreq=3963, maxDocs=44218)
                0.09375 = fieldNorm(doc=782)
          0.11182115 = weight(abstract_txt:tools in 782) [ClassicSimilarity], result of:
            0.11182115 = score(doc=782,freq=1.0), product of:
              0.26796153 = queryWeight, product of:
                2.9774828 = boost
                4.451232 = idf(docFreq=1401, maxDocs=44218)
                0.020218221 = queryNorm
              0.417303 = fieldWeight in 782, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.451232 = idf(docFreq=1401, maxDocs=44218)
                0.09375 = fieldNorm(doc=782)
          0.15393358 = weight(abstract_txt:tool in 782) [ClassicSimilarity], result of:
            0.15393358 = score(doc=782,freq=1.0), product of:
              0.3315981 = queryWeight, product of:
                3.3122191 = boost
                4.951651 = idf(docFreq=849, maxDocs=44218)
                0.020218221 = queryNorm
              0.4642173 = fieldWeight in 782, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.951651 = idf(docFreq=849, maxDocs=44218)
                0.09375 = fieldNorm(doc=782)
        0.24 = coord(6/25)
    
  2. Harlow, C.: Data munging tools in Preparation for RDF : Catmandu and LODRefine (2015) 0.11
    0.10622327 = sum of:
      0.10622327 = product of:
        0.44259697 = sum of:
          0.016785823 = weight(abstract_txt:systems in 2277) [ClassicSimilarity], result of:
            0.016785823 = score(doc=2277,freq=1.0), product of:
              0.0787171 = queryWeight, product of:
                1.1411233 = boost
                3.4118783 = idf(docFreq=3963, maxDocs=44218)
                0.020218221 = queryNorm
              0.2132424 = fieldWeight in 2277, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4118783 = idf(docFreq=3963, maxDocs=44218)
                0.0625 = fieldNorm(doc=2277)
          0.018440533 = weight(abstract_txt:other in 2277) [ClassicSimilarity], result of:
            0.018440533 = score(doc=2277,freq=1.0), product of:
              0.0838088 = queryWeight, product of:
                1.177451 = boost
                3.5204957 = idf(docFreq=3555, maxDocs=44218)
                0.020218221 = queryNorm
              0.22003098 = fieldWeight in 2277, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.5204957 = idf(docFreq=3555, maxDocs=44218)
                0.0625 = fieldNorm(doc=2277)
          0.06264148 = weight(abstract_txt:each in 2277) [ClassicSimilarity], result of:
            0.06264148 = score(doc=2277,freq=2.0), product of:
              0.17206891 = queryWeight, product of:
                2.066307 = boost
                4.118742 = idf(docFreq=1954, maxDocs=44218)
                0.020218221 = queryNorm
              0.36404878 = fieldWeight in 2277, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.118742 = idf(docFreq=1954, maxDocs=44218)
                0.0625 = fieldNorm(doc=2277)
          0.09417317 = weight(abstract_txt:data in 2277) [ClassicSimilarity], result of:
            0.09417317 = score(doc=2277,freq=9.0), product of:
              0.15054093 = queryWeight, product of:
                2.2317233 = boost
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.020218221 = queryNorm
              0.62556523 = fieldWeight in 2277, product of:
                3.0 = tf(freq=9.0), with freq of:
                  9.0 = termFreq=9.0
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.0625 = fieldNorm(doc=2277)
          0.10542599 = weight(abstract_txt:tools in 2277) [ClassicSimilarity], result of:
            0.10542599 = score(doc=2277,freq=2.0), product of:
              0.26796153 = queryWeight, product of:
                2.9774828 = boost
                4.451232 = idf(docFreq=1401, maxDocs=44218)
                0.020218221 = queryNorm
              0.39343703 = fieldWeight in 2277, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.451232 = idf(docFreq=1401, maxDocs=44218)
                0.0625 = fieldNorm(doc=2277)
          0.14512996 = weight(abstract_txt:tool in 2277) [ClassicSimilarity], result of:
            0.14512996 = score(doc=2277,freq=2.0), product of:
              0.3315981 = queryWeight, product of:
                3.3122191 = boost
                4.951651 = idf(docFreq=849, maxDocs=44218)
                0.020218221 = queryNorm
              0.43766826 = fieldWeight in 2277, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.951651 = idf(docFreq=849, maxDocs=44218)
                0.0625 = fieldNorm(doc=2277)
        0.24 = coord(6/25)
    
  3. Fowler, W.A.; Fowler, R.H.: ¬A hypertext-based approach to computer science education unifying programming principles (1993) 0.11
    0.10546651 = sum of:
      0.10546651 = product of:
        0.52733254 = sum of:
          0.10700421 = weight(abstract_txt:integrating in 8017) [ClassicSimilarity], result of:
            0.10700421 = score(doc=8017,freq=1.0), product of:
              0.13531457 = queryWeight, product of:
                1.057926 = boost
                6.326249 = idf(docFreq=214, maxDocs=44218)
                0.020218221 = queryNorm
              0.79078114 = fieldWeight in 8017, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.326249 = idf(docFreq=214, maxDocs=44218)
                0.125 = fieldNorm(doc=8017)
          0.032417078 = weight(abstract_txt:system in 8017) [ClassicSimilarity], result of:
            0.032417078 = score(doc=8017,freq=1.0), product of:
              0.07690181 = queryWeight, product of:
                1.1278889 = boost
                3.3723085 = idf(docFreq=4123, maxDocs=44218)
                0.020218221 = queryNorm
              0.42153856 = fieldWeight in 8017, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3723085 = idf(docFreq=4123, maxDocs=44218)
                0.125 = fieldNorm(doc=8017)
          0.033571646 = weight(abstract_txt:systems in 8017) [ClassicSimilarity], result of:
            0.033571646 = score(doc=8017,freq=1.0), product of:
              0.0787171 = queryWeight, product of:
                1.1411233 = boost
                3.4118783 = idf(docFreq=3963, maxDocs=44218)
                0.020218221 = queryNorm
              0.4264848 = fieldWeight in 8017, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4118783 = idf(docFreq=3963, maxDocs=44218)
                0.125 = fieldNorm(doc=8017)
          0.14909486 = weight(abstract_txt:tools in 8017) [ClassicSimilarity], result of:
            0.14909486 = score(doc=8017,freq=1.0), product of:
              0.26796153 = queryWeight, product of:
                2.9774828 = boost
                4.451232 = idf(docFreq=1401, maxDocs=44218)
                0.020218221 = queryNorm
              0.556404 = fieldWeight in 8017, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.451232 = idf(docFreq=1401, maxDocs=44218)
                0.125 = fieldNorm(doc=8017)
          0.20524476 = weight(abstract_txt:tool in 8017) [ClassicSimilarity], result of:
            0.20524476 = score(doc=8017,freq=1.0), product of:
              0.3315981 = queryWeight, product of:
                3.3122191 = boost
                4.951651 = idf(docFreq=849, maxDocs=44218)
                0.020218221 = queryNorm
              0.6189564 = fieldWeight in 8017, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.951651 = idf(docFreq=849, maxDocs=44218)
                0.125 = fieldNorm(doc=8017)
        0.2 = coord(5/25)
    
  4. Nichols, D.M.; Paynter, G.W.; Chan, C.-H.; Bainbridge, D.; McKay, D.; Twidale, M.B.; Blandford, A.: Experiences in deploying metadata analysis tools for institutional repositories (2009) 0.10
    0.10361117 = sum of:
      0.10361117 = product of:
        0.51805586 = sum of:
          0.052246105 = weight(abstract_txt:wider in 2986) [ClassicSimilarity], result of:
            0.052246105 = score(doc=2986,freq=1.0), product of:
              0.13318846 = queryWeight, product of:
                1.0495819 = boost
                6.2763524 = idf(docFreq=225, maxDocs=44218)
                0.020218221 = queryNorm
              0.39227203 = fieldWeight in 2986, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.2763524 = idf(docFreq=225, maxDocs=44218)
                0.0625 = fieldNorm(doc=2986)
          0.03193994 = weight(abstract_txt:other in 2986) [ClassicSimilarity], result of:
            0.03193994 = score(doc=2986,freq=3.0), product of:
              0.0838088 = queryWeight, product of:
                1.177451 = boost
                3.5204957 = idf(docFreq=3555, maxDocs=44218)
                0.020218221 = queryNorm
              0.38110483 = fieldWeight in 2986, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.5204957 = idf(docFreq=3555, maxDocs=44218)
                0.0625 = fieldNorm(doc=2986)
          0.031391058 = weight(abstract_txt:data in 2986) [ClassicSimilarity], result of:
            0.031391058 = score(doc=2986,freq=1.0), product of:
              0.15054093 = queryWeight, product of:
                2.2317233 = boost
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.020218221 = queryNorm
              0.20852174 = fieldWeight in 2986, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.0625 = fieldNorm(doc=2986)
          0.19723396 = weight(abstract_txt:tools in 2986) [ClassicSimilarity], result of:
            0.19723396 = score(doc=2986,freq=7.0), product of:
              0.26796153 = queryWeight, product of:
                2.9774828 = boost
                4.451232 = idf(docFreq=1401, maxDocs=44218)
                0.020218221 = queryNorm
              0.7360533 = fieldWeight in 2986, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                4.451232 = idf(docFreq=1401, maxDocs=44218)
                0.0625 = fieldNorm(doc=2986)
          0.20524476 = weight(abstract_txt:tool in 2986) [ClassicSimilarity], result of:
            0.20524476 = score(doc=2986,freq=4.0), product of:
              0.3315981 = queryWeight, product of:
                3.3122191 = boost
                4.951651 = idf(docFreq=849, maxDocs=44218)
                0.020218221 = queryNorm
              0.6189564 = fieldWeight in 2986, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.951651 = idf(docFreq=849, maxDocs=44218)
                0.0625 = fieldNorm(doc=2986)
        0.2 = coord(5/25)
    
  5. How classifications work : problems and challenges in an electronic age (1998) 0.10
    0.0998119 = sum of:
      0.0998119 = product of:
        0.35647106 = sum of:
          0.020260673 = weight(abstract_txt:system in 849) [ClassicSimilarity], result of:
            0.020260673 = score(doc=849,freq=4.0), product of:
              0.07690181 = queryWeight, product of:
                1.1278889 = boost
                3.3723085 = idf(docFreq=4123, maxDocs=44218)
                0.020218221 = queryNorm
              0.2634616 = fieldWeight in 849, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.3723085 = idf(docFreq=4123, maxDocs=44218)
                0.0390625 = fieldNorm(doc=849)
          0.020982277 = weight(abstract_txt:systems in 849) [ClassicSimilarity], result of:
            0.020982277 = score(doc=849,freq=4.0), product of:
              0.0787171 = queryWeight, product of:
                1.1411233 = boost
                3.4118783 = idf(docFreq=3963, maxDocs=44218)
                0.020218221 = queryNorm
              0.26655298 = fieldWeight in 849, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.4118783 = idf(docFreq=3963, maxDocs=44218)
                0.0390625 = fieldNorm(doc=849)
          0.023050666 = weight(abstract_txt:other in 849) [ClassicSimilarity], result of:
            0.023050666 = score(doc=849,freq=4.0), product of:
              0.0838088 = queryWeight, product of:
                1.177451 = boost
                3.5204957 = idf(docFreq=3555, maxDocs=44218)
                0.020218221 = queryNorm
              0.27503872 = fieldWeight in 849, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.5204957 = idf(docFreq=3555, maxDocs=44218)
                0.0390625 = fieldNorm(doc=849)
          0.069136 = weight(abstract_txt:suite in 849) [ClassicSimilarity], result of:
            0.069136 = score(doc=849,freq=1.0), product of:
              0.21960731 = queryWeight, product of:
                1.347741 = boost
                8.059301 = idf(docFreq=37, maxDocs=44218)
                0.020218221 = queryNorm
              0.31481647 = fieldWeight in 849, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.059301 = idf(docFreq=37, maxDocs=44218)
                0.0390625 = fieldNorm(doc=849)
          0.03915092 = weight(abstract_txt:each in 849) [ClassicSimilarity], result of:
            0.03915092 = score(doc=849,freq=2.0), product of:
              0.17206891 = queryWeight, product of:
                2.066307 = boost
                4.118742 = idf(docFreq=1954, maxDocs=44218)
                0.020218221 = queryNorm
              0.22753048 = fieldWeight in 849, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.118742 = idf(docFreq=1954, maxDocs=44218)
                0.0390625 = fieldNorm(doc=849)
          0.09318429 = weight(abstract_txt:tools in 849) [ClassicSimilarity], result of:
            0.09318429 = score(doc=849,freq=4.0), product of:
              0.26796153 = queryWeight, product of:
                2.9774828 = boost
                4.451232 = idf(docFreq=1401, maxDocs=44218)
                0.020218221 = queryNorm
              0.3477525 = fieldWeight in 849, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.451232 = idf(docFreq=1401, maxDocs=44218)
                0.0390625 = fieldNorm(doc=849)
          0.09070623 = weight(abstract_txt:tool in 849) [ClassicSimilarity], result of:
            0.09070623 = score(doc=849,freq=2.0), product of:
              0.3315981 = queryWeight, product of:
                3.3122191 = boost
                4.951651 = idf(docFreq=849, maxDocs=44218)
                0.020218221 = queryNorm
              0.27354267 = fieldWeight in 849, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.951651 = idf(docFreq=849, maxDocs=44218)
                0.0390625 = fieldNorm(doc=849)
        0.28 = coord(7/25)