Document (#24017)

Author
Hakala, J.
Title
¬The NEDLIB harvester
Source
Zeitschrift für Bibliothekswesen und Bibliographie. 48(2001) H.3/4, S.211-216
Year
2001
Abstract
The importance of the Internet as publishing channel is constantly growing. As a result, the national libraries are developing new means for extending their deposit activities into the Web documents. The NEDLIB harvester is a tool with which Web documents can be collected and archived. This article outlines the design principles and functionality of this public domain tool, which is also suitable for small scale document archival
Form
Elektronische Dokumente

Similar documents (author)

  1. Hakala, J.: Z39.50-1995: information retrieval protocol : an introduction to the standard and it's usage (1996) 6.19
    6.190705 = sum of:
      6.190705 = weight(author_txt:hakala in 3340) [ClassicSimilarity], result of:
        6.190705 = fieldWeight in 3340, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.905128 = idf(docFreq=5, maxDocs=44218)
          0.625 = fieldNorm(doc=3340)
    
  2. Hakala, J.: Dublin core in 1997 : a report from Dublin Core metadata workshops 4 & 5 (1998) 6.19
    6.190705 = sum of:
      6.190705 = weight(author_txt:hakala in 2220) [ClassicSimilarity], result of:
        6.190705 = fieldWeight in 2220, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.905128 = idf(docFreq=5, maxDocs=44218)
          0.625 = fieldNorm(doc=2220)
    
  3. Hakala, J.: Internet metadata and library cataloguing (1999) 6.19
    6.190705 = sum of:
      6.190705 = weight(author_txt:hakala in 4565) [ClassicSimilarity], result of:
        6.190705 = fieldWeight in 4565, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.905128 = idf(docFreq=5, maxDocs=44218)
          0.625 = fieldNorm(doc=4565)
    
  4. Hakala, J.; Husby, O.; Koch, T.: Warwick framework and Dublin core set provide a comprehensive infrastructure for network resource description (1996) 3.71
    3.7144227 = sum of:
      3.7144227 = weight(author_txt:hakala in 6921) [ClassicSimilarity], result of:
        3.7144227 = fieldWeight in 6921, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.905128 = idf(docFreq=5, maxDocs=44218)
          0.375 = fieldNorm(doc=6921)
    

Similar documents (content)

  1. Boeri, R.J.; Hensel, M.: Corporate online/CD-ROM publishing : the desing and tactical issues (1996) 0.15
    0.14609352 = sum of:
      0.14609352 = product of:
        0.52176255 = sum of:
          0.013574182 = weight(abstract_txt:this in 4553) [ClassicSimilarity], result of:
            0.013574182 = score(doc=4553,freq=1.0), product of:
              0.060004234 = queryWeight, product of:
                1.0190986 = boost
                2.4130175 = idf(docFreq=10762, maxDocs=44218)
                0.024400864 = queryNorm
              0.2262204 = fieldWeight in 4553, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4130175 = idf(docFreq=10762, maxDocs=44218)
                0.09375 = fieldNorm(doc=4553)
          0.06401745 = weight(abstract_txt:importance in 4553) [ClassicSimilarity], result of:
            0.06401745 = score(doc=4553,freq=1.0), product of:
              0.1339353 = queryWeight, product of:
                1.0766085 = boost
                5.098378 = idf(docFreq=733, maxDocs=44218)
                0.024400864 = queryNorm
              0.47797295 = fieldWeight in 4553, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.098378 = idf(docFreq=733, maxDocs=44218)
                0.09375 = fieldNorm(doc=4553)
          0.07330386 = weight(abstract_txt:small in 4553) [ClassicSimilarity], result of:
            0.07330386 = score(doc=4553,freq=1.0), product of:
              0.14659327 = queryWeight, product of:
                1.1263343 = boost
                5.333859 = idf(docFreq=579, maxDocs=44218)
                0.024400864 = queryNorm
              0.5000493 = fieldWeight in 4553, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.333859 = idf(docFreq=579, maxDocs=44218)
                0.09375 = fieldNorm(doc=4553)
          0.15042219 = weight(abstract_txt:publishing in 4553) [ClassicSimilarity], result of:
            0.15042219 = score(doc=4553,freq=4.0), product of:
              0.1491251 = queryWeight, product of:
                1.1360191 = boost
                5.3797226 = idf(docFreq=553, maxDocs=44218)
                0.024400864 = queryNorm
              1.008698 = fieldWeight in 4553, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.3797226 = idf(docFreq=553, maxDocs=44218)
                0.09375 = fieldNorm(doc=4553)
          0.0793347 = weight(abstract_txt:scale in 4553) [ClassicSimilarity], result of:
            0.0793347 = score(doc=4553,freq=1.0), product of:
              0.1545272 = queryWeight, product of:
                1.1564125 = boost
                5.476297 = idf(docFreq=502, maxDocs=44218)
                0.024400864 = queryNorm
              0.5134028 = fieldWeight in 4553, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.476297 = idf(docFreq=502, maxDocs=44218)
                0.09375 = fieldNorm(doc=4553)
          0.023972541 = weight(abstract_txt:which in 4553) [ClassicSimilarity], result of:
            0.023972541 = score(doc=4553,freq=1.0), product of:
              0.087669566 = queryWeight, product of:
                1.231827 = boost
                2.9167147 = idf(docFreq=6503, maxDocs=44218)
                0.024400864 = queryNorm
              0.273442 = fieldWeight in 4553, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.9167147 = idf(docFreq=6503, maxDocs=44218)
                0.09375 = fieldNorm(doc=4553)
          0.1171376 = weight(abstract_txt:documents in 4553) [ClassicSimilarity], result of:
            0.1171376 = score(doc=4553,freq=3.0), product of:
              0.17503704 = queryWeight, product of:
                1.7405651 = boost
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.024400864 = queryNorm
              0.6692161 = fieldWeight in 4553, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.09375 = fieldNorm(doc=4553)
        0.28 = coord(7/25)
    
  2. Jens-Erik Mai, J.-E.: ¬The role of documents, domains and decisions in indexing (2004) 0.12
    0.12057231 = sum of:
      0.12057231 = product of:
        0.50238466 = sum of:
          0.072550416 = weight(abstract_txt:domain in 2653) [ClassicSimilarity], result of:
            0.072550416 = score(doc=2653,freq=2.0), product of:
              0.11555252 = queryWeight, product of:
                4.7355914 = idf(docFreq=1054, maxDocs=44218)
                0.024400864 = queryNorm
              0.6278566 = fieldWeight in 2653, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.7355914 = idf(docFreq=1054, maxDocs=44218)
                0.09375 = fieldNorm(doc=2653)
          0.08615342 = weight(abstract_txt:means in 2653) [ClassicSimilarity], result of:
            0.08615342 = score(doc=2653,freq=2.0), product of:
              0.12957896 = queryWeight, product of:
                1.0589551 = boost
                5.0147786 = idf(docFreq=797, maxDocs=44218)
                0.024400864 = queryNorm
              0.66487193 = fieldWeight in 2653, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.0147786 = idf(docFreq=797, maxDocs=44218)
                0.09375 = fieldNorm(doc=2653)
          0.088417046 = weight(abstract_txt:activities in 2653) [ClassicSimilarity], result of:
            0.088417046 = score(doc=2653,freq=2.0), product of:
              0.13183887 = queryWeight, product of:
                1.0681494 = boost
                5.0583196 = idf(docFreq=763, maxDocs=44218)
                0.024400864 = queryNorm
              0.67064476 = fieldWeight in 2653, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.0583196 = idf(docFreq=763, maxDocs=44218)
                0.09375 = fieldNorm(doc=2653)
          0.06401745 = weight(abstract_txt:importance in 2653) [ClassicSimilarity], result of:
            0.06401745 = score(doc=2653,freq=1.0), product of:
              0.1339353 = queryWeight, product of:
                1.0766085 = boost
                5.098378 = idf(docFreq=733, maxDocs=44218)
                0.024400864 = queryNorm
              0.47797295 = fieldWeight in 2653, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.098378 = idf(docFreq=733, maxDocs=44218)
                0.09375 = fieldNorm(doc=2653)
          0.07395052 = weight(abstract_txt:outlines in 2653) [ClassicSimilarity], result of:
            0.07395052 = score(doc=2653,freq=1.0), product of:
              0.14745414 = queryWeight, product of:
                1.1296366 = boost
                5.349498 = idf(docFreq=570, maxDocs=44218)
                0.024400864 = queryNorm
              0.5015154 = fieldWeight in 2653, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.349498 = idf(docFreq=570, maxDocs=44218)
                0.09375 = fieldNorm(doc=2653)
          0.11729577 = weight(abstract_txt:tool in 2653) [ClassicSimilarity], result of:
            0.11729577 = score(doc=2653,freq=1.0), product of:
              0.25267428 = queryWeight, product of:
                2.0912492 = boost
                4.951651 = idf(docFreq=849, maxDocs=44218)
                0.024400864 = queryNorm
              0.4642173 = fieldWeight in 2653, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.951651 = idf(docFreq=849, maxDocs=44218)
                0.09375 = fieldNorm(doc=2653)
        0.24 = coord(6/25)
    
  3. Björk, B.-C.; Laakso, M.; Welling, P.; Paetau, P.: Anatomy of green open access (2014) 0.12
    0.11717828 = sum of:
      0.11717828 = product of:
        0.48824286 = sum of:
          0.009049455 = weight(abstract_txt:this in 1194) [ClassicSimilarity], result of:
            0.009049455 = score(doc=1194,freq=1.0), product of:
              0.060004234 = queryWeight, product of:
                1.0190986 = boost
                2.4130175 = idf(docFreq=10762, maxDocs=44218)
                0.024400864 = queryNorm
              0.1508136 = fieldWeight in 1194, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4130175 = idf(docFreq=10762, maxDocs=44218)
                0.0625 = fieldNorm(doc=1194)
          0.050140727 = weight(abstract_txt:publishing in 1194) [ClassicSimilarity], result of:
            0.050140727 = score(doc=1194,freq=1.0), product of:
              0.1491251 = queryWeight, product of:
                1.1360191 = boost
                5.3797226 = idf(docFreq=553, maxDocs=44218)
                0.024400864 = queryNorm
              0.33623266 = fieldWeight in 1194, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.3797226 = idf(docFreq=553, maxDocs=44218)
                0.0625 = fieldNorm(doc=1194)
          0.022601528 = weight(abstract_txt:which in 1194) [ClassicSimilarity], result of:
            0.022601528 = score(doc=1194,freq=2.0), product of:
              0.087669566 = queryWeight, product of:
                1.231827 = boost
                2.9167147 = idf(docFreq=6503, maxDocs=44218)
                0.024400864 = queryNorm
              0.2578036 = fieldWeight in 1194, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.9167147 = idf(docFreq=6503, maxDocs=44218)
                0.0625 = fieldNorm(doc=1194)
          0.08208062 = weight(abstract_txt:archival in 1194) [ClassicSimilarity], result of:
            0.08208062 = score(doc=1194,freq=1.0), product of:
              0.20713368 = queryWeight, product of:
                1.3388615 = boost
                6.340301 = idf(docFreq=211, maxDocs=44218)
                0.024400864 = queryNorm
              0.3962688 = fieldWeight in 1194, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.340301 = idf(docFreq=211, maxDocs=44218)
                0.0625 = fieldNorm(doc=1194)
          0.13083905 = weight(abstract_txt:extending in 1194) [ClassicSimilarity], result of:
            0.13083905 = score(doc=1194,freq=1.0), product of:
              0.28264973 = queryWeight, product of:
                1.5639921 = boost
                7.406428 = idf(docFreq=72, maxDocs=44218)
                0.024400864 = queryNorm
              0.46290174 = fieldWeight in 1194, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.406428 = idf(docFreq=72, maxDocs=44218)
                0.0625 = fieldNorm(doc=1194)
          0.19353147 = weight(abstract_txt:archived in 1194) [ClassicSimilarity], result of:
            0.19353147 = score(doc=1194,freq=1.0), product of:
              0.3669369 = queryWeight, product of:
                1.7819929 = boost
                8.43879 = idf(docFreq=25, maxDocs=44218)
                0.024400864 = queryNorm
              0.5274244 = fieldWeight in 1194, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.43879 = idf(docFreq=25, maxDocs=44218)
                0.0625 = fieldNorm(doc=1194)
        0.24 = coord(6/25)
    
  4. Dzhingo, A.: Russian national bibliography : its present situation (1996) 0.12
    0.116881266 = sum of:
      0.116881266 = product of:
        0.5844063 = sum of:
          0.055358667 = weight(abstract_txt:principles in 7903) [ClassicSimilarity], result of:
            0.055358667 = score(doc=7903,freq=1.0), product of:
              0.13728003 = queryWeight, product of:
                1.0899686 = boost
                5.161646 = idf(docFreq=688, maxDocs=44218)
                0.024400864 = queryNorm
              0.4032536 = fieldWeight in 7903, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.161646 = idf(docFreq=688, maxDocs=44218)
                0.078125 = fieldNorm(doc=7903)
          0.06267591 = weight(abstract_txt:publishing in 7903) [ClassicSimilarity], result of:
            0.06267591 = score(doc=7903,freq=1.0), product of:
              0.1491251 = queryWeight, product of:
                1.1360191 = boost
                5.3797226 = idf(docFreq=553, maxDocs=44218)
                0.024400864 = queryNorm
              0.42029083 = fieldWeight in 7903, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.3797226 = idf(docFreq=553, maxDocs=44218)
                0.078125 = fieldNorm(doc=7903)
          0.034601383 = weight(abstract_txt:which in 7903) [ClassicSimilarity], result of:
            0.034601383 = score(doc=7903,freq=3.0), product of:
              0.087669566 = queryWeight, product of:
                1.231827 = boost
                2.9167147 = idf(docFreq=6503, maxDocs=44218)
                0.024400864 = queryNorm
              0.39467955 = fieldWeight in 7903, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.9167147 = idf(docFreq=6503, maxDocs=44218)
                0.078125 = fieldNorm(doc=7903)
          0.3341557 = weight(abstract_txt:deposit in 7903) [ClassicSimilarity], result of:
            0.3341557 = score(doc=7903,freq=3.0), product of:
              0.31555554 = queryWeight, product of:
                1.6525255 = boost
                7.825686 = idf(docFreq=47, maxDocs=44218)
                0.024400864 = queryNorm
              1.0589442 = fieldWeight in 7903, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.825686 = idf(docFreq=47, maxDocs=44218)
                0.078125 = fieldNorm(doc=7903)
          0.09761467 = weight(abstract_txt:documents in 7903) [ClassicSimilarity], result of:
            0.09761467 = score(doc=7903,freq=3.0), product of:
              0.17503704 = queryWeight, product of:
                1.7405651 = boost
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.024400864 = queryNorm
              0.5576801 = fieldWeight in 7903, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.078125 = fieldNorm(doc=7903)
        0.2 = coord(5/25)
    
  5. Urs, S.R.; Angrosh, M.A.: Ontology-based knowledge organization systems in digital libraries : a comparison of experiments in OWL and KAON ontologies (2006 (?)) 0.12
    0.116847515 = sum of:
      0.116847515 = product of:
        0.41731256 = sum of:
          0.051832523 = weight(abstract_txt:domain in 2799) [ClassicSimilarity], result of:
            0.051832523 = score(doc=2799,freq=3.0), product of:
              0.11555252 = queryWeight, product of:
                4.7355914 = idf(docFreq=1054, maxDocs=44218)
                0.024400864 = queryNorm
              0.44856244 = fieldWeight in 2799, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.7355914 = idf(docFreq=1054, maxDocs=44218)
                0.0546875 = fieldNorm(doc=2799)
          0.01371485 = weight(abstract_txt:this in 2799) [ClassicSimilarity], result of:
            0.01371485 = score(doc=2799,freq=3.0), product of:
              0.060004234 = queryWeight, product of:
                1.0190986 = boost
                2.4130175 = idf(docFreq=10762, maxDocs=44218)
                0.024400864 = queryNorm
              0.2285647 = fieldWeight in 2799, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.4130175 = idf(docFreq=10762, maxDocs=44218)
                0.0546875 = fieldNorm(doc=2799)
          0.037343513 = weight(abstract_txt:importance in 2799) [ClassicSimilarity], result of:
            0.037343513 = score(doc=2799,freq=1.0), product of:
              0.1339353 = queryWeight, product of:
                1.0766085 = boost
                5.098378 = idf(docFreq=733, maxDocs=44218)
                0.024400864 = queryNorm
              0.27881756 = fieldWeight in 2799, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.098378 = idf(docFreq=733, maxDocs=44218)
                0.0546875 = fieldNorm(doc=2799)
          0.08585478 = weight(abstract_txt:developing in 2799) [ClassicSimilarity], result of:
            0.08585478 = score(doc=2799,freq=5.0), product of:
              0.13643882 = queryWeight, product of:
                1.0866239 = boost
                5.145807 = idf(docFreq=699, maxDocs=44218)
                0.024400864 = queryNorm
              0.6292548 = fieldWeight in 2799, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                5.145807 = idf(docFreq=699, maxDocs=44218)
                0.0546875 = fieldNorm(doc=2799)
          0.019776339 = weight(abstract_txt:which in 2799) [ClassicSimilarity], result of:
            0.019776339 = score(doc=2799,freq=2.0), product of:
              0.087669566 = queryWeight, product of:
                1.231827 = boost
                2.9167147 = idf(docFreq=6503, maxDocs=44218)
                0.024400864 = queryNorm
              0.22557814 = fieldWeight in 2799, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.9167147 = idf(docFreq=6503, maxDocs=44218)
                0.0546875 = fieldNorm(doc=2799)
          0.0394505 = weight(abstract_txt:documents in 2799) [ClassicSimilarity], result of:
            0.0394505 = score(doc=2799,freq=1.0), product of:
              0.17503704 = queryWeight, product of:
                1.7405651 = boost
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.024400864 = queryNorm
              0.22538373 = fieldWeight in 2799, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.0546875 = fieldNorm(doc=2799)
          0.16934004 = weight(abstract_txt:archived in 2799) [ClassicSimilarity], result of:
            0.16934004 = score(doc=2799,freq=1.0), product of:
              0.3669369 = queryWeight, product of:
                1.7819929 = boost
                8.43879 = idf(docFreq=25, maxDocs=44218)
                0.024400864 = queryNorm
              0.46149635 = fieldWeight in 2799, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.43879 = idf(docFreq=25, maxDocs=44218)
                0.0546875 = fieldNorm(doc=2799)
        0.28 = coord(7/25)