Document (#29777)

Author
Hagedorn, K.
Title
OAIster: a "no dead ends" OAI service provider
Source
Library hi tech. 21(2003) no.2, S.170-181
Year
2003
Abstract
OAIster, at the University of Michigan, University Libraries, Digital Library Production Service (DLPS), is an Andrew W. Mellon Foundation grant-funded project designed to test the feasibility of using the Open Archives Initiative Protocol for Metadata Harvesting (OAI-PMH) to harvest digital object metadata from multiple and varied digital object repositories and develop a service to allow end-users to access that metadata. This article describes in-depth the development of our system to harvest, store, transform the metadata into Digital Library eXtension Service (DLXS) Bibliographic Class format, build indexes and make the metadata searchable through an interface using the XPAT search engine. Results of the testing of our service and statistics on usage are reported, as well as the issues that we have encountered during our harvesting and transformation operations. The article closes by discussing the future improvements and potential of OAIster and the OAI-PMH protocol.
Content
Vgl. auch unter: http://www.emeraldinsight.com/10.1108/07378830310479811.
Theme
Metadaten
Object
OAI-PMH

Similar documents (content)

  1. Shreeves, S.L.; Kaczmarek, J.S.; Cole, T.W.: Harvesting cultural heritage metadata using OAI Protocol (2003) 0.41
    0.41258144 = sum of:
      0.41258144 = product of:
        1.0314536 = sum of:
          0.014076748 = weight(abstract_txt:using in 4775) [ClassicSimilarity], result of:
            0.014076748 = score(doc=4775,freq=1.0), product of:
              0.05202893 = queryWeight, product of:
                1.0399858 = boost
                3.4631186 = idf(docFreq=3765, maxDocs=44218)
                0.014446084 = queryNorm
              0.27055615 = fieldWeight in 4775, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4631186 = idf(docFreq=3765, maxDocs=44218)
                0.078125 = fieldNorm(doc=4775)
          0.018657044 = weight(abstract_txt:article in 4775) [ClassicSimilarity], result of:
            0.018657044 = score(doc=4775,freq=1.0), product of:
              0.06277768 = queryWeight, product of:
                1.1423721 = boost
                3.8040617 = idf(docFreq=2677, maxDocs=44218)
                0.014446084 = queryNorm
              0.2971923 = fieldWeight in 4775, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.8040617 = idf(docFreq=2677, maxDocs=44218)
                0.078125 = fieldNorm(doc=4775)
          0.02629395 = weight(abstract_txt:university in 4775) [ClassicSimilarity], result of:
            0.02629395 = score(doc=4775,freq=1.0), product of:
              0.078912765 = queryWeight, product of:
                1.2807919 = boost
                4.264995 = idf(docFreq=1688, maxDocs=44218)
                0.014446084 = queryNorm
              0.33320275 = fieldWeight in 4775, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.264995 = idf(docFreq=1688, maxDocs=44218)
                0.078125 = fieldNorm(doc=4775)
          0.10976773 = weight(abstract_txt:mellon in 4775) [ClassicSimilarity], result of:
            0.10976773 = score(doc=4775,freq=1.0), product of:
              0.16238646 = queryWeight, product of:
                1.2991667 = boost
                8.652365 = idf(docFreq=20, maxDocs=44218)
                0.014446084 = queryNorm
              0.675966 = fieldWeight in 4775, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.652365 = idf(docFreq=20, maxDocs=44218)
                0.078125 = fieldNorm(doc=4775)
          0.11163514 = weight(abstract_txt:andrew in 4775) [ClassicSimilarity], result of:
            0.11163514 = score(doc=4775,freq=1.0), product of:
              0.164223 = queryWeight, product of:
                1.3064926 = boost
                8.701155 = idf(docFreq=19, maxDocs=44218)
                0.014446084 = queryNorm
              0.67977774 = fieldWeight in 4775, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.701155 = idf(docFreq=19, maxDocs=44218)
                0.078125 = fieldNorm(doc=4775)
          0.099825904 = weight(abstract_txt:protocol in 4775) [ClassicSimilarity], result of:
            0.099825904 = score(doc=4775,freq=1.0), product of:
              0.19204612 = queryWeight, product of:
                1.9980563 = boost
                6.653462 = idf(docFreq=154, maxDocs=44218)
                0.014446084 = queryNorm
              0.51980174 = fieldWeight in 4775, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.653462 = idf(docFreq=154, maxDocs=44218)
                0.078125 = fieldNorm(doc=4775)
          0.21344633 = weight(abstract_txt:harvesting in 4775) [ClassicSimilarity], result of:
            0.21344633 = score(doc=4775,freq=2.0), product of:
              0.2529837 = queryWeight, product of:
                2.293249 = boost
                7.636444 = idf(docFreq=57, maxDocs=44218)
                0.014446084 = queryNorm
              0.8437158 = fieldWeight in 4775, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.636444 = idf(docFreq=57, maxDocs=44218)
                0.078125 = fieldNorm(doc=4775)
          0.07798363 = weight(abstract_txt:digital in 4775) [ClassicSimilarity], result of:
            0.07798363 = score(doc=4775,freq=2.0), product of:
              0.16289672 = queryWeight, product of:
                2.6024125 = boost
                4.332974 = idf(docFreq=1577, maxDocs=44218)
                0.014446084 = queryNorm
              0.4787305 = fieldWeight in 4775, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.332974 = idf(docFreq=1577, maxDocs=44218)
                0.078125 = fieldNorm(doc=4775)
          0.08104104 = weight(abstract_txt:service in 4775) [ClassicSimilarity], result of:
            0.08104104 = score(doc=4775,freq=1.0), product of:
              0.22682628 = queryWeight, product of:
                3.4333782 = boost
                4.5732145 = idf(docFreq=1240, maxDocs=44218)
                0.014446084 = queryNorm
              0.3572824 = fieldWeight in 4775, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5732145 = idf(docFreq=1240, maxDocs=44218)
                0.078125 = fieldNorm(doc=4775)
          0.27872607 = weight(abstract_txt:metadata in 4775) [ClassicSimilarity], result of:
            0.27872607 = score(doc=4775,freq=8.0), product of:
              0.25841147 = queryWeight, product of:
                3.664636 = boost
                4.881247 = idf(docFreq=911, maxDocs=44218)
                0.014446084 = queryNorm
              1.0786134 = fieldWeight in 4775, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                4.881247 = idf(docFreq=911, maxDocs=44218)
                0.078125 = fieldNorm(doc=4775)
        0.4 = coord(10/25)
    
  2. Halbert, M.: ¬The Metascholar Initiative : AmericanSouth.Org and MetaArchive.Org (2003) 0.23
    0.22627272 = sum of:
      0.22627272 = product of:
        0.80811685 = sum of:
          0.050058756 = weight(abstract_txt:feasibility in 4777) [ClassicSimilarity], result of:
            0.050058756 = score(doc=4777,freq=1.0), product of:
              0.09620997 = queryWeight, product of:
                6.6599345 = idf(docFreq=153, maxDocs=44218)
                0.014446084 = queryNorm
              0.52030736 = fieldWeight in 4777, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.6599345 = idf(docFreq=153, maxDocs=44218)
                0.078125 = fieldNorm(doc=4777)
          0.02629395 = weight(abstract_txt:university in 4777) [ClassicSimilarity], result of:
            0.02629395 = score(doc=4777,freq=1.0), product of:
              0.078912765 = queryWeight, product of:
                1.2807919 = boost
                4.264995 = idf(docFreq=1688, maxDocs=44218)
                0.014446084 = queryNorm
              0.33320275 = fieldWeight in 4777, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.264995 = idf(docFreq=1688, maxDocs=44218)
                0.078125 = fieldNorm(doc=4777)
          0.10976773 = weight(abstract_txt:mellon in 4777) [ClassicSimilarity], result of:
            0.10976773 = score(doc=4777,freq=1.0), product of:
              0.16238646 = queryWeight, product of:
                1.2991667 = boost
                8.652365 = idf(docFreq=20, maxDocs=44218)
                0.014446084 = queryNorm
              0.675966 = fieldWeight in 4777, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.652365 = idf(docFreq=20, maxDocs=44218)
                0.078125 = fieldNorm(doc=4777)
          0.11163514 = weight(abstract_txt:andrew in 4777) [ClassicSimilarity], result of:
            0.11163514 = score(doc=4777,freq=1.0), product of:
              0.164223 = queryWeight, product of:
                1.3064926 = boost
                8.701155 = idf(docFreq=19, maxDocs=44218)
                0.014446084 = queryNorm
              0.67977774 = fieldWeight in 4777, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.701155 = idf(docFreq=19, maxDocs=44218)
                0.078125 = fieldNorm(doc=4777)
          0.099825904 = weight(abstract_txt:protocol in 4777) [ClassicSimilarity], result of:
            0.099825904 = score(doc=4777,freq=1.0), product of:
              0.19204612 = queryWeight, product of:
                1.9980563 = boost
                6.653462 = idf(docFreq=154, maxDocs=44218)
                0.014446084 = queryNorm
              0.51980174 = fieldWeight in 4777, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.653462 = idf(docFreq=154, maxDocs=44218)
                0.078125 = fieldNorm(doc=4777)
          0.21344633 = weight(abstract_txt:harvesting in 4777) [ClassicSimilarity], result of:
            0.21344633 = score(doc=4777,freq=2.0), product of:
              0.2529837 = queryWeight, product of:
                2.293249 = boost
                7.636444 = idf(docFreq=57, maxDocs=44218)
                0.014446084 = queryNorm
              0.8437158 = fieldWeight in 4777, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.636444 = idf(docFreq=57, maxDocs=44218)
                0.078125 = fieldNorm(doc=4777)
          0.19708909 = weight(abstract_txt:metadata in 4777) [ClassicSimilarity], result of:
            0.19708909 = score(doc=4777,freq=4.0), product of:
              0.25841147 = queryWeight, product of:
                3.664636 = boost
                4.881247 = idf(docFreq=911, maxDocs=44218)
                0.014446084 = queryNorm
              0.76269484 = fieldWeight in 4777, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.881247 = idf(docFreq=911, maxDocs=44218)
                0.078125 = fieldNorm(doc=4777)
        0.28 = coord(7/25)
    
  3. Van de Sompel, H.; Nelson, M.L.; Lagoze, C.; Warner, S.: Resource harvesting within the OAI-PMH framework (2004) 0.21
    0.2064665 = sum of:
      0.2064665 = product of:
        0.8602771 = sum of:
          0.02925797 = weight(abstract_txt:using in 4110) [ClassicSimilarity], result of:
            0.02925797 = score(doc=4110,freq=3.0), product of:
              0.05202893 = queryWeight, product of:
                1.0399858 = boost
                3.4631186 = idf(docFreq=3765, maxDocs=44218)
                0.014446084 = queryNorm
              0.5623404 = fieldWeight in 4110, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.4631186 = idf(docFreq=3765, maxDocs=44218)
                0.09375 = fieldNorm(doc=4110)
          0.103973456 = weight(abstract_txt:object in 4110) [ClassicSimilarity], result of:
            0.103973456 = score(doc=4110,freq=2.0), product of:
              0.13869502 = queryWeight, product of:
                1.697991 = boost
                5.6542544 = idf(docFreq=420, maxDocs=44218)
                0.014446084 = queryNorm
              0.7496553 = fieldWeight in 4110, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.6542544 = idf(docFreq=420, maxDocs=44218)
                0.09375 = fieldNorm(doc=4110)
          0.11979108 = weight(abstract_txt:protocol in 4110) [ClassicSimilarity], result of:
            0.11979108 = score(doc=4110,freq=1.0), product of:
              0.19204612 = queryWeight, product of:
                1.9980563 = boost
                6.653462 = idf(docFreq=154, maxDocs=44218)
                0.014446084 = queryNorm
              0.6237621 = fieldWeight in 4110, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.653462 = idf(docFreq=154, maxDocs=44218)
                0.09375 = fieldNorm(doc=4110)
          0.2561356 = weight(abstract_txt:harvesting in 4110) [ClassicSimilarity], result of:
            0.2561356 = score(doc=4110,freq=2.0), product of:
              0.2529837 = queryWeight, product of:
                2.293249 = boost
                7.636444 = idf(docFreq=57, maxDocs=44218)
                0.014446084 = queryNorm
              1.012459 = fieldWeight in 4110, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.636444 = idf(docFreq=57, maxDocs=44218)
                0.09375 = fieldNorm(doc=4110)
          0.114612065 = weight(abstract_txt:digital in 4110) [ClassicSimilarity], result of:
            0.114612065 = score(doc=4110,freq=3.0), product of:
              0.16289672 = queryWeight, product of:
                2.6024125 = boost
                4.332974 = idf(docFreq=1577, maxDocs=44218)
                0.014446084 = queryNorm
              0.7035873 = fieldWeight in 4110, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.332974 = idf(docFreq=1577, maxDocs=44218)
                0.09375 = fieldNorm(doc=4110)
          0.23650692 = weight(abstract_txt:metadata in 4110) [ClassicSimilarity], result of:
            0.23650692 = score(doc=4110,freq=4.0), product of:
              0.25841147 = queryWeight, product of:
                3.664636 = boost
                4.881247 = idf(docFreq=911, maxDocs=44218)
                0.014446084 = queryNorm
              0.91523385 = fieldWeight in 4110, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.881247 = idf(docFreq=911, maxDocs=44218)
                0.09375 = fieldNorm(doc=4110)
        0.24 = coord(6/25)
    
  4. Hagedorn, K.; Chapman, S.; Newman, D.: Enhancing search and browse using automated clustering of subject metadata (2007) 0.19
    0.19325651 = sum of:
      0.19325651 = product of:
        0.8052355 = sum of:
          0.051748503 = weight(abstract_txt:varied in 1168) [ClassicSimilarity], result of:
            0.051748503 = score(doc=1168,freq=1.0), product of:
              0.098363034 = queryWeight, product of:
                1.0111275 = boost
                6.7340426 = idf(docFreq=142, maxDocs=44218)
                0.014446084 = queryNorm
              0.52609706 = fieldWeight in 1168, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.7340426 = idf(docFreq=142, maxDocs=44218)
                0.078125 = fieldNorm(doc=1168)
          0.02438164 = weight(abstract_txt:using in 1168) [ClassicSimilarity], result of:
            0.02438164 = score(doc=1168,freq=3.0), product of:
              0.05202893 = queryWeight, product of:
                1.0399858 = boost
                3.4631186 = idf(docFreq=3765, maxDocs=44218)
                0.014446084 = queryNorm
              0.46861696 = fieldWeight in 1168, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.4631186 = idf(docFreq=3765, maxDocs=44218)
                0.078125 = fieldNorm(doc=1168)
          0.08121523 = weight(abstract_txt:michigan in 1168) [ClassicSimilarity], result of:
            0.08121523 = score(doc=1168,freq=1.0), product of:
              0.13283883 = queryWeight, product of:
                1.1750395 = boost
                7.825686 = idf(docFreq=47, maxDocs=44218)
                0.014446084 = queryNorm
              0.6113817 = fieldWeight in 1168, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.825686 = idf(docFreq=47, maxDocs=44218)
                0.078125 = fieldNorm(doc=1168)
          0.03718526 = weight(abstract_txt:university in 1168) [ClassicSimilarity], result of:
            0.03718526 = score(doc=1168,freq=2.0), product of:
              0.078912765 = queryWeight, product of:
                1.2807919 = boost
                4.264995 = idf(docFreq=1688, maxDocs=44218)
                0.014446084 = queryNorm
              0.47121984 = fieldWeight in 1168, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.264995 = idf(docFreq=1688, maxDocs=44218)
                0.078125 = fieldNorm(doc=1168)
          0.13936304 = weight(abstract_txt:metadata in 1168) [ClassicSimilarity], result of:
            0.13936304 = score(doc=1168,freq=2.0), product of:
              0.25841147 = queryWeight, product of:
                3.664636 = boost
                4.881247 = idf(docFreq=911, maxDocs=44218)
                0.014446084 = queryNorm
              0.5393067 = fieldWeight in 1168, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.881247 = idf(docFreq=911, maxDocs=44218)
                0.078125 = fieldNorm(doc=1168)
          0.47134182 = weight(abstract_txt:oaister in 1168) [ClassicSimilarity], result of:
            0.47134182 = score(doc=1168,freq=1.0), product of:
              0.6187252 = queryWeight, product of:
                4.3923755 = boost
                9.7509775 = idf(docFreq=6, maxDocs=44218)
                0.014446084 = queryNorm
              0.7617951 = fieldWeight in 1168, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.7509775 = idf(docFreq=6, maxDocs=44218)
                0.078125 = fieldNorm(doc=1168)
        0.24 = coord(6/25)
    
  5. Arms, C.R.: Available and useful : OAI at the Library of Congress (2003) 0.19
    0.18870018 = sum of:
      0.18870018 = product of:
        0.78625077 = sum of:
          0.018657044 = weight(abstract_txt:article in 4773) [ClassicSimilarity], result of:
            0.018657044 = score(doc=4773,freq=1.0), product of:
              0.06277768 = queryWeight, product of:
                1.1423721 = boost
                3.8040617 = idf(docFreq=2677, maxDocs=44218)
                0.014446084 = queryNorm
              0.2971923 = fieldWeight in 4773, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.8040617 = idf(docFreq=2677, maxDocs=44218)
                0.078125 = fieldNorm(doc=4773)
          0.17290352 = weight(abstract_txt:protocol in 4773) [ClassicSimilarity], result of:
            0.17290352 = score(doc=4773,freq=3.0), product of:
              0.19204612 = queryWeight, product of:
                1.9980563 = boost
                6.653462 = idf(docFreq=154, maxDocs=44218)
                0.014446084 = queryNorm
              0.9003229 = fieldWeight in 4773, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.653462 = idf(docFreq=154, maxDocs=44218)
                0.078125 = fieldNorm(doc=4773)
          0.26141733 = weight(abstract_txt:harvesting in 4773) [ClassicSimilarity], result of:
            0.26141733 = score(doc=4773,freq=3.0), product of:
              0.2529837 = queryWeight, product of:
                2.293249 = boost
                7.636444 = idf(docFreq=57, maxDocs=44218)
                0.014446084 = queryNorm
              1.0333366 = fieldWeight in 4773, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.636444 = idf(docFreq=57, maxDocs=44218)
                0.078125 = fieldNorm(doc=4773)
          0.055142753 = weight(abstract_txt:digital in 4773) [ClassicSimilarity], result of:
            0.055142753 = score(doc=4773,freq=1.0), product of:
              0.16289672 = queryWeight, product of:
                2.6024125 = boost
                4.332974 = idf(docFreq=1577, maxDocs=44218)
                0.014446084 = queryNorm
              0.33851358 = fieldWeight in 4773, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.332974 = idf(docFreq=1577, maxDocs=44218)
                0.078125 = fieldNorm(doc=4773)
          0.08104104 = weight(abstract_txt:service in 4773) [ClassicSimilarity], result of:
            0.08104104 = score(doc=4773,freq=1.0), product of:
              0.22682628 = queryWeight, product of:
                3.4333782 = boost
                4.5732145 = idf(docFreq=1240, maxDocs=44218)
                0.014446084 = queryNorm
              0.3572824 = fieldWeight in 4773, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5732145 = idf(docFreq=1240, maxDocs=44218)
                0.078125 = fieldNorm(doc=4773)
          0.19708909 = weight(abstract_txt:metadata in 4773) [ClassicSimilarity], result of:
            0.19708909 = score(doc=4773,freq=4.0), product of:
              0.25841147 = queryWeight, product of:
                3.664636 = boost
                4.881247 = idf(docFreq=911, maxDocs=44218)
                0.014446084 = queryNorm
              0.76269484 = fieldWeight in 4773, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.881247 = idf(docFreq=911, maxDocs=44218)
                0.078125 = fieldNorm(doc=4773)
        0.24 = coord(6/25)