Document (#29775)

Author
Hagedorn, K.
Title
OAIster: a "no dead ends" OAI service provider
Source
Library hi tech. 21(2003) no.2, S.170-181
Year
2003
Abstract
OAIster, at the University of Michigan, University Libraries, Digital Library Production Service (DLPS), is an Andrew W. Mellon Foundation grant-funded project designed to test the feasibility of using the Open Archives Initiative Protocol for Metadata Harvesting (OAI-PMH) to harvest digital object metadata from multiple and varied digital object repositories and develop a service to allow end-users to access that metadata. This article describes in-depth the development of our system to harvest, store, transform the metadata into Digital Library eXtension Service (DLXS) Bibliographic Class format, build indexes and make the metadata searchable through an interface using the XPAT search engine. Results of the testing of our service and statistics on usage are reported, as well as the issues that we have encountered during our harvesting and transformation operations. The article closes by discussing the future improvements and potential of OAIster and the OAI-PMH protocol.
Content
Vgl. auch unter: http://www.emeraldinsight.com/10.1108/07378830310479811.
Theme
Metadaten
Object
OAI-PMH

Similar documents (content)

  1. Shreeves, S.L.; Kaczmarek, J.S.; Cole, T.W.: Harvesting cultural heritage metadata using OAI Protocol (2003) 0.41
    0.41384363 = sum of:
      0.41384363 = product of:
        1.0346091 = sum of:
          0.0141952885 = weight(abstract_txt:using in 773) [ClassicSimilarity], result of:
            0.0141952885 = score(doc=773,freq=1.0), product of:
              0.05234591 = queryWeight, product of:
                1.0437361 = boost
                3.4711347 = idf(docFreq=3679, maxDocs=43556)
                0.014448427 = queryNorm
              0.2711824 = fieldWeight in 773, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4711347 = idf(docFreq=3679, maxDocs=43556)
                0.078125 = fieldNorm(doc=773)
          0.018921072 = weight(abstract_txt:article in 773) [ClassicSimilarity], result of:
            0.018921072 = score(doc=773,freq=1.0), product of:
              0.063399196 = queryWeight, product of:
                1.1486592 = boost
                3.8200758 = idf(docFreq=2595, maxDocs=43556)
                0.014448427 = queryNorm
              0.2984434 = fieldWeight in 773, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.8200758 = idf(docFreq=2595, maxDocs=43556)
                0.078125 = fieldNorm(doc=773)
          0.026306668 = weight(abstract_txt:university in 773) [ClassicSimilarity], result of:
            0.026306668 = score(doc=773,freq=1.0), product of:
              0.07897636 = queryWeight, product of:
                1.2820292 = boost
                4.263622 = idf(docFreq=1665, maxDocs=43556)
                0.014448427 = queryNorm
              0.33309546 = fieldWeight in 773, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.263622 = idf(docFreq=1665, maxDocs=43556)
                0.078125 = fieldNorm(doc=773)
          0.10935304 = weight(abstract_txt:mellon in 773) [ClassicSimilarity], result of:
            0.10935304 = score(doc=773,freq=1.0), product of:
              0.16205552 = queryWeight, product of:
                1.2985727 = boost
                8.63728 = idf(docFreq=20, maxDocs=43556)
                0.014448427 = queryNorm
              0.6747875 = fieldWeight in 773, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.63728 = idf(docFreq=20, maxDocs=43556)
                0.078125 = fieldNorm(doc=773)
          0.11121666 = weight(abstract_txt:andrew in 773) [ClassicSimilarity], result of:
            0.11121666 = score(doc=773,freq=1.0), product of:
              0.16389151 = queryWeight, product of:
                1.305908 = boost
                8.68607 = idf(docFreq=19, maxDocs=43556)
                0.014448427 = queryNorm
              0.67859924 = fieldWeight in 773, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.68607 = idf(docFreq=19, maxDocs=43556)
                0.078125 = fieldNorm(doc=773)
          0.10017191 = weight(abstract_txt:protocol in 773) [ClassicSimilarity], result of:
            0.10017191 = score(doc=773,freq=1.0), product of:
              0.19258267 = queryWeight, product of:
                2.0019717 = boost
                6.6579223 = idf(docFreq=151, maxDocs=43556)
                0.014448427 = queryNorm
              0.5201502 = fieldWeight in 773, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.6579223 = idf(docFreq=151, maxDocs=43556)
                0.078125 = fieldNorm(doc=773)
          0.21249181 = weight(abstract_txt:harvesting in 773) [ClassicSimilarity], result of:
            0.21249181 = score(doc=773,freq=2.0), product of:
              0.2523508 = queryWeight, product of:
                2.291668 = boost
                7.62136 = idf(docFreq=57, maxDocs=43556)
                0.014448427 = queryNorm
              0.84204924 = fieldWeight in 773, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.62136 = idf(docFreq=57, maxDocs=43556)
                0.078125 = fieldNorm(doc=773)
          0.07902579 = weight(abstract_txt:digital in 773) [ClassicSimilarity], result of:
            0.07902579 = score(doc=773,freq=2.0), product of:
              0.16442423 = queryWeight, product of:
                2.6160572 = boost
                4.3500876 = idf(docFreq=1527, maxDocs=43556)
                0.014448427 = queryNorm
              0.4806213 = fieldWeight in 773, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.3500876 = idf(docFreq=1527, maxDocs=43556)
                0.078125 = fieldNorm(doc=773)
          0.08082996 = weight(abstract_txt:service in 773) [ClassicSimilarity], result of:
            0.08082996 = score(doc=773,freq=1.0), product of:
              0.2265417 = queryWeight, product of:
                3.433155 = boost
                4.5670333 = idf(docFreq=1229, maxDocs=43556)
                0.014448427 = queryNorm
              0.35679948 = fieldWeight in 773, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5670333 = idf(docFreq=1229, maxDocs=43556)
                0.078125 = fieldNorm(doc=773)
          0.28209683 = weight(abstract_txt:metadata in 773) [ClassicSimilarity], result of:
            0.28209683 = score(doc=773,freq=8.0), product of:
              0.2606166 = queryWeight, product of:
                3.68231 = boost
                4.8984776 = idf(docFreq=882, maxDocs=43556)
                0.014448427 = queryNorm
              1.0824208 = fieldWeight in 773, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                4.8984776 = idf(docFreq=882, maxDocs=43556)
                0.078125 = fieldNorm(doc=773)
        0.4 = coord(10/25)
    
  2. Halbert, M.: ¬The Metascholar Initiative : AmericanSouth.Org and MetaArchive.Org (2003) 0.23
    0.22650622 = sum of:
      0.22650622 = product of:
        0.8089508 = sum of:
          0.049938112 = weight(abstract_txt:feasibility in 775) [ClassicSimilarity], result of:
            0.049938112 = score(doc=775,freq=1.0), product of:
              0.09610175 = queryWeight, product of:
                6.651365 = idf(docFreq=152, maxDocs=43556)
                0.014448427 = queryNorm
              0.5196379 = fieldWeight in 775, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.651365 = idf(docFreq=152, maxDocs=43556)
                0.078125 = fieldNorm(doc=775)
          0.026306668 = weight(abstract_txt:university in 775) [ClassicSimilarity], result of:
            0.026306668 = score(doc=775,freq=1.0), product of:
              0.07897636 = queryWeight, product of:
                1.2820292 = boost
                4.263622 = idf(docFreq=1665, maxDocs=43556)
                0.014448427 = queryNorm
              0.33309546 = fieldWeight in 775, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.263622 = idf(docFreq=1665, maxDocs=43556)
                0.078125 = fieldNorm(doc=775)
          0.10935304 = weight(abstract_txt:mellon in 775) [ClassicSimilarity], result of:
            0.10935304 = score(doc=775,freq=1.0), product of:
              0.16205552 = queryWeight, product of:
                1.2985727 = boost
                8.63728 = idf(docFreq=20, maxDocs=43556)
                0.014448427 = queryNorm
              0.6747875 = fieldWeight in 775, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.63728 = idf(docFreq=20, maxDocs=43556)
                0.078125 = fieldNorm(doc=775)
          0.11121666 = weight(abstract_txt:andrew in 775) [ClassicSimilarity], result of:
            0.11121666 = score(doc=775,freq=1.0), product of:
              0.16389151 = queryWeight, product of:
                1.305908 = boost
                8.68607 = idf(docFreq=19, maxDocs=43556)
                0.014448427 = queryNorm
              0.67859924 = fieldWeight in 775, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.68607 = idf(docFreq=19, maxDocs=43556)
                0.078125 = fieldNorm(doc=775)
          0.10017191 = weight(abstract_txt:protocol in 775) [ClassicSimilarity], result of:
            0.10017191 = score(doc=775,freq=1.0), product of:
              0.19258267 = queryWeight, product of:
                2.0019717 = boost
                6.6579223 = idf(docFreq=151, maxDocs=43556)
                0.014448427 = queryNorm
              0.5201502 = fieldWeight in 775, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.6579223 = idf(docFreq=151, maxDocs=43556)
                0.078125 = fieldNorm(doc=775)
          0.21249181 = weight(abstract_txt:harvesting in 775) [ClassicSimilarity], result of:
            0.21249181 = score(doc=775,freq=2.0), product of:
              0.2523508 = queryWeight, product of:
                2.291668 = boost
                7.62136 = idf(docFreq=57, maxDocs=43556)
                0.014448427 = queryNorm
              0.84204924 = fieldWeight in 775, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.62136 = idf(docFreq=57, maxDocs=43556)
                0.078125 = fieldNorm(doc=775)
          0.19947259 = weight(abstract_txt:metadata in 775) [ClassicSimilarity], result of:
            0.19947259 = score(doc=775,freq=4.0), product of:
              0.2606166 = queryWeight, product of:
                3.68231 = boost
                4.8984776 = idf(docFreq=882, maxDocs=43556)
                0.014448427 = queryNorm
              0.7653871 = fieldWeight in 775, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.8984776 = idf(docFreq=882, maxDocs=43556)
                0.078125 = fieldNorm(doc=775)
        0.28 = coord(7/25)
    
  3. Van de Sompel, H.; Nelson, M.L.; Lagoze, C.; Warner, S.: Resource harvesting within the OAI-PMH framework (2004) 0.21
    0.20736727 = sum of:
      0.20736727 = product of:
        0.8640303 = sum of:
          0.029504355 = weight(abstract_txt:using in 108) [ClassicSimilarity], result of:
            0.029504355 = score(doc=108,freq=3.0), product of:
              0.05234591 = queryWeight, product of:
                1.0437361 = boost
                3.4711347 = idf(docFreq=3679, maxDocs=43556)
                0.014448427 = queryNorm
              0.563642 = fieldWeight in 108, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.4711347 = idf(docFreq=3679, maxDocs=43556)
                0.09375 = fieldNorm(doc=108)
          0.10381865 = weight(abstract_txt:object in 108) [ClassicSimilarity], result of:
            0.10381865 = score(doc=108,freq=2.0), product of:
              0.1386243 = queryWeight, product of:
                1.6985135 = boost
                5.6487164 = idf(docFreq=416, maxDocs=43556)
                0.014448427 = queryNorm
              0.74892104 = fieldWeight in 108, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.6487164 = idf(docFreq=416, maxDocs=43556)
                0.09375 = fieldNorm(doc=108)
          0.12020629 = weight(abstract_txt:protocol in 108) [ClassicSimilarity], result of:
            0.12020629 = score(doc=108,freq=1.0), product of:
              0.19258267 = queryWeight, product of:
                2.0019717 = boost
                6.6579223 = idf(docFreq=151, maxDocs=43556)
                0.014448427 = queryNorm
              0.6241802 = fieldWeight in 108, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.6579223 = idf(docFreq=151, maxDocs=43556)
                0.09375 = fieldNorm(doc=108)
          0.2549902 = weight(abstract_txt:harvesting in 108) [ClassicSimilarity], result of:
            0.2549902 = score(doc=108,freq=2.0), product of:
              0.2523508 = queryWeight, product of:
                2.291668 = boost
                7.62136 = idf(docFreq=57, maxDocs=43556)
                0.014448427 = queryNorm
              1.0104592 = fieldWeight in 108, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.62136 = idf(docFreq=57, maxDocs=43556)
                0.09375 = fieldNorm(doc=108)
          0.11614371 = weight(abstract_txt:digital in 108) [ClassicSimilarity], result of:
            0.11614371 = score(doc=108,freq=3.0), product of:
              0.16442423 = queryWeight, product of:
                2.6160572 = boost
                4.3500876 = idf(docFreq=1527, maxDocs=43556)
                0.014448427 = queryNorm
              0.7063662 = fieldWeight in 108, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.3500876 = idf(docFreq=1527, maxDocs=43556)
                0.09375 = fieldNorm(doc=108)
          0.23936711 = weight(abstract_txt:metadata in 108) [ClassicSimilarity], result of:
            0.23936711 = score(doc=108,freq=4.0), product of:
              0.2606166 = queryWeight, product of:
                3.68231 = boost
                4.8984776 = idf(docFreq=882, maxDocs=43556)
                0.014448427 = queryNorm
              0.91846454 = fieldWeight in 108, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.8984776 = idf(docFreq=882, maxDocs=43556)
                0.09375 = fieldNorm(doc=108)
        0.24 = coord(6/25)
    
  4. Hagedorn, K.; Chapman, S.; Newman, D.: Enhancing search and browse using automated clustering of subject metadata (2007) 0.19
    0.1934001 = sum of:
      0.1934001 = product of:
        0.80583376 = sum of:
          0.051637556 = weight(abstract_txt:varied in 3166) [ClassicSimilarity], result of:
            0.051637556 = score(doc=3166,freq=1.0), product of:
              0.098269865 = queryWeight, product of:
                1.0112174 = boost
                6.7259755 = idf(docFreq=141, maxDocs=43556)
                0.014448427 = queryNorm
              0.52546686 = fieldWeight in 3166, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.7259755 = idf(docFreq=141, maxDocs=43556)
                0.078125 = fieldNorm(doc=3166)
          0.02458696 = weight(abstract_txt:using in 3166) [ClassicSimilarity], result of:
            0.02458696 = score(doc=3166,freq=3.0), product of:
              0.05234591 = queryWeight, product of:
                1.0437361 = boost
                3.4711347 = idf(docFreq=3679, maxDocs=43556)
                0.014448427 = queryNorm
              0.46970165 = fieldWeight in 3166, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.4711347 = idf(docFreq=3679, maxDocs=43556)
                0.078125 = fieldNorm(doc=3166)
          0.0815193 = weight(abstract_txt:michigan in 3166) [ClassicSimilarity], result of:
            0.0815193 = score(doc=3166,freq=1.0), product of:
              0.13323455 = queryWeight, product of:
                1.1774508 = boost
                7.831655 = idf(docFreq=46, maxDocs=43556)
                0.014448427 = queryNorm
              0.61184806 = fieldWeight in 3166, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.831655 = idf(docFreq=46, maxDocs=43556)
                0.078125 = fieldNorm(doc=3166)
          0.037203245 = weight(abstract_txt:university in 3166) [ClassicSimilarity], result of:
            0.037203245 = score(doc=3166,freq=2.0), product of:
              0.07897636 = queryWeight, product of:
                1.2820292 = boost
                4.263622 = idf(docFreq=1665, maxDocs=43556)
                0.014448427 = queryNorm
              0.47106808 = fieldWeight in 3166, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.263622 = idf(docFreq=1665, maxDocs=43556)
                0.078125 = fieldNorm(doc=3166)
          0.14104842 = weight(abstract_txt:metadata in 3166) [ClassicSimilarity], result of:
            0.14104842 = score(doc=3166,freq=2.0), product of:
              0.2606166 = queryWeight, product of:
                3.68231 = boost
                4.8984776 = idf(docFreq=882, maxDocs=43556)
                0.014448427 = queryNorm
              0.5412104 = fieldWeight in 3166, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.8984776 = idf(docFreq=882, maxDocs=43556)
                0.078125 = fieldNorm(doc=3166)
          0.46983826 = weight(abstract_txt:oaister in 3166) [ClassicSimilarity], result of:
            0.46983826 = score(doc=3166,freq=1.0), product of:
              0.6177071 = queryWeight, product of:
                4.3912306 = boost
                9.735892 = idf(docFreq=6, maxDocs=43556)
                0.014448427 = queryNorm
              0.7606166 = fieldWeight in 3166, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.735892 = idf(docFreq=6, maxDocs=43556)
                0.078125 = fieldNorm(doc=3166)
        0.24 = coord(6/25)
    
  5. Arms, C.R.: Available and useful : OAI at the Library of Congress (2003) 0.19
    0.18932505 = sum of:
      0.18932505 = product of:
        0.7888544 = sum of:
          0.018921072 = weight(abstract_txt:article in 771) [ClassicSimilarity], result of:
            0.018921072 = score(doc=771,freq=1.0), product of:
              0.063399196 = queryWeight, product of:
                1.1486592 = boost
                3.8200758 = idf(docFreq=2595, maxDocs=43556)
                0.014448427 = queryNorm
              0.2984434 = fieldWeight in 771, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.8200758 = idf(docFreq=2595, maxDocs=43556)
                0.078125 = fieldNorm(doc=771)
          0.17350283 = weight(abstract_txt:protocol in 771) [ClassicSimilarity], result of:
            0.17350283 = score(doc=771,freq=3.0), product of:
              0.19258267 = queryWeight, product of:
                2.0019717 = boost
                6.6579223 = idf(docFreq=151, maxDocs=43556)
                0.014448427 = queryNorm
              0.90092653 = fieldWeight in 771, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.6579223 = idf(docFreq=151, maxDocs=43556)
                0.078125 = fieldNorm(doc=771)
          0.26024827 = weight(abstract_txt:harvesting in 771) [ClassicSimilarity], result of:
            0.26024827 = score(doc=771,freq=3.0), product of:
              0.2523508 = queryWeight, product of:
                2.291668 = boost
                7.62136 = idf(docFreq=57, maxDocs=43556)
                0.014448427 = queryNorm
              1.0312955 = fieldWeight in 771, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.62136 = idf(docFreq=57, maxDocs=43556)
                0.078125 = fieldNorm(doc=771)
          0.05587967 = weight(abstract_txt:digital in 771) [ClassicSimilarity], result of:
            0.05587967 = score(doc=771,freq=1.0), product of:
              0.16442423 = queryWeight, product of:
                2.6160572 = boost
                4.3500876 = idf(docFreq=1527, maxDocs=43556)
                0.014448427 = queryNorm
              0.3398506 = fieldWeight in 771, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.3500876 = idf(docFreq=1527, maxDocs=43556)
                0.078125 = fieldNorm(doc=771)
          0.08082996 = weight(abstract_txt:service in 771) [ClassicSimilarity], result of:
            0.08082996 = score(doc=771,freq=1.0), product of:
              0.2265417 = queryWeight, product of:
                3.433155 = boost
                4.5670333 = idf(docFreq=1229, maxDocs=43556)
                0.014448427 = queryNorm
              0.35679948 = fieldWeight in 771, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5670333 = idf(docFreq=1229, maxDocs=43556)
                0.078125 = fieldNorm(doc=771)
          0.19947259 = weight(abstract_txt:metadata in 771) [ClassicSimilarity], result of:
            0.19947259 = score(doc=771,freq=4.0), product of:
              0.2606166 = queryWeight, product of:
                3.68231 = boost
                4.8984776 = idf(docFreq=882, maxDocs=43556)
                0.014448427 = queryNorm
              0.7653871 = fieldWeight in 771, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.8984776 = idf(docFreq=882, maxDocs=43556)
                0.078125 = fieldNorm(doc=771)
        0.24 = coord(6/25)