Document (#29778)

Author
Hagedorn, K.
Title
OAIster: a "no dead ends" OAI service provider
Source
Library hi tech. 21(2003) no.2, S.170-181
Year
2003
Abstract
OAIster, at the University of Michigan, University Libraries, Digital Library Production Service (DLPS), is an Andrew W. Mellon Foundation grant-funded project designed to test the feasibility of using the Open Archives Initiative Protocol for Metadata Harvesting (OAI-PMH) to harvest digital object metadata from multiple and varied digital object repositories and develop a service to allow end-users to access that metadata. This article describes in-depth the development of our system to harvest, store, transform the metadata into Digital Library eXtension Service (DLXS) Bibliographic Class format, build indexes and make the metadata searchable through an interface using the XPAT search engine. Results of the testing of our service and statistics on usage are reported, as well as the issues that we have encountered during our harvesting and transformation operations. The article closes by discussing the future improvements and potential of OAIster and the OAI-PMH protocol.
Content
Vgl. auch unter: http://www.emeraldinsight.com/10.1108/07378830310479811.
Theme
Metadaten
Object
OAI-PMH

Similar documents (content)

  1. Shreeves, S.L.; Kaczmarek, J.S.; Cole, T.W.: Harvesting cultural heritage metadata using OAI Protocol (2003) 0.42
    0.41562793 = sum of:
      0.41562793 = product of:
        1.0390698 = sum of:
          0.014285618 = weight(abstract_txt:using in 776) [ClassicSimilarity], result of:
            0.014285618 = score(doc=776,freq=1.0), product of:
              0.05255246 = queryWeight, product of:
                1.0450585 = boost
                3.4794931 = idf(docFreq=3580, maxDocs=42740)
                0.014452284 = queryNorm
              0.2718354 = fieldWeight in 776, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4794931 = idf(docFreq=3580, maxDocs=42740)
                0.078125 = fieldNorm(doc=776)
          0.019166624 = weight(abstract_txt:article in 776) [ClassicSimilarity], result of:
            0.019166624 = score(doc=776,freq=1.0), product of:
              0.06392795 = queryWeight, product of:
                1.1526288 = boost
                3.8376453 = idf(docFreq=2502, maxDocs=42740)
                0.014452284 = queryNorm
              0.29981604 = fieldWeight in 776, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.8376453 = idf(docFreq=2502, maxDocs=42740)
                0.078125 = fieldNorm(doc=776)
          0.02607939 = weight(abstract_txt:university in 776) [ClassicSimilarity], result of:
            0.02607939 = score(doc=776,freq=1.0), product of:
              0.078498006 = queryWeight, product of:
                1.2772425 = boost
                4.2525434 = idf(docFreq=1652, maxDocs=42740)
                0.014452284 = queryNorm
              0.33222997 = fieldWeight in 776, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.2525434 = idf(docFreq=1652, maxDocs=42740)
                0.078125 = fieldNorm(doc=776)
          0.108541615 = weight(abstract_txt:mellon in 776) [ClassicSimilarity], result of:
            0.108541615 = score(doc=776,freq=1.0), product of:
              0.16120599 = queryWeight, product of:
                1.2942544 = boost
                8.618368 = idf(docFreq=20, maxDocs=42740)
                0.014452284 = queryNorm
              0.67331004 = fieldWeight in 776, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.618368 = idf(docFreq=20, maxDocs=42740)
                0.078125 = fieldNorm(doc=776)
          0.1103955 = weight(abstract_txt:andrew in 776) [ClassicSimilarity], result of:
            0.1103955 = score(doc=776,freq=1.0), product of:
              0.1630364 = queryWeight, product of:
                1.3015815 = boost
                8.667158 = idf(docFreq=19, maxDocs=42740)
                0.014452284 = queryNorm
              0.67712176 = fieldWeight in 776, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.667158 = idf(docFreq=19, maxDocs=42740)
                0.078125 = fieldNorm(doc=776)
          0.09953042 = weight(abstract_txt:protocol in 776) [ClassicSimilarity], result of:
            0.09953042 = score(doc=776,freq=1.0), product of:
              0.19170389 = queryWeight, product of:
                1.9959954 = boost
                6.645611 = idf(docFreq=150, maxDocs=42740)
                0.014452284 = queryNorm
              0.51918834 = fieldWeight in 776, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.645611 = idf(docFreq=150, maxDocs=42740)
                0.078125 = fieldNorm(doc=776)
          0.21672828 = weight(abstract_txt:harvesting in 776) [ClassicSimilarity], result of:
            0.21672828 = score(doc=776,freq=2.0), product of:
              0.25561956 = queryWeight, product of:
                2.3048418 = boost
                7.6739063 = idf(docFreq=53, maxDocs=42740)
                0.014452284 = queryNorm
              0.84785485 = fieldWeight in 776, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.6739063 = idf(docFreq=53, maxDocs=42740)
                0.078125 = fieldNorm(doc=776)
          0.080528244 = weight(abstract_txt:digital in 776) [ClassicSimilarity], result of:
            0.080528244 = score(doc=776,freq=2.0), product of:
              0.16645335 = queryWeight, product of:
                2.6303003 = boost
                4.3787556 = idf(docFreq=1456, maxDocs=42740)
                0.014452284 = queryNorm
              0.4837887 = fieldWeight in 776, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.3787556 = idf(docFreq=1456, maxDocs=42740)
                0.078125 = fieldNorm(doc=776)
          0.0806698 = weight(abstract_txt:service in 776) [ClassicSimilarity], result of:
            0.0806698 = score(doc=776,freq=1.0), product of:
              0.2261766 = queryWeight, product of:
                3.427977 = boost
                4.5653415 = idf(docFreq=1208, maxDocs=42740)
                0.014452284 = queryNorm
              0.3566673 = fieldWeight in 776, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5653415 = idf(docFreq=1208, maxDocs=42740)
                0.078125 = fieldNorm(doc=776)
          0.2831442 = weight(abstract_txt:metadata in 776) [ClassicSimilarity], result of:
            0.2831442 = score(doc=776,freq=8.0), product of:
              0.26118535 = queryWeight, product of:
                3.6837358 = boost
                4.905958 = idf(docFreq=859, maxDocs=42740)
                0.014452284 = queryNorm
              1.0840739 = fieldWeight in 776, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                4.905958 = idf(docFreq=859, maxDocs=42740)
                0.078125 = fieldNorm(doc=776)
        0.4 = coord(10/25)
    
  2. Halbert, M.: ¬The Metascholar Initiative : AmericanSouth.Org and MetaArchive.Org (2003) 0.23
    0.22723506 = sum of:
      0.22723506 = product of:
        0.8115538 = sum of:
          0.05006535 = weight(abstract_txt:feasibility in 778) [ClassicSimilarity], result of:
            0.05006535 = score(doc=778,freq=1.0), product of:
              0.09623695 = queryWeight, product of:
                6.658944 = idf(docFreq=148, maxDocs=42740)
                0.014452284 = queryNorm
              0.52023 = fieldWeight in 778, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.658944 = idf(docFreq=148, maxDocs=42740)
                0.078125 = fieldNorm(doc=778)
          0.02607939 = weight(abstract_txt:university in 778) [ClassicSimilarity], result of:
            0.02607939 = score(doc=778,freq=1.0), product of:
              0.078498006 = queryWeight, product of:
                1.2772425 = boost
                4.2525434 = idf(docFreq=1652, maxDocs=42740)
                0.014452284 = queryNorm
              0.33222997 = fieldWeight in 778, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.2525434 = idf(docFreq=1652, maxDocs=42740)
                0.078125 = fieldNorm(doc=778)
          0.108541615 = weight(abstract_txt:mellon in 778) [ClassicSimilarity], result of:
            0.108541615 = score(doc=778,freq=1.0), product of:
              0.16120599 = queryWeight, product of:
                1.2942544 = boost
                8.618368 = idf(docFreq=20, maxDocs=42740)
                0.014452284 = queryNorm
              0.67331004 = fieldWeight in 778, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.618368 = idf(docFreq=20, maxDocs=42740)
                0.078125 = fieldNorm(doc=778)
          0.1103955 = weight(abstract_txt:andrew in 778) [ClassicSimilarity], result of:
            0.1103955 = score(doc=778,freq=1.0), product of:
              0.1630364 = queryWeight, product of:
                1.3015815 = boost
                8.667158 = idf(docFreq=19, maxDocs=42740)
                0.014452284 = queryNorm
              0.67712176 = fieldWeight in 778, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.667158 = idf(docFreq=19, maxDocs=42740)
                0.078125 = fieldNorm(doc=778)
          0.09953042 = weight(abstract_txt:protocol in 778) [ClassicSimilarity], result of:
            0.09953042 = score(doc=778,freq=1.0), product of:
              0.19170389 = queryWeight, product of:
                1.9959954 = boost
                6.645611 = idf(docFreq=150, maxDocs=42740)
                0.014452284 = queryNorm
              0.51918834 = fieldWeight in 778, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.645611 = idf(docFreq=150, maxDocs=42740)
                0.078125 = fieldNorm(doc=778)
          0.21672828 = weight(abstract_txt:harvesting in 778) [ClassicSimilarity], result of:
            0.21672828 = score(doc=778,freq=2.0), product of:
              0.25561956 = queryWeight, product of:
                2.3048418 = boost
                7.6739063 = idf(docFreq=53, maxDocs=42740)
                0.014452284 = queryNorm
              0.84785485 = fieldWeight in 778, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.6739063 = idf(docFreq=53, maxDocs=42740)
                0.078125 = fieldNorm(doc=778)
          0.2002132 = weight(abstract_txt:metadata in 778) [ClassicSimilarity], result of:
            0.2002132 = score(doc=778,freq=4.0), product of:
              0.26118535 = queryWeight, product of:
                3.6837358 = boost
                4.905958 = idf(docFreq=859, maxDocs=42740)
                0.014452284 = queryNorm
              0.76655596 = fieldWeight in 778, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.905958 = idf(docFreq=859, maxDocs=42740)
                0.078125 = fieldNorm(doc=778)
        0.28 = coord(7/25)
    
  3. Van de Sompel, H.; Nelson, M.L.; Lagoze, C.; Warner, S.: Resource harvesting within the OAI-PMH framework (2004) 0.21
    0.20930547 = sum of:
      0.20930547 = product of:
        0.87210613 = sum of:
          0.029692102 = weight(abstract_txt:using in 111) [ClassicSimilarity], result of:
            0.029692102 = score(doc=111,freq=3.0), product of:
              0.05255246 = queryWeight, product of:
                1.0450585 = boost
                3.4794931 = idf(docFreq=3580, maxDocs=42740)
                0.014452284 = queryNorm
              0.5649993 = fieldWeight in 111, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.4794931 = idf(docFreq=3580, maxDocs=42740)
                0.09375 = fieldNorm(doc=111)
          0.104295895 = weight(abstract_txt:object in 111) [ClassicSimilarity], result of:
            0.104295895 = score(doc=111,freq=2.0), product of:
              0.13900839 = queryWeight, product of:
                1.6996698 = boost
                5.6590033 = idf(docFreq=404, maxDocs=42740)
                0.014452284 = queryNorm
              0.7502849 = fieldWeight in 111, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.6590033 = idf(docFreq=404, maxDocs=42740)
                0.09375 = fieldNorm(doc=111)
          0.11943651 = weight(abstract_txt:protocol in 111) [ClassicSimilarity], result of:
            0.11943651 = score(doc=111,freq=1.0), product of:
              0.19170389 = queryWeight, product of:
                1.9959954 = boost
                6.645611 = idf(docFreq=150, maxDocs=42740)
                0.014452284 = queryNorm
              0.623026 = fieldWeight in 111, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.645611 = idf(docFreq=150, maxDocs=42740)
                0.09375 = fieldNorm(doc=111)
          0.26007393 = weight(abstract_txt:harvesting in 111) [ClassicSimilarity], result of:
            0.26007393 = score(doc=111,freq=2.0), product of:
              0.25561956 = queryWeight, product of:
                2.3048418 = boost
                7.6739063 = idf(docFreq=53, maxDocs=42740)
                0.014452284 = queryNorm
              1.0174258 = fieldWeight in 111, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.6739063 = idf(docFreq=53, maxDocs=42740)
                0.09375 = fieldNorm(doc=111)
          0.11835188 = weight(abstract_txt:digital in 111) [ClassicSimilarity], result of:
            0.11835188 = score(doc=111,freq=3.0), product of:
              0.16645335 = queryWeight, product of:
                2.6303003 = boost
                4.3787556 = idf(docFreq=1456, maxDocs=42740)
                0.014452284 = queryNorm
              0.7110213 = fieldWeight in 111, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.3787556 = idf(docFreq=1456, maxDocs=42740)
                0.09375 = fieldNorm(doc=111)
          0.24025582 = weight(abstract_txt:metadata in 111) [ClassicSimilarity], result of:
            0.24025582 = score(doc=111,freq=4.0), product of:
              0.26118535 = queryWeight, product of:
                3.6837358 = boost
                4.905958 = idf(docFreq=859, maxDocs=42740)
                0.014452284 = queryNorm
              0.91986716 = fieldWeight in 111, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.905958 = idf(docFreq=859, maxDocs=42740)
                0.09375 = fieldNorm(doc=111)
        0.24 = coord(6/25)
    
  4. Hagedorn, K.; Chapman, S.; Newman, D.: Enhancing search and browse using automated clustering of subject metadata (2007) 0.19
    0.19265728 = sum of:
      0.19265728 = product of:
        0.80273867 = sum of:
          0.05198321 = weight(abstract_txt:varied in 3169) [ClassicSimilarity], result of:
            0.05198321 = score(doc=3169,freq=1.0), product of:
              0.09867923 = queryWeight, product of:
                1.0126094 = boost
                6.7429094 = idf(docFreq=136, maxDocs=42740)
                0.014452284 = queryNorm
              0.5267898 = fieldWeight in 3169, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.7429094 = idf(docFreq=136, maxDocs=42740)
                0.078125 = fieldNorm(doc=3169)
          0.02474342 = weight(abstract_txt:using in 3169) [ClassicSimilarity], result of:
            0.02474342 = score(doc=3169,freq=3.0), product of:
              0.05255246 = queryWeight, product of:
                1.0450585 = boost
                3.4794931 = idf(docFreq=3580, maxDocs=42740)
                0.014452284 = queryNorm
              0.47083274 = fieldWeight in 3169, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.4794931 = idf(docFreq=3580, maxDocs=42740)
                0.078125 = fieldNorm(doc=3169)
          0.08085962 = weight(abstract_txt:michigan in 3169) [ClassicSimilarity], result of:
            0.08085962 = score(doc=3169,freq=1.0), product of:
              0.13247629 = queryWeight, product of:
                1.1732705 = boost
                7.8127427 = idf(docFreq=46, maxDocs=42740)
                0.014452284 = queryNorm
              0.6103705 = fieldWeight in 3169, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.8127427 = idf(docFreq=46, maxDocs=42740)
                0.078125 = fieldNorm(doc=3169)
          0.036881827 = weight(abstract_txt:university in 3169) [ClassicSimilarity], result of:
            0.036881827 = score(doc=3169,freq=2.0), product of:
              0.078498006 = queryWeight, product of:
                1.2772425 = boost
                4.2525434 = idf(docFreq=1652, maxDocs=42740)
                0.014452284 = queryNorm
              0.4698441 = fieldWeight in 3169, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.2525434 = idf(docFreq=1652, maxDocs=42740)
                0.078125 = fieldNorm(doc=3169)
          0.1415721 = weight(abstract_txt:metadata in 3169) [ClassicSimilarity], result of:
            0.1415721 = score(doc=3169,freq=2.0), product of:
              0.26118535 = queryWeight, product of:
                3.6837358 = boost
                4.905958 = idf(docFreq=859, maxDocs=42740)
                0.014452284 = queryNorm
              0.54203695 = fieldWeight in 3169, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.905958 = idf(docFreq=859, maxDocs=42740)
                0.078125 = fieldNorm(doc=3169)
          0.46669844 = weight(abstract_txt:oaister in 3169) [ClassicSimilarity], result of:
            0.46669844 = score(doc=3169,freq=1.0), product of:
              0.61477333 = queryWeight, product of:
                4.3777122 = boost
                9.71698 = idf(docFreq=6, maxDocs=42740)
                0.014452284 = queryNorm
              0.75913906 = fieldWeight in 3169, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.71698 = idf(docFreq=6, maxDocs=42740)
                0.078125 = fieldNorm(doc=3169)
        0.24 = coord(6/25)
    
  5. Arms, C.R.: Available and useful : OAI at the Library of Congress (2003) 0.19
    0.19075687 = sum of:
      0.19075687 = product of:
        0.7948203 = sum of:
          0.019166624 = weight(abstract_txt:article in 774) [ClassicSimilarity], result of:
            0.019166624 = score(doc=774,freq=1.0), product of:
              0.06392795 = queryWeight, product of:
                1.1526288 = boost
                3.8376453 = idf(docFreq=2502, maxDocs=42740)
                0.014452284 = queryNorm
              0.29981604 = fieldWeight in 774, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.8376453 = idf(docFreq=2502, maxDocs=42740)
                0.078125 = fieldNorm(doc=774)
          0.17239174 = weight(abstract_txt:protocol in 774) [ClassicSimilarity], result of:
            0.17239174 = score(doc=774,freq=3.0), product of:
              0.19170389 = queryWeight, product of:
                1.9959954 = boost
                6.645611 = idf(docFreq=150, maxDocs=42740)
                0.014452284 = queryNorm
              0.8992606 = fieldWeight in 774, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.645611 = idf(docFreq=150, maxDocs=42740)
                0.078125 = fieldNorm(doc=774)
          0.26543686 = weight(abstract_txt:harvesting in 774) [ClassicSimilarity], result of:
            0.26543686 = score(doc=774,freq=3.0), product of:
              0.25561956 = queryWeight, product of:
                2.3048418 = boost
                7.6739063 = idf(docFreq=53, maxDocs=42740)
                0.014452284 = queryNorm
              1.0384059 = fieldWeight in 774, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.6739063 = idf(docFreq=53, maxDocs=42740)
                0.078125 = fieldNorm(doc=774)
          0.05694207 = weight(abstract_txt:digital in 774) [ClassicSimilarity], result of:
            0.05694207 = score(doc=774,freq=1.0), product of:
              0.16645335 = queryWeight, product of:
                2.6303003 = boost
                4.3787556 = idf(docFreq=1456, maxDocs=42740)
                0.014452284 = queryNorm
              0.34209028 = fieldWeight in 774, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.3787556 = idf(docFreq=1456, maxDocs=42740)
                0.078125 = fieldNorm(doc=774)
          0.0806698 = weight(abstract_txt:service in 774) [ClassicSimilarity], result of:
            0.0806698 = score(doc=774,freq=1.0), product of:
              0.2261766 = queryWeight, product of:
                3.427977 = boost
                4.5653415 = idf(docFreq=1208, maxDocs=42740)
                0.014452284 = queryNorm
              0.3566673 = fieldWeight in 774, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5653415 = idf(docFreq=1208, maxDocs=42740)
                0.078125 = fieldNorm(doc=774)
          0.2002132 = weight(abstract_txt:metadata in 774) [ClassicSimilarity], result of:
            0.2002132 = score(doc=774,freq=4.0), product of:
              0.26118535 = queryWeight, product of:
                3.6837358 = boost
                4.905958 = idf(docFreq=859, maxDocs=42740)
                0.014452284 = queryNorm
              0.76655596 = fieldWeight in 774, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.905958 = idf(docFreq=859, maxDocs=42740)
                0.078125 = fieldNorm(doc=774)
        0.24 = coord(6/25)