Document (#39468)

Author
Boiger, W.
Title
Entwicklung und Implementierung eines MARC21-MARCXML-Konverters in der Programmiersprache Perl
Source
Perspektive Bibliothek. 4(2015) H.2, S.33-59
Year
2015
Abstract
Aktuell befinden sich im Datenbestand des gemeinsamen Katalogs des Bibliotheksverbundes Bayern und des Kooperativen Bibliotheksverbundes Berlin-Brandenburg (B3Kat) etwa 25,6 Millionen Titeldatensätze. Die Bayerische Verbundzentrale veröffentlicht diese Daten seit 2011 im Zuge der bayerischen Open-Data-Initiative auf ihrer Webpräsenz. Zu den Nachnutzern dieser Daten gehören die Deutsche Digitale Bibliothek und das Projekt Culturegraph der DNB. Die Daten werden im weitverbreiteten Katalogdatenformat MARCXML publiziert. Zur Erzeugung der XML-Dateien verwendete die Verbundzentrale bis 2014 die Windows-Software MarcEdit. Anfang 2015 entwickelte der Verfasser im Rahmen der bayerischen Referendarsausbildung einen einfachen MARC-21-MARCXML-Konverter in Perl, der die Konvertierung wesentlich erleichert und den Einsatz von MarcEdit in der Verbundzentrale überflüssig macht. In der vorliegenden Arbeit, die zusammen mit dem Konverter verfasst wurde, wird zunächst die Notwendigkeit einer Perl-Implementierung motiviert. Im Anschluss werden die bibliographischen Datenformate MARC 21 und MARCXML beleuchtet und für die Konvertierung wesentliche Eigenschaften erläutert. Zum Schluss wird der Aufbau des Konverters im Detail beschrieben. Die Perl-Implementierung selbst ist Teil der Arbeit. Verwendung, Verbreitung und Veränderung der Software sind unter den Bedingungen der GNU Affero General Public License gestattet, entweder gemäß Version 3 der Lizenz oder (nach Ihrer Option) jeder späteren Version.[Sie finden die Datei mit der Perl-Implementierung in der rechten Spalte in der Kategorie Artikelwerkzeuge unter dem Punkt Zusatzdateien.]
Content
Vgl.: http://journals.ub.uni-heidelberg.de/index.php/bibliothek/article/view/26271.
Theme
Datenformate
Object
MARCXML
Location
D

Similar documents (content)

  1. McCallum, S.H.: MARCXML sampler (2005) 0.25
    0.24975206 = sum of:
      0.24975206 = product of:
        3.1219008 = sum of:
          0.058701985 = weight(abstract_txt:marc in 362) [ClassicSimilarity], result of:
            0.058701985 = score(doc=362,freq=2.0), product of:
              0.08059995 = queryWeight, product of:
                1.2948464 = boost
                5.49328 = idf(docFreq=477, maxDocs=42740)
                0.011331432 = queryNorm
              0.7283129 = fieldWeight in 362, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.49328 = idf(docFreq=477, maxDocs=42740)
                0.09375 = fieldNorm(doc=362)
          3.0631988 = weight(title_txt:marcxml in 362) [ClassicSimilarity], result of:
            3.0631988 = score(doc=362,freq=1.0), product of:
              0.50438696 = queryWeight, product of:
                4.5808682 = boost
                9.71698 = idf(docFreq=6, maxDocs=42740)
                0.011331432 = queryNorm
              6.0731125 = fieldWeight in 362, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.71698 = idf(docFreq=6, maxDocs=42740)
                0.625 = fieldNorm(doc=362)
        0.08 = coord(2/25)
    
  2. Anderson, R.; Birbeck, M.; Kay, M.; Livingstone, S.; Loesgen, B.; Martin, D.; Mohr, S.; Ozu, N.; Peat, B.; Pinnock, J.; Stark, P.; Williams, K.: XML professionell : behandelt W3C DOM, SAX, CSS, XSLT, DTDs, XML Schemas, XLink, XPointer, XPath, E-Commerce, BizTalk, B2B, SOAP, WAP, WML (2000) 0.08
    0.077927 = sum of:
      0.077927 = product of:
        0.27831072 = sum of:
          0.006862638 = weight(abstract_txt:software in 1730) [ClassicSimilarity], result of:
            0.006862638 = score(doc=1730,freq=1.0), product of:
              0.050503075 = queryWeight, product of:
                1.0249667 = boost
                4.3483377 = idf(docFreq=1501, maxDocs=42740)
                0.011331432 = queryNorm
              0.13588555 = fieldWeight in 1730, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.3483377 = idf(docFreq=1501, maxDocs=42740)
                0.03125 = fieldNorm(doc=1730)
          0.01673207 = weight(abstract_txt:ihrer in 1730) [ClassicSimilarity], result of:
            0.01673207 = score(doc=1730,freq=2.0), product of:
              0.07261286 = queryWeight, product of:
                1.2290163 = boost
                5.214001 = idf(docFreq=631, maxDocs=42740)
                0.011331432 = queryNorm
              0.23042847 = fieldWeight in 1730, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.214001 = idf(docFreq=631, maxDocs=42740)
                0.03125 = fieldNorm(doc=1730)
          0.012377258 = weight(abstract_txt:version in 1730) [ClassicSimilarity], result of:
            0.012377258 = score(doc=1730,freq=1.0), product of:
              0.07482959 = queryWeight, product of:
                1.247635 = boost
                5.2929897 = idf(docFreq=583, maxDocs=42740)
                0.011331432 = queryNorm
              0.16540593 = fieldWeight in 1730, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.2929897 = idf(docFreq=583, maxDocs=42740)
                0.03125 = fieldNorm(doc=1730)
          0.012377258 = weight(abstract_txt:arbeit in 1730) [ClassicSimilarity], result of:
            0.012377258 = score(doc=1730,freq=1.0), product of:
              0.07482959 = queryWeight, product of:
                1.247635 = boost
                5.2929897 = idf(docFreq=583, maxDocs=42740)
                0.011331432 = queryNorm
              0.16540593 = fieldWeight in 1730, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.2929897 = idf(docFreq=583, maxDocs=42740)
                0.03125 = fieldNorm(doc=1730)
          0.01847632 = weight(abstract_txt:daten in 1730) [ClassicSimilarity], result of:
            0.01847632 = score(doc=1730,freq=1.0), product of:
              0.111883104 = queryWeight, product of:
                1.8684382 = boost
                5.2844644 = idf(docFreq=588, maxDocs=42740)
                0.011331432 = queryNorm
              0.16513951 = fieldWeight in 1730, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.2844644 = idf(docFreq=588, maxDocs=42740)
                0.03125 = fieldNorm(doc=1730)
          0.061642688 = weight(abstract_txt:implementierung in 1730) [ClassicSimilarity], result of:
            0.061642688 = score(doc=1730,freq=1.0), product of:
              0.27495068 = queryWeight, product of:
                3.382153 = boost
                7.174254 = idf(docFreq=88, maxDocs=42740)
                0.011331432 = queryNorm
              0.22419544 = fieldWeight in 1730, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.174254 = idf(docFreq=88, maxDocs=42740)
                0.03125 = fieldNorm(doc=1730)
          0.14984247 = weight(abstract_txt:perl in 1730) [ClassicSimilarity], result of:
            0.14984247 = score(doc=1730,freq=1.0), product of:
              0.53546 = queryWeight, product of:
                5.276967 = boost
                8.954841 = idf(docFreq=14, maxDocs=42740)
                0.011331432 = queryNorm
              0.27983877 = fieldWeight in 1730, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.954841 = idf(docFreq=14, maxDocs=42740)
                0.03125 = fieldNorm(doc=1730)
        0.28 = coord(7/25)
    
  3. Weinheimer, J.: ¬A visual explanation of the areas defined by AACR2, RDA, ISBD, LC NAF, LC Classification, LC Subject Headings, Dewey Classification, MARC21 : plus a quick look at ISO2709, MARCXML and a version of BIBFRAME (2015) 0.07
    0.06605697 = sum of:
      0.06605697 = product of:
        0.82571214 = sum of:
          0.05991246 = weight(abstract_txt:marc in 4883) [ClassicSimilarity], result of:
            0.05991246 = score(doc=4883,freq=3.0), product of:
              0.08059995 = queryWeight, product of:
                1.2948464 = boost
                5.49328 = idf(docFreq=477, maxDocs=42740)
                0.011331432 = queryNorm
              0.74333125 = fieldWeight in 4883, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.49328 = idf(docFreq=477, maxDocs=42740)
                0.078125 = fieldNorm(doc=4883)
          0.7657997 = weight(title_txt:marcxml in 4883) [ClassicSimilarity], result of:
            0.7657997 = score(doc=4883,freq=1.0), product of:
              0.50438696 = queryWeight, product of:
                4.5808682 = boost
                9.71698 = idf(docFreq=6, maxDocs=42740)
                0.011331432 = queryNorm
              1.5182781 = fieldWeight in 4883, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.71698 = idf(docFreq=6, maxDocs=42740)
                0.15625 = fieldNorm(doc=4883)
        0.08 = coord(2/25)
    
  4. Junger, U.; Hapke, T.: Erschließung 2013: Visionen und mögliche Entwicklungen : Bericht über einen Workshop der Facharbeitsgruppe Erschließung und Informationsvermittlung auf der 12. Verbundkonferenz des Gemeinsamen Bibliotheksverbundes am 11. September 2008 in der Staatsbibliothek zu Berlin - Preußischer Kulturbesitz (2008) 0.06
    0.06351751 = sum of:
      0.06351751 = product of:
        0.31758755 = sum of:
          0.020105572 = weight(abstract_txt:unter in 4485) [ClassicSimilarity], result of:
            0.020105572 = score(doc=4485,freq=3.0), product of:
              0.06178565 = queryWeight, product of:
                1.133691 = boost
                4.8095913 = idf(docFreq=946, maxDocs=42740)
                0.011331432 = queryNorm
              0.32540846 = fieldWeight in 4485, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.8095913 = idf(docFreq=946, maxDocs=42740)
                0.0390625 = fieldNorm(doc=4485)
          0.021880109 = weight(abstract_txt:arbeit in 4485) [ClassicSimilarity], result of:
            0.021880109 = score(doc=4485,freq=2.0), product of:
              0.07482959 = queryWeight, product of:
                1.247635 = boost
                5.2929897 = idf(docFreq=583, maxDocs=42740)
                0.011331432 = queryNorm
              0.29239914 = fieldWeight in 4485, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.2929897 = idf(docFreq=583, maxDocs=42740)
                0.0390625 = fieldNorm(doc=4485)
          0.04000241 = weight(abstract_txt:daten in 4485) [ClassicSimilarity], result of:
            0.04000241 = score(doc=4485,freq=3.0), product of:
              0.111883104 = queryWeight, product of:
                1.8684382 = boost
                5.2844644 = idf(docFreq=588, maxDocs=42740)
                0.011331432 = queryNorm
              0.35753754 = fieldWeight in 4485, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.2844644 = idf(docFreq=588, maxDocs=42740)
                0.0390625 = fieldNorm(doc=4485)
          0.10595462 = weight(abstract_txt:bibliotheksverbundes in 4485) [ClassicSimilarity], result of:
            0.10595462 = score(doc=4485,freq=2.0), product of:
              0.214184 = queryWeight, product of:
                2.110787 = boost
                8.954841 = idf(docFreq=14, maxDocs=42740)
                0.011331432 = queryNorm
              0.4946897 = fieldWeight in 4485, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.954841 = idf(docFreq=14, maxDocs=42740)
                0.0390625 = fieldNorm(doc=4485)
          0.12964486 = weight(abstract_txt:verbundzentrale in 4485) [ClassicSimilarity], result of:
            0.12964486 = score(doc=4485,freq=2.0), product of:
              0.28048375 = queryWeight, product of:
                2.9583554 = boost
                8.367054 = idf(docFreq=26, maxDocs=42740)
                0.011331432 = queryNorm
              0.4622188 = fieldWeight in 4485, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.367054 = idf(docFreq=26, maxDocs=42740)
                0.0390625 = fieldNorm(doc=4485)
        0.2 = coord(5/25)
    
  5. Standage, T.: Perl : the glue of the Internet (1995) 0.06
    0.06268204 = sum of:
      0.06268204 = product of:
        0.7835255 = sum of:
          0.03431319 = weight(abstract_txt:software in 3458) [ClassicSimilarity], result of:
            0.03431319 = score(doc=3458,freq=1.0), product of:
              0.050503075 = queryWeight, product of:
                1.0249667 = boost
                4.3483377 = idf(docFreq=1501, maxDocs=42740)
                0.011331432 = queryNorm
              0.67942774 = fieldWeight in 3458, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.3483377 = idf(docFreq=1501, maxDocs=42740)
                0.15625 = fieldNorm(doc=3458)
          0.7492123 = weight(abstract_txt:perl in 3458) [ClassicSimilarity], result of:
            0.7492123 = score(doc=3458,freq=1.0), product of:
              0.53546 = queryWeight, product of:
                5.276967 = boost
                8.954841 = idf(docFreq=14, maxDocs=42740)
                0.011331432 = queryNorm
              1.3991939 = fieldWeight in 3458, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.954841 = idf(docFreq=14, maxDocs=42740)
                0.15625 = fieldNorm(doc=3458)
        0.08 = coord(2/25)