Document (#39467)

Author
Boiger, W.
Title
Entwicklung und Implementierung eines MARC21-MARCXML-Konverters in der Programmiersprache Perl
Source
Perspektive Bibliothek. 4(2015) H.2, S.33-59
Year
2015
Abstract
Aktuell befinden sich im Datenbestand des gemeinsamen Katalogs des Bibliotheksverbundes Bayern und des Kooperativen Bibliotheksverbundes Berlin-Brandenburg (B3Kat) etwa 25,6 Millionen Titeldatensätze. Die Bayerische Verbundzentrale veröffentlicht diese Daten seit 2011 im Zuge der bayerischen Open-Data-Initiative auf ihrer Webpräsenz. Zu den Nachnutzern dieser Daten gehören die Deutsche Digitale Bibliothek und das Projekt Culturegraph der DNB. Die Daten werden im weitverbreiteten Katalogdatenformat MARCXML publiziert. Zur Erzeugung der XML-Dateien verwendete die Verbundzentrale bis 2014 die Windows-Software MarcEdit. Anfang 2015 entwickelte der Verfasser im Rahmen der bayerischen Referendarsausbildung einen einfachen MARC-21-MARCXML-Konverter in Perl, der die Konvertierung wesentlich erleichert und den Einsatz von MarcEdit in der Verbundzentrale überflüssig macht. In der vorliegenden Arbeit, die zusammen mit dem Konverter verfasst wurde, wird zunächst die Notwendigkeit einer Perl-Implementierung motiviert. Im Anschluss werden die bibliographischen Datenformate MARC 21 und MARCXML beleuchtet und für die Konvertierung wesentliche Eigenschaften erläutert. Zum Schluss wird der Aufbau des Konverters im Detail beschrieben. Die Perl-Implementierung selbst ist Teil der Arbeit. Verwendung, Verbreitung und Veränderung der Software sind unter den Bedingungen der GNU Affero General Public License gestattet, entweder gemäß Version 3 der Lizenz oder (nach Ihrer Option) jeder späteren Version.[Sie finden die Datei mit der Perl-Implementierung in der rechten Spalte in der Kategorie Artikelwerkzeuge unter dem Punkt Zusatzdateien.]
Content
Vgl.: http://journals.ub.uni-heidelberg.de/index.php/bibliothek/article/view/26271.
Theme
Datenformate
Object
MARCXML
Location
D

Similar documents (content)

  1. McCallum, S.H.: MARCXML sampler (2005) 0.25
    0.2513976 = sum of:
      0.2513976 = product of:
        3.1424701 = sum of:
          0.05870911 = weight(abstract_txt:marc in 4361) [ClassicSimilarity], result of:
            0.05870911 = score(doc=4361,freq=2.0), product of:
              0.08050476 = queryWeight, product of:
                1.2975786 = boost
                5.500443 = idf(docFreq=490, maxDocs=44218)
                0.011279509 = queryNorm
              0.7292626 = fieldWeight in 4361, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.500443 = idf(docFreq=490, maxDocs=44218)
                0.09375 = fieldNorm(doc=4361)
          3.083761 = weight(title_txt:marcxml in 4361) [ClassicSimilarity], result of:
            3.083761 = score(doc=4361,freq=1.0), product of:
              0.50600237 = queryWeight, product of:
                4.600597 = boost
                9.7509775 = idf(docFreq=6, maxDocs=44218)
                0.011279509 = queryNorm
              6.094361 = fieldWeight in 4361, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.7509775 = idf(docFreq=6, maxDocs=44218)
                0.625 = fieldNorm(doc=4361)
        0.08 = coord(2/25)
    
  2. Anderson, R.; Birbeck, M.; Kay, M.; Livingstone, S.; Loesgen, B.; Martin, D.; Mohr, S.; Ozu, N.; Peat, B.; Pinnock, J.; Stark, P.; Williams, K.: XML professionell : behandelt W3C DOM, SAX, CSS, XSLT, DTDs, XML Schemas, XLink, XPointer, XPath, E-Commerce, BizTalk, B2B, SOAP, WAP, WML (2000) 0.08
    0.07809369 = sum of:
      0.07809369 = product of:
        0.27890605 = sum of:
          0.0068731406 = weight(abstract_txt:software in 729) [ClassicSimilarity], result of:
            0.0068731406 = score(doc=729,freq=1.0), product of:
              0.050490793 = queryWeight, product of:
                1.0276116 = boost
                4.3560514 = idf(docFreq=1541, maxDocs=44218)
                0.011279509 = queryNorm
              0.13612661 = fieldWeight in 729, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.3560514 = idf(docFreq=1541, maxDocs=44218)
                0.03125 = fieldNorm(doc=729)
          0.016323451 = weight(abstract_txt:ihrer in 729) [ClassicSimilarity], result of:
            0.016323451 = score(doc=729,freq=2.0), product of:
              0.071335666 = queryWeight, product of:
                1.2214516 = boost
                5.17774 = idf(docFreq=677, maxDocs=44218)
                0.011279509 = queryNorm
              0.22882594 = fieldWeight in 729, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.17774 = idf(docFreq=677, maxDocs=44218)
                0.03125 = fieldNorm(doc=729)
          0.012298123 = weight(abstract_txt:version in 729) [ClassicSimilarity], result of:
            0.012298123 = score(doc=729,freq=1.0), product of:
              0.07441628 = queryWeight, product of:
                1.2475469 = boost
                5.288358 = idf(docFreq=606, maxDocs=44218)
                0.011279509 = queryNorm
              0.1652612 = fieldWeight in 729, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.288358 = idf(docFreq=606, maxDocs=44218)
                0.03125 = fieldNorm(doc=729)
          0.012321165 = weight(abstract_txt:arbeit in 729) [ClassicSimilarity], result of:
            0.012321165 = score(doc=729,freq=1.0), product of:
              0.0745092 = queryWeight, product of:
                1.2483256 = boost
                5.291659 = idf(docFreq=604, maxDocs=44218)
                0.011279509 = queryNorm
              0.16536434 = fieldWeight in 729, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.291659 = idf(docFreq=604, maxDocs=44218)
                0.03125 = fieldNorm(doc=729)
          0.018110018 = weight(abstract_txt:daten in 729) [ClassicSimilarity], result of:
            0.018110018 = score(doc=729,freq=1.0), product of:
              0.11026011 = queryWeight, product of:
                1.8598492 = boost
                5.255941 = idf(docFreq=626, maxDocs=44218)
                0.011279509 = queryNorm
              0.16424815 = fieldWeight in 729, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.255941 = idf(docFreq=626, maxDocs=44218)
                0.03125 = fieldNorm(doc=729)
          0.061997592 = weight(abstract_txt:implementierung in 729) [ClassicSimilarity], result of:
            0.061997592 = score(doc=729,freq=1.0), product of:
              0.27565673 = queryWeight, product of:
                3.3956444 = boost
                7.1970778 = idf(docFreq=89, maxDocs=44218)
                0.011279509 = queryNorm
              0.22490868 = fieldWeight in 729, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.1970778 = idf(docFreq=89, maxDocs=44218)
                0.03125 = fieldNorm(doc=729)
          0.15098256 = weight(abstract_txt:perl in 729) [ClassicSimilarity], result of:
            0.15098256 = score(doc=729,freq=1.0), product of:
              0.5374935 = queryWeight, product of:
                5.3012652 = boost
                8.988837 = idf(docFreq=14, maxDocs=44218)
                0.011279509 = queryNorm
              0.28090116 = fieldWeight in 729, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.988837 = idf(docFreq=14, maxDocs=44218)
                0.03125 = fieldNorm(doc=729)
        0.28 = coord(7/25)
    
  3. Weinheimer, J.: ¬A visual explanation of the areas defined by AACR2, RDA, ISBD, LC NAF, LC Classification, LC Subject Headings, Dewey Classification, MARC21 : plus a quick look at ISO2709, MARCXML and a version of BIBFRAME (2015) 0.07
    0.0664688 = sum of:
      0.0664688 = product of:
        0.83085996 = sum of:
          0.059919734 = weight(abstract_txt:marc in 2882) [ClassicSimilarity], result of:
            0.059919734 = score(doc=2882,freq=3.0), product of:
              0.08050476 = queryWeight, product of:
                1.2975786 = boost
                5.500443 = idf(docFreq=490, maxDocs=44218)
                0.011279509 = queryNorm
              0.7443005 = fieldWeight in 2882, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.500443 = idf(docFreq=490, maxDocs=44218)
                0.078125 = fieldNorm(doc=2882)
          0.77094024 = weight(title_txt:marcxml in 2882) [ClassicSimilarity], result of:
            0.77094024 = score(doc=2882,freq=1.0), product of:
              0.50600237 = queryWeight, product of:
                4.600597 = boost
                9.7509775 = idf(docFreq=6, maxDocs=44218)
                0.011279509 = queryNorm
              1.5235902 = fieldWeight in 2882, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.7509775 = idf(docFreq=6, maxDocs=44218)
                0.15625 = fieldNorm(doc=2882)
        0.08 = coord(2/25)
    
  4. Standage, T.: Perl : the glue of the Internet (1995) 0.06
    0.06314228 = sum of:
      0.06314228 = product of:
        0.7892785 = sum of:
          0.034365702 = weight(abstract_txt:software in 3389) [ClassicSimilarity], result of:
            0.034365702 = score(doc=3389,freq=1.0), product of:
              0.050490793 = queryWeight, product of:
                1.0276116 = boost
                4.3560514 = idf(docFreq=1541, maxDocs=44218)
                0.011279509 = queryNorm
              0.68063307 = fieldWeight in 3389, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.3560514 = idf(docFreq=1541, maxDocs=44218)
                0.15625 = fieldNorm(doc=3389)
          0.7549128 = weight(abstract_txt:perl in 3389) [ClassicSimilarity], result of:
            0.7549128 = score(doc=3389,freq=1.0), product of:
              0.5374935 = queryWeight, product of:
                5.3012652 = boost
                8.988837 = idf(docFreq=14, maxDocs=44218)
                0.011279509 = queryNorm
              1.4045058 = fieldWeight in 3389, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.988837 = idf(docFreq=14, maxDocs=44218)
                0.15625 = fieldNorm(doc=3389)
        0.08 = coord(2/25)
    
  5. Junger, U.; Hapke, T.: Erschließung 2013: Visionen und mögliche Entwicklungen : Bericht über einen Workshop der Facharbeitsgruppe Erschließung und Informationsvermittlung auf der 12. Verbundkonferenz des Gemeinsamen Bibliotheksverbundes am 11. September 2008 in der Staatsbibliothek zu Berlin - Preußischer Kulturbesitz (2008) 0.06
    0.06284525 = sum of:
      0.06284525 = product of:
        0.31422624 = sum of:
          0.019713316 = weight(abstract_txt:unter in 2484) [ClassicSimilarity], result of:
            0.019713316 = score(doc=2484,freq=3.0), product of:
              0.06090245 = queryWeight, product of:
                1.1286006 = boost
                4.7841444 = idf(docFreq=1004, maxDocs=44218)
                0.011279509 = queryNorm
              0.32368675 = fieldWeight in 2484, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.7841444 = idf(docFreq=1004, maxDocs=44218)
                0.0390625 = fieldNorm(doc=2484)
          0.02178095 = weight(abstract_txt:arbeit in 2484) [ClassicSimilarity], result of:
            0.02178095 = score(doc=2484,freq=2.0), product of:
              0.0745092 = queryWeight, product of:
                1.2483256 = boost
                5.291659 = idf(docFreq=604, maxDocs=44218)
                0.011279509 = queryNorm
              0.29232562 = fieldWeight in 2484, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.291659 = idf(docFreq=604, maxDocs=44218)
                0.0390625 = fieldNorm(doc=2484)
          0.039209343 = weight(abstract_txt:daten in 2484) [ClassicSimilarity], result of:
            0.039209343 = score(doc=2484,freq=3.0), product of:
              0.11026011 = queryWeight, product of:
                1.8598492 = boost
                5.255941 = idf(docFreq=626, maxDocs=44218)
                0.011279509 = queryNorm
              0.3556077 = fieldWeight in 2484, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.255941 = idf(docFreq=626, maxDocs=44218)
                0.0390625 = fieldNorm(doc=2484)
          0.104477674 = weight(abstract_txt:bibliotheksverbundes in 2484) [ClassicSimilarity], result of:
            0.104477674 = score(doc=2484,freq=2.0), product of:
              0.21192119 = queryWeight, product of:
                2.105281 = boost
                8.924298 = idf(docFreq=15, maxDocs=44218)
                0.011279509 = queryNorm
              0.4930025 = fieldWeight in 2484, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.924298 = idf(docFreq=15, maxDocs=44218)
                0.0390625 = fieldNorm(doc=2484)
          0.12904495 = weight(abstract_txt:verbundzentrale in 2484) [ClassicSimilarity], result of:
            0.12904495 = score(doc=2484,freq=2.0), product of:
              0.279265 = queryWeight, product of:
                2.9598982 = boost
                8.364683 = idf(docFreq=27, maxDocs=44218)
                0.011279509 = queryNorm
              0.4620878 = fieldWeight in 2484, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.364683 = idf(docFreq=27, maxDocs=44218)
                0.0390625 = fieldNorm(doc=2484)
        0.2 = coord(5/25)