Document (#42349)

Author
Teal, W.
Title
Alma enumerator : automating repetitive cataloging tasks with Python
Source
Code4Lib journal. Issue 42(2018), [http://journal.code4lib.org]
Year
2018
Abstract
In June 2016, the Warburg College library migrated to a new integrated library system, Alma. In the process, we lost the enumeration and chronology data for roughly 79,000 print serial item records. Re-entering all this data by hand seemed an unthinkable task. Fortunately, the information was recorded as free text in each item's description field. By using Python, Alma's API and much trial and error, the Wartburg College library was able to parse the serial item descriptions into enumeration and chronology data that was uploaded back into Alma. This paper discusses the design and feasibility considerations addressed in trying to solve this problem, the complications encountered during development, and the highlights and shortcomings of the collection of Python scripts that became Alma Enumerator.
Content
Vgl.: https://journal.code4lib.org/articles/13947.
Theme
Formalerschließung
Object
Alma

Similar documents (content)

  1. Hodges, D.W.; Schlottmann, K.: better archival migration outcomes with Python and the Google Sheets API : Reporting from the archives (2019) 0.07
    0.07011579 = sum of:
      0.07011579 = product of:
        0.4382237 = sum of:
          0.047809303 = weight(abstract_txt:scripts in 5444) [ClassicSimilarity], result of:
            0.047809303 = score(doc=5444,freq=1.0), product of:
              0.10017081 = queryWeight, product of:
                1.0272036 = boost
                7.636444 = idf(docFreq=57, maxDocs=44218)
                0.012770077 = queryNorm
              0.47727776 = fieldWeight in 5444, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.636444 = idf(docFreq=57, maxDocs=44218)
                0.0625 = fieldNorm(doc=5444)
          0.0104229655 = weight(abstract_txt:library in 5444) [ClassicSimilarity], result of:
            0.0104229655 = score(doc=5444,freq=1.0), product of:
              0.052332025 = queryWeight, product of:
                1.2859684 = boost
                3.1867187 = idf(docFreq=4964, maxDocs=44218)
                0.012770077 = queryNorm
              0.19916992 = fieldWeight in 5444, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.1867187 = idf(docFreq=4964, maxDocs=44218)
                0.0625 = fieldNorm(doc=5444)
          0.023922365 = weight(abstract_txt:data in 5444) [ClassicSimilarity], result of:
            0.023922365 = score(doc=5444,freq=4.0), product of:
              0.0573618 = queryWeight, product of:
                1.3463498 = boost
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.012770077 = queryNorm
              0.41704348 = fieldWeight in 5444, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.0625 = fieldNorm(doc=5444)
          0.35606906 = weight(abstract_txt:python in 5444) [ClassicSimilarity], result of:
            0.35606906 = score(doc=5444,freq=2.0), product of:
              0.43730676 = queryWeight, product of:
                3.7174027 = boost
                9.211981 = idf(docFreq=11, maxDocs=44218)
                0.012770077 = queryNorm
              0.81423175 = fieldWeight in 5444, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.211981 = idf(docFreq=11, maxDocs=44218)
                0.0625 = fieldNorm(doc=5444)
        0.16 = coord(4/25)
    
  2. Balster, K.; Rendall, R.; Shrader, T.: Linked serial data : mapping the CONSER standard record to BIBFRAME (2018) 0.06
    0.06266343 = sum of:
      0.06266343 = product of:
        0.5221953 = sum of:
          0.017941775 = weight(abstract_txt:data in 5174) [ClassicSimilarity], result of:
            0.017941775 = score(doc=5174,freq=1.0), product of:
              0.0573618 = queryWeight, product of:
                1.3463498 = boost
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.012770077 = queryNorm
              0.31278262 = fieldWeight in 5174, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.09375 = fieldNorm(doc=5174)
          0.24527258 = weight(abstract_txt:chronology in 5174) [ClassicSimilarity], result of:
            0.24527258 = score(doc=5174,freq=1.0), product of:
              0.2864935 = queryWeight, product of:
                2.456735 = boost
                9.131938 = idf(docFreq=12, maxDocs=44218)
                0.012770077 = queryNorm
              0.85611916 = fieldWeight in 5174, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.131938 = idf(docFreq=12, maxDocs=44218)
                0.09375 = fieldNorm(doc=5174)
          0.25898093 = weight(abstract_txt:enumeration in 5174) [ClassicSimilarity], result of:
            0.25898093 = score(doc=5174,freq=1.0), product of:
              0.29707125 = queryWeight, product of:
                2.5016768 = boost
                9.298992 = idf(docFreq=10, maxDocs=44218)
                0.012770077 = queryNorm
              0.8717805 = fieldWeight in 5174, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.298992 = idf(docFreq=10, maxDocs=44218)
                0.09375 = fieldNorm(doc=5174)
        0.12 = coord(3/25)
    
  3. Erlinger, C.: Spatial planning and its need for national and regional bibliographies of grey literature (2019) 0.06
    0.06191268 = sum of:
      0.06191268 = product of:
        0.515939 = sum of:
          0.12263629 = weight(abstract_txt:parse in 5274) [ClassicSimilarity], result of:
            0.12263629 = score(doc=5274,freq=1.0), product of:
              0.14324676 = queryWeight, product of:
                1.2283674 = boost
                9.131938 = idf(docFreq=12, maxDocs=44218)
                0.012770077 = queryNorm
              0.85611916 = fieldWeight in 5274, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.131938 = idf(docFreq=12, maxDocs=44218)
                0.09375 = fieldNorm(doc=5274)
          0.015634447 = weight(abstract_txt:library in 5274) [ClassicSimilarity], result of:
            0.015634447 = score(doc=5274,freq=1.0), product of:
              0.052332025 = queryWeight, product of:
                1.2859684 = boost
                3.1867187 = idf(docFreq=4964, maxDocs=44218)
                0.012770077 = queryNorm
              0.29875487 = fieldWeight in 5274, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.1867187 = idf(docFreq=4964, maxDocs=44218)
                0.09375 = fieldNorm(doc=5274)
          0.37766826 = weight(abstract_txt:python in 5274) [ClassicSimilarity], result of:
            0.37766826 = score(doc=5274,freq=1.0), product of:
              0.43730676 = queryWeight, product of:
                3.7174027 = boost
                9.211981 = idf(docFreq=11, maxDocs=44218)
                0.012770077 = queryNorm
              0.8636232 = fieldWeight in 5274, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.211981 = idf(docFreq=11, maxDocs=44218)
                0.09375 = fieldNorm(doc=5274)
        0.12 = coord(3/25)
    
  4. Saltzman, A.B.: Art slide sets : online access (1998) 0.06
    0.057927698 = sum of:
      0.057927698 = product of:
        0.2896385 = sum of:
          0.08348648 = weight(abstract_txt:entering in 5188) [ClassicSimilarity], result of:
            0.08348648 = score(doc=5188,freq=1.0), product of:
              0.11085353 = queryWeight, product of:
                1.0805893 = boost
                8.033325 = idf(docFreq=38, maxDocs=44218)
                0.012770077 = queryNorm
              0.75312424 = fieldWeight in 5188, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.033325 = idf(docFreq=38, maxDocs=44218)
                0.09375 = fieldNorm(doc=5188)
          0.022110447 = weight(abstract_txt:library in 5188) [ClassicSimilarity], result of:
            0.022110447 = score(doc=5188,freq=2.0), product of:
              0.052332025 = queryWeight, product of:
                1.2859684 = boost
                3.1867187 = idf(docFreq=4964, maxDocs=44218)
                0.012770077 = queryNorm
              0.42250317 = fieldWeight in 5188, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.1867187 = idf(docFreq=4964, maxDocs=44218)
                0.09375 = fieldNorm(doc=5188)
          0.017941775 = weight(abstract_txt:data in 5188) [ClassicSimilarity], result of:
            0.017941775 = score(doc=5188,freq=1.0), product of:
              0.0573618 = queryWeight, product of:
                1.3463498 = boost
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.012770077 = queryNorm
              0.31278262 = fieldWeight in 5188, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.09375 = fieldNorm(doc=5188)
          0.08172573 = weight(abstract_txt:college in 5188) [ClassicSimilarity], result of:
            0.08172573 = score(doc=5188,freq=1.0), product of:
              0.137696 = queryWeight, product of:
                1.7031839 = boost
                6.330911 = idf(docFreq=213, maxDocs=44218)
                0.012770077 = queryNorm
              0.5935229 = fieldWeight in 5188, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.330911 = idf(docFreq=213, maxDocs=44218)
                0.09375 = fieldNorm(doc=5188)
          0.08437405 = weight(abstract_txt:item in 5188) [ClassicSimilarity], result of:
            0.08437405 = score(doc=5188,freq=1.0), product of:
              0.14065485 = queryWeight, product of:
                1.721386 = boost
                6.39857 = idf(docFreq=199, maxDocs=44218)
                0.012770077 = queryNorm
              0.5998659 = fieldWeight in 5188, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.39857 = idf(docFreq=199, maxDocs=44218)
                0.09375 = fieldNorm(doc=5188)
        0.2 = coord(5/25)
    
  5. Serial item contribution identifier : new SISAC code (1993) 0.05
    0.046313792 = sum of:
      0.046313792 = product of:
        0.38594827 = sum of:
          0.023922365 = weight(abstract_txt:data in 4563) [ClassicSimilarity], result of:
            0.023922365 = score(doc=4563,freq=1.0), product of:
              0.0573618 = queryWeight, product of:
                1.3463498 = boost
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.012770077 = queryNorm
              0.41704348 = fieldWeight in 4563, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.125 = fieldNorm(doc=4563)
          0.11249874 = weight(abstract_txt:item in 4563) [ClassicSimilarity], result of:
            0.11249874 = score(doc=4563,freq=1.0), product of:
              0.14065485 = queryWeight, product of:
                1.721386 = boost
                6.39857 = idf(docFreq=199, maxDocs=44218)
                0.012770077 = queryNorm
              0.79982126 = fieldWeight in 4563, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.39857 = idf(docFreq=199, maxDocs=44218)
                0.125 = fieldNorm(doc=4563)
          0.24952717 = weight(abstract_txt:serial in 4563) [ClassicSimilarity], result of:
            0.24952717 = score(doc=4563,freq=2.0), product of:
              0.1898708 = queryWeight, product of:
                2.0 = boost
                7.4342074 = idf(docFreq=70, maxDocs=44218)
                0.012770077 = queryNorm
              1.3141946 = fieldWeight in 4563, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.4342074 = idf(docFreq=70, maxDocs=44218)
                0.125 = fieldNorm(doc=4563)
        0.12 = coord(3/25)