Document (#42144)

Author
Carlson, S.
Seely, A.
Title
Using OpenRefine's reconciliation to validate local authority headings
Source
Cataloging and classification quarterly. 55(2017) no.1, S.1-11
Year
2017
Abstract
In 2015, the Cataloging and Metadata Services department of Rice University's Fondren Library developed a process to reconcile four years of authority headings against an internally developed thesaurus. With a goal of immediate cleanup as well as an ongoing maintenance procedure, staff developed a "hack" of OpenRefine's normal Reconciliation function that ultimately yielded 99.6% authority reconciliation and a stable process for monthly data verification.
Content
Vgl.: https://doi.org/10.1080/01639374.2016.1245693.
Theme
Metadaten

Similar documents (author)

  1. Carlson, P.A.: ¬The rhetoric of hypertext (1990) 5.99
    5.989656 = sum of:
      5.989656 = weight(author_txt:carlson in 4914) [ClassicSimilarity], result of:
        5.989656 = fieldWeight in 4914, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.583449 = idf(docFreq=7, maxDocs=42740)
          0.625 = fieldNorm(doc=4914)
    
  2. Carlson, C.: Perspectives of a hypermedia film sequence database (1993) 5.99
    5.989656 = sum of:
      5.989656 = weight(author_txt:carlson in 5321) [ClassicSimilarity], result of:
        5.989656 = fieldWeight in 5321, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.583449 = idf(docFreq=7, maxDocs=42740)
          0.625 = fieldNorm(doc=5321)
    
  3. Carlson, C.N.; Süllow, K.: AMPHORE, ein standardbasiertes Werkzeug zur Sequenzerschließung (1996) 4.79
    4.7917247 = sum of:
      4.7917247 = weight(author_txt:carlson in 5320) [ClassicSimilarity], result of:
        4.7917247 = fieldWeight in 5320, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.583449 = idf(docFreq=7, maxDocs=42740)
          0.5 = fieldNorm(doc=5320)
    
  4. Carlson, J.R.; Kacmar, C.J.: an examination of end-user preferences : Increasing link marker effectiveness for WWW and other hypermedia interfaces (1999) 4.79
    4.7917247 = sum of:
      4.7917247 = weight(author_txt:carlson in 5302) [ClassicSimilarity], result of:
        4.7917247 = fieldWeight in 5302, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.583449 = idf(docFreq=7, maxDocs=42740)
          0.5 = fieldNorm(doc=5302)
    
  5. Banach, P.; Carlson Jr., M.: Cataloging at the University of Massachusetts Amherst Library (2000) 4.19
    4.192759 = sum of:
      4.192759 = weight(author_txt:carlson in 382) [ClassicSimilarity], result of:
        4.192759 = fieldWeight in 382, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.583449 = idf(docFreq=7, maxDocs=42740)
          0.4375 = fieldNorm(doc=382)
    

Similar documents (content)

  1. Vellucci, S.L.: Commercial services for providing authority control : outsourcing the process (2004) 0.15
    0.15049405 = sum of:
      0.15049405 = product of:
        0.75247025 = sum of:
          0.077237144 = weight(abstract_txt:ongoing in 682) [ClassicSimilarity], result of:
            0.077237144 = score(doc=682,freq=2.0), product of:
              0.108870156 = queryWeight, product of:
                1.147292 = boost
                6.4211435 = idf(docFreq=188, maxDocs=42740)
                0.014778233 = queryNorm
              0.7094428 = fieldWeight in 682, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.4211435 = idf(docFreq=188, maxDocs=42740)
                0.078125 = fieldNorm(doc=682)
          0.027930098 = weight(abstract_txt:process in 682) [ClassicSimilarity], result of:
            0.027930098 = score(doc=682,freq=1.0), product of:
              0.08771886 = queryWeight, product of:
                1.4564012 = boost
                4.07558 = idf(docFreq=1972, maxDocs=42740)
                0.014778233 = queryNorm
              0.3184047 = fieldWeight in 682, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.07558 = idf(docFreq=1972, maxDocs=42740)
                0.078125 = fieldNorm(doc=682)
          0.26765972 = weight(abstract_txt:cleanup in 682) [ClassicSimilarity], result of:
            0.26765972 = score(doc=682,freq=2.0), product of:
              0.24931403 = queryWeight, product of:
                1.7361726 = boost
                9.71698 = idf(docFreq=6, maxDocs=42740)
                0.014778233 = queryNorm
              1.0735847 = fieldWeight in 682, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.71698 = idf(docFreq=6, maxDocs=42740)
                0.078125 = fieldNorm(doc=682)
          0.05639174 = weight(abstract_txt:headings in 682) [ClassicSimilarity], result of:
            0.05639174 = score(doc=682,freq=1.0), product of:
              0.14012694 = queryWeight, product of:
                1.8407524 = boost
                5.151145 = idf(docFreq=672, maxDocs=42740)
                0.014778233 = queryNorm
              0.40243322 = fieldWeight in 682, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.151145 = idf(docFreq=672, maxDocs=42740)
                0.078125 = fieldNorm(doc=682)
          0.32325158 = weight(abstract_txt:authority in 682) [ClassicSimilarity], result of:
            0.32325158 = score(doc=682,freq=12.0), product of:
              0.22440979 = queryWeight, product of:
                2.8529954 = boost
                5.322531 = idf(docFreq=566, maxDocs=42740)
                0.014778233 = queryNorm
              1.4404522 = fieldWeight in 682, product of:
                3.4641016 = tf(freq=12.0), with freq of:
                  12.0 = termFreq=12.0
                5.322531 = idf(docFreq=566, maxDocs=42740)
                0.078125 = fieldNorm(doc=682)
        0.2 = coord(5/25)
    
  2. Lougheed, B.; Moran, R.; Callison, C.: Reconciliation through description : using metadata to realize the vision of the National Research Centre for Truth and Reconciliation (2015) 0.13
    0.13448192 = sum of:
      0.13448192 = product of:
        1.1206827 = sum of:
          0.092725225 = weight(abstract_txt:ultimately in 4182) [ClassicSimilarity], result of:
            0.092725225 = score(doc=4182,freq=1.0), product of:
              0.1372079 = queryWeight, product of:
                1.28798 = boost
                7.2085433 = idf(docFreq=85, maxDocs=42740)
                0.014778233 = queryNorm
              0.6758009 = fieldWeight in 4182, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.2085433 = idf(docFreq=85, maxDocs=42740)
                0.09375 = fieldNorm(doc=4182)
          0.033516116 = weight(abstract_txt:process in 4182) [ClassicSimilarity], result of:
            0.033516116 = score(doc=4182,freq=1.0), product of:
              0.08771886 = queryWeight, product of:
                1.4564012 = boost
                4.07558 = idf(docFreq=1972, maxDocs=42740)
                0.014778233 = queryNorm
              0.38208562 = fieldWeight in 4182, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.07558 = idf(docFreq=1972, maxDocs=42740)
                0.09375 = fieldNorm(doc=4182)
          0.9944414 = weight(abstract_txt:reconciliation in 4182) [ClassicSimilarity], result of:
            0.9944414 = score(doc=4182,freq=3.0), product of:
              0.6672675 = queryWeight, product of:
                4.9196043 = boost
                9.177984 = idf(docFreq=11, maxDocs=42740)
                0.014778233 = queryNorm
              1.4903189 = fieldWeight in 4182, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                9.177984 = idf(docFreq=11, maxDocs=42740)
                0.09375 = fieldNorm(doc=4182)
        0.12 = coord(3/25)
    
  3. Hooland, S. van; Verborgh, R.; Wilde, M. De; Hercher, J.; Mannens, E.; Wa, R.Van de: Evaluating the success of vocabulary reconciliation for cultural heritage collections (2013) 0.13
    0.13094018 = sum of:
      0.13094018 = product of:
        0.8183762 = sum of:
          0.03949912 = weight(abstract_txt:process in 2663) [ClassicSimilarity], result of:
            0.03949912 = score(doc=2663,freq=2.0), product of:
              0.08771886 = queryWeight, product of:
                1.4564012 = boost
                4.07558 = idf(docFreq=1972, maxDocs=42740)
                0.014778233 = queryNorm
              0.45029223 = fieldWeight in 2663, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.07558 = idf(docFreq=1972, maxDocs=42740)
                0.078125 = fieldNorm(doc=2663)
          0.05639174 = weight(abstract_txt:headings in 2663) [ClassicSimilarity], result of:
            0.05639174 = score(doc=2663,freq=1.0), product of:
              0.14012694 = queryWeight, product of:
                1.8407524 = boost
                5.151145 = idf(docFreq=672, maxDocs=42740)
                0.014778233 = queryNorm
              0.40243322 = fieldWeight in 2663, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.151145 = idf(docFreq=672, maxDocs=42740)
                0.078125 = fieldNorm(doc=2663)
          0.04585373 = weight(abstract_txt:developed in 2663) [ClassicSimilarity], result of:
            0.04585373 = score(doc=2663,freq=1.0), product of:
              0.13974133 = queryWeight, product of:
                2.2513478 = boost
                4.2001014 = idf(docFreq=1741, maxDocs=42740)
                0.014778233 = queryNorm
              0.32813293 = fieldWeight in 2663, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.2001014 = idf(docFreq=1741, maxDocs=42740)
                0.078125 = fieldNorm(doc=2663)
          0.6766316 = weight(abstract_txt:reconciliation in 2663) [ClassicSimilarity], result of:
            0.6766316 = score(doc=2663,freq=2.0), product of:
              0.6672675 = queryWeight, product of:
                4.9196043 = boost
                9.177984 = idf(docFreq=11, maxDocs=42740)
                0.014778233 = queryNorm
              1.0140336 = fieldWeight in 2663, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.177984 = idf(docFreq=11, maxDocs=42740)
                0.078125 = fieldNorm(doc=2663)
        0.16 = coord(4/25)
    
  4. Ross, J.: Geographic headings online (1984) 0.06
    0.06258781 = sum of:
      0.06258781 = product of:
        0.39117384 = sum of:
          0.14248273 = weight(abstract_txt:verification in 1468) [ClassicSimilarity], result of:
            0.14248273 = score(doc=1468,freq=1.0), product of:
              0.16486336 = queryWeight, product of:
                1.4118274 = boost
                7.9016905 = idf(docFreq=42, maxDocs=42740)
                0.014778233 = queryNorm
              0.8642474 = fieldWeight in 1468, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.9016905 = idf(docFreq=42, maxDocs=42740)
                0.109375 = fieldNorm(doc=1468)
          0.039102133 = weight(abstract_txt:process in 1468) [ClassicSimilarity], result of:
            0.039102133 = score(doc=1468,freq=1.0), product of:
              0.08771886 = queryWeight, product of:
                1.4564012 = boost
                4.07558 = idf(docFreq=1972, maxDocs=42740)
                0.014778233 = queryNorm
              0.44576657 = fieldWeight in 1468, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.07558 = idf(docFreq=1972, maxDocs=42740)
                0.109375 = fieldNorm(doc=1468)
          0.07894842 = weight(abstract_txt:headings in 1468) [ClassicSimilarity], result of:
            0.07894842 = score(doc=1468,freq=1.0), product of:
              0.14012694 = queryWeight, product of:
                1.8407524 = boost
                5.151145 = idf(docFreq=672, maxDocs=42740)
                0.014778233 = queryNorm
              0.56340647 = fieldWeight in 1468, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.151145 = idf(docFreq=672, maxDocs=42740)
                0.109375 = fieldNorm(doc=1468)
          0.13064057 = weight(abstract_txt:authority in 1468) [ClassicSimilarity], result of:
            0.13064057 = score(doc=1468,freq=1.0), product of:
              0.22440979 = queryWeight, product of:
                2.8529954 = boost
                5.322531 = idf(docFreq=566, maxDocs=42740)
                0.014778233 = queryNorm
              0.58215183 = fieldWeight in 1468, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.322531 = idf(docFreq=566, maxDocs=42740)
                0.109375 = fieldNorm(doc=1468)
        0.16 = coord(4/25)
    
  5. Mugridge, R.L.; Furniss, K.A.: Education for authority control : whose responsibility is it? (2002) 0.06
    0.06167361 = sum of:
      0.06167361 = product of:
        0.38546008 = sum of:
          0.074341685 = weight(abstract_txt:maintenance in 460) [ClassicSimilarity], result of:
            0.074341685 = score(doc=460,freq=2.0), product of:
              0.10613198 = queryWeight, product of:
                1.1327724 = boost
                6.3398805 = idf(docFreq=204, maxDocs=42740)
                0.014778233 = queryNorm
              0.7004645 = fieldWeight in 460, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.3398805 = idf(docFreq=204, maxDocs=42740)
                0.078125 = fieldNorm(doc=460)
          0.054614913 = weight(abstract_txt:ongoing in 460) [ClassicSimilarity], result of:
            0.054614913 = score(doc=460,freq=1.0), product of:
              0.108870156 = queryWeight, product of:
                1.147292 = boost
                6.4211435 = idf(docFreq=188, maxDocs=42740)
                0.014778233 = queryNorm
              0.5016518 = fieldWeight in 460, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.4211435 = idf(docFreq=188, maxDocs=42740)
                0.078125 = fieldNorm(doc=460)
          0.027930098 = weight(abstract_txt:process in 460) [ClassicSimilarity], result of:
            0.027930098 = score(doc=460,freq=1.0), product of:
              0.08771886 = queryWeight, product of:
                1.4564012 = boost
                4.07558 = idf(docFreq=1972, maxDocs=42740)
                0.014778233 = queryNorm
              0.3184047 = fieldWeight in 460, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.07558 = idf(docFreq=1972, maxDocs=42740)
                0.078125 = fieldNorm(doc=460)
          0.2285734 = weight(abstract_txt:authority in 460) [ClassicSimilarity], result of:
            0.2285734 = score(doc=460,freq=6.0), product of:
              0.22440979 = queryWeight, product of:
                2.8529954 = boost
                5.322531 = idf(docFreq=566, maxDocs=42740)
                0.014778233 = queryNorm
              1.0185536 = fieldWeight in 460, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                5.322531 = idf(docFreq=566, maxDocs=42740)
                0.078125 = fieldNorm(doc=460)
        0.16 = coord(4/25)