Document (#44180)

Author
McElfresh, L.K.
Title
Creator name standardization using faceted vocabularies in the BTAA geoportal : Michigan State University libraries digital repository case study
Source
Cataloging and classification quarterly. 61(2023) no.5-6, S.605-625
Year
2023
Abstract
Digital libraries incorporate metadata from varied sources, ranging from traditional catalog data to author-supplied descriptions. The Big Ten Academic Alliance (BTAA) Geoportal unites geospatial resources from the libraries of the BTAA, compounding the variability of metadata. The BTAA Geospatial Information Network's (BTAA GIN) Metadata Committee works to ensure completeness and consistency of metadata in the Geoportal, including a project to standardize the contents of the Creator field. The project comprises an OpenRefine data cleaning phase; evaluation of controlled vocabularies for semiautomated matching via OpenRefine reconciliation; and development and testing of a best practices guide for application of a controlled vocabulary.
Content
Vgl.: https://www.tandfonline.com/doi/full/10.1080/01639374.2023.2200430.
Footnote
Beitrag in Themenheft: Implementation of Faceted Vocabularies.
Field
Geowissenschaften
Location
USA

Similar documents (content)

  1. Hooland, S. van; Verborgh, R.; Wilde, M. De; Hercher, J.; Mannens, E.; Wa, R.Van de: Evaluating the success of vocabulary reconciliation for cultural heritage collections (2013) 0.21
    0.21427792 = sum of:
      0.21427792 = product of:
        0.8928247 = sum of:
          0.012153444 = weight(abstract_txt:from in 662) [ClassicSimilarity], result of:
            0.012153444 = score(doc=662,freq=1.0), product of:
              0.056284618 = queryWeight, product of:
                1.215483 = boost
                2.7638826 = idf(docFreq=7577, maxDocs=44218)
                0.016754106 = queryNorm
              0.21592833 = fieldWeight in 662, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.7638826 = idf(docFreq=7577, maxDocs=44218)
                0.078125 = fieldNorm(doc=662)
          0.19708103 = weight(abstract_txt:reconciliation in 662) [ClassicSimilarity], result of:
            0.19708103 = score(doc=662,freq=2.0), product of:
              0.19844323 = queryWeight, product of:
                1.3176848 = boost
                8.988837 = idf(docFreq=14, maxDocs=44218)
                0.016754106 = queryNorm
              0.9931356 = fieldWeight in 662, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.988837 = idf(docFreq=14, maxDocs=44218)
                0.078125 = fieldNorm(doc=662)
          0.08779657 = weight(abstract_txt:controlled in 662) [ClassicSimilarity], result of:
            0.08779657 = score(doc=662,freq=2.0), product of:
              0.14583713 = queryWeight, product of:
                1.5975057 = boost
                5.4488444 = idf(docFreq=516, maxDocs=44218)
                0.016754106 = queryNorm
              0.60201794 = fieldWeight in 662, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.4488444 = idf(docFreq=516, maxDocs=44218)
                0.078125 = fieldNorm(doc=662)
          0.11220226 = weight(abstract_txt:vocabularies in 662) [ClassicSimilarity], result of:
            0.11220226 = score(doc=662,freq=2.0), product of:
              0.17174502 = queryWeight, product of:
                1.7336062 = boost
                5.913062 = idf(docFreq=324, maxDocs=44218)
                0.016754106 = queryNorm
              0.6533072 = fieldWeight in 662, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.913062 = idf(docFreq=324, maxDocs=44218)
                0.078125 = fieldNorm(doc=662)
          0.32898328 = weight(abstract_txt:openrefine in 662) [ClassicSimilarity], result of:
            0.32898328 = score(doc=662,freq=1.0), product of:
              0.44327742 = queryWeight, product of:
                2.7851346 = boost
                9.499662 = idf(docFreq=8, maxDocs=44218)
                0.016754106 = queryNorm
              0.74216115 = fieldWeight in 662, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.499662 = idf(docFreq=8, maxDocs=44218)
                0.078125 = fieldNorm(doc=662)
          0.15460812 = weight(abstract_txt:metadata in 662) [ClassicSimilarity], result of:
            0.15460812 = score(doc=662,freq=3.0), product of:
              0.23407274 = queryWeight, product of:
                2.8621922 = boost
                4.881247 = idf(docFreq=911, maxDocs=44218)
                0.016754106 = queryNorm
              0.6605131 = fieldWeight in 662, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.881247 = idf(docFreq=911, maxDocs=44218)
                0.078125 = fieldNorm(doc=662)
        0.24 = coord(6/25)
    
  2. Lynch, J.D.; Gibson, J.; Han, M.-J.: Analyzing and normalizing type metadata for a large aggregated digital library (2020) 0.16
    0.15587652 = sum of:
      0.15587652 = product of:
        0.7793826 = sum of:
          0.0145841325 = weight(abstract_txt:from in 5720) [ClassicSimilarity], result of:
            0.0145841325 = score(doc=5720,freq=1.0), product of:
              0.056284618 = queryWeight, product of:
                1.215483 = boost
                2.7638826 = idf(docFreq=7577, maxDocs=44218)
                0.016754106 = queryNorm
              0.259114 = fieldWeight in 5720, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.7638826 = idf(docFreq=7577, maxDocs=44218)
                0.09375 = fieldNorm(doc=5720)
          0.052979 = weight(abstract_txt:digital in 5720) [ClassicSimilarity], result of:
            0.052979 = score(doc=5720,freq=2.0), product of:
              0.092221335 = queryWeight, product of:
                1.270352 = boost
                4.332974 = idf(docFreq=1577, maxDocs=44218)
                0.016754106 = queryNorm
              0.5744766 = fieldWeight in 5720, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.332974 = idf(docFreq=1577, maxDocs=44218)
                0.09375 = fieldNorm(doc=5720)
          0.05466084 = weight(abstract_txt:project in 5720) [ClassicSimilarity], result of:
            0.05466084 = score(doc=5720,freq=2.0), product of:
              0.0941629 = queryWeight, product of:
                1.2836549 = boost
                4.378348 = idf(docFreq=1507, maxDocs=44218)
                0.016754106 = queryNorm
              0.5804924 = fieldWeight in 5720, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.378348 = idf(docFreq=1507, maxDocs=44218)
                0.09375 = fieldNorm(doc=5720)
          0.39477992 = weight(abstract_txt:openrefine in 5720) [ClassicSimilarity], result of:
            0.39477992 = score(doc=5720,freq=1.0), product of:
              0.44327742 = queryWeight, product of:
                2.7851346 = boost
                9.499662 = idf(docFreq=8, maxDocs=44218)
                0.016754106 = queryNorm
              0.89059335 = fieldWeight in 5720, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.499662 = idf(docFreq=8, maxDocs=44218)
                0.09375 = fieldNorm(doc=5720)
          0.26237866 = weight(abstract_txt:metadata in 5720) [ClassicSimilarity], result of:
            0.26237866 = score(doc=5720,freq=6.0), product of:
              0.23407274 = queryWeight, product of:
                2.8621922 = boost
                4.881247 = idf(docFreq=911, maxDocs=44218)
                0.016754106 = queryNorm
              1.1209279 = fieldWeight in 5720, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                4.881247 = idf(docFreq=911, maxDocs=44218)
                0.09375 = fieldNorm(doc=5720)
        0.2 = coord(5/25)
    
  3. Integrating multiple overlapping metadata standards (1999) 0.14
    0.13627228 = sum of:
      0.13627228 = product of:
        0.85170174 = sum of:
          0.06243635 = weight(abstract_txt:digital in 4052) [ClassicSimilarity], result of:
            0.06243635 = score(doc=4052,freq=1.0), product of:
              0.092221335 = queryWeight, product of:
                1.270352 = boost
                4.332974 = idf(docFreq=1577, maxDocs=44218)
                0.016754106 = queryNorm
              0.67702717 = fieldWeight in 4052, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.332974 = idf(docFreq=1577, maxDocs=44218)
                0.15625 = fieldNorm(doc=4052)
          0.062309444 = weight(abstract_txt:libraries in 4052) [ClassicSimilarity], result of:
            0.062309444 = score(doc=4052,freq=1.0), product of:
              0.10542398 = queryWeight, product of:
                1.6635035 = boost
                3.782635 = idf(docFreq=2735, maxDocs=44218)
                0.016754106 = queryNorm
              0.59103674 = fieldWeight in 4052, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.782635 = idf(docFreq=2735, maxDocs=44218)
                0.15625 = fieldNorm(doc=4052)
          0.4744819 = weight(abstract_txt:geospatial in 4052) [ClassicSimilarity], result of:
            0.4744819 = score(doc=4052,freq=1.0), product of:
              0.35646716 = queryWeight, product of:
                2.4975727 = boost
                8.518833 = idf(docFreq=23, maxDocs=44218)
                0.016754106 = queryNorm
              1.3310677 = fieldWeight in 4052, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.518833 = idf(docFreq=23, maxDocs=44218)
                0.15625 = fieldNorm(doc=4052)
          0.252474 = weight(abstract_txt:metadata in 4052) [ClassicSimilarity], result of:
            0.252474 = score(doc=4052,freq=2.0), product of:
              0.23407274 = queryWeight, product of:
                2.8621922 = boost
                4.881247 = idf(docFreq=911, maxDocs=44218)
                0.016754106 = queryNorm
              1.0786134 = fieldWeight in 4052, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.881247 = idf(docFreq=911, maxDocs=44218)
                0.15625 = fieldNorm(doc=4052)
        0.16 = coord(4/25)
    
  4. Hooland, S. van; Verborgh, R.: Linked data for Lilibraries, archives and museums : how to clean, link, and publish your metadata (2014) 0.13
    0.13021138 = sum of:
      0.13021138 = product of:
        0.54254746 = sum of:
          0.010312539 = weight(abstract_txt:from in 5153) [ClassicSimilarity], result of:
            0.010312539 = score(doc=5153,freq=2.0), product of:
              0.056284618 = queryWeight, product of:
                1.215483 = boost
                2.7638826 = idf(docFreq=7577, maxDocs=44218)
                0.016754106 = queryNorm
              0.18322127 = fieldWeight in 5153, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.7638826 = idf(docFreq=7577, maxDocs=44218)
                0.046875 = fieldNorm(doc=5153)
          0.12916224 = weight(abstract_txt:cleaning in 5153) [ClassicSimilarity], result of:
            0.12916224 = score(doc=5153,freq=3.0), product of:
              0.18386492 = queryWeight, product of:
                1.2683609 = boost
                8.652365 = idf(docFreq=20, maxDocs=44218)
                0.016754106 = queryNorm
              0.7024844 = fieldWeight in 5153, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                8.652365 = idf(docFreq=20, maxDocs=44218)
                0.046875 = fieldNorm(doc=5153)
          0.018730905 = weight(abstract_txt:digital in 5153) [ClassicSimilarity], result of:
            0.018730905 = score(doc=5153,freq=1.0), product of:
              0.092221335 = queryWeight, product of:
                1.270352 = boost
                4.332974 = idf(docFreq=1577, maxDocs=44218)
                0.016754106 = queryNorm
              0.20310816 = fieldWeight in 5153, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.332974 = idf(docFreq=1577, maxDocs=44218)
                0.046875 = fieldNorm(doc=5153)
          0.14482439 = weight(abstract_txt:reconciliation in 5153) [ClassicSimilarity], result of:
            0.14482439 = score(doc=5153,freq=3.0), product of:
              0.19844323 = queryWeight, product of:
                1.3176848 = boost
                8.988837 = idf(docFreq=14, maxDocs=44218)
                0.016754106 = queryNorm
              0.7298026 = fieldWeight in 5153, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                8.988837 = idf(docFreq=14, maxDocs=44218)
                0.046875 = fieldNorm(doc=5153)
          0.018692832 = weight(abstract_txt:libraries in 5153) [ClassicSimilarity], result of:
            0.018692832 = score(doc=5153,freq=1.0), product of:
              0.10542398 = queryWeight, product of:
                1.6635035 = boost
                3.782635 = idf(docFreq=2735, maxDocs=44218)
                0.016754106 = queryNorm
              0.17731102 = fieldWeight in 5153, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.782635 = idf(docFreq=2735, maxDocs=44218)
                0.046875 = fieldNorm(doc=5153)
          0.22082455 = weight(abstract_txt:metadata in 5153) [ClassicSimilarity], result of:
            0.22082455 = score(doc=5153,freq=17.0), product of:
              0.23407274 = queryWeight, product of:
                2.8621922 = boost
                4.881247 = idf(docFreq=911, maxDocs=44218)
                0.016754106 = queryNorm
              0.9434014 = fieldWeight in 5153, product of:
                4.1231055 = tf(freq=17.0), with freq of:
                  17.0 = termFreq=17.0
                4.881247 = idf(docFreq=911, maxDocs=44218)
                0.046875 = fieldNorm(doc=5153)
        0.24 = coord(6/25)
    
  5. Gilliland, A.J.: Contemplating co-creator rights in archival description (2012) 0.11
    0.11105432 = sum of:
      0.11105432 = product of:
        0.5552716 = sum of:
          0.009722755 = weight(abstract_txt:from in 415) [ClassicSimilarity], result of:
            0.009722755 = score(doc=415,freq=1.0), product of:
              0.056284618 = queryWeight, product of:
                1.215483 = boost
                2.7638826 = idf(docFreq=7577, maxDocs=44218)
                0.016754106 = queryNorm
              0.17274266 = fieldWeight in 415, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.7638826 = idf(docFreq=7577, maxDocs=44218)
                0.0625 = fieldNorm(doc=415)
          0.02576737 = weight(abstract_txt:project in 415) [ClassicSimilarity], result of:
            0.02576737 = score(doc=415,freq=1.0), product of:
              0.0941629 = queryWeight, product of:
                1.2836549 = boost
                4.378348 = idf(docFreq=1507, maxDocs=44218)
                0.016754106 = queryNorm
              0.27364674 = fieldWeight in 415, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.378348 = idf(docFreq=1507, maxDocs=44218)
                0.0625 = fieldNorm(doc=415)
          0.11148587 = weight(abstract_txt:reconciliation in 415) [ClassicSimilarity], result of:
            0.11148587 = score(doc=415,freq=1.0), product of:
              0.19844323 = queryWeight, product of:
                1.3176848 = boost
                8.988837 = idf(docFreq=14, maxDocs=44218)
                0.016754106 = queryNorm
              0.5618023 = fieldWeight in 415, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.988837 = idf(docFreq=14, maxDocs=44218)
                0.0625 = fieldNorm(doc=415)
          0.30730602 = weight(abstract_txt:creator in 415) [ClassicSimilarity], result of:
            0.30730602 = score(doc=415,freq=3.0), product of:
              0.3408056 = queryWeight, product of:
                2.4420903 = boost
                8.329592 = idf(docFreq=28, maxDocs=44218)
                0.016754106 = queryNorm
              0.9017047 = fieldWeight in 415, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                8.329592 = idf(docFreq=28, maxDocs=44218)
                0.0625 = fieldNorm(doc=415)
          0.1009896 = weight(abstract_txt:metadata in 415) [ClassicSimilarity], result of:
            0.1009896 = score(doc=415,freq=2.0), product of:
              0.23407274 = queryWeight, product of:
                2.8621922 = boost
                4.881247 = idf(docFreq=911, maxDocs=44218)
                0.016754106 = queryNorm
              0.43144536 = fieldWeight in 415, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.881247 = idf(docFreq=911, maxDocs=44218)
                0.0625 = fieldNorm(doc=415)
        0.2 = coord(5/25)