Document (#43723)

Author
Vorndran, A.
Grund, S.
Title
Metadata sharing : how to transfer metadata information among work cluster members
Source
Cataloging and classification quarterly. 59(2021) no.8, p.757-774
Year
2021
Abstract
The German National Library (DNB) is using a clustering technique to aggregate works from the database Culturegraph. Culturegraph collects bibliographic metadata records from all German Regional Library Networks, the Austrian Library Network, and DNB. This stock of about 180 million records serves as the basis for work clustering-the attempt to assemble all manifestations of a work in one cluster. The results of this work clustering are not employed in the display of search results, as other similar approaches successfully do, but for transferring metadata elements among the cluster members. In this paper the transfer of content-descriptive metadata elements such as controlled and uncontrolled index terms and classifications and links to name records in the German Integrated Authority File (GND) are described. In this way, standardization and cross linking can be improved and the richness of metadata description can be enhanced.
Content
Vgl.: https://doi.org/10.1080/01639374.2021.1989101.
Footnote
Teil eines Themenheftes: Artificial intelligence (AI) and automated processes for subject sccess
Theme
Metadaten
Object
Culturegraph
GND

Similar documents (content)

  1. Carlyle, A.; Summerlin, J.: Transforming catalog displays : records clustering for works of fiction (2000) 0.16
    0.16399698 = sum of:
      0.16399698 = product of:
        0.68332076 = sum of:
          0.02535302 = weight(abstract_txt:results in 100) [ClassicSimilarity], result of:
            0.02535302 = score(doc=100,freq=2.0), product of:
              0.065893605 = queryWeight, product of:
                1.0266223 = boost
                3.482422 = idf(docFreq=3693, maxDocs=44218)
                0.018431097 = queryNorm
              0.38475692 = fieldWeight in 100, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.482422 = idf(docFreq=3693, maxDocs=44218)
                0.078125 = fieldNorm(doc=100)
          0.016869295 = weight(abstract_txt:this in 100) [ClassicSimilarity], result of:
            0.016869295 = score(doc=100,freq=2.0), product of:
              0.0632749 = queryWeight, product of:
                1.422721 = boost
                2.4130175 = idf(docFreq=10762, maxDocs=44218)
                0.018431097 = queryNorm
              0.2666033 = fieldWeight in 100, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.4130175 = idf(docFreq=10762, maxDocs=44218)
                0.078125 = fieldNorm(doc=100)
          0.10993913 = weight(abstract_txt:records in 100) [ClassicSimilarity], result of:
            0.10993913 = score(doc=100,freq=4.0), product of:
              0.15920086 = queryWeight, product of:
                1.9543728 = boost
                4.4196396 = idf(docFreq=1446, maxDocs=44218)
                0.018431097 = queryNorm
              0.6905687 = fieldWeight in 100, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.4196396 = idf(docFreq=1446, maxDocs=44218)
                0.078125 = fieldNorm(doc=100)
          0.04637989 = weight(abstract_txt:work in 100) [ClassicSimilarity], result of:
            0.04637989 = score(doc=100,freq=1.0), product of:
              0.15645759 = queryWeight, product of:
                2.2371874 = boost
                3.7943997 = idf(docFreq=2703, maxDocs=44218)
                0.018431097 = queryNorm
              0.29643747 = fieldWeight in 100, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.7943997 = idf(docFreq=2703, maxDocs=44218)
                0.078125 = fieldNorm(doc=100)
          0.30589816 = weight(abstract_txt:clustering in 100) [ClassicSimilarity], result of:
            0.30589816 = score(doc=100,freq=4.0), product of:
              0.31494048 = queryWeight, product of:
                2.7488368 = boost
                6.2162485 = idf(docFreq=239, maxDocs=44218)
                0.018431097 = queryNorm
              0.9712888 = fieldWeight in 100, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.2162485 = idf(docFreq=239, maxDocs=44218)
                0.078125 = fieldNorm(doc=100)
          0.17888126 = weight(abstract_txt:cluster in 100) [ClassicSimilarity], result of:
            0.17888126 = score(doc=100,freq=1.0), product of:
              0.3496019 = queryWeight, product of:
                2.8961535 = boost
                6.5493927 = idf(docFreq=171, maxDocs=44218)
                0.018431097 = queryNorm
              0.5116713 = fieldWeight in 100, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.5493927 = idf(docFreq=171, maxDocs=44218)
                0.078125 = fieldNorm(doc=100)
        0.24 = coord(6/25)
    
  2. Jacob, E.K.; Albrechtsen, H.; George, N.: Empirical analysis and evaluation of a metadata scheme for representing pedagogical resources in a digital library for educators (2006) 0.15
    0.15348676 = sum of:
      0.15348676 = product of:
        0.548167 = sum of:
          0.03933622 = weight(abstract_txt:among in 2518) [ClassicSimilarity], result of:
            0.03933622 = score(doc=2518,freq=1.0), product of:
              0.111265846 = queryWeight, product of:
                1.3340435 = boost
                4.5252304 = idf(docFreq=1301, maxDocs=44218)
                0.018431097 = queryNorm
              0.35353363 = fieldWeight in 2518, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5252304 = idf(docFreq=1301, maxDocs=44218)
                0.078125 = fieldNorm(doc=2518)
          0.04607634 = weight(abstract_txt:library in 2518) [ClassicSimilarity], result of:
            0.04607634 = score(doc=2518,freq=5.0), product of:
              0.08276738 = queryWeight, product of:
                1.4091729 = boost
                3.1867187 = idf(docFreq=4964, maxDocs=44218)
                0.018431097 = queryNorm
              0.55669683 = fieldWeight in 2518, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                3.1867187 = idf(docFreq=4964, maxDocs=44218)
                0.078125 = fieldNorm(doc=2518)
          0.023856787 = weight(abstract_txt:this in 2518) [ClassicSimilarity], result of:
            0.023856787 = score(doc=2518,freq=4.0), product of:
              0.0632749 = queryWeight, product of:
                1.422721 = boost
                2.4130175 = idf(docFreq=10762, maxDocs=44218)
                0.018431097 = queryNorm
              0.37703398 = fieldWeight in 2518, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                2.4130175 = idf(docFreq=10762, maxDocs=44218)
                0.078125 = fieldNorm(doc=2518)
          0.061803106 = weight(abstract_txt:elements in 2518) [ClassicSimilarity], result of:
            0.061803106 = score(doc=2518,freq=1.0), product of:
              0.15037432 = queryWeight, product of:
                1.5508717 = boost
                5.260737 = idf(docFreq=623, maxDocs=44218)
                0.018431097 = queryNorm
              0.41099507 = fieldWeight in 2518, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.260737 = idf(docFreq=623, maxDocs=44218)
                0.078125 = fieldNorm(doc=2518)
          0.054969564 = weight(abstract_txt:records in 2518) [ClassicSimilarity], result of:
            0.054969564 = score(doc=2518,freq=1.0), product of:
              0.15920086 = queryWeight, product of:
                1.9543728 = boost
                4.4196396 = idf(docFreq=1446, maxDocs=44218)
                0.018431097 = queryNorm
              0.34528434 = fieldWeight in 2518, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4196396 = idf(docFreq=1446, maxDocs=44218)
                0.078125 = fieldNorm(doc=2518)
          0.06559107 = weight(abstract_txt:work in 2518) [ClassicSimilarity], result of:
            0.06559107 = score(doc=2518,freq=2.0), product of:
              0.15645759 = queryWeight, product of:
                2.2371874 = boost
                3.7943997 = idf(docFreq=2703, maxDocs=44218)
                0.018431097 = queryNorm
              0.41922587 = fieldWeight in 2518, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.7943997 = idf(docFreq=2703, maxDocs=44218)
                0.078125 = fieldNorm(doc=2518)
          0.25653392 = weight(abstract_txt:metadata in 2518) [ClassicSimilarity], result of:
            0.25653392 = score(doc=2518,freq=3.0), product of:
              0.38838583 = queryWeight, product of:
                4.316993 = boost
                4.881247 = idf(docFreq=911, maxDocs=44218)
                0.018431097 = queryNorm
              0.6605131 = fieldWeight in 2518, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.881247 = idf(docFreq=911, maxDocs=44218)
                0.078125 = fieldNorm(doc=2518)
        0.28 = coord(7/25)
    
  3. Pfeffer, M.: Using clustering across union catalogues to enrich entries with indexing information (2014) 0.15
    0.15335964 = sum of:
      0.15335964 = product of:
        0.47924888 = sum of:
          0.06627392 = weight(abstract_txt:regional in 3301) [ClassicSimilarity], result of:
            0.06627392 = score(doc=3301,freq=1.0), product of:
              0.12504084 = queryWeight, product of:
                6.784232 = idf(docFreq=135, maxDocs=44218)
                0.018431097 = queryNorm
              0.53001815 = fieldWeight in 3301, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.784232 = idf(docFreq=135, maxDocs=44218)
                0.078125 = fieldNorm(doc=3301)
          0.02535302 = weight(abstract_txt:results in 3301) [ClassicSimilarity], result of:
            0.02535302 = score(doc=3301,freq=2.0), product of:
              0.065893605 = queryWeight, product of:
                1.0266223 = boost
                3.482422 = idf(docFreq=3693, maxDocs=44218)
                0.018431097 = queryNorm
              0.38475692 = fieldWeight in 3301, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.482422 = idf(docFreq=3693, maxDocs=44218)
                0.078125 = fieldNorm(doc=3301)
          0.03933622 = weight(abstract_txt:among in 3301) [ClassicSimilarity], result of:
            0.03933622 = score(doc=3301,freq=1.0), product of:
              0.111265846 = queryWeight, product of:
                1.3340435 = boost
                4.5252304 = idf(docFreq=1301, maxDocs=44218)
                0.018431097 = queryNorm
              0.35353363 = fieldWeight in 3301, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5252304 = idf(docFreq=1301, maxDocs=44218)
                0.078125 = fieldNorm(doc=3301)
          0.041211933 = weight(abstract_txt:library in 3301) [ClassicSimilarity], result of:
            0.041211933 = score(doc=3301,freq=4.0), product of:
              0.08276738 = queryWeight, product of:
                1.4091729 = boost
                3.1867187 = idf(docFreq=4964, maxDocs=44218)
                0.018431097 = queryNorm
              0.4979248 = fieldWeight in 3301, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.1867187 = idf(docFreq=4964, maxDocs=44218)
                0.078125 = fieldNorm(doc=3301)
          0.020660581 = weight(abstract_txt:this in 3301) [ClassicSimilarity], result of:
            0.020660581 = score(doc=3301,freq=3.0), product of:
              0.0632749 = queryWeight, product of:
                1.422721 = boost
                2.4130175 = idf(docFreq=10762, maxDocs=44218)
                0.018431097 = queryNorm
              0.32652098 = fieldWeight in 3301, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.4130175 = idf(docFreq=10762, maxDocs=44218)
                0.078125 = fieldNorm(doc=3301)
          0.08708423 = weight(abstract_txt:members in 3301) [ClassicSimilarity], result of:
            0.08708423 = score(doc=3301,freq=1.0), product of:
              0.18899913 = queryWeight, product of:
                1.7386771 = boost
                5.8977947 = idf(docFreq=329, maxDocs=44218)
                0.018431097 = queryNorm
              0.4607652 = fieldWeight in 3301, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8977947 = idf(docFreq=329, maxDocs=44218)
                0.078125 = fieldNorm(doc=3301)
          0.04637989 = weight(abstract_txt:work in 3301) [ClassicSimilarity], result of:
            0.04637989 = score(doc=3301,freq=1.0), product of:
              0.15645759 = queryWeight, product of:
                2.2371874 = boost
                3.7943997 = idf(docFreq=2703, maxDocs=44218)
                0.018431097 = queryNorm
              0.29643747 = fieldWeight in 3301, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.7943997 = idf(docFreq=2703, maxDocs=44218)
                0.078125 = fieldNorm(doc=3301)
          0.15294908 = weight(abstract_txt:clustering in 3301) [ClassicSimilarity], result of:
            0.15294908 = score(doc=3301,freq=1.0), product of:
              0.31494048 = queryWeight, product of:
                2.7488368 = boost
                6.2162485 = idf(docFreq=239, maxDocs=44218)
                0.018431097 = queryNorm
              0.4856444 = fieldWeight in 3301, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.2162485 = idf(docFreq=239, maxDocs=44218)
                0.078125 = fieldNorm(doc=3301)
        0.32 = coord(8/25)
    
  4. Paling, S.: Developing a metadata element set for organizing literary works : a survey of the American literary community (2011) 0.15
    0.15322284 = sum of:
      0.15322284 = product of:
        0.6384285 = sum of:
          0.017927293 = weight(abstract_txt:results in 4554) [ClassicSimilarity], result of:
            0.017927293 = score(doc=4554,freq=1.0), product of:
              0.065893605 = queryWeight, product of:
                1.0266223 = boost
                3.482422 = idf(docFreq=3693, maxDocs=44218)
                0.018431097 = queryNorm
              0.27206424 = fieldWeight in 4554, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.482422 = idf(docFreq=3693, maxDocs=44218)
                0.078125 = fieldNorm(doc=4554)
          0.0119283935 = weight(abstract_txt:this in 4554) [ClassicSimilarity], result of:
            0.0119283935 = score(doc=4554,freq=1.0), product of:
              0.0632749 = queryWeight, product of:
                1.422721 = boost
                2.4130175 = idf(docFreq=10762, maxDocs=44218)
                0.018431097 = queryNorm
              0.18851699 = fieldWeight in 4554, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4130175 = idf(docFreq=10762, maxDocs=44218)
                0.078125 = fieldNorm(doc=4554)
          0.12360621 = weight(abstract_txt:elements in 4554) [ClassicSimilarity], result of:
            0.12360621 = score(doc=4554,freq=4.0), product of:
              0.15037432 = queryWeight, product of:
                1.5508717 = boost
                5.260737 = idf(docFreq=623, maxDocs=44218)
                0.018431097 = queryNorm
              0.82199013 = fieldWeight in 4554, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.260737 = idf(docFreq=623, maxDocs=44218)
                0.078125 = fieldNorm(doc=4554)
          0.12315569 = weight(abstract_txt:members in 4554) [ClassicSimilarity], result of:
            0.12315569 = score(doc=4554,freq=2.0), product of:
              0.18899913 = queryWeight, product of:
                1.7386771 = boost
                5.8977947 = idf(docFreq=329, maxDocs=44218)
                0.018431097 = queryNorm
              0.6516204 = fieldWeight in 4554, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.8977947 = idf(docFreq=329, maxDocs=44218)
                0.078125 = fieldNorm(doc=4554)
          0.06559107 = weight(abstract_txt:work in 4554) [ClassicSimilarity], result of:
            0.06559107 = score(doc=4554,freq=2.0), product of:
              0.15645759 = queryWeight, product of:
                2.2371874 = boost
                3.7943997 = idf(docFreq=2703, maxDocs=44218)
                0.018431097 = queryNorm
              0.41922587 = fieldWeight in 4554, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.7943997 = idf(docFreq=2703, maxDocs=44218)
                0.078125 = fieldNorm(doc=4554)
          0.29621986 = weight(abstract_txt:metadata in 4554) [ClassicSimilarity], result of:
            0.29621986 = score(doc=4554,freq=4.0), product of:
              0.38838583 = queryWeight, product of:
                4.316993 = boost
                4.881247 = idf(docFreq=911, maxDocs=44218)
                0.018431097 = queryNorm
              0.76269484 = fieldWeight in 4554, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.881247 = idf(docFreq=911, maxDocs=44218)
                0.078125 = fieldNorm(doc=4554)
        0.24 = coord(6/25)
    
  5. Aman, V.: Internationally mobile scientists as knowledge transmitters : a lexical-based approach to detect knowledge transfer (2022) 0.14
    0.1397834 = sum of:
      0.1397834 = product of:
        0.58243084 = sum of:
          0.09579174 = weight(abstract_txt:transferring in 665) [ClassicSimilarity], result of:
            0.09579174 = score(doc=665,freq=1.0), product of:
              0.18548788 = queryWeight, product of:
                1.2179567 = boost
                8.2629 = idf(docFreq=30, maxDocs=44218)
                0.018431097 = queryNorm
              0.5164313 = fieldWeight in 665, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.2629 = idf(docFreq=30, maxDocs=44218)
                0.0625 = fieldNorm(doc=665)
          0.031468973 = weight(abstract_txt:among in 665) [ClassicSimilarity], result of:
            0.031468973 = score(doc=665,freq=1.0), product of:
              0.111265846 = queryWeight, product of:
                1.3340435 = boost
                4.5252304 = idf(docFreq=1301, maxDocs=44218)
                0.018431097 = queryNorm
              0.2828269 = fieldWeight in 665, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5252304 = idf(docFreq=1301, maxDocs=44218)
                0.0625 = fieldNorm(doc=665)
          0.013495437 = weight(abstract_txt:this in 665) [ClassicSimilarity], result of:
            0.013495437 = score(doc=665,freq=2.0), product of:
              0.0632749 = queryWeight, product of:
                1.422721 = boost
                2.4130175 = idf(docFreq=10762, maxDocs=44218)
                0.018431097 = queryNorm
              0.21328263 = fieldWeight in 665, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.4130175 = idf(docFreq=10762, maxDocs=44218)
                0.0625 = fieldNorm(doc=665)
          0.18734868 = weight(abstract_txt:transfer in 665) [ClassicSimilarity], result of:
            0.18734868 = score(doc=665,freq=5.0), product of:
              0.21373905 = queryWeight, product of:
                1.8489748 = boost
                6.2719374 = idf(docFreq=226, maxDocs=44218)
                0.018431097 = queryNorm
              0.87652993 = fieldWeight in 665, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                6.2719374 = idf(docFreq=226, maxDocs=44218)
                0.0625 = fieldNorm(doc=665)
          0.037103914 = weight(abstract_txt:work in 665) [ClassicSimilarity], result of:
            0.037103914 = score(doc=665,freq=1.0), product of:
              0.15645759 = queryWeight, product of:
                2.2371874 = boost
                3.7943997 = idf(docFreq=2703, maxDocs=44218)
                0.018431097 = queryNorm
              0.23714998 = fieldWeight in 665, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.7943997 = idf(docFreq=2703, maxDocs=44218)
                0.0625 = fieldNorm(doc=665)
          0.21722208 = weight(abstract_txt:german in 665) [ClassicSimilarity], result of:
            0.21722208 = score(doc=665,freq=3.0), product of:
              0.32015932 = queryWeight, product of:
                2.7715182 = boost
                6.2675414 = idf(docFreq=227, maxDocs=44218)
                0.018431097 = queryNorm
              0.6784812 = fieldWeight in 665, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.2675414 = idf(docFreq=227, maxDocs=44218)
                0.0625 = fieldNorm(doc=665)
        0.24 = coord(6/25)