Document (#43727)

Author
Hahn, J.
Title
Semi-automated methods for BIBFRAME work entity description
Source
Cataloging and classification quarterly. 59(2021) no.8, p.853-867
Year
2021
Abstract
This paper reports an investigation of machine learning methods for the semi-automated creation of a BIBFRAME Work entity description within the RDF linked data editor Sinopia (https://sinopia.io). The automated subject indexing software Annif was configured with the Library of Congress Subject Headings (LCSH) vocabulary from the Linked Data Service at https://id.loc.gov/. The training corpus was comprised of 9.3 million titles and LCSH linked data references from the IvyPlus POD project (https://pod.stanford.edu/) and from Share-VDE (https://wiki.share-vde.org). Semi-automated processes were explored to support and extend, not replace, professional expertise.
Content
Vgl.: https://doi.org/10.1080/01639374.2021.2014011.
Footnote
Teil eines Themenheftes: Artificial intelligence (AI) and automated processes for subject sccess
Theme
Formalerschließung
Object
BIBFRAME

Similar documents (author)

  1. Hahn, G.: ¬Die Bibliothek des Wissenschaftlichen Dienstes des US-Kongresses : Eine Bibliothek in der Library of Congress (1985) 4.88
    4.878167 = sum of:
      4.878167 = weight(author_txt:hahn in 1311) [ClassicSimilarity], result of:
        4.878167 = fieldWeight in 1311, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          7.805067 = idf(docFreq=48, maxDocs=44218)
          0.625 = fieldNorm(doc=1311)
    
  2. Hahn, G.: ¬Die Entwicklung der Wirtschaftswissenschaften im Spiegel von Klassifikationssystemen : ein Beitrag zur Wissenschafts- und Klassifikationskunde der Nationalökonomie (1978) 4.88
    4.878167 = sum of:
      4.878167 = weight(author_txt:hahn in 1698) [ClassicSimilarity], result of:
        4.878167 = fieldWeight in 1698, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          7.805067 = idf(docFreq=48, maxDocs=44218)
          0.625 = fieldNorm(doc=1698)
    
  3. Hahn, G.: Sacherschließung durch Schlagwortkataloge : theoretische und praktische Fragen, dargestellt am Beispiel der Bibliotheken der Industrie- und Handelskammern (1983) 4.88
    4.878167 = sum of:
      4.878167 = weight(author_txt:hahn in 1699) [ClassicSimilarity], result of:
        4.878167 = fieldWeight in 1699, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          7.805067 = idf(docFreq=48, maxDocs=44218)
          0.625 = fieldNorm(doc=1699)
    
  4. Hahn, G.: ¬Die Bibliothek des Deutschen Bundestages : Informationsbasis für die parlamentarische Arbeit (1983) 4.88
    4.878167 = sum of:
      4.878167 = weight(author_txt:hahn in 1700) [ClassicSimilarity], result of:
        4.878167 = fieldWeight in 1700, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          7.805067 = idf(docFreq=48, maxDocs=44218)
          0.625 = fieldNorm(doc=1700)
    
  5. Hahn, G.: Information und Dokumentation in der Bibliothek des Deutschen Bundestages : ein Beispiel der Praxis für die Einheit bibliothekarischer und dokumentarischer Prinzipien (1978-79) 4.88
    4.878167 = sum of:
      4.878167 = weight(author_txt:hahn in 1701) [ClassicSimilarity], result of:
        4.878167 = fieldWeight in 1701, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          7.805067 = idf(docFreq=48, maxDocs=44218)
          0.625 = fieldNorm(doc=1701)
    

Similar documents (content)

  1. Ahmed, M.; Mukhopadhyay, M.; Mukhopadhyay, P.: Automated knowledge organization : AI ML based subject indexing system for libraries (2023) 0.21
    0.21355167 = sum of:
      0.21355167 = product of:
        0.7626845 = sum of:
          0.025430303 = weight(abstract_txt:subject in 977) [ClassicSimilarity], result of:
            0.025430303 = score(doc=977,freq=2.0), product of:
              0.073639534 = queryWeight, product of:
                1.3180315 = boost
                3.9070187 = idf(docFreq=2415, maxDocs=44218)
                0.014300123 = queryNorm
              0.34533492 = fieldWeight in 977, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.9070187 = idf(docFreq=2415, maxDocs=44218)
                0.0625 = fieldNorm(doc=977)
          0.25375283 = weight(abstract_txt:annif in 977) [ClassicSimilarity], result of:
            0.25375283 = score(doc=977,freq=3.0), product of:
              0.23665202 = queryWeight, product of:
                1.6707458 = boost
                9.905128 = idf(docFreq=5, maxDocs=44218)
                0.014300123 = queryNorm
              1.0722615 = fieldWeight in 977, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                9.905128 = idf(docFreq=5, maxDocs=44218)
                0.0625 = fieldNorm(doc=977)
          0.023753101 = weight(abstract_txt:data in 977) [ClassicSimilarity], result of:
            0.023753101 = score(doc=977,freq=2.0), product of:
              0.080547854 = queryWeight, product of:
                1.6882738 = boost
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.014300123 = queryNorm
              0.29489428 = fieldWeight in 977, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.0625 = fieldNorm(doc=977)
          0.10632582 = weight(abstract_txt:lcsh in 977) [ClassicSimilarity], result of:
            0.10632582 = score(doc=977,freq=2.0), product of:
              0.1911184 = queryWeight, product of:
                2.1233497 = boost
                6.29421 = idf(docFreq=221, maxDocs=44218)
                0.014300123 = queryNorm
              0.5563348 = fieldWeight in 977, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.29421 = idf(docFreq=221, maxDocs=44218)
                0.0625 = fieldNorm(doc=977)
          0.077339485 = weight(abstract_txt:linked in 977) [ClassicSimilarity], result of:
            0.077339485 = score(doc=977,freq=1.0), product of:
              0.22293827 = queryWeight, product of:
                2.8087184 = boost
                5.550558 = idf(docFreq=466, maxDocs=44218)
                0.014300123 = queryNorm
              0.34690988 = fieldWeight in 977, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.550558 = idf(docFreq=466, maxDocs=44218)
                0.0625 = fieldNorm(doc=977)
          0.1260521 = weight(abstract_txt:semi in 977) [ClassicSimilarity], result of:
            0.1260521 = score(doc=977,freq=1.0), product of:
              0.30875725 = queryWeight, product of:
                3.3054035 = boost
                6.532101 = idf(docFreq=174, maxDocs=44218)
                0.014300123 = queryNorm
              0.40825632 = fieldWeight in 977, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.532101 = idf(docFreq=174, maxDocs=44218)
                0.0625 = fieldNorm(doc=977)
          0.1500309 = weight(abstract_txt:automated in 977) [ClassicSimilarity], result of:
            0.1500309 = score(doc=977,freq=2.0), product of:
              0.30292875 = queryWeight, product of:
                3.7805545 = boost
                5.6033173 = idf(docFreq=442, maxDocs=44218)
                0.014300123 = queryNorm
              0.49526796 = fieldWeight in 977, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.6033173 = idf(docFreq=442, maxDocs=44218)
                0.0625 = fieldNorm(doc=977)
        0.28 = coord(7/25)
    
  2. Samples, J.; Bigelow, I.: MARC to BIBFRAME : converting the PCC to Linked Data (2020) 0.18
    0.17836301 = sum of:
      0.17836301 = product of:
        0.89181507 = sum of:
          0.07062126 = weight(abstract_txt:share in 119) [ClassicSimilarity], result of:
            0.07062126 = score(doc=119,freq=2.0), product of:
              0.088124394 = queryWeight, product of:
                1.0195378 = boost
                6.044398 = idf(docFreq=284, maxDocs=44218)
                0.014300123 = queryNorm
              0.80138147 = fieldWeight in 119, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.044398 = idf(docFreq=284, maxDocs=44218)
                0.09375 = fieldNorm(doc=119)
          0.03494102 = weight(abstract_txt:work in 119) [ClassicSimilarity], result of:
            0.03494102 = score(doc=119,freq=2.0), product of:
              0.06945544 = queryWeight, product of:
                1.2800395 = boost
                3.7943997 = idf(docFreq=2703, maxDocs=44218)
                0.014300123 = queryNorm
              0.50307107 = fieldWeight in 119, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.7943997 = idf(docFreq=2703, maxDocs=44218)
                0.09375 = fieldNorm(doc=119)
          0.035629652 = weight(abstract_txt:data in 119) [ClassicSimilarity], result of:
            0.035629652 = score(doc=119,freq=2.0), product of:
              0.080547854 = queryWeight, product of:
                1.6882738 = boost
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.014300123 = queryNorm
              0.44234142 = fieldWeight in 119, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.09375 = fieldNorm(doc=119)
          0.116009235 = weight(abstract_txt:linked in 119) [ClassicSimilarity], result of:
            0.116009235 = score(doc=119,freq=1.0), product of:
              0.22293827 = queryWeight, product of:
                2.8087184 = boost
                5.550558 = idf(docFreq=466, maxDocs=44218)
                0.014300123 = queryNorm
              0.5203648 = fieldWeight in 119, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.550558 = idf(docFreq=466, maxDocs=44218)
                0.09375 = fieldNorm(doc=119)
          0.63461393 = weight(abstract_txt:bibframe in 119) [ClassicSimilarity], result of:
            0.63461393 = score(doc=119,freq=5.0), product of:
              0.35359728 = queryWeight, product of:
                2.888183 = boost
                8.561393 = idf(docFreq=22, maxDocs=44218)
                0.014300123 = queryNorm
              1.7947365 = fieldWeight in 119, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                8.561393 = idf(docFreq=22, maxDocs=44218)
                0.09375 = fieldNorm(doc=119)
        0.2 = coord(5/25)
    
  3. Zhu, L.; Xu, A.; Deng, S.; Heng, G.; Li, X.: Entity management using Wikidata for cultural heritage information (2024) 0.17
    0.17180514 = sum of:
      0.17180514 = product of:
        0.71585476 = sum of:
          0.014323244 = weight(abstract_txt:from in 975) [ClassicSimilarity], result of:
            0.014323244 = score(doc=975,freq=1.0), product of:
              0.05527777 = queryWeight, product of:
                1.3985924 = boost
                2.7638826 = idf(docFreq=7577, maxDocs=44218)
                0.014300123 = queryNorm
              0.259114 = fieldWeight in 975, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.7638826 = idf(docFreq=7577, maxDocs=44218)
                0.09375 = fieldNorm(doc=975)
          0.043637227 = weight(abstract_txt:data in 975) [ClassicSimilarity], result of:
            0.043637227 = score(doc=975,freq=3.0), product of:
              0.080547854 = queryWeight, product of:
                1.6882738 = boost
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.014300123 = queryNorm
              0.5417553 = fieldWeight in 975, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.09375 = fieldNorm(doc=975)
          0.19367518 = weight(abstract_txt:entity in 975) [ClassicSimilarity], result of:
            0.19367518 = score(doc=975,freq=3.0), product of:
              0.19003549 = queryWeight, product of:
                2.1173255 = boost
                6.2763524 = idf(docFreq=225, maxDocs=44218)
                0.014300123 = queryNorm
              1.0191526 = fieldWeight in 975, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.2763524 = idf(docFreq=225, maxDocs=44218)
                0.09375 = fieldNorm(doc=975)
          0.116009235 = weight(abstract_txt:linked in 975) [ClassicSimilarity], result of:
            0.116009235 = score(doc=975,freq=1.0), product of:
              0.22293827 = queryWeight, product of:
                2.8087184 = boost
                5.550558 = idf(docFreq=466, maxDocs=44218)
                0.014300123 = queryNorm
              0.5203648 = fieldWeight in 975, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.550558 = idf(docFreq=466, maxDocs=44218)
                0.09375 = fieldNorm(doc=975)
          0.18907815 = weight(abstract_txt:semi in 975) [ClassicSimilarity], result of:
            0.18907815 = score(doc=975,freq=1.0), product of:
              0.30875725 = queryWeight, product of:
                3.3054035 = boost
                6.532101 = idf(docFreq=174, maxDocs=44218)
                0.014300123 = queryNorm
              0.6123845 = fieldWeight in 975, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.532101 = idf(docFreq=174, maxDocs=44218)
                0.09375 = fieldNorm(doc=975)
          0.1591318 = weight(abstract_txt:automated in 975) [ClassicSimilarity], result of:
            0.1591318 = score(doc=975,freq=1.0), product of:
              0.30292875 = queryWeight, product of:
                3.7805545 = boost
                5.6033173 = idf(docFreq=442, maxDocs=44218)
                0.014300123 = queryNorm
              0.525311 = fieldWeight in 975, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.6033173 = idf(docFreq=442, maxDocs=44218)
                0.09375 = fieldNorm(doc=975)
        0.24 = coord(6/25)
    
  4. Heng, G.; Cole, T.W.; Tian, T.(C.); Han, M.-J.: Rethinking authority reconciliation process (2022) 0.13
    0.12950967 = sum of:
      0.12950967 = product of:
        0.6475483 = sum of:
          0.025193969 = weight(abstract_txt:data in 727) [ClassicSimilarity], result of:
            0.025193969 = score(doc=727,freq=1.0), product of:
              0.080547854 = queryWeight, product of:
                1.6882738 = boost
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.014300123 = queryNorm
              0.31278262 = fieldWeight in 727, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.09375 = fieldNorm(doc=727)
          0.15813512 = weight(abstract_txt:entity in 727) [ClassicSimilarity], result of:
            0.15813512 = score(doc=727,freq=2.0), product of:
              0.19003549 = queryWeight, product of:
                2.1173255 = boost
                6.2763524 = idf(docFreq=225, maxDocs=44218)
                0.014300123 = queryNorm
              0.8321346 = fieldWeight in 727, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.2763524 = idf(docFreq=225, maxDocs=44218)
                0.09375 = fieldNorm(doc=727)
          0.116009235 = weight(abstract_txt:linked in 727) [ClassicSimilarity], result of:
            0.116009235 = score(doc=727,freq=1.0), product of:
              0.22293827 = queryWeight, product of:
                2.8087184 = boost
                5.550558 = idf(docFreq=466, maxDocs=44218)
                0.014300123 = queryNorm
              0.5203648 = fieldWeight in 727, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.550558 = idf(docFreq=466, maxDocs=44218)
                0.09375 = fieldNorm(doc=727)
          0.18907815 = weight(abstract_txt:semi in 727) [ClassicSimilarity], result of:
            0.18907815 = score(doc=727,freq=1.0), product of:
              0.30875725 = queryWeight, product of:
                3.3054035 = boost
                6.532101 = idf(docFreq=174, maxDocs=44218)
                0.014300123 = queryNorm
              0.6123845 = fieldWeight in 727, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.532101 = idf(docFreq=174, maxDocs=44218)
                0.09375 = fieldNorm(doc=727)
          0.1591318 = weight(abstract_txt:automated in 727) [ClassicSimilarity], result of:
            0.1591318 = score(doc=727,freq=1.0), product of:
              0.30292875 = queryWeight, product of:
                3.7805545 = boost
                5.6033173 = idf(docFreq=442, maxDocs=44218)
                0.014300123 = queryNorm
              0.525311 = fieldWeight in 727, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.6033173 = idf(docFreq=442, maxDocs=44218)
                0.09375 = fieldNorm(doc=727)
        0.2 = coord(5/25)
    
  5. Willer, M.; Dunsire, G.: ISBD, the UNIMARC bibliographic format, and RDA : interoperability issues in namespaces and the linked data environment (2014) 0.12
    0.118160754 = sum of:
      0.118160754 = product of:
        0.49233648 = sum of:
          0.024707034 = weight(abstract_txt:work in 1999) [ClassicSimilarity], result of:
            0.024707034 = score(doc=1999,freq=1.0), product of:
              0.06945544 = queryWeight, product of:
                1.2800395 = boost
                3.7943997 = idf(docFreq=2703, maxDocs=44218)
                0.014300123 = queryNorm
              0.355725 = fieldWeight in 1999, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.7943997 = idf(docFreq=2703, maxDocs=44218)
                0.09375 = fieldNorm(doc=1999)
          0.020256126 = weight(abstract_txt:from in 1999) [ClassicSimilarity], result of:
            0.020256126 = score(doc=1999,freq=2.0), product of:
              0.05527777 = queryWeight, product of:
                1.3985924 = boost
                2.7638826 = idf(docFreq=7577, maxDocs=44218)
                0.014300123 = queryNorm
              0.36644253 = fieldWeight in 1999, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.7638826 = idf(docFreq=7577, maxDocs=44218)
                0.09375 = fieldNorm(doc=1999)
          0.088550046 = weight(abstract_txt:description in 1999) [ClassicSimilarity], result of:
            0.088550046 = score(doc=1999,freq=3.0), product of:
              0.11278326 = queryWeight, product of:
                1.6311451 = boost
                4.835176 = idf(docFreq=954, maxDocs=44218)
                0.014300123 = queryNorm
              0.7851347 = fieldWeight in 1999, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.835176 = idf(docFreq=954, maxDocs=44218)
                0.09375 = fieldNorm(doc=1999)
          0.035629652 = weight(abstract_txt:data in 1999) [ClassicSimilarity], result of:
            0.035629652 = score(doc=1999,freq=2.0), product of:
              0.080547854 = queryWeight, product of:
                1.6882738 = boost
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.014300123 = queryNorm
              0.44234142 = fieldWeight in 1999, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.09375 = fieldNorm(doc=1999)
          0.16406183 = weight(abstract_txt:linked in 1999) [ClassicSimilarity], result of:
            0.16406183 = score(doc=1999,freq=2.0), product of:
              0.22293827 = queryWeight, product of:
                2.8087184 = boost
                5.550558 = idf(docFreq=466, maxDocs=44218)
                0.014300123 = queryNorm
              0.73590696 = fieldWeight in 1999, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.550558 = idf(docFreq=466, maxDocs=44218)
                0.09375 = fieldNorm(doc=1999)
          0.1591318 = weight(abstract_txt:automated in 1999) [ClassicSimilarity], result of:
            0.1591318 = score(doc=1999,freq=1.0), product of:
              0.30292875 = queryWeight, product of:
                3.7805545 = boost
                5.6033173 = idf(docFreq=442, maxDocs=44218)
                0.014300123 = queryNorm
              0.525311 = fieldWeight in 1999, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.6033173 = idf(docFreq=442, maxDocs=44218)
                0.09375 = fieldNorm(doc=1999)
        0.24 = coord(6/25)