Search (14 results, page 1 of 1)

  • × theme_ss:"Formalerschließung"
  • × year_i:[2020 TO 2030}
  1. Zhang, L.; Lu, W.; Yang, J.: LAGOS-AND : a large gold standard dataset for scholarly author name disambiguation (2023) 0.03
    0.026603904 = product of:
      0.053207807 = sum of:
        0.053207807 = sum of:
          0.02216725 = weight(_text_:m in 883) [ClassicSimilarity], result of:
            0.02216725 = score(doc=883,freq=4.0), product of:
              0.114023164 = queryWeight, product of:
                2.4884486 = idf(docFreq=9980, maxDocs=44218)
                0.045820985 = queryNorm
              0.19441006 = fieldWeight in 883, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                2.4884486 = idf(docFreq=9980, maxDocs=44218)
                0.0390625 = fieldNorm(doc=883)
          0.031040555 = weight(_text_:22 in 883) [ClassicSimilarity], result of:
            0.031040555 = score(doc=883,freq=2.0), product of:
              0.16045728 = queryWeight, product of:
                3.5018296 = idf(docFreq=3622, maxDocs=44218)
                0.045820985 = queryNorm
              0.19345059 = fieldWeight in 883, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.5018296 = idf(docFreq=3622, maxDocs=44218)
                0.0390625 = fieldNorm(doc=883)
      0.5 = coord(1/2)
    
    Abstract
    In this article, we present a method to automatically build large labeled datasets for the author ambiguity problem in the academic world by leveraging the authoritative academic resources, ORCID and DOI. Using the method, we built LAGOS-AND, two large, gold-standard sub-datasets for author name disambiguation (AND), of which LAGOS-AND-BLOCK is created for clustering-based AND research and LAGOS-AND-PAIRWISE is created for classification-based AND research. Our LAGOS-AND datasets are substantially different from the existing ones. The initial versions of the datasets (v1.0, released in February 2021) include 7.5 M citations authored by 798 K unique authors (LAGOS-AND-BLOCK) and close to 1 M instances (LAGOS-AND-PAIRWISE). And both datasets show close similarities to the whole Microsoft Academic Graph (MAG) across validations of six facets. In building the datasets, we reveal the variation degrees of last names in three literature databases, PubMed, MAG, and Semantic Scholar, by comparing author names hosted to the authors' official last names shown on the ORCID pages. Furthermore, we evaluate several baseline disambiguation methods as well as the MAG's author IDs system on our datasets, and the evaluation helps identify several interesting findings. We hope the datasets and findings will bring new insights for future studies. The code and datasets are publicly available.
    Date
    22. 1.2023 18:40:36
  2. Morris, V.: Automated language identification of bibliographic resources (2020) 0.01
    0.012416222 = product of:
      0.024832444 = sum of:
        0.024832444 = product of:
          0.04966489 = sum of:
            0.04966489 = weight(_text_:22 in 5749) [ClassicSimilarity], result of:
              0.04966489 = score(doc=5749,freq=2.0), product of:
                0.16045728 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.045820985 = queryNorm
                0.30952093 = fieldWeight in 5749, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0625 = fieldNorm(doc=5749)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Date
    2. 3.2020 19:04:22
  3. Das, S.; Paik, J.H.: Gender tagging of named entities using retrieval-assisted multi-context aggregation : an unsupervised approach (2023) 0.01
    0.009312166 = product of:
      0.018624332 = sum of:
        0.018624332 = product of:
          0.037248664 = sum of:
            0.037248664 = weight(_text_:22 in 941) [ClassicSimilarity], result of:
              0.037248664 = score(doc=941,freq=2.0), product of:
                0.16045728 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.045820985 = queryNorm
                0.23214069 = fieldWeight in 941, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.046875 = fieldNorm(doc=941)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Date
    22. 3.2023 12:00:14
  4. Kim, J.(im); Kim, J.(enna): Effect of forename string on author name disambiguation (2020) 0.01
    0.0077601387 = product of:
      0.015520277 = sum of:
        0.015520277 = product of:
          0.031040555 = sum of:
            0.031040555 = weight(_text_:22 in 5930) [ClassicSimilarity], result of:
              0.031040555 = score(doc=5930,freq=2.0), product of:
                0.16045728 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.045820985 = queryNorm
                0.19345059 = fieldWeight in 5930, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=5930)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Date
    11. 7.2020 13:22:58
  5. Pauman Budanovic, M.; Zumer, M.: Prototype cataloging interface based on the IFLA Library Reference Model (LRM). Part 1 : conceptual design (2021) 0.01
    0.007758537 = product of:
      0.015517074 = sum of:
        0.015517074 = product of:
          0.031034147 = sum of:
            0.031034147 = weight(_text_:m in 700) [ClassicSimilarity], result of:
              0.031034147 = score(doc=700,freq=4.0), product of:
                0.114023164 = queryWeight, product of:
                  2.4884486 = idf(docFreq=9980, maxDocs=44218)
                  0.045820985 = queryNorm
                0.27217406 = fieldWeight in 700, product of:
                  2.0 = tf(freq=4.0), with freq of:
                    4.0 = termFreq=4.0
                  2.4884486 = idf(docFreq=9980, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=700)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
  6. Pauman Budanovic, M.; Zumer, M.: Prototype cataloging interface based on the IFLA Library Reference Model (LRM). Part 2 : usability evaluation (2021) 0.01
    0.007758537 = product of:
      0.015517074 = sum of:
        0.015517074 = product of:
          0.031034147 = sum of:
            0.031034147 = weight(_text_:m in 714) [ClassicSimilarity], result of:
              0.031034147 = score(doc=714,freq=4.0), product of:
                0.114023164 = queryWeight, product of:
                  2.4884486 = idf(docFreq=9980, maxDocs=44218)
                  0.045820985 = queryNorm
                0.27217406 = fieldWeight in 714, product of:
                  2.0 = tf(freq=4.0), with freq of:
                    4.0 = termFreq=4.0
                  2.4884486 = idf(docFreq=9980, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=714)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
  7. Preminger, M.; Rype, I.; Ådland, M.K.; Massey, D.; Tallerås, K.: ¬The public library metadata landscape : the case of Norway 2017-2018 (2020) 0.01
    0.005486114 = product of:
      0.010972228 = sum of:
        0.010972228 = product of:
          0.021944456 = sum of:
            0.021944456 = weight(_text_:m in 5802) [ClassicSimilarity], result of:
              0.021944456 = score(doc=5802,freq=2.0), product of:
                0.114023164 = queryWeight, product of:
                  2.4884486 = idf(docFreq=9980, maxDocs=44218)
                  0.045820985 = queryNorm
                0.19245613 = fieldWeight in 5802, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  2.4884486 = idf(docFreq=9980, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=5802)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
  8. Sfakakis, M.; Zapounidou, S.; Papatheodorou, C.: Mapping derivative relationships from BIBFRAME 2.0 to RDA (2020) 0.01
    0.005486114 = product of:
      0.010972228 = sum of:
        0.010972228 = product of:
          0.021944456 = sum of:
            0.021944456 = weight(_text_:m in 294) [ClassicSimilarity], result of:
              0.021944456 = score(doc=294,freq=2.0), product of:
                0.114023164 = queryWeight, product of:
                  2.4884486 = idf(docFreq=9980, maxDocs=44218)
                  0.045820985 = queryNorm
                0.19245613 = fieldWeight in 294, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  2.4884486 = idf(docFreq=9980, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=294)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
  9. Aalberg, T.; O'Neill, E.; Zumer, M.: Extending the LRM Model to integrating resources (2021) 0.01
    0.005486114 = product of:
      0.010972228 = sum of:
        0.010972228 = product of:
          0.021944456 = sum of:
            0.021944456 = weight(_text_:m in 295) [ClassicSimilarity], result of:
              0.021944456 = score(doc=295,freq=2.0), product of:
                0.114023164 = queryWeight, product of:
                  2.4884486 = idf(docFreq=9980, maxDocs=44218)
                  0.045820985 = queryNorm
                0.19245613 = fieldWeight in 295, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  2.4884486 = idf(docFreq=9980, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=295)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
  10. Fisher, M.; Rafferty, P.: Current issues with cataloging printed music : challenges facing staff and systems (2024) 0.01
    0.005486114 = product of:
      0.010972228 = sum of:
        0.010972228 = product of:
          0.021944456 = sum of:
            0.021944456 = weight(_text_:m in 1151) [ClassicSimilarity], result of:
              0.021944456 = score(doc=1151,freq=2.0), product of:
                0.114023164 = queryWeight, product of:
                  2.4884486 = idf(docFreq=9980, maxDocs=44218)
                  0.045820985 = queryNorm
                0.19245613 = fieldWeight in 1151, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  2.4884486 = idf(docFreq=9980, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=1151)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
  11. Alker-Windbichler, S.; Bauer, K.-H.; Bruckner, W.; Cerny, N.; Kiegler-Griensteidl, M.; Labner, J.: Exemplarspezifische Erschließung im Österreichischen Bibliothekenverbund : Ergebnisse einer Arbeitsgruppe der Zentralen Redaktion (2022) 0.00
    0.0047023837 = product of:
      0.009404767 = sum of:
        0.009404767 = product of:
          0.018809535 = sum of:
            0.018809535 = weight(_text_:m in 485) [ClassicSimilarity], result of:
              0.018809535 = score(doc=485,freq=2.0), product of:
                0.114023164 = queryWeight, product of:
                  2.4884486 = idf(docFreq=9980, maxDocs=44218)
                  0.045820985 = queryNorm
                0.1649624 = fieldWeight in 485, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  2.4884486 = idf(docFreq=9980, maxDocs=44218)
                  0.046875 = fieldNorm(doc=485)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
  12. Menzel, S.; Schnaitter, H.; Zinck, J.; Petras, V.; Neudecker, C.; Labusch, K.; Leitner, E.; Rehm, G.: Named Entity Linking mit Wikidata und GND : das Potenzial handkuratierter und strukturierter Datenquellen für die semantische Anreicherung von Volltexten (2021) 0.00
    0.003918653 = product of:
      0.007837306 = sum of:
        0.007837306 = product of:
          0.015674612 = sum of:
            0.015674612 = weight(_text_:m in 373) [ClassicSimilarity], result of:
              0.015674612 = score(doc=373,freq=2.0), product of:
                0.114023164 = queryWeight, product of:
                  2.4884486 = idf(docFreq=9980, maxDocs=44218)
                  0.045820985 = queryNorm
                0.13746867 = fieldWeight in 373, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  2.4884486 = idf(docFreq=9980, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=373)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Source
    Qualität in der Inhaltserschließung. Hrsg.: M. Franke-Maier, u.a
  13. Oliver, C: Introducing RDA : a guide to the basics after 3R (2021) 0.00
    0.003918653 = product of:
      0.007837306 = sum of:
        0.007837306 = product of:
          0.015674612 = sum of:
            0.015674612 = weight(_text_:m in 716) [ClassicSimilarity], result of:
              0.015674612 = score(doc=716,freq=2.0), product of:
                0.114023164 = queryWeight, product of:
                  2.4884486 = idf(docFreq=9980, maxDocs=44218)
                  0.045820985 = queryNorm
                0.13746867 = fieldWeight in 716, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  2.4884486 = idf(docFreq=9980, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=716)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Type
    m
  14. ¬The library's guide to graphic novels (2020) 0.00
    0.002743057 = product of:
      0.005486114 = sum of:
        0.005486114 = product of:
          0.010972228 = sum of:
            0.010972228 = weight(_text_:m in 717) [ClassicSimilarity], result of:
              0.010972228 = score(doc=717,freq=2.0), product of:
                0.114023164 = queryWeight, product of:
                  2.4884486 = idf(docFreq=9980, maxDocs=44218)
                  0.045820985 = queryNorm
                0.09622806 = fieldWeight in 717, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  2.4884486 = idf(docFreq=9980, maxDocs=44218)
                  0.02734375 = fieldNorm(doc=717)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Type
    m