Search (440 results, page 1 of 22)

Gabler, S.: Vergabe von DDC-Sachgruppen mittels eines Schlagwort-Thesaurus (2021) 0.15
```
0.15059501 = product of:
  0.25099167 = sum of:
    0.059636947 = product of:
      0.17891084 = sum of:
        0.17891084 = weight(_text_:3a in 1000) [ClassicSimilarity], result of:
          0.17891084 = score(doc=1000,freq=2.0), product of:
            0.38200375 = queryWeight, product of:
              8.478011 = idf(docFreq=24, maxDocs=44218)
              0.04505818 = queryNorm
            0.46834838 = fieldWeight in 1000, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              8.478011 = idf(docFreq=24, maxDocs=44218)
              0.0390625 = fieldNorm(doc=1000)
      0.33333334 = coord(1/3)
    0.17891084 = weight(_text_:2f in 1000) [ClassicSimilarity], result of:
      0.17891084 = score(doc=1000,freq=2.0), product of:
        0.38200375 = queryWeight, product of:
          8.478011 = idf(docFreq=24, maxDocs=44218)
          0.04505818 = queryNorm
        0.46834838 = fieldWeight in 1000, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          8.478011 = idf(docFreq=24, maxDocs=44218)
          0.0390625 = fieldNorm(doc=1000)
    0.012443894 = product of:
      0.024887787 = sum of:
        0.024887787 = weight(_text_:data in 1000) [ClassicSimilarity], result of:
          0.024887787 = score(doc=1000,freq=2.0), product of:
            0.14247625 = queryWeight, product of:
              3.1620505 = idf(docFreq=5088, maxDocs=44218)
              0.04505818 = queryNorm
            0.17468026 = fieldWeight in 1000, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.1620505 = idf(docFreq=5088, maxDocs=44218)
              0.0390625 = fieldNorm(doc=1000)
      0.5 = coord(1/2)
  0.6 = coord(3/5)
```
Abstract

Vorgestellt wird die Konstruktion eines thematisch geordneten Thesaurus auf Basis der Sachschlagwörter der Gemeinsamen Normdatei (GND) unter Nutzung der darin enthaltenen DDC-Notationen. Oberste Ordnungsebene dieses Thesaurus werden die DDC-Sachgruppen der Deutschen Nationalbibliothek. Die Konstruktion des Thesaurus erfolgt regelbasiert unter der Nutzung von Linked Data Prinzipien in einem SPARQL Prozessor. Der Thesaurus dient der automatisierten Gewinnung von Metadaten aus wissenschaftlichen Publikationen mittels eines computerlinguistischen Extraktors. Hierzu werden digitale Volltexte verarbeitet. Dieser ermittelt die gefundenen Schlagwörter über Vergleich der Zeichenfolgen Benennungen im Thesaurus, ordnet die Treffer nach Relevanz im Text und gibt die zugeordne-ten Sachgruppen rangordnend zurück. Die grundlegende Annahme dabei ist, dass die gesuchte Sachgruppe unter den oberen Rängen zurückgegeben wird. In einem dreistufigen Verfahren wird die Leistungsfähigkeit des Verfahrens validiert. Hierzu wird zunächst anhand von Metadaten und Erkenntnissen einer Kurzautopsie ein Goldstandard aus Dokumenten erstellt, die im Online-Katalog der DNB abrufbar sind. Die Dokumente vertei-len sich über 14 der Sachgruppen mit einer Losgröße von jeweils 50 Dokumenten. Sämtliche Dokumente werden mit dem Extraktor erschlossen und die Ergebnisse der Kategorisierung do-kumentiert. Schließlich wird die sich daraus ergebende Retrievalleistung sowohl für eine harte (binäre) Kategorisierung als auch eine rangordnende Rückgabe der Sachgruppen beurteilt.

Content

Master thesis Master of Science (Library and Information Studies) (MSc), Universität Wien. Advisor: Christoph Steiner. Vgl.: https://www.researchgate.net/publication/371680244_Vergabe_von_DDC-Sachgruppen_mittels_eines_Schlagwort-Thesaurus. DOI: 10.25365/thesis.70030. Vgl. dazu die Präsentation unter: https://www.google.com/url?sa=i&rct=j&q=&esrc=s&source=web&cd=&ved=0CAIQw7AJahcKEwjwoZzzytz_AhUAAAAAHQAAAAAQAg&url=https%3A%2F%2Fwiki.dnb.de%2Fdownload%2Fattachments%2F252121510%2FDA3%2520Workshop-Gabler.pdf%3Fversion%3D1%26modificationDate%3D1671093170000%26api%3Dv2&psig=AOvVaw0szwENK1or3HevgvIDOfjx&ust=1687719410889597&opi=89978449.

Daquino, M.; Peroni, S.; Shotton, D.; Colavizza, G.; Ghavimi, B.; Lauscher, A.; Mayr, P.; Romanello, M.; Zumstein, P.: ¬The OpenCitations Data Model (2020) 0.13

0.12976845 = product of:
  0.21628073 = sum of:
    0.11275144 = weight(_text_:readable in 38) [ClassicSimilarity], result of:
      0.11275144 = score(doc=38,freq=2.0), product of:
        0.2768342 = queryWeight, product of:
          6.1439276 = idf(docFreq=257, maxDocs=44218)
          0.04505818 = queryNorm
        0.4072887 = fieldWeight in 38, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          6.1439276 = idf(docFreq=257, maxDocs=44218)
          0.046875 = fieldNorm(doc=38)
    0.06402116 = weight(_text_:bibliographic in 38) [ClassicSimilarity], result of:
      0.06402116 = score(doc=38,freq=4.0), product of:
        0.17541347 = queryWeight, product of:
          3.893044 = idf(docFreq=2449, maxDocs=44218)
          0.04505818 = queryNorm
        0.3649729 = fieldWeight in 38, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          3.893044 = idf(docFreq=2449, maxDocs=44218)
          0.046875 = fieldNorm(doc=38)
    0.03950814 = product of:
      0.07901628 = sum of:
        0.07901628 = weight(_text_:data in 38) [ClassicSimilarity], result of:
          0.07901628 = score(doc=38,freq=14.0), product of:
            0.14247625 = queryWeight, product of:
              3.1620505 = idf(docFreq=5088, maxDocs=44218)
              0.04505818 = queryNorm
            0.55459267 = fieldWeight in 38, product of:
              3.7416575 = tf(freq=14.0), with freq of:
                14.0 = termFreq=14.0
              3.1620505 = idf(docFreq=5088, maxDocs=44218)
              0.046875 = fieldNorm(doc=38)
      0.5 = coord(1/2)
  0.6 = coord(3/5)

Abstract: A variety of schemas and ontologies are currently used for the machine-readable description of bibliographic entities and citations. This diversity, and the reuse of the same ontology terms with different nuances, generates inconsistencies in data. Adoption of a single data model would facilitate data integration tasks regardless of the data supplier or context application. In this paper we present the OpenCitations Data Model (OCDM), a generic data model for describing bibliographic entities and citations, developed using Semantic Web technologies. We also evaluate the effective reusability of OCDM according to ontology evaluation practices, mention existing users of OCDM, and discuss the use and impact of OCDM in the wider open science community.

Noever, D.; Ciolino, M.: ¬The Turing deception (2022) 0.11

0.11450293 = product of:
  0.28625733 = sum of:
    0.07156433 = product of:
      0.214693 = sum of:
        0.214693 = weight(_text_:3a in 862) [ClassicSimilarity], result of:
          0.214693 = score(doc=862,freq=2.0), product of:
            0.38200375 = queryWeight, product of:
              8.478011 = idf(docFreq=24, maxDocs=44218)
              0.04505818 = queryNorm
            0.56201804 = fieldWeight in 862, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              8.478011 = idf(docFreq=24, maxDocs=44218)
              0.046875 = fieldNorm(doc=862)
      0.33333334 = coord(1/3)
    0.214693 = weight(_text_:2f in 862) [ClassicSimilarity], result of:
      0.214693 = score(doc=862,freq=2.0), product of:
        0.38200375 = queryWeight, product of:
          8.478011 = idf(docFreq=24, maxDocs=44218)
          0.04505818 = queryNorm
        0.56201804 = fieldWeight in 862, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          8.478011 = idf(docFreq=24, maxDocs=44218)
          0.046875 = fieldNorm(doc=862)
  0.4 = coord(2/5)

Source: https%3A%2F%2Farxiv.org%2Fabs%2F2212.06721&usg=AOvVaw3i_9pZm9y_dQWoHi6uv0EN

Zhu, Y.; Quan, L.; Chen, P.-Y.; Kim, M.C.; Che, C.: Predicting coauthorship using bibliographic network embedding (2023) 0.05
```
0.05263589 = product of:
  0.13158973 = sum of:
    0.10670193 = weight(_text_:bibliographic in 917) [ClassicSimilarity], result of:
      0.10670193 = score(doc=917,freq=16.0), product of:
        0.17541347 = queryWeight, product of:
          3.893044 = idf(docFreq=2449, maxDocs=44218)
          0.04505818 = queryNorm
        0.6082881 = fieldWeight in 917, product of:
          4.0 = tf(freq=16.0), with freq of:
            16.0 = termFreq=16.0
          3.893044 = idf(docFreq=2449, maxDocs=44218)
          0.0390625 = fieldNorm(doc=917)
    0.024887787 = product of:
      0.049775574 = sum of:
        0.049775574 = weight(_text_:data in 917) [ClassicSimilarity], result of:
          0.049775574 = score(doc=917,freq=8.0), product of:
            0.14247625 = queryWeight, product of:
              3.1620505 = idf(docFreq=5088, maxDocs=44218)
              0.04505818 = queryNorm
            0.34936053 = fieldWeight in 917, product of:
              2.828427 = tf(freq=8.0), with freq of:
                8.0 = termFreq=8.0
              3.1620505 = idf(docFreq=5088, maxDocs=44218)
              0.0390625 = fieldNorm(doc=917)
      0.5 = coord(1/2)
  0.4 = coord(2/5)
```
Abstract

Coauthorship prediction applies predictive analytics to bibliographic data to predict authors who are highly likely to be coauthors. In this study, we propose an approach for coauthorship prediction based on bibliographic network embedding through a graph-based bibliographic data model that can be used to model common bibliographic data, including papers, terms, sources, authors, departments, research interests, universities, and countries. A real-world dataset released by AMiner that includes more than 2 million papers, 8 million citations, and 1.7 million authors were integrated into a large bibliographic network using the proposed bibliographic data model. Translation-based methods were applied to the entities and relationships to generate their low-dimensional embeddings while preserving their connectivity information in the original bibliographic network. We applied machine learning algorithms to embeddings that represent the coauthorship relationships of the two authors and achieved high prediction results. The reference model, which is the combination of a network embedding size of 100, the most basic translation-based method, and a gradient boosting method achieved an F1 score of 0.9 and even higher scores are obtainable with different embedding sizes and more advanced embedding methods. Thus, the strengths of the proposed approach lie in its customizable components under a unified framework.

Fernanda de Jesus, A.; Ferreira de Castro, F.: Proposal for the publication of linked open bibliographic data (2024) 0.05

0.052019097 = product of:
  0.13004774 = sum of:
    0.09053959 = weight(_text_:bibliographic in 1161) [ClassicSimilarity], result of:
      0.09053959 = score(doc=1161,freq=8.0), product of:
        0.17541347 = queryWeight, product of:
          3.893044 = idf(docFreq=2449, maxDocs=44218)
          0.04505818 = queryNorm
        0.5161496 = fieldWeight in 1161, product of:
          2.828427 = tf(freq=8.0), with freq of:
            8.0 = termFreq=8.0
          3.893044 = idf(docFreq=2449, maxDocs=44218)
          0.046875 = fieldNorm(doc=1161)
    0.03950814 = product of:
      0.07901628 = sum of:
        0.07901628 = weight(_text_:data in 1161) [ClassicSimilarity], result of:
          0.07901628 = score(doc=1161,freq=14.0), product of:
            0.14247625 = queryWeight, product of:
              3.1620505 = idf(docFreq=5088, maxDocs=44218)
              0.04505818 = queryNorm
            0.55459267 = fieldWeight in 1161, product of:
              3.7416575 = tf(freq=14.0), with freq of:
                14.0 = termFreq=14.0
              3.1620505 = idf(docFreq=5088, maxDocs=44218)
              0.046875 = fieldNorm(doc=1161)
      0.5 = coord(1/2)
  0.4 = coord(2/5)

Abstract: Linked Open Data (LOD) are a set of principles for publishing structured, connected data available for reuse under an open license. The objective of this paper is to analyze the publishing of bibliographic data such as LOD, having as a product the elaboration of theoretical-methodological recommendations for the publication of these data, in an approach based on the ten best practices for publishing LOD, from the World Wide Web Consortium. The starting point was the conduction of a Systematic Review of Literature, where initiatives to publish bibliographic data such as LOD were identified. An empirical study of these institutions was also conducted. As a result, theoretical-methodological recommendations were obtained for the process of publishing bibliographic data such as LOD.

Wu, S.: Implementing bibliographic enhancement data in academic library catalogs : an empirical study (2024) 0.04

0.04194648 = product of:
  0.10486619 = sum of:
    0.074691355 = weight(_text_:bibliographic in 1159) [ClassicSimilarity], result of:
      0.074691355 = score(doc=1159,freq=4.0), product of:
        0.17541347 = queryWeight, product of:
          3.893044 = idf(docFreq=2449, maxDocs=44218)
          0.04505818 = queryNorm
        0.4258017 = fieldWeight in 1159, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          3.893044 = idf(docFreq=2449, maxDocs=44218)
          0.0546875 = fieldNorm(doc=1159)
    0.03017484 = product of:
      0.06034968 = sum of:
        0.06034968 = weight(_text_:data in 1159) [ClassicSimilarity], result of:
          0.06034968 = score(doc=1159,freq=6.0), product of:
            0.14247625 = queryWeight, product of:
              3.1620505 = idf(docFreq=5088, maxDocs=44218)
              0.04505818 = queryNorm
            0.42357713 = fieldWeight in 1159, product of:
              2.4494898 = tf(freq=6.0), with freq of:
                6.0 = termFreq=6.0
              3.1620505 = idf(docFreq=5088, maxDocs=44218)
              0.0546875 = fieldNorm(doc=1159)
      0.5 = coord(1/2)
  0.4 = coord(2/5)

Abstract: This study examines users' needs for bibliographic enhancement data (BIBED) in academic library catalogs. Qualitative data were collected through 30 academic users' activity logs and follow-up interviews. These 30 participants were recruited from a public university in the United States that has over 19,000 students enrolled and over 600 full-time faculty members. This study identified 19 types of BIBED useful for supporting the five user tasks proposed in the IFLA Library Reference Model and in seven other contexts, such as enhancing one's understanding, offering search instructions, and providing readers' advisory. Findings suggest that adopting BIBFRAME and Semantic Web technologies may enable academic library catalogs to provide BIBED to better meet user needs in various contexts.

Zhao, D.; Strotmann, A.: Mapping knowledge domains on Wikipedia : an author bibliographic coupling analysis of traditional Chinese medicine (2022) 0.04
```
0.038836475 = product of:
  0.09709118 = sum of:
    0.079848416 = weight(_text_:bibliographic in 608) [ClassicSimilarity], result of:
      0.079848416 = score(doc=608,freq=14.0), product of:
        0.17541347 = queryWeight, product of:
          3.893044 = idf(docFreq=2449, maxDocs=44218)
          0.04505818 = queryNorm
        0.45520115 = fieldWeight in 608, product of:
          3.7416575 = tf(freq=14.0), with freq of:
            14.0 = termFreq=14.0
          3.893044 = idf(docFreq=2449, maxDocs=44218)
          0.03125 = fieldNorm(doc=608)
    0.017242765 = product of:
      0.03448553 = sum of:
        0.03448553 = weight(_text_:data in 608) [ClassicSimilarity], result of:
          0.03448553 = score(doc=608,freq=6.0), product of:
            0.14247625 = queryWeight, product of:
              3.1620505 = idf(docFreq=5088, maxDocs=44218)
              0.04505818 = queryNorm
            0.24204408 = fieldWeight in 608, product of:
              2.4494898 = tf(freq=6.0), with freq of:
                6.0 = termFreq=6.0
              3.1620505 = idf(docFreq=5088, maxDocs=44218)
              0.03125 = fieldNorm(doc=608)
      0.5 = coord(1/2)
  0.4 = coord(2/5)
```
Abstract

Purpose Wikipedia has the lofty goal of compiling all human knowledge. The purpose of the present study is to map the structure of the Traditional Chinese Medicine (TCM) knowledge domain on Wikipedia, to identify patterns of knowledge representation on Wikipedia and to test the applicability of author bibliographic coupling analysis, an effective method for mapping knowledge domains represented in published scholarly documents, for Wikipedia data. Design/methodology/approach We adapted and followed the well-established procedures and techniques for author bibliographic coupling analysis (ABCA). Instead of bibliographic data from a citation database, we used all articles on TCM downloaded from the English version of Wikipedia as our dataset. An author bibliographic coupling network was calculated and then factor analyzed using SPSS. Factor analysis results were visualized. Factors were labeled upon manual examination of articles that authors who load primarily in each factor have significantly contributed references to. Clear factors were interpreted as topics. Findings Seven TCM topic areas are represented on Wikipedia, among which Acupuncture-related practices, Falun Gong and Herbal Medicine attracted the most significant contributors to TCM. Acupuncture and Qi Gong have the most connections to the TCM knowledge domain and also serve as bridges for other topics to connect to the domain. Herbal medicine is weakly linked to and non-herbal medicine is isolated from the rest of the TCM knowledge domain. It appears that specific topics are represented well on Wikipedia but their conceptual connections are not. ABCA is effective for mapping knowledge domains on Wikipedia but document-based bibliographic coupling analysis is not. Originality/value Given the prominent position of Wikipedia for both information users and for researchers on knowledge organization and information retrieval, it is important to study how well knowledge is represented and structured on Wikipedia. Such studies appear largely missing although studies from different perspectives both about Wikipedia and using Wikipedia as data are abundant. Author bibliographic coupling analysis is effective for mapping knowledge domains represented in published scholarly documents but has never been applied to mapping knowledge domains represented on Wikipedia.
Dobreski, B.: Common usage as warrant in bibliographic description (2020) 0.04
```
0.038719673 = product of:
  0.09679918 = sum of:
    0.08435529 = weight(_text_:bibliographic in 5708) [ClassicSimilarity], result of:
      0.08435529 = score(doc=5708,freq=10.0), product of:
        0.17541347 = queryWeight, product of:
          3.893044 = idf(docFreq=2449, maxDocs=44218)
          0.04505818 = queryNorm
        0.480894 = fieldWeight in 5708, product of:
          3.1622777 = tf(freq=10.0), with freq of:
            10.0 = termFreq=10.0
          3.893044 = idf(docFreq=2449, maxDocs=44218)
          0.0390625 = fieldNorm(doc=5708)
    0.012443894 = product of:
      0.024887787 = sum of:
        0.024887787 = weight(_text_:data in 5708) [ClassicSimilarity], result of:
          0.024887787 = score(doc=5708,freq=2.0), product of:
            0.14247625 = queryWeight, product of:
              3.1620505 = idf(docFreq=5088, maxDocs=44218)
              0.04505818 = queryNorm
            0.17468026 = fieldWeight in 5708, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.1620505 = idf(docFreq=5088, maxDocs=44218)
              0.0390625 = fieldNorm(doc=5708)
      0.5 = coord(1/2)
  0.4 = coord(2/5)
```
Abstract

Purpose Within standards for bibliographic description, common usage has served as a prominent design principle, guiding the choice and form of certain names and titles. In practice, however, the determination of common usage is difficult and lends itself to varying interpretations. The purpose of this paper is to explore the presence and role of common usage in bibliographic description through an examination of previously unexplored connections between common usage and the concept of warrant. Design/methodology/approach A brief historical review of the concept of common usage was conducted, followed by a case study of the current bibliographic standard Resource Description and Access (RDA) employing qualitative content analysis to examine the appearances, delineations and functions of common usage. Findings were then compared to the existing literature on warrant in knowledge organization. Findings Multiple interpretations of common usage coexist within RDA and its predecessors, and the current prioritization of these interpretations tends to render user perspectives secondary to those of creators, scholars and publishers. These varying common usages and their overall reliance on concrete sources of evidence reveal a mixture of underlying warrants, with literary warrant playing a more prominent role in comparison to the also present scientific/philosophical, use and autonomous warrants. Originality/value This paper offers new understanding of the concept of common usage, and adds to the body of work examining warrant in knowledge organization practices beyond classification. It sheds light on the design of the influential standard RDA while revealing the implications of naming and labeling in widely shared bibliographic data.

Morris, V.: Automated language identification of bibliographic resources (2020) 0.03

0.033911508 = product of:
  0.08477877 = sum of:
    0.060359728 = weight(_text_:bibliographic in 5749) [ClassicSimilarity], result of:
      0.060359728 = score(doc=5749,freq=2.0), product of:
        0.17541347 = queryWeight, product of:
          3.893044 = idf(docFreq=2449, maxDocs=44218)
          0.04505818 = queryNorm
        0.34409973 = fieldWeight in 5749, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.893044 = idf(docFreq=2449, maxDocs=44218)
          0.0625 = fieldNorm(doc=5749)
    0.024419045 = product of:
      0.04883809 = sum of:
        0.04883809 = weight(_text_:22 in 5749) [ClassicSimilarity], result of:
          0.04883809 = score(doc=5749,freq=2.0), product of:
            0.15778607 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.04505818 = queryNorm
            0.30952093 = fieldWeight in 5749, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.0625 = fieldNorm(doc=5749)
      0.5 = coord(1/2)
  0.4 = coord(2/5)

Date: 2. 3.2020 19:04:22

Dunsire, G.; Fritz, D.; Fritz, R.: Instructions, interfaces, and interoperable data : the RIMMF experience with RDA revisited (2020) 0.03

0.033195842 = product of:
  0.0829896 = sum of:
    0.052814763 = weight(_text_:bibliographic in 5751) [ClassicSimilarity], result of:
      0.052814763 = score(doc=5751,freq=2.0), product of:
        0.17541347 = queryWeight, product of:
          3.893044 = idf(docFreq=2449, maxDocs=44218)
          0.04505818 = queryNorm
        0.30108726 = fieldWeight in 5751, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.893044 = idf(docFreq=2449, maxDocs=44218)
          0.0546875 = fieldNorm(doc=5751)
    0.03017484 = product of:
      0.06034968 = sum of:
        0.06034968 = weight(_text_:data in 5751) [ClassicSimilarity], result of:
          0.06034968 = score(doc=5751,freq=6.0), product of:
            0.14247625 = queryWeight, product of:
              3.1620505 = idf(docFreq=5088, maxDocs=44218)
              0.04505818 = queryNorm
            0.42357713 = fieldWeight in 5751, product of:
              2.4494898 = tf(freq=6.0), with freq of:
                6.0 = termFreq=6.0
              3.1620505 = idf(docFreq=5088, maxDocs=44218)
              0.0546875 = fieldNorm(doc=5751)
      0.5 = coord(1/2)
  0.4 = coord(2/5)

Abstract: This article presents a case study of RIMMF, a software tool developed to improve the orientation and training of catalogers who use Resource Description and Access (RDA) to maintain bibliographic data. The cataloging guidance and instructions of RDA are based on the Functional Requirements conceptual models that are now consolidated in the IFLA Library Reference Model, but many catalogers are applying RDA in systems that have evolved from inventory and text-processing applications developed from older metadata paradigms. The article describes how RIMMF interacts with the RDA Toolkit and RDA Registry to offer cataloger-friendly multilingual data input and editing interfaces.

Samples, J.; Bigelow, I.: MARC to BIBFRAME : converting the PCC to Linked Data (2020) 0.03

0.033195842 = product of:
  0.0829896 = sum of:
    0.052814763 = weight(_text_:bibliographic in 119) [ClassicSimilarity], result of:
      0.052814763 = score(doc=119,freq=2.0), product of:
        0.17541347 = queryWeight, product of:
          3.893044 = idf(docFreq=2449, maxDocs=44218)
          0.04505818 = queryNorm
        0.30108726 = fieldWeight in 119, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.893044 = idf(docFreq=2449, maxDocs=44218)
          0.0546875 = fieldNorm(doc=119)
    0.03017484 = product of:
      0.06034968 = sum of:
        0.06034968 = weight(_text_:data in 119) [ClassicSimilarity], result of:
          0.06034968 = score(doc=119,freq=6.0), product of:
            0.14247625 = queryWeight, product of:
              3.1620505 = idf(docFreq=5088, maxDocs=44218)
              0.04505818 = queryNorm
            0.42357713 = fieldWeight in 119, product of:
              2.4494898 = tf(freq=6.0), with freq of:
                6.0 = termFreq=6.0
              3.1620505 = idf(docFreq=5088, maxDocs=44218)
              0.0546875 = fieldNorm(doc=119)
      0.5 = coord(1/2)
  0.4 = coord(2/5)

Abstract: The Program for Cooperative Cataloging (PCC) has formal relationships with the Library of Congress (LC), Share-VDE, and Linked Data for Production Phase 2 (LD4P2) for work on Bibliographic Framework (BIBFRAME), and PCC institutions have been very active in the exploration of MARC to BIBFRAME conversion processes. This article will review the involvement of PCC in the development of BIBFRAME and examine the work of LC, Share-VDE, and LD4P2 on MARC to BIBFRAME conversion. It will conclude with a discussion of areas for further exploration by the PCC leading up to the creation of PCC conversion specifications and PCC BIBFRAME data.

Ahmed, M.; Mukhopadhyay, M.; Mukhopadhyay, P.: Automated knowledge organization : AI ML based subject indexing system for libraries (2023) 0.03
```
0.03317586 = product of:
  0.08293965 = sum of:
    0.06534132 = weight(_text_:bibliographic in 977) [ClassicSimilarity], result of:
      0.06534132 = score(doc=977,freq=6.0), product of:
        0.17541347 = queryWeight, product of:
          3.893044 = idf(docFreq=2449, maxDocs=44218)
          0.04505818 = queryNorm
        0.3724989 = fieldWeight in 977, product of:
          2.4494898 = tf(freq=6.0), with freq of:
            6.0 = termFreq=6.0
          3.893044 = idf(docFreq=2449, maxDocs=44218)
          0.0390625 = fieldNorm(doc=977)
    0.017598324 = product of:
      0.035196647 = sum of:
        0.035196647 = weight(_text_:data in 977) [ClassicSimilarity], result of:
          0.035196647 = score(doc=977,freq=4.0), product of:
            0.14247625 = queryWeight, product of:
              3.1620505 = idf(docFreq=5088, maxDocs=44218)
              0.04505818 = queryNorm
            0.24703519 = fieldWeight in 977, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              3.1620505 = idf(docFreq=5088, maxDocs=44218)
              0.0390625 = fieldNorm(doc=977)
      0.5 = coord(1/2)
  0.4 = coord(2/5)
```
Abstract

The research study as reported here is an attempt to explore the possibilities of an AI/ML-based semi-automated indexing system in a library setup to handle large volumes of documents. It uses the Python virtual environment to install and configure an open source AI environment (named Annif) to feed the LOD (Linked Open Data) dataset of Library of Congress Subject Headings (LCSH) as a standard KOS (Knowledge Organisation System). The framework deployed the Turtle format of LCSH after cleaning the file with Skosify, applied an array of backend algorithms (namely TF-IDF, Omikuji, and NN-Ensemble) to measure relative performance, and selected Snowball as an analyser. The training of Annif was conducted with a large set of bibliographic records populated with subject descriptors (MARC tag 650$a) and indexed by trained LIS professionals. The training dataset is first treated with MarcEdit to export it in a format suitable for OpenRefine, and then in OpenRefine it undergoes many steps to produce a bibliographic record set suitable to train Annif. The framework, after training, has been tested with a bibliographic dataset to measure indexing efficiencies, and finally, the automated indexing framework is integrated with data wrangling software (OpenRefine) to produce suggested headings on a mass scale. The entire framework is based on open-source software, open datasets, and open standards.
Yu, L.; Fan, Z.; Li, A.: ¬A hierarchical typology of scholarly information units : based on a deduction-verification study (2020) 0.03
```
0.033102494 = product of:
  0.08275623 = sum of:
    0.030179864 = weight(_text_:bibliographic in 5655) [ClassicSimilarity], result of:
      0.030179864 = score(doc=5655,freq=2.0), product of:
        0.17541347 = queryWeight, product of:
          3.893044 = idf(docFreq=2449, maxDocs=44218)
          0.04505818 = queryNorm
        0.17204987 = fieldWeight in 5655, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.893044 = idf(docFreq=2449, maxDocs=44218)
          0.03125 = fieldNorm(doc=5655)
    0.052576363 = sum of:
      0.028157318 = weight(_text_:data in 5655) [ClassicSimilarity], result of:
        0.028157318 = score(doc=5655,freq=4.0), product of:
          0.14247625 = queryWeight, product of:
            3.1620505 = idf(docFreq=5088, maxDocs=44218)
            0.04505818 = queryNorm
          0.19762816 = fieldWeight in 5655, product of:
            2.0 = tf(freq=4.0), with freq of:
              4.0 = termFreq=4.0
            3.1620505 = idf(docFreq=5088, maxDocs=44218)
            0.03125 = fieldNorm(doc=5655)
      0.024419045 = weight(_text_:22 in 5655) [ClassicSimilarity], result of:
        0.024419045 = score(doc=5655,freq=2.0), product of:
          0.15778607 = queryWeight, product of:
            3.5018296 = idf(docFreq=3622, maxDocs=44218)
            0.04505818 = queryNorm
          0.15476047 = fieldWeight in 5655, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            3.5018296 = idf(docFreq=3622, maxDocs=44218)
            0.03125 = fieldNorm(doc=5655)
  0.4 = coord(2/5)
```
Abstract

Purpose The purpose of this paper is to lay a theoretical foundation for identifying operational information units for library and information professional activities in the context of scholarly communication. Design/methodology/approach The study adopts a deduction-verification approach to formulate a typology of units for scholarly information. It first deduces possible units from an existing conceptualization of information, which defines information as the combined product of data and meaning, and then tests the usefulness of these units via two empirical investigations, one with a group of scholarly papers and the other with a sample of scholarly information users. Findings The results show that, on defining an information unit as a piece of information that is complete in both data and meaning, to such an extent that it remains meaningful to its target audience when retrieved and displayed independently in a database, it is then possible to formulate a hierarchical typology of units for scholarly information. The typology proposed in this study consists of three levels, which in turn, consists of 1, 5 and 44 units, respectively. Research limitations/implications The result of this study has theoretical implications on both the philosophical and conceptual levels: on the philosophical level, it hinges on, and reinforces the objective view of information; on the conceptual level, it challenges the conceptualization of work by IFLA's Functional Requirements for Bibliographic Records and Library Reference Model but endorses that by Library of Congress's BIBFRAME 2.0 model. Practical implications It calls for reconsideration of existing operational units in a variety of library and information activities. Originality/value The study strengthens the conceptual foundation of operational information units and brings to light the primacy of "one work" as an information unit and the possibility for it to be supplemented by smaller units.

Date

14. 1.2020 11:15:22
Dattolo, A.; Corbatto, M.: Assisting researchers in bibliographic tasks : a new usable, real-time tool for analyzing bibliographies (2022) 0.03
```
0.031581532 = product of:
  0.07895383 = sum of:
    0.06402116 = weight(_text_:bibliographic in 559) [ClassicSimilarity], result of:
      0.06402116 = score(doc=559,freq=4.0), product of:
        0.17541347 = queryWeight, product of:
          3.893044 = idf(docFreq=2449, maxDocs=44218)
          0.04505818 = queryNorm
        0.3649729 = fieldWeight in 559, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          3.893044 = idf(docFreq=2449, maxDocs=44218)
          0.046875 = fieldNorm(doc=559)
    0.014932672 = product of:
      0.029865343 = sum of:
        0.029865343 = weight(_text_:data in 559) [ClassicSimilarity], result of:
          0.029865343 = score(doc=559,freq=2.0), product of:
            0.14247625 = queryWeight, product of:
              3.1620505 = idf(docFreq=5088, maxDocs=44218)
              0.04505818 = queryNorm
            0.2096163 = fieldWeight in 559, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.1620505 = idf(docFreq=5088, maxDocs=44218)
              0.046875 = fieldNorm(doc=559)
      0.5 = coord(1/2)
  0.4 = coord(2/5)
```
Abstract

The amount of scientific papers is growing together with the development of science itself; but, although there is an unprecedented availability of large citation indexes, some daily activities of researchers remain time-consuming and poorly supported. In this paper, we present Visual Bibliographies (VisualBib), a real-time visual platform, designed using a zz-structure-based model for linking metadata and a narrative, visual approach for showing bibliographies. VisualBib represents a usable, advanced, and visual tool, which simplifies the management of bibliographies, supports a core set of bibliographic tasks, and helps researchers during complex analyses on scientific bibliographies. We present the variety of metadata formats and visualization methods, proposing two use case scenarios. The maturity of the system implementation allowed us two studies, for evaluating both the effectiveness of VisualBib in providing answers to specific data analysis tasks and to support experienced users during real-life uses. The results of the evaluation are positive and describe an effective and usable platform.

Serra, L.G.; Schneider, J.A.; Santarém Segundo, J.E.: Person identifiers in MARC 21 records in a semantic environment (2020) 0.03

0.030980965 = product of:
  0.077452414 = sum of:
    0.052814763 = weight(_text_:bibliographic in 127) [ClassicSimilarity], result of:
      0.052814763 = score(doc=127,freq=2.0), product of:
        0.17541347 = queryWeight, product of:
          3.893044 = idf(docFreq=2449, maxDocs=44218)
          0.04505818 = queryNorm
        0.30108726 = fieldWeight in 127, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.893044 = idf(docFreq=2449, maxDocs=44218)
          0.0546875 = fieldNorm(doc=127)
    0.024637653 = product of:
      0.049275305 = sum of:
        0.049275305 = weight(_text_:data in 127) [ClassicSimilarity], result of:
          0.049275305 = score(doc=127,freq=4.0), product of:
            0.14247625 = queryWeight, product of:
              3.1620505 = idf(docFreq=5088, maxDocs=44218)
              0.04505818 = queryNorm
            0.34584928 = fieldWeight in 127, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              3.1620505 = idf(docFreq=5088, maxDocs=44218)
              0.0546875 = fieldNorm(doc=127)
      0.5 = coord(1/2)
  0.4 = coord(2/5)

Abstract: This article discusses how libraries can include person identifiers in the MARC format. It suggests using URIs in fields and subfields to help transition the data to an RDF model, and to help prepare the catalog for a Linked Data. It analyzes the selection of URIs and Real-World Objects, and the use of tag 024 to describe person identifiers in authority records. When a creator or collaborator is identified in a work, the identifiers are transferred from authority to the bibliographic record. The article concludes that URI-based descriptions can provide a better experience for users, offering other methods of discovery.

Organisciak, P.; Schmidt, B.M.; Downie, J.S.: Giving shape to large digital libraries through exploratory data analysis (2022) 0.03

0.030054057 = product of:
  0.07513514 = sum of:
    0.045269795 = weight(_text_:bibliographic in 473) [ClassicSimilarity], result of:
      0.045269795 = score(doc=473,freq=2.0), product of:
        0.17541347 = queryWeight, product of:
          3.893044 = idf(docFreq=2449, maxDocs=44218)
          0.04505818 = queryNorm
        0.2580748 = fieldWeight in 473, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.893044 = idf(docFreq=2449, maxDocs=44218)
          0.046875 = fieldNorm(doc=473)
    0.029865343 = product of:
      0.059730686 = sum of:
        0.059730686 = weight(_text_:data in 473) [ClassicSimilarity], result of:
          0.059730686 = score(doc=473,freq=8.0), product of:
            0.14247625 = queryWeight, product of:
              3.1620505 = idf(docFreq=5088, maxDocs=44218)
              0.04505818 = queryNorm
            0.4192326 = fieldWeight in 473, product of:
              2.828427 = tf(freq=8.0), with freq of:
                8.0 = termFreq=8.0
              3.1620505 = idf(docFreq=5088, maxDocs=44218)
              0.046875 = fieldNorm(doc=473)
      0.5 = coord(1/2)
  0.4 = coord(2/5)

Abstract: The emergence of large multi-institutional digital libraries has opened the door to aggregate-level examinations of the published word. Such large-scale analysis offers a new way to pursue traditional problems in the humanities and social sciences, using digital methods to ask routine questions of large corpora. However, inquiry into multiple centuries of books is constrained by the burdens of scale, where statistical inference is technically complex and limited by hurdles to access and flexibility. This work examines the role that exploratory data analysis and visualization tools may play in understanding large bibliographic datasets. We present one such tool, HathiTrust+Bookworm, which allows multifaceted exploration of the multimillion work HathiTrust Digital Library, and center it in the broader space of scholarly tools for exploratory data analysis.
Theme: Data Mining

Frey, J.; Streitmatter, D.; Götz, F.; Hellmann, S.; Arndt, N.: DBpedia Archivo (2020) 0.03
```
0.029792959 = product of:
  0.074482396 = sum of:
    0.06577167 = weight(_text_:readable in 53) [ClassicSimilarity], result of:
      0.06577167 = score(doc=53,freq=2.0), product of:
        0.2768342 = queryWeight, product of:
          6.1439276 = idf(docFreq=257, maxDocs=44218)
          0.04505818 = queryNorm
        0.23758507 = fieldWeight in 53, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          6.1439276 = idf(docFreq=257, maxDocs=44218)
          0.02734375 = fieldNorm(doc=53)
    0.008710725 = product of:
      0.01742145 = sum of:
        0.01742145 = weight(_text_:data in 53) [ClassicSimilarity], result of:
          0.01742145 = score(doc=53,freq=2.0), product of:
            0.14247625 = queryWeight, product of:
              3.1620505 = idf(docFreq=5088, maxDocs=44218)
              0.04505818 = queryNorm
            0.12227618 = fieldWeight in 53, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.1620505 = idf(docFreq=5088, maxDocs=44218)
              0.02734375 = fieldNorm(doc=53)
      0.5 = coord(1/2)
  0.4 = coord(2/5)
```
Content

# Community action on individual ontologies We would like to call on all ontology maintainers and consumers to help us increase the average star rating of the web of ontologies by fixing and improving its ontologies. You can easily check an ontology at https://archivo.dbpedia.org/info. If you are an ontology maintainer just release a patched version - archivo will automatically pick it up 8 hours later. If you are a user of an ontology and want your consumed data to become FAIRer, please inform the ontology maintainer about the issues found with Archivo. The star rating is very basic and only requires fixing small things. However, theimpact on technical and legal usability can be immense.
# How does Archivo work? Each week Archivo runs several discovery algorithms to scan for new ontologies. Once discovered Archivo checks them every 8 hours. When changes are detected, Archivo downloads and rates and archives the latest snapshot persistently on the DBpedia Databus. # Archivo's mission Archivo's mission is to improve FAIRness (findability, accessibility, interoperability, and reusability) of all available ontologies on the Semantic Web. Archivo is not a guideline, it is fully automated, machine-readable and enforces interoperability with its star rating. - Ontology developers can implement against Archivo until they reach more stars. The stars and tests are designed to guarantee the interoperability and fitness of the ontology. - Ontology users can better find, access and re-use ontologies. Snapshots are persisted in case the original is not reachable anymore adding a layer of reliability to the decentral web of ontologies.

Guerrini, M.: Metadata: the dimension of cataloging in the digital age (2022) 0.03

0.028094485 = product of:
  0.07023621 = sum of:
    0.052814763 = weight(_text_:bibliographic in 735) [ClassicSimilarity], result of:
      0.052814763 = score(doc=735,freq=2.0), product of:
        0.17541347 = queryWeight, product of:
          3.893044 = idf(docFreq=2449, maxDocs=44218)
          0.04505818 = queryNorm
        0.30108726 = fieldWeight in 735, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.893044 = idf(docFreq=2449, maxDocs=44218)
          0.0546875 = fieldNorm(doc=735)
    0.01742145 = product of:
      0.0348429 = sum of:
        0.0348429 = weight(_text_:data in 735) [ClassicSimilarity], result of:
          0.0348429 = score(doc=735,freq=2.0), product of:
            0.14247625 = queryWeight, product of:
              3.1620505 = idf(docFreq=5088, maxDocs=44218)
              0.04505818 = queryNorm
            0.24455236 = fieldWeight in 735, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.1620505 = idf(docFreq=5088, maxDocs=44218)
              0.0546875 = fieldNorm(doc=735)
      0.5 = coord(1/2)
  0.4 = coord(2/5)

Abstract: Metadata creation is the process of recording metadata, that is data essential to the identification and retrieval of any type of resource, including bibliographic resources. Metadata capable of identifying characteristics of an entity have always existed. However, the triggering event that has rewritten and enhanced their value is the digital revolution. Cataloging is configured as an action of creating metadata. While cataloging produces a catalog, that is a list of records relating to various types of resources, ordered and searchable, according to a defined criterion, the metadata process produces the metadata of the resources.

Candela, G.: ¬An automatic data quality approach to assess semantic data from cultural heritage institutions (2023) 0.03

0.02561613 = product of:
  0.12808065 = sum of:
    0.12808065 = sum of:
      0.085347325 = weight(_text_:data in 997) [ClassicSimilarity], result of:
        0.085347325 = score(doc=997,freq=12.0), product of:
          0.14247625 = queryWeight, product of:
            3.1620505 = idf(docFreq=5088, maxDocs=44218)
            0.04505818 = queryNorm
          0.59902847 = fieldWeight in 997, product of:
            3.4641016 = tf(freq=12.0), with freq of:
              12.0 = termFreq=12.0
            3.1620505 = idf(docFreq=5088, maxDocs=44218)
            0.0546875 = fieldNorm(doc=997)
      0.04273333 = weight(_text_:22 in 997) [ClassicSimilarity], result of:
        0.04273333 = score(doc=997,freq=2.0), product of:
          0.15778607 = queryWeight, product of:
            3.5018296 = idf(docFreq=3622, maxDocs=44218)
            0.04505818 = queryNorm
          0.2708308 = fieldWeight in 997, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            3.5018296 = idf(docFreq=3622, maxDocs=44218)
            0.0546875 = fieldNorm(doc=997)
  0.2 = coord(1/5)

Abstract: In recent years, cultural heritage institutions have been exploring the benefits of applying Linked Open Data to their catalogs and digital materials. Innovative and creative methods have emerged to publish and reuse digital contents to promote computational access, such as the concepts of Labs and Collections as Data. Data quality has become a requirement for researchers and training methods based on artificial intelligence and machine learning. This article explores how the quality of Linked Open Data made available by cultural heritage institutions can be automatically assessed. The results obtained can be useful for other institutions who wish to publish and assess their collections.
Date: 22. 6.2023 18:23:31

Rockelle Strader, C.: Cataloging to support information literacy : the IFLA Library Reference Model's user tasks in the context of the Framework for Information Literacy for Higher Education (2021) 0.02

0.024080986 = product of:
  0.060202464 = sum of:
    0.045269795 = weight(_text_:bibliographic in 713) [ClassicSimilarity], result of:
      0.045269795 = score(doc=713,freq=2.0), product of:
        0.17541347 = queryWeight, product of:
          3.893044 = idf(docFreq=2449, maxDocs=44218)
          0.04505818 = queryNorm
        0.2580748 = fieldWeight in 713, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.893044 = idf(docFreq=2449, maxDocs=44218)
          0.046875 = fieldNorm(doc=713)
    0.014932672 = product of:
      0.029865343 = sum of:
        0.029865343 = weight(_text_:data in 713) [ClassicSimilarity], result of:
          0.029865343 = score(doc=713,freq=2.0), product of:
            0.14247625 = queryWeight, product of:
              3.1620505 = idf(docFreq=5088, maxDocs=44218)
              0.04505818 = queryNorm
            0.2096163 = fieldWeight in 713, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.1620505 = idf(docFreq=5088, maxDocs=44218)
              0.046875 = fieldNorm(doc=713)
      0.5 = coord(1/2)
  0.4 = coord(2/5)

Abstract: Cataloging practices, as exemplified by the five user tasks of the IFLA Library Reference Model, can support information literacy practices. The six frames of the Framework for Information Literacy for Higher Education are used as lenses to examine the user tasks. Two themes emerge from this examination: context matters, and catalogers must tailor bibliographic descriptions to meet users' expectations and information needs. Catalogers need to solicit feedback from various user communities to reform cataloging practices to remain current and viable. Such conversations will enrich the catalog and enhance (reclaim?) its position as a primary tool for research and learning. Supplemental data for this article is available online at https://doi.org/10.1080/01639374.2021.1939828.

Search (440 results, page 1 of 22)

Authors

Languages

Types

Themes

Subjects

Classifications