Literatur zur Informationserschließung
Diese Datenbank enthält über 40.000 Dokumente zu Themen aus den Bereichen Formalerschließung – Inhaltserschließung – Information Retrieval.
© 2015 W. Gödert, TH Köln, Institut für Informationswissenschaft
/
Powered by litecat, BIS Oldenburg
(Stand: 28. April 2022)
Suche
Suchergebnisse
Treffer 1–4 von 4
sortiert nach:
-
1Martín-Moncunill, D. ; García-Barriocanal, E. ; Sicilia, M.-A. ; Sánchez-Alonso, S.: Evaluating the practical applicability of thesaurus-based keyphrase extraction in the agricultural domain : insights from the VOA3R project.
In: Knowledge organization. 42(2015) no.2, S.76-89.
Abstract: The use of Knowledge Organization Systems (KOSs) in aggregated metadata collections facilitates the implementation of search mechanisms operating on the same term or keyphrase space, thus preparing the ground for improved browsing, more accurate retrieval and better user profiling. Automatic thesaurus-based keyphrase extraction appears to be an inexpensive tool to obtain this information, but the studies on its effectiveness are scattered and do not consider the practical applicability of these techniques compared to the quality obtained by involving human experts. This paper presents an evaluation of keyphrase extraction using the KEA software and the AGROVOC vocabulary on a sample of a large collection of metadata in the field of agriculture from the AGRIS database. This effort includes a double evaluation, the classical automatic evaluation based on precision and recall measures, plus a blind evaluation aimed to contrast the quality of the keyphrases extracted against expert-provided samples and against the keyphrases originally recorded in the metadata. Results show not only that KEA outperforms humans in matching the original keyphrases, but also that the quality of the keyphrases extracted was similar to those provided by humans.
Inhalt: Vgl.: http://www.ergon-verlag.de/isko_ko/downloads/ko_42_2015_2_b.pdf.
Themenfeld: Konzeption und Anwendung des Prinzips Thesaurus
Wissenschaftsfach: Landbauwissenschaft
-
2Rousidis, D. ; Garoufallou, E. ; Balatsoukas, P. ; Sicilia, M.-A.: Evaluation of metadata in research data repositories : the case of the DC.Subject Element.
In: Metadata and semantics research: 9th Research Conference, MTSR 2015, Manchester, UK, September 9-11, 2015, Proceedings. Eds.: E. Garoufallou et al. Cham : Springer, 2015. S.203-213.
(Communications in computer and information science; 544)
Abstract: Research Data repositories are growing in terms of volume rapidly and exponentially. Their main goal is to provide scientists the essential mechanism to store, share, and re-use datasets generated at various stages of the research process. Despite the fact that metadata play an important role for research data management in the context of these repositories, several factors - such as the big volume of data and its complex lifecycles, as well as operational constraints related to financial resources and human factors - may impede the effectiveness of several metadata elements. The aim of the research reported in this paper was to perform a descriptive analysis of the DC.Subject metadata element and to identify its data quality problems in the context of the Dryad research data repository. In order to address this aim a total of 4.557 packages and 13.638 data files were analysed following a data-preprocessing method. The findings showed emerging trends about the subject coverage of the repository (e.g. the most popular subjects and the authors that contributed the most for these subjects). Also, quality problems related to the lack of controlled vocabulary and standardisation were very common. This study has implications for the evaluation of metadata and the improvement of the quality of the research data annotation process.
Themenfeld: Metadaten
Objekt: Dublin core
-
3Rajabi, E. ; Sanchez-Alonso, S. ; Sicilia, M.-A.: Analyzing broken links on the web of data : An experiment with DBpedia.
In: Journal of the Association for Information Science and Technology. 65(2014) no.8, S.1721-1727.
(Brief communication)
Abstract: Linked open data allow interlinking and integrating any kind of data on the web. Links between various data sources play a key role insofar as they allow software applications (e.g., browsers, search engines) to operate over the aggregated data space as if it was a unique local database. In this new data space, where DBpedia, a data set including structured information from Wikipedia, seems to be the central hub, we analyzed and highlighted outgoing links from this hub in an effort to discover broken links. The paper reports on an experiment to examine the causes of broken links and proposes some treatments for solving this problem.
Themenfeld: Semantic Web
Objekt: DBpedia
-
4Rajabi, E. ; Sicilia, M.-A. ; Sanchez-Alonso, S.: Research objects interlinking : the case of Dryad repository.
In: Metadata and semantics research: 8th Research Conference, MTSR 2014, Karlsruhe, Germany, November 27-29, 2014, Proceedings. Eds.: S. Closs et al. Cham : Springer, 2014. S.14-21.
(Communications in computer and information science; 478)
Abstract: Interlinking research objects using the RDF links facilitates sharing and data discovery on the Web of Data. This works toward enriching the research repositories by linking their research artifacts to various scientific or even general data on the Web. In this paper, we experiment on an interlinking approach over Dryad, a research object repository, to a digital library dataset in the Linked Open Data cloud. We fetch data from both targets in different steps, run an interlinking tool and report as well as analyze the results. The generated outputs and assessed matched links show that interlinking a research dataset like Dryad to Web of Data brings an added value to the repository, as it connects its research artefacts to scientific objects of other datasets.