Literatur zur Informationserschließung
Diese Datenbank enthält über 40.000 Dokumente zu Themen aus den Bereichen Formalerschließung – Inhaltserschließung – Information Retrieval.
© 2015 W. Gödert, TH Köln, Institut für Informationswissenschaft
/
Powered by litecat, BIS Oldenburg
(Stand: 28. April 2022)
Suche
Suchergebnisse
Treffer 1–3 von 3
sortiert nach:
-
1Iorio, A.D. ; Peroni, S. ; Poggi, F. ; Vitali, F.: Dealing with structural patterns of XML documents.
In: Journal of the Association for Information Science and Technology. 65(2014) no.9, S.1884-1900.
Abstract: Evaluating collections of XML documents without paying attention to the schema they were written in may give interesting insights into the expected characteristics of a markup language, as well as any regularity that may span vocabularies and languages, and that are more fundamental and frequent than plain content models. In this paper we explore the idea of structural patterns in XML vocabularies, by examining the characteristics of elements as they are used, rather than as they are defined. We introduce from the ground up a formal theory of 8 plus 3 structural patterns for XML elements, and verify their identifiability in a number of different XML vocabularies. The results allowed the creation of visualization and content extraction tools that are completely independent of the schema and without any previous knowledge of the semantics and organization of the XML vocabulary of the documents.
Objekt: XML
-
2Iorio, A. di ; Peroni, S. ; Vitali, F.: ¬A Semantic Web approach to everyday overlapping markup.
In: Journal of the American Society for Information Science and Technology. 62(2011) no.9, S.1696-1716.
Abstract: Overlapping structures in XML are not symptoms of a misunderstanding of the intrinsic characteristics of a text document nor evidence of extreme scholarly requirements far beyond those needed by the most common XML-based applications. On the contrary, overlaps have started to appear in a large number of incredibly popular applications hidden under the guise of syntactical tricks to the basic hierarchy of the XML data format. Unfortunately, syntactical tricks have the drawback that the affected structures require complicated workarounds to support even the simplest query or usage. In this article, we present Extremely Annotational Resource Description Framework (RDF) Markup (EARMARK), an approach to overlapping markup that simplifies and streamlines the management of multiple hierarchies on the same content, and provides an approach to sophisticated queries and usages over such structures without the need of ad-hoc applications, simply by using Semantic Web tools and languages. We compare how relevant tasks (e.g., the identification of the contribution of an author in a word processor document) are of some substantial complexity when using the original data format and become more or less trivial when using EARMARK. We finally evaluate positively the memory and disk requirements of EARMARK documents in comparison to Open Office and Microsoft Word XML-based formats.
Themenfeld: Semantic Web ; Wissensrepräsentation
Objekt: RDF ; EARMARK
-
3Oliveira Lima, J.A. de ; Palmirani, M. ; Vitali, F.: ¬A time-aware ontology for legal resources.
In: Culture and identity in knowledge organization: Proceedings of the Tenth International ISKO Conference 5-8 August 2008, Montreal, Canada. Ed. by Clément Arsenault and Joseph T. Tennis. Würzburg : Ergon Verlag, 2008. S.56-62.
(Advances in knowledge organization; vol.11)
Inhalt: This paper presents a new approach to associating metadata to legal documents by exploiting a fully developed information ontology of legal resources that takes time into account. Our information ontology is technically an application of the FRBR model to organization of legal documents. Our contribution clarifies not only the diachronic evolution of the legal resources in time, but it also puts the theoretical grounding for the modeling of the relationships between the different entities participating to the legislative process workflow (e.g. bills, amendments). Our model is also applicable to all artefacts of the publishing process. Moreover the time dimension can be used to support successful interconnections between different legal resources (e.g. between normative acts and case-law) that need precise point-in-time referencing.
Anmerkung: Vgl. unter: http://www.ergon-verlag.de/isko_ko/tocs/0497f79b0c0b3ed06/0497f79b0c0b5550a/index.php.
Wissenschaftsfach: Rechtswissenschaft
Objekt: FRBR