Diese Datenbank enthält über 40.000 Dokumente zu Themen aus den Bereichen Formalerschließung – Inhaltserschließung – Information Retrieval.
© 2015 W. Gödert, TH Köln, Institut für Informationswissenschaft / Powered by litecat, BIS Oldenburg (Stand: 11. November 2018)
1Lu, K. ; Mao, J. ; Li, G.: Toward effective automated weighted subject indexing : a comparison of different approaches in different environments.
In: Journal of the Association for Information Science and Technology. 69(2018) no.1, S.121-133.
Abstract: Subject indexing plays an important role in supporting subject access to information resources. Current subject indexing systems do not make adequate distinctions on the importance of assigned subject descriptors. Assigning numeric weights to subject descriptors to distinguish their importance to the documents can strengthen the role of subject metadata. Automated methods are more cost-effective. This study compares different automated weighting methods in different environments. Two evaluation methods were used to assess the performance. Experiments on three datasets in the biomedical domain suggest the performance of different weighting methods depends on whether it is an abstract or full text environment. Mutual information with bag-of-words representation shows the best average performance in the full text environment, while cosine with bag-of-words representation is the best in an abstract environment. The cosine measure has relatively consistent and robust performance. A direct weighting method, IDF (Inverse Document Frequency), can produce quick and reasonable estimates of the weights. Bag-of-words representation generally outperforms the concept-based representation. Further improvement in performance can be obtained by using the learning-to-rank method to integrate different weighting methods. This study follows up Lu and Mao (Journal of the Association for Information Science and Technology, 66, 1776-1784, 2015), in which an automated weighted subject indexing method was proposed and validated. The findings from this study contribute to more effective weighted subject indexing.
Inhalt: Vgl.: http://onlinelibrary.wiley.com/doi/10.1002/asi.23912/full.
Anmerkung: Vgl. das Erratum in JASIST 69(2018) no.7, S.956.
Themenfeld: Automatisches Indexieren ; Indexierungsstudien
2Xu, Y. ; Li, G. ; Mou, L. ; Lu, Y.: Learning non-taxonomic relations on demand for ontology extension.
In: International journal of software engineering and knowledge engineering. 24(2014) no.8, S.1159-1175.
Abstract: Learning non-taxonomic relations becomes an important research topic in ontology extension. Most of the existing learning approaches are mainly based on expert crafted corpora. These approaches are normally domain-specific and the corpora acquisition is laborious and costly. On the other hand, based on the static corpora, it is not able to meet personalized needs of semantic relations discovery for various taxonomies. In this paper, we propose a novel approach for learning non-taxonomic relations on demand. For any supplied taxonomy, it can focus on the segment of the taxonomy and collect information dynamically about the taxonomic concepts by using Wikipedia as a learning source. Based on the newly generated corpus, non-taxonomic relations are acquired through three steps: a) semantic relatedness detection; b) relations extraction between concepts; and c) relations generalization within a hierarchy. The proposed approach is evaluated on three different predefined taxonomies and the experimental results show that it is effective in capturing non-taxonomic relations as needed and has good potential for the ontology extension on demand.
Inhalt: Vgl.: http://www.worldscientific.com/doi/abs/10.1142/S0218194014400099.
3Mazzocchi, F. ; Fedeli, G.C.: Introduction to the special issue: 'Paradigms of Knowledge and its Organization: The Tree, the Net and Beyond'.
In: Knowledge organization. 40(2013) no.6, S.363-365.
Inhalt: Beitrag im Rahmen eines Special Issue: 'Paradigms of Knowledge and its Organization: The Tree, the Net and Beyond,' edited by Fulvio Mazzocchi and Gian Carlo Fedeli. - Vgl.: http://www.ergon-verlag.de/isko_ko/downloads/ko_40_2013_6_a.pdf.
4Fiorelli, G.: Hummingbird unleashed.
Abstract: Sometimes I think that us SEOs could be wonderful characters for a Woody Allen movie: We are stressed, nervous, paranoid, we have a tendency for sudden changes of mood...okay, maybe I am exaggerating a little bit, but that's how we tend to (over)react whenever Google announces something. One thing that doesn't help is the lack of clarity coming from Google, which not only never mentions Hummingbird in any official document (for example, in the post of its 15th anniversary), but has also shied away from details of this epochal update in the "off-the-record" declarations of Amit Singhal. In fact, in some ways those statements partly contributed to the confusion. When Google announces an update-especially one like Hummingbird-the best thing to do is to avoid trying to immediately understand what it really is based on intuition alone. It is better to wait until the dust falls to the ground, recover the original documents, examine those related to them (and any variants), take the time to see the update in action, calmly investigate, and then after all that try to find the most plausible answers.
Objekt: Google Hummingbird
5Gnoli, C. ; Pullman, T. ; Cousson, P. ; Merli, G. ; Szostak, R.: Representing the structural elements of a freely faceted classification.
In: Classification and ontology: formal approaches and access to knowledge: proceedings of the International UDC Seminar, 19-20 September 2011, The Hague, The Netherlands. Eds.: A. Slavic u. E. Civallero. Würzburg : Ergon Verlag, 2011. S.193-205.
Abstract: Freely faceted classifications allow for free combination of concepts across all knowledge domains, and for sorting of the resulting compound classmarks. Starting from work by the Classification Research Group, the Integrative Levels Classification (ILC) project has produced a first edition of a general freely faceted scheme. The system is managed as a MySQL database, and can be browsed through a Web interface. The ILC database structure provides a case for identifying and representing the structural elements of any freely faceted classification. These belong to both the notational and the verbal planes. Notational elements include: arrays, chains, deictics, facets, foci, place of definition of foci, examples of combinations, subclasses of a faceted class, groupings, related classes; verbal elements include: main caption, synonyms, descriptions, included terms, related terms, notes. Encoding of some of these elements in an international mark-up format like SKOS can be problematic, especially as this does not provide for faceted structures, although approximate SKOS equivalents are identified for most of them.
Themenfeld: Universale Facettenklassifikationen ; Wissensrepräsentation
6Gnoli, C. ; Merli, G. ; Pavan, G. ; Bernuzzi, E. ; Priano, M.: Freely faceted classification for a Web-based bibliographic archive : the BioAcoustic Reference Database.
In: Wissensspeicher in digitalen Räumen: Nachhaltigkeit - Verfügbarkeit - semantische Interoperabilität. Proceedings der 11. Tagung der Deutschen Sektion der Internationalen Gesellschaft für Wissensorganisation, Konstanz, 20. bis 22. Februar 2008. Hrsg.: J. Sieglerschmidt u. H.P.Ohly. Würzburg : Ergon Verlag, 2010. S.124-134.
(Fortschritte in der Wissensorganisation; Bd.11)
Abstract: The Integrative Level Classification (ILC) research project is experimenting with a knowledge organization system based on phenomena rather than disciplines. Each phenomenon has a constant notation, which can be combined with that of any other phenomenon in a freely faceted structure. Citation order can express differential focality of the facets. Very specific subjects can have long classmarks, although their complexity is reduced by various devices. Freely faceted classification is being tested by indexing a corpus of about 3300 papers in the interdisciplinary domain of bioacoustics. The subjects of these papers often include phenomena from a wide variety of integrative levels (mechanical waves, animals, behaviour, vessels, fishing, law, ...) as well as information about the methods of study, as predicted in the León Manifesto. The archive is recorded in a MySQL database, and can be fed and searched through PHP Web interfaces. Indexer's work is made easier by mechanisms that suggest possible classes on the basis of matching title words with terms in the ILC schedules, and synthesize automatically the verbal caption corresponding to the classmark being edited. Users can search the archive by selecting and combining values in each facet. Search refinement should be improved, especially for the cases where no record, or too many records, match the faceted query. However, experience is being gained progressively, showing that freely faceted classification by phenomena, theories, and methods is feasible and successfully working.
Themenfeld: Universale Facettenklassifikationen
Objekt: BioAcoustic Reference Database
7Park, J.-r. ; Li, G. ; Burger, A.: Opening and closing rituals of the virtual reference service of the Internet Public Library.
In: Journal of documentation. 66(2010) no.6, S.807-823.
Abstract: Purpose - The purpose of this paper is to explore the communicative rituals of opening and closing manifested in e-mail-based Internet Public Library's (IPL's) online reference interaction. Design/methodology/approach - A total of 400 transcripts comprising user queries and responses by IPL librarians are examined. The opening and closing elements are identified to examine the way in which IPL librarians and users construct social space; that is, communicate their interpersonal and affective stances during the course of seeking and offering information. Findings - The results of data analysis show regular patterns of verbal and structural politeness indicators of opening and closing e-mail discourse. Linguistic elements such as greetings and acknowledgement are included in all the sampled transcripts; i.e. a 100 percent occurrence. Closing rituals have a 95 percent occurrence of linguistic features such as acknowledgement and invitation for follow-up. In contrast, there is a low occurrence of personalized openings through greeting by user name (26 percent). This lack of personalization also occurs in closings: personalized farewell through use of librarian name appears in only 8 percent of closings. Research limitations/implications - The employment of the various politeness tactics in opening and closing reflects the librarian's attention and concern to user's information needs, interests and wants. Such communicative competence narrows social distance and brings forth close socio-interpersonal space for interaction; this may, in turn, improve the overall quality of reference service. Research findings also indicate that more use of personal names may decrease the social distance between the librarian and user, resulting in increased solidarity and proximity. Originality/value - The study provides new insights into linguistic politeness and the functions of address forms such as personal names with a view toward developing effective opening and closing rituals that contribute to the enhancement of virtual reference services.
Themenfeld: Information Gateway
Objekt: Internet Public Library
8Colazzo, D. ; Sartiani, C. ; Albano, A. ; Manghi, P. ; Ghelli, G. ; Lini, L. ; Paoli, M.: ¬A typed text retrieval query language for XML documents.
In: Journal of the American Society for Information Science and technology. 53(2002) no.6, S.467-488.
Abstract: XML is nowadays considered the standard meta-language for document markup and data representation. XML is widely employed in Web-related applications as well as in database applications, and there is also a growing interest for it by the literary community to develop tools for supporting document-oriented retrieval operations. The purpose of this article is to show the basic new requirements of this kind of applications and to present the main features of a typed query language, called Tequyla-TX, designed to support them.
Objekt: XML ; Tequyla-TX
9Tomasselli, G.: Erfahrungen beim Einsatz eines PROLOG-Programms auf Mikrorechnern : zur Erfassung und Prüfung bibliographischer Daten; PROLOG als Mittel zur Beschreibung bibliographischen Wissens.
In: Zentralblatt für Bibliothekswesen. 103(1989) H.2, S.65-69.
Abstract: Outlines the advantages of knowledge-based systems, developed as a form of artificial intelligence and transformed into partially effective expert systems, including the concept of logical programming or defining all relevant knowledge to satisfy logical conditions or IF-THEN rules, instead of a traditional, algorithmic programming language. Links the features of PROLOG to these concepts, along with its capacity as a machine language for future 5th generation computers, including microcomputers. Examines how logical programming allows bibliographical data and processes to be described, and the development of the inter-library bibliographic data base DALIS for producing booklists for state libraries.