Literatur zur Informationserschließung
Diese Datenbank enthält über 40.000 Dokumente zu Themen aus den Bereichen Formalerschließung – Inhaltserschließung – Information Retrieval.
© 2015 W. Gödert, TH Köln, Institut für Informationswissenschaft
/
Powered by litecat, BIS Oldenburg
(Stand: 28. April 2022)
Suche
Suchergebnisse
Treffer 1–13 von 13
sortiert nach:
-
1Mache, B. ; Klaffki, L.: ¬Das DARIAH-DE Repository : Elementarer Teil einer modularen Infrastruktur für geistes- und kulturwissenschaftliche Forschungsdaten.
In: o-bib: Das offene Bibliotheksjournal. 5(2018) Nr.3, S.92-103.
Abstract: DARIAH-DE unterstützt mit digitalen Ressourcen und Methoden arbeitende Geistes- und Kulturwissenschaftler/innen in Forschung und Lehre. Forschungsdaten und Wissenschaftliche Sammlungen haben dabei eine zentrale Bedeutung. Im Rahmen einer Forschungsdaten-Föderationsarchitektur stehen eine Reihe von Werkzeugen zur Verfügung, mit denen Daten gefunden, vernetzt, publiziert und archiviert werden können. Hierzu zählt auch das DARIAH-DE Repository, das den Forschenden eine sichere, langfristige und nachhaltige Speicherung sowie die Veröffentlichung der Forschungsdaten ermöglicht.
Inhalt: https://www.o-bib.de/article/view/5334. https://doi.org/10.5282/o-bib/2018H3S92-103.
Anmerkung: Beitrag im Rahmen eines Themenschwerpunkts Forschungsdaten.
Wissenschaftsfach: Kulturwissenschaften ; Geisteswissenschaften
Objekt: DARIAH-DE
Land/Ort: D
-
2Ye, Z. ; He, B. ; Wang, L. ; Luo, T.: Utilizing term proximity for blog post retrieval.
In: Journal of the American Society for Information Science and Technology. 64(2013) no.11, S.2278-2298.
Abstract: Term proximity is effective for many information retrieval (IR) research fields yet remains unexplored in blogosphere IR. The blogosphere is characterized by large amounts of noise, including incohesive, off-topic content and spam. Consequently, the classical bag-of-words unigram IR models are not reliable enough to provide robust and effective retrieval performance. In this article, we propose to boost the blog postretrieval performance by employing term proximity information. We investigate a variety of popular and state-of-the-art proximity-based statistical IR models, including a proximity-based counting model, the Markov random field (MRF) model, and the divergence from randomness (DFR) multinomial model. Extensive experimentation on the standard TREC Blog06 test dataset demonstrates that the introduction of term proximity information is indeed beneficial to retrieval from the blogosphere. Results also indicate the superiority of the unordered bi-gram model with the sequential-dependence phrases over other variants of the proximity-based models. Finally, inspired by the effectiveness of proximity models, we extend our study by exploring the proximity evidence between query terms and opinionated terms. The consequent opinionated proximity model shows promising performance in the experiments.
Themenfeld: Computerlinguistik
-
3Lin, N. ; Li, D. ; Ding, Y. ; He, B. ; Qin, Z. ; Tang, J. ; Li, J. ; Dong, T.: ¬The dynamic features of Delicious, Flickr, and YouTube.
In: Journal of the American Society for Information Science and Technology. 63(2012) no.1, S.139-162.
Abstract: This article investigates the dynamic features of social tagging vocabularies in Delicious, Flickr, and YouTube from 2003 to 2008. Three algorithms are designed to study the macro- and micro-tag growth as well as the dynamics of taggers' activities, respectively. Moreover, we propose a Tagger Tag Resource Latent Dirichlet Allocation (TTR-LDA) model to explore the evolution of topics emerging from those social vocabularies. Our results show that (a) at the macro level, tag growth in all the three tagging systems obeys power law distribution with exponents lower than 1; at the micro level, the tag growth of popular resources in all three tagging systems follows a similar power law distribution; (b) the exponents of tag growth vary in different evolving stages of resources; (c) the growth of number of taggers associated with different popular resources presents a feature of convergence over time; (d) the active level of taggers has a positive correlation with the macro-tag growth of different tagging systems; and (e) some topics evolve into several subtopics over time while others experience relatively stable stages in which their contents do not change much, and certain groups of taggers continue their interests in them.
Themenfeld: Social tagging
Objekt: Delicious ; Flickr ; YouTube
-
4Ye, Z. ; Huang, J.X. ; He, B. ; Lin, H.: Mining a multilingual association dictionary from Wikipedia for cross-language information retrieval.
In: Journal of the American Society for Information Science and Technology. 63(2012) no.12, S.2474-2487.
Abstract: Wikipedia is characterized by its dense link structure and a large number of articles in different languages, which make it a notable Web corpus for knowledge extraction and mining, in particular for mining the multilingual associations. In this paper, motivated by a psychological theory of word meaning, we propose a graph-based approach to constructing a cross-language association dictionary (CLAD) from Wikipedia, which can be used in a variety of cross-language accessing and processing applications. In order to evaluate the quality of the mined CLAD, and to demonstrate how the mined CLAD can be used in practice, we explore two different applications of the mined CLAD to cross-language information retrieval (CLIR). First, we use the mined CLAD to conduct cross-language query expansion; and, second, we use it to filter out translation candidates with low translation probabilities. Experimental results on a variety of standard CLIR test collections show that the CLIR retrieval performance can be substantially improved with the above two applications of CLAD, which indicates that the mined CLAD is of sound quality.
Themenfeld: Multilinguale Probleme
Objekt: Wikipedia
-
5He, B. ; Ding, Y. ; Ni, C.: Mining enriched contextual information of scientific collaboration : a meso perspective.
In: Journal of the American Society for Information Science and Technology. 62(2011) no.5, S.831-845.
Abstract: Studying scientific collaboration using coauthorship networks has attracted much attention in recent years. How and in what context two authors collaborate remain among the major questions. Previous studies, however, have focused on either exploring the global topology of coauthorship networks (macro perspective) or ranking the impact of individual authors (micro perspective). Neither of them has provided information on the context of the collaboration between two specific authors, which may potentially imply rich socioeconomic, disciplinary, and institutional information on collaboration. Different from the macro perspective and micro perspective, this article proposes a novel method (meso perspective) to analyze scientific collaboration, in which a contextual subgraph is extracted as the unit of analysis. A contextual subgraph is defined as a small subgraph of a large-scale coauthorship network that captures relationship and context between two coauthors. This method is applied to the field of library and information science. Topological properties of all the subgraphs in four time spans are investigated, including size, average degree, clustering coefficient, and network centralization. Results show that contextual subgprahs capture useful contextual information on two authors' collaboration.
Themenfeld: Informetrie
-
6Li, D. ; Ding, Y. ; Sugimoto, C. ; He, B. ; Tang, J. ; Yan, E. ; Lin, N. ; Qin, Z. ; Dong, T.: Modeling topic and community structure in social tagging : the TTR-LDA-Community model.
In: Journal of the American Society for Information Science and Technology. 62(2011) no.9, S.1849-1866.
Abstract: The presence of social networks in complex systems has made networks and community structure a focal point of study in many domains. Previous studies have focused on the structural emergence and growth of communities and on the topics displayed within the network. However, few scholars have closely examined the relationship between the thematic and structural properties of networks. Therefore, this article proposes the Tagger Tag Resource-Latent Dirichlet Allocation-Community model (TTR-LDA-Community model), which combines the Latent Dirichlet Allocation (LDA) model with the Girvan-Newman community detection algorithm through an inference mechanism. Using social tagging data from Delicious, this article demonstrates the clustering of active taggers into communities, the topic distributions within communities, and the ranking of taggers, tags, and resources within these communities. The data analysis evaluates patterns in community structure and topical affiliations diachronically. The article evaluates the effectiveness of community detection and the inference mechanism embedded in the model and finds that the TTR-LDA-Community model outperforms other traditional models in tag prediction. This has implications for scholars in domains interested in community detection, profiling, and recommender systems.
Themenfeld: Social tagging
-
7Schmid-Ruhe, B.: Mit Leichtigkeit zum Plagiat : Herausforderungen an Bibliotheken im Zeitalter der digitalen Wissenschaftskommunikation.
In: BuB. 60(2008) H.3, S.231-233.
(Lesesaal Schwerpunkt: Streitfall Bildschirmmedien)
Abstract: Digitale Medien, vor allem das Internet, sind in der Wissenschaftskommunikation zugleich Fluch und Segen. Sie ermöglichen durch simples »Kopieren und Einfügen« einerseits die rasante Ausbreitung von »unethischen Autorschaften« durch Plagiarismus. Gleichzeitig bringen sie die Lösung dieser Problematik selber mit - indem sie nämlich das Auffinden von Plagiaten erheblich erleichtern. Was die digitalen Medien den analogen in Sachen Plagiarismusproblematik voraus haben, ist die neue Leichtigkeit, mit der man sich Texte, Bilder, Quellcodes und Töne zugänglich macht, sie verarbeitet und letztlich auch aneignet. Ganz klar: Das Internet an sich ist dabei nicht weniger ethisch als das gedruckte Wort. Es kommt hier wie dort einfach darauf an, dass man seine Quellen benennt. Bernd Schmid-Ruhe, an der Universitätsbibliothek Konstanz im Bereich Informationskompetenz tätig, beschreibt den täglichen Kampf gegen Plagiate in der Wissenschaft und zeigt auf, was Bibliothekare dagegen tun können.
Wissenschaftsfach: Kommunikationswissenschaften
Behandelte Form: Elektronische Dokumente
-
8He, B. ; Ounis, I.: Combining fields for query expansion and adaptive query expansion.
In: Information processing and management. 43(2007) no.5, S.1294-1307.
Abstract: In this paper, we aim to improve query expansion for ad-hoc retrieval, by proposing a more fine-grained term reweighting process. This fine-grained process uses statistics from the representation of documents in various fields, such as their titles, the anchor text of their incoming links, and their body content. The contribution of this paper is twofold: First, we propose a novel query expansion mechanism on fields by combining field evidence available in a corpora. Second, we propose an adaptive query expansion mechanism that selects an appropriate collection resource, either the local collection, or a high-quality external resource, for query expansion on a per-query basis. The two proposed query expansion approaches are thoroughly evaluated using two standard Text Retrieval Conference (TREC) Web collections, namely the WT10G collection and the large-scale .GOV2 collection. From the experimental results, we observe a statistically significant improvement compared with the baselines. Moreover, we conclude that the adaptive query expansion mechanism is very effective when the external collection used is much larger than the local collection.
-
9Mathe, B.: Kaleidoscopic classifications : redefining information in a world cultural context.
In: Inspel. 33(1999) no.1, S.54-60.
-
10Rusche, B.: Arbeitsgruppe Fremddatenstrategie.Stand: August 1997.
In: http://elib.zib.de/kobv/ag/fs.
(ZIB TR 97-15)
Objekt: KOBV
-
11Stache, B.: Erschließung personengebundener Literatur in Schlagwortkatalogen : vergleichende Untersuchung der Richtlinien der StB Bochum und der "Regeln für den Schlagwortkatalog" (RSWK).
Köln : FHBD, 1989.
Anmerkung: [Diplomarbeit]
Themenfeld: Schlagwortkatalog
Objekt: RSWK ; BOKLA
-
12Ruch, E. ; Lesche, B.: Information extent and information distance.
In: Journal of chemical physics. 69(1978), S.393-401.
Themenfeld: Information
-
13Krahe, B.: Auskunftsdienst in den USA : dargestellt am Beispiel eines Lehrbuches.
In: BuB. 23(1971), S.398-403.
Themenfeld: Informationsdienstleistungen