Diese Datenbank enthält über 40.000 Dokumente zu Themen aus den Bereichen Formalerschließung – Inhaltserschließung – Information Retrieval.
© 2015 W. Gödert, TH Köln, Institut für Informationswissenschaft / Powered by litecat, BIS Oldenburg (Stand: 04. Juni 2021)
1Min, C. ; Chen, Q. ; Yan, E. ; Bu, Y. ; Sun, J.: Citation cascade and the evolution of topic relevance.
In: Journal of the Association for Information Science and Technology. 72(2021) no.1, S.110-127.
Abstract: Citation analysis, as a tool for quantitative studies of science, has long emphasized direct citation relations, leaving indirect or high-order citations overlooked. However, a series of early and recent studies demonstrate the existence of indirect and continuous citation impact across generations. Adding to the literature on high-order citations, we introduce the concept of a citation cascade: the constitution of a series of subsequent citing events initiated by a certain publication. We investigate this citation structure by analyzing more than 450,000 articles and over 6 million citation relations. We show that citation impact exists not only within the three generations documented in prior research but also in much further generations. Still, our experimental results indicate that two to four generations are generally adequate to trace a work's scientific impact. We also explore specific structural properties-such as depth, width, structural virality, and size-which account for differences among individual citation cascades. Finally, we find evidence that it is more important for a scientific work to inspire trans-domain (or indirectly related domain) works than to receive only intradomain recognition in order to achieve high impact. Our methods and findings can serve as a new tool for scientific evaluation and the modeling of scientific history.
Inhalt: Vgl.: https://asistdl.onlinelibrary.wiley.com/doi/10.1002/asi.24370.
Themenfeld: Citation indexing
2Min, C. ; Ding, Y. ; Li, J. ; Bu, Y. ; Pei, L. ; Sun, J.: Innovation or imitation : the diffusion of citations.
In: Journal of the Association for Information Science and Technology. 69(2018) no.10, S.1271-1282.
Abstract: Citations in scientific literature are important both for tracking the historical development of scientific ideas and for forecasting research trends. However, the diffusion mechanisms underlying the citation process remain poorly understood, despite the frequent and longstanding use of citation counts for assessment purposes within the scientific community. Here, we extend the study of citation dynamics to a more general diffusion process to understand how citation growth associates with different diffusion patterns. Using a classic diffusion model, we quantify and illustrate specific diffusion mechanisms which have been proven to exert a significant impact on the growth and decay of citation counts. Experiments reveal a positive relation between the "low p and low q" pattern and high scientific impact. A sharp citation peak produced by rapid change of citation counts, however, has a negative effect on future impact. In addition, we have suggested a simple indicator, saturation level, to roughly estimate an individual article's current stage in the life cycle and its potential to attract future attention. The proposed approach can also be extended to higher levels of aggregation (e.g., individual scientists, journals, institutions), providing further insights into the practice of scientific evaluation.
Inhalt: Vgl.: https://onlinelibrary.wiley.com/doi/10.1002/asi.24047.
3Ferret, O. ; Grau, B. ; Hurault-Plantet, M. ; Illouz, G. ; Jacquemin, C. ; Monceaux, L. ; Robba, I. ; Vilnat, A.: How NLP can improve question answering.
In: Knowledge organization. 29(2002) nos.3/4, S.135-155.
Abstract: Answering open-domain factual questions requires Natural Language processing for refining document selection and answer identification. With our system QALC, we have participated in the Question Answering track of the TREC8, TREC9 and TREC10 evaluations. QALC performs an analysis of documents relying an multiword term searches and their linguistic variation both to minimize the number of documents selected and to provide additional clues when comparing question and sentence representations. This comparison process also makes use of the results of a syntactic parsing of the questions and Named Entity recognition functionalities. Answer extraction relies an the application of syntactic patterns chosen according to the kind of information that is sought, and categorized depending an the syntactic form of the question. These patterns allow QALC to handle nicely linguistic variations at the answer level.
Themenfeld: Computerlinguistik ; Retrievalstudien ; Sprachretrieval
4Jacquemin, C.: Spotting and discovering terms through natural language processing.
Cambridge, MA : MIT Press, 2001. VIII, 378 S.
Abstract: In this book Christian Jacquemin shows how the power of natural language processing (NLP) can be used to advance text indexing and information retrieval (IR). Jacquemin's novel tool is FASTR, a parser that normalizes terms and recognizes term variants. Since there are more meanings in a language than there are words, FASTR uses a metagrammar composed of shallow linguistic transformations that describe the morphological, syntactic, semantic, and pragmatic variations of words and terms. The acquired parsed terms can then be applied for precise retrieval and assembly of information. The use of a corpus-based unification grammar to define, recognize, and combine term variants from their base forms allows for intelligent information access to, or "linguistic data tuning" of, heterogeneous texts. FASTR can be used to do automatic controlled indexing, to carry out content-based Web searches through conceptually related alternative query formulations, to abstract scientific and technical extracts, and even to translate and collect terms from multilingual material. Jacquemin provides a comprehensive account of the method and implementation of this innovative retrieval technique for text processing.
Anmerkung: Rez. in: KO 28(2001) no.3, S.152-154 (L. Da Sylva)
LCSH: Language and languages / Variation / Data processing ; Terms and phrases / Data processing
RSWK: Automatische Indexierung / Computerlinguistik / Information Retrieval ; Syntaktische Analyse (GBV) ; Textverstehendes System (HBZ) ; Computerlinguistik / Sprachvariante (HBZ)
BK: 54.75 ; 18.04 ; 17.52 ; 17.46
GHBS: BFP (FH K) ; BFP (DU) ; TZF (DU) ; TVV (DU)
LCC: P305.18.D38J33 2001
RVK: ES 965
5Jacquemin, C.: What is the tree that we see through the window : a linguistic approach to windowing and term variation.
In: Information processing and management. 32(1996) no.4, S.445-458.
Abstract: Provides a linguistic approach to text windowing through an extraction of term variants with the help of a partial parser. The syntactic grounding of the method ensures ehat words observed within restricted spans are lexically related and that spurious word cooccurrences are rules out with a good level of confidence. The system is computationally tractable on large corpora and large lists of terms. Gives illustrative examples of term variation from a large medical corpus. An experimental evaluation of the method shows that only a small proportion of co-occuring words are lexically related and motivates the call for natural language parsing techniques in text windowing