Diese Datenbank enthält über 40.000 Dokumente zu Themen aus den Bereichen Formalerschließung – Inhaltserschließung – Information Retrieval.
© 2015 W. Gödert, TH Köln, Institut für Informationswissenschaft / Powered by litecat, BIS Oldenburg (Stand: 23. Dezember 2017)
1Zhang, C. ; Bu, Y. ; Ding, Y. ; Xu, J.: Understanding scientific collaboration : homophily, transitivity, and preferential attachment.
In: Journal of the Association for Information Science and Technology. 69(2018) no.1, S.72-86.
Abstract: Scientific collaboration is essential in solving problems and breeding innovation. Coauthor network analysis has been utilized to study scholars' collaborations for a long time, but these studies have not simultaneously taken different collaboration features into consideration. In this paper, we present a systematic approach to analyze the differences in possibilities that two authors will cooperate as seen from the effects of homophily, transitivity, and preferential attachment. Exponential random graph models (ERGMs) are applied in this research. We find that different types of publications one author has written play diverse roles in his/her collaborations. An author's tendency to form new collaborations with her/his coauthors' collaborators is strong, where the more coauthors one author had before, the more new collaborators he/she will attract. We demonstrate that considering the authors' attributes and homophily effects as well as the transitivity and preferential attachment effects of the coauthorship network in which they are embedded helps us gain a comprehensive understanding of scientific collaboration.
Inhalt: Vgl.: http://onlinelibrary.wiley.com/doi/10.1002/asi.23916/full.
2Bu, Y. ; Ding, Y. ; Xu, J. ; Liang, X. ; Gao, G. ; Zhao, Y.: Understanding success through the diversity of collaborators and the milestone of career.
In: Journal of the Association for Information Science and Technology. 69(2018) no.1, S.87-97.
Abstract: Scientific collaboration is vital to many fields, and it is common to see scholars seek out experienced researchers or experts in a domain with whom they can share knowledge, experience, and resources. To explore the diversity of research collaborations, this article performs a temporal analysis on the scientific careers of researchers in the field of computer science. Specifically, we analyze collaborators using 2 indicators: the research topic diversity, measured by the Author-Conference-Topic model and cosine, and the impact diversity, measured by the normalized standard deviation of h-indices. We find that the collaborators of high-impact researchers tend to study diverse research topics and have diverse h-indices. Moreover, by setting PhD graduation as an important milestone in researchers' careers, we examine several indicators related to scientific collaboration and their effects on a career. The results show that collaborating with authoritative authors plays an important role prior to a researcher's PhD graduation, but working with non-authoritative authors carries more weight after PhD graduation.
Inhalt: Vgl.: http://onlinelibrary.wiley.com/doi/10.1002/asi.23911/full.
3Ding, Y. ; Song, M. ; Chambers, T. (Hrsg.): Xu, J.: Author credit-assignment schemas : a comparison and analysis.
In: Journal of the Association for Information Science and Technology. 67(2016) no.8, S.1973-1989.
Abstract: Credit assignment to multiple authors of a publication is a challenging task owing to the conventions followed within different areas of research. In this study, we present a review of different author credit-assignment schemas, which are designed mainly based on author position and the total number of coauthors on the publication. We implemented, tested, and classified 15 author credit-assignment schemas into 3 types: linear, curve, and "other" assignment schemas. Further investigation and analysis revealed that most of the methods provide reasonable credit-assignment results, even though the credit-assignment distribution approaches are quite different among different types. The evaluation of each schema based on PubMed articles published in 2013 shows that there exist positive correlations among different schemas and that the similarity of credit-assignment distributions can be derived from the similar design principles that stress the number of coauthors or the author position, or consider both. We provide a summary about the features of each credit-assignment schema to facilitate the selection of the appropriate one, depending on the different conditions required to meet diverse needs.
Inhalt: Vgl.: http://onlinelibrary.wiley.com/doi/10.1002/asi.23495/abstract.
4Schroeder, J. ; Xu, J. ; Chen, H. ; Chau, M.: Automated criminal link analysis based on domain knowledge.
In: Journal of the American Society for Information Science and Technology. 58(2007) no.6, S.842-855.
Abstract: Link (association) analysis has been used in the criminal justice domain to search large datasets for associations between crime entities in order to facilitate crime investigations. However, link analysis still faces many challenging problems, such as information overload, high search complexity, and heavy reliance on domain knowledge. To address these challenges, this article proposes several techniques for automated, effective, and efficient link analysis. These techniques include the co-occurrence analysis, the shortest path algorithm, and a heuristic approach to identifying associations and determining their importance. We developed a prototype system called CrimeLink Explorer based on the proposed techniques. Results of a user study with 10 crime investigators from the Tucson Police Department showed that our system could help subjects conduct link analysis more efficiently than traditional single-level link analysis tools. Moreover, subjects believed that association paths found based on the heuristic approach were more accurate than those found based solely on the co-occurrence analysis and that the automated link analysis system would be of great help in crime investigations.
5Xu, J. ; Weischedel, R.: Empirical studies on the impact of lexical resources on CLIR performance.
In: Information processing and management. 41(2005) no.3, S.475-488.
Abstract: In this paper, we compile and review several experiments measuring cross-lingual information retrieval (CLIR) performance as a function of the following resources: bilingual term lists, parallel corpora, machine translation (MT), and stemmers. Our CLIR system uses a simple probabilistic language model; the studies used TREC test corpora over Chinese, Spanish and Arabic. Our findings include: � One can achieve an acceptable CLIR performance using only a bilingual term list (70-80% on Chinese and Arabic corpora). � However, if a bilingual term list and parallel corpora are available, CLIR performance can rival monolingual performance. � If no parallel corpus is available, pseudo-parallel texts produced by an MT system can partially overcome the lack of parallel text. � While stemming is useful normally, with a very large parallel corpus for Arabic-English, stemming hurt performance in our empirical studies with Arabic, a highly inflected language.
Themenfeld: Multilinguale Probleme
6Xu, J. ; Weischedel, R. ; Licuanan, A.: Evaluation of an extraction-based approach to answering definitional questions.
In: SIGIR'04: Proceedings of the 27th Annual International ACM-SIGIR Conference an Research and Development in Information Retrieval. Ed.: K. Järvelin, u.a. New York, NY : ACM Press, 2004. S.418-424.
7Xu, J. ; Croft, W.B.: Topic-based language models for distributed retrieval.
In: Advances in information retrieval: Recent research from the Center for Intelligent Information Retrieval. Ed.: W.B. Croft. Boston, MA : Kluwer Academic Publ., 2000. S.151-172.
(The Kluwer international series on information retrieval; 7)
Abstract: Effective retrieval in a distributed environment is an important but difficult problem. Lack of effectiveness appears to have two major causes. First, existing collection selection algorithms do not work well on heterogeneous collections. Second, relevant documents are scattered over many collections and searching a few collections misses many relevant documents. We propose a topic-oriented approach to distributed retrieval. With this approach, we structure the document set of a distributed retrieval environment around a set of topics. Retrieval for a query involves first selecting the right topics for the query and then dispatching the search process to collections that contain such topics. The content of a topic is characterized by a language model. In environments where the labeling of documents by topics is unavailable, document clustering is employed for topic identification. Based on these ideas, three methods are proposed to suit different environments. We show that all three methods improve effectiveness of distributed retrieval
Themenfeld: Verteilte bibliographische Datenbanken
8Allan, J. ; Callan, J.P. ; Croft, W.B. ; Ballesteros, L. ; Broglio, J. ; Xu, J. ; Shu, H.: INQUERY at TREC-5.
In: The Fifth Text Retrieval Conference (TREC-5). Ed.: E.M. Voorhees u. D.K. Harman. Gaithersburgh, MD : National Institute of Standards and Technology, 1997. S.191-197.
(NIST special publication;)
Objekt: TREC ; INQUERY