Diese Datenbank enthält über 40.000 Dokumente zu Themen aus den Bereichen Formalerschließung – Inhaltserschließung – Information Retrieval.
© 2015 W. Gödert, TH Köln, Institut für Informationswissenschaft / Powered by litecat, BIS Oldenburg (Stand: 11. November 2018)
1Reyes Ayala, B. ; Knudson, R. ; Chen, J. ; Cao, G. ; Wang, X.: Metadata records machine translation combining multi-engine outputs with limited parallel data.
In: Journal of the Association for Information Science and Technology. 69(2018) no.1, S.47-59.
Abstract: One way to facilitate Multilingual Information Access (MLIA) for digital libraries is to generate multilingual metadata records by applying Machine Translation (MT) techniques. Current online MT services are available and affordable, but are not always effective for creating multilingual metadata records. In this study, we implemented 3 different MT strategies and evaluated their performance when translating English metadata records to Chinese and Spanish. These strategies included combining MT results from 3 online MT systems (Google, Bing, and Yahoo!) with and without additional linguistic resources, such as manually-generated parallel corpora, and metadata records in the two target languages obtained from international partners. The open-source statistical MT platform Moses was applied to design and implement the three translation strategies. Human evaluation of the MT results using adequacy and fluency demonstrated that two of the strategies produced higher quality translations than individual online MT systems for both languages. Especially, adding small, manually-generated parallel corpora of metadata records significantly improved translation performance. Our study suggested an effective and efficient MT approach for providing multilingual services for digital collections.
Inhalt: Vgl.: http://onlinelibrary.wiley.com/doi/10.1002/asi.23925/full.
2Yang, B. ; Rousseau, R. ; Wang, X. ; Huang, S.: How important is scientific software in bioinformatics research? : a comparative study between international and Chinese research communities.
In: Journal of the Association for Information Science and Technology. 69(2018) no.9, S.1122-1133.
Abstract: Software programs are among the most important tools in data-driven research. The popularity of well-known packages and corresponding large numbers of citations received bear testimony of the contribution of scientific software to academic research. Yet software is not generally recognized as an academic outcome. In this study, a usage-based model is proposed with varied indicators including citations, mentions, and downloads to measure the importance of scientific software. We performed an investigation on a sample of international bioinformatics research articles, and on a sample from the Chinese community. Our analysis shows that scientists in the field of bioinformatics rely heavily on scientific software: the major differences between the international community and the Chinese example being how scientific packages are mentioned in publications and the time gap between the introduction of a package and its use. Biologists publishing in international journals tend to apply the latest tools earlier; Chinese scientists publishing in Chinese tend to follow later. Further, journals with higher impact factors tend to publish articles applying the latest tools earlier.
Inhalt: Vgl.: https://onlinelibrary.wiley.com/doi/10.1002/asi.24031.
3Wang, X. ; Hong, Z. ; Xu, Y.(C.) ; Zhang, C. ; Ling, H.: Relevance judgments of mobile commercial information.
In: Journal of the Association for Information Science and Technology. 65(2014) no.7, S.1335-1348.
Abstract: In the age of mobile commerce, users receive floods of commercial messages. How do users judge the relevance of such information? Is their relevance judgment affected by contextual factors, such as location and time? How do message content and contextual factors affect users' privacy concerns? With a focus on mobile ads, we propose a research model based on theories of relevance judgment and mobile marketing research. We suggest topicality, reliability, and economic value as key content factors and location and time as key contextual factors. We found mobile relevance judgment is affected mainly by content factors, whereas privacy concerns are affected by both content and contextual factors. Moreover, topicality and economic value have a synergetic effect that makes a message more relevant. Higher topicality and location precision exacerbate privacy concerns, whereas message reliability alleviates privacy concerns caused by location precision. These findings reveal an interesting intricacy in user relevance judgment and privacy concerns and provide nuanced guidance for the design and delivery of mobile commercial information.
4Ding, Y. ; Zhang, G. ; Chambers, T. ; Song, M. ; Wang, X. ; Zhai, C.: Content-based citation analysis : the next generation of citation analysis.
In: Journal of the Association for Information Science and Technology. 65(2014) no.9, S.1820-1833.
Abstract: Traditional citation analysis has been widely applied to detect patterns of scientific collaboration, map the landscapes of scholarly disciplines, assess the impact of research outputs, and observe knowledge transfer across domains. It is, however, limited, as it assumes all citations are of similar value and weights each equally. Content-based citation analysis (CCA) addresses a citation's value by interpreting each one based on its context at both the syntactic and semantic levels. This paper provides a comprehensive overview of CAA research in terms of its theoretical foundations, methodical approaches, and example applications. In addition, we highlight how increased computational capabilities and publicly available full-text resources have opened this area of research to vast possibilities, which enable deeper citation analysis, more accurate citation prediction, and increased knowledge discovery.
Themenfeld: Citation indexing
5Wang, X. ; Erdelez, S. ; Allen, C. ; Anderson, B. ; Cao, H. ; Shyu, C.-R.: Role of domain knowledge in developing user-centered medical-image indexing.
In: Journal of the American Society for Information Science and Technology. 63(2012) no.2, S.225-241.
Abstract: An efficient and robust medical-image indexing procedure should be user-oriented. It is essential to index the images at the right level of description and ensure that the indexed levels match the user's interest level. This study examines 240 medical-image descriptions produced by three different groups of medical-image users (novices, intermediates, and experts) in the area of radiography. This article reports several important findings: First, the effect of domain knowledge has a significant relationship with the use of semantic image attributes in image-users' descriptions. We found that experts employ more high-level image attributes which require high-reasoning or diagnostic knowledge to search for a medical image (Abstract Objects and Scenes) than do novices; novices are more likely to describe some basic objects which do not require much radiological knowledge to search for an image they need (Generic Objects) than are experts. Second, all image users in this study prefer to use image attributes of the semantic levels to represent the image that they desired to find, especially using those specific-level and scene-related attributes. Third, image attributes generated by medical-image users can be mapped to all levels of the pyramid model that was developed to structure visual information. Therefore, the pyramid model could be considered a robust instrument for indexing medical imagery.
6Jiang, Y. ; Zheng, H.-T. ; Wang, X. ; Lu, B. ; Wu, K.: Affiliation disambiguation for constructing semantic digital libraries.
In: Journal of the American Society for Information Science and Technology. 62(2011) no.6, S.1029-1041.
Abstract: With increasing digital information availability, semantic web technologies have been employed to construct semantic digital libraries in order to ease information comprehension. The use of semantic web enables users to search or visualize resources in a semantic fashion. Semantic web generation is a key process in semantic digital library construction, which converts metadata of digital resources into semantic web data. Many text mining technologies, such as keyword extraction and clustering, have been proposed to generate semantic web data. However, one important type of metadata in publications, called affiliation, is hard to convert into semantic web data precisely because different authors, who have the same affiliation, often express the affiliation in different ways. To address this issue, this paper proposes a clustering method based on normalized compression distance for the purpose of affiliation disambiguation. The experimental results show that our method is able to identify different affiliations that denote the same institutes. The clustering results outperform the well-known k-means clustering method in terms of average precision, F-measure, entropy, and purity.
7Teo, H.-H. ; Wang, X. ; Wei, K.-K. ; Sia, C.-L. ; Lee, M.K.O.: Organizational learning capacity and attitude toward complex technological innovations : an empirical study.
In: Journal of the American Society for Information Science and Technology. 57(2006) no.2, S.264-279.
Abstract: Recent studies have found organizational learning capacity to be a key factor in influencing organizational assimilation and exploitation of knowledge-intensive innovations. Despite its increasing importance, the impact of organizational learning capacity an technology assimilation is not well understood. Distilling from extant works an organizational learning and technology assimilation, this study identifies four components of organizational learning capacity, namely, systems orientation, organizational climate for learning orientation, knowledge acquisition and utilization orientation, and information sharing and dissemination orientation. The authors subject these components to structural equation modeling analyses to better understand their structure and dimensionality. The analyses strongly support the proposed four major dimensions underlying organizational learning capacity. Organizational learning capacity, as a higher-order factor, has a significant impact an attitude towards organizational adoption of knowledge-intensive innovations. Implications for practice and research are discussed.