Diese Datenbank enthält über 40.000 Dokumente zu Themen aus den Bereichen Formalerschließung – Inhaltserschließung – Information Retrieval.
© 2015 W. Gödert, TH Köln, Institut für Informationswissenschaft / Powered by litecat, BIS Oldenburg (Stand: 23. Dezember 2017)
1Reyes Ayala, B. ; Knudson, R. ; Chen, J. ; Cao, G. ; Wang, X.: Metadata records machine translation combining multi-engine outputs with limited parallel data.
In: Journal of the Association for Information Science and Technology. 69(2018) no.1, S.47-59.
Abstract: One way to facilitate Multilingual Information Access (MLIA) for digital libraries is to generate multilingual metadata records by applying Machine Translation (MT) techniques. Current online MT services are available and affordable, but are not always effective for creating multilingual metadata records. In this study, we implemented 3 different MT strategies and evaluated their performance when translating English metadata records to Chinese and Spanish. These strategies included combining MT results from 3 online MT systems (Google, Bing, and Yahoo!) with and without additional linguistic resources, such as manually-generated parallel corpora, and metadata records in the two target languages obtained from international partners. The open-source statistical MT platform Moses was applied to design and implement the three translation strategies. Human evaluation of the MT results using adequacy and fluency demonstrated that two of the strategies produced higher quality translations than individual online MT systems for both languages. Especially, adding small, manually-generated parallel corpora of metadata records significantly improved translation performance. Our study suggested an effective and efficient MT approach for providing multilingual services for digital collections.
Inhalt: Vgl.: http://onlinelibrary.wiley.com/doi/10.1002/asi.23925/full.
2Hu, X. ; Rousseau, R. ; Chen, J.: ¬A new approach for measuring the value of patents based on structural indicators for ego patent citation networks.
In: Journal of the American Society for Information Science and Technology. 63(2012) no.9, S.1834-1842.
Abstract: Technology sectors differ in terms of technological complexity. When studying technology and innovation through patent analysis it is well known that similar amounts of technological knowledge can produce different numbers of patented innovation as output. A new multilayered approach to measure the technological value of patents based on ego patent citation networks (PCNs) is developed in this study. The results show that the structural indicators for the ego PCN developed in this contribution can characterize groups of patents and, hence, in an indirect way, the health of companies.
3Chen, J.: Artificial intelligence.
In: Encyclopedia of library and information sciences. 3rd ed. Ed.: M.J. Bates. London : Taylor & Francis, 2009. S.xx-xx.
Abstract: Artificial intelligence (AI) is a multidisciplinary subject, typically studied as a research area within Computer Science. AI study aims at achieving a good understanding of the nature of intelligence and building intelligent agents which are computational systems demonstrating intelligent behavior. AI has been developed over more than 50 years. The topics studied in AI are quite broad, ranging from knowledge representation and reasoning, knowledge-based systems, machine learning and data mining, natural language processing, to search, image processing, robotics, and intelligent information systems. Numerous successful AI systems have been deployed in real-life applications in engineering, finance, science, health care, education, and service sectors. AI research has also significantly impacted the subject area of Library and Information Science (LIS), helping to develop smart Web search engines, personalized news filters, and knowledge-sharing and indexing systems. This entry briefly outlines the main topics studied in AI, samples some typical successful AI applications, and discusses the cross-fertilization between AI and LIS.
Anmerkung: Vgl.: http://www.tandfonline.com/doi/book/10.1081/E-ELIS3.
4Chen, J.: ¬A lexical knowledge base approach for English-Chinese cross-language information retrieval.
In: Journal of the American Society for Information Science and Technology. 57(2006) no.2, S.233-243.
Abstract: This study proposes and explores a natural language processing- (NLP) based strategy to address out-ofdictionary and vocabulary mismatch problems in query translation based English-Chinese Cross-Language Information Retrieval (EC-CLIR). The strategy, named the LKB approach, is to construct a lexical knowledge base (LKB) and to use it for query translation. In this article, the author describes the LKB construction process, which customizes available translation resources based an the document collection of the EC-CLIR system. The evaluation shows that the LKB approach is very promising. It consistently increased the percentage of correct translations and decreased the percentage of missing translations in addition to effectively detecting the vocabulary gap between the document collection and the translation resource of the system. The comparative analysis of the top EC-CLIR results using the LKB and two other translation resources demonstrates that the LKB approach has produced significant improvement in EC-CLIR performance compared to performance using the original translation resource without customization. It has also achieved the same level of performance as a sophisticated machine translation system. The study concludes that the LKB approach has the potential to be an empirical model for developing real-world CLIR systems. Linguistic knowledge and NLP techniques, if appropriately used, can improve the effectiveness of English-Chinese crosslanguage information retrieval.
5Jang, J.-S.R. ; Lee, H.-R. ; Chen, J.-C. ; Lin, C.-Y.: Research and developments of a multi-modal MIR engine for commercial applications in East Asia.
In: Journal of the American Society for Information Science and Technology. 55(2004) no.12, S.1067-1076.
Abstract: This article describes the research and development of efficient Music Information Retrieval (MIR) engine is embedded in a karaoke software package targeted for Asian people's need of music retrieval. The engine has a multi-modal interface that allows queries by singing, humming, tapping, speaking, and writing. In particular, we discuss the design philosophy, technical barriers, and performance evaluation of such engine, as weIl as its current and potential commercial applications. Feedbacks and feature requests from users, which greatly influence our future work, are also addressed.
Anmerkung: Beitrag in einem Themenheft zur Musikerschließung und zum Musikretrieval
6Qin, J. ; Chen, J.: ¬A multi-layered, multi-dimensional representation of digital educational resources.
In: Subject retrieval in a networked environment: Proceedings of the IFLA Satellite Meeting held in Dublin, OH, 14-16 August 2001 and sponsored by the IFLA Classification and Indexing Section, the IFLA Information Technology Section and OCLC. Ed.: I.C. McIlwaine. München : Saur, 2003. S.90-96.
(UBCIM publications: new series; vol.25)
Abstract: Semantic mapping between controlled vocabulary and keywords is the first step towards knowledge-based subject access. This study reports the preliminary result of a semantic mapping experiment for the Gateway to Educational Materials (GEM). A total of 3,555 keywords were mapped with 322 concept names in the GEM controlled vocabulary. The preliminary test to 10,000 metadata records presented widely varied sets of results between the mapped and non-mapped data. The paper discussed linguistic and technical problems encountered in the mapping process and raised issues in the representation technologies and methods, which will lead to future study of knowledge-based access to networked information resources.
Themenfeld: Information Gateway