Literatur zur Informationserschließung
Diese Datenbank enthält über 40.000 Dokumente zu Themen aus den Bereichen Formalerschließung – Inhaltserschließung – Information Retrieval.
© 2015 W. Gödert, TH Köln, Institut für Informationswissenschaft
/
Powered by litecat, BIS Oldenburg
(Stand: 28. April 2022)
Suche
Suchergebnisse
Treffer 1–11 von 11
sortiert nach:
-
1Wu, C. ; Yan, E. ; Zhu, Y. ; Li, K.: Gender imbalance in the productivity of funded projects : a study of the outputs of National Institutes of Health R01 grants.
In: Journal of the Association for Information Science and Technology. 72(2021) no.11, S.1386-1399.
Abstract: This study examines the relationship between team's gender composition and outputs of funded projects using a large data set of National Institutes of Health (NIH) R01 grants and their associated publications between 1990 and 2017. This study finds that while the women investigators' presence in NIH grants is generally low, higher women investigator presence is on average related to slightly lower number of publications. This study finds empirically that women investigators elect to work in fields in which fewer publications per million-dollar funding is the norm. For fields where women investigators are relatively well represented, they are as productive as men. The overall lower productivity of women investigators may be attributed to the low representation of women in high productivity fields dominated by men investigators. The findings shed light on possible reasons for gender disparity in grant productivity.
Inhalt: Vgl.: https://asistdl.onlinelibrary.wiley.com/doi/10.1002/asi.24487.
Themenfeld: Informetrie
Wissenschaftsfach: Medizin
Land/Ort: USA
-
2Luo, P. ; Chen, K. ; Wu, C. ; Li, Y.: Exploring the social influence of multichannel access in an online health community.
In: Journal of the Association for Information Science and Technology. 69(2018) no.1, S.98-109.
Abstract: Social influence has a great impact on human behavior, which has been widely investigated in various research fields. Even so, it has rarely been investigated in the online health community. In this paper, we focus on the multichannel access in online health communities, defining social influence as the average degree of multichannel access to a physician's colleagues. Based on the multinomial logistic regression model, we examined the direct effects of social influence and patients' rating to multichannel access. In addition, we explored the moderating effect of social influence on the relationship between patients' rating and multichannel access in online health communities. The results of the experiment and robustness testing support the propositions that social influence and patients' rating significantly and positively affect multichannel access in an online health community. The moderating effect of social influence is negative and significantly influences the accessible channels provided by the focal physician. This research contributes to the literature concerning online health communities, social influence, and multichannel access; it also has practical implications.
Inhalt: Vgl.: http://onlinelibrary.wiley.com/doi/10.1002/asi.23928/full.
Wissenschaftsfach: Medizin
-
3Ma, X. ; Carranza, E.J.M. ; Wu, C. ; Meer, F.D. van der ; Liu, G.: ¬A SKOS-based multilingual thesaurus of geological time scale for interoperability of online geological maps.
In: Computers & Geosciences. 37(2011), no.10, S.1602-1615.
Abstract: The usefulness of online geological maps is hindered by linguistic barriers. Multilingual geoscience thesauri alleviate linguistic barriers of geological maps. However, the benefits of multilingual geoscience thesauri for online geological maps are less studied. In this regard, we developed a multilingual thesaurus of geological time scale (GTS) to alleviate linguistic barriers of GTS records among online geological maps. We extended the Simple Knowledge Organization System (SKOS) model to represent the ordinal hierarchical structure of GTS terms. We collected GTS terms in seven languages and encoded them into a thesaurus by using the extended SKOS model. We implemented methods of characteristic-oriented term retrieval in JavaScript programs for accessing Web Map Services (WMS), recognizing GTS terms, and making translations. With the developed thesaurus and programs, we set up a pilot system to test recognitions and translations of GTS terms in online geological maps. Results of this pilot system proved the accuracy of the developed thesaurus and the functionality of the developed programs. Therefore, with proper deployments, SKOS-based multilingual geoscience thesauri can be functional for alleviating linguistic barriers among online geological maps and, thus, improving their interoperability.
Inhalt: Article Outline 1. Introduction 2. SKOS-based multilingual thesaurus of geological time scale 2.1. Addressing the insufficiency of SKOS in the context of the Semantic Web 2.2. Addressing semantics and syntax/lexicon in multilingual GTS terms 2.3. Extending SKOS model to capture GTS structure 2.4. Summary of building the SKOS-based MLTGTS 3. Recognizing and translating GTS terms retrieved from WMS 4. Pilot system, results, and evaluation 5. Discussion 6. Conclusions Vgl. unter: http://www.sciencedirect.com/science?_ob=MiamiImageURL&_cid=271720&_user=3865853&_pii=S0098300411000744&_check=y&_origin=&_coverDate=31-Oct-2011&view=c&wchp=dGLbVlt-zSkzS&_valck=1&md5=e2c1daf53df72d034d22278212578f42&ie=/sdarticle.pdf.
Themenfeld: Wissensrepräsentation ; Konzeption und Anwendung des Prinzips Thesaurus ; Multilinguale Probleme
Wissenschaftsfach: Geowissenschaften
Objekt: SKOS
-
4Tsai, R.T.-H. ; Chiu, B. ; Wu, C.-E.: Visual webpage block importance prediction using conditional random fields.
In: Journal of the American Society for Information Science and Technology. 62(2011) no.11, S.2225-2235.
Abstract: We have developed a system that segments web pages into blocks and predicts those blocks' importance (block importance prediction or BIP). First, we use VIPS to partition a page into a tree composed of blocks and then extracts features from each block and labels all leaf nodes. This paper makes two main contributions. Firstly, we are pioneering the formulation of BIP as a sequence tagging task. We employ DFS, which outputs a single sequence for the whole tree in which related sub-blocks are adjacent. Our second contribution is using the conditional random fields (CRF) model for labeling these sequences. CRF's transition features model correlations between neighboring labels well, and CRF can simultaneously label all blocks in a sequence to find the global optimal solution for the whole sequence, not only the best solution for each block. In our experiments, our CRF-based system achieves an F1-measure of 97.41%, which significantly outperforms our ME-based baseline (95.64%). Lastly, we tested the CRF-based system using sites which were not covered in the training data. On completely novel sites CRF performed slightly worse than ME. However, when given only two training pages from a given site, CRF improved almost three times as much as ME.
-
5Yu, L.-C. ; Wu, C.-H. ; Chang, R.-Y. ; Liu, C.-H. ; Hovy, E.H.: Annotation and verification of sense pools in OntoNotes.
In: Information processing and management. 46(2010) no.4, S.436-447.
Abstract: The paper describes the OntoNotes, a multilingual (English, Chinese and Arabic) corpus with large-scale semantic annotations, including predicate-argument structure, word senses, ontology linking, and coreference. The underlying semantic model of OntoNotes involves word senses that are grouped into so-called sense pools, i.e., sets of near-synonymous senses of words. Such information is useful for many applications, including query expansion for information retrieval (IR) systems, (near-)duplicate detection for text summarization systems, and alternative word selection for writing support systems. Although a sense pool provides a set of near-synonymous senses of words, there is still no knowledge about whether two words in a pool are interchangeable in practical use. Therefore, this paper devises an unsupervised algorithm that incorporates Google n-grams and a statistical test to determine whether a word in a pool can be substituted by other words in the same pool. The n-gram features are used to measure the degree of context mismatch for a substitution. The statistical test is then applied to determine whether the substitution is adequate based on the degree of mismatch. The proposed method is compared with a supervised method, namely Linear Discriminant Analysis (LDA). Experimental results show that the proposed unsupervised method can achieve comparable performance with the supervised method.
Themenfeld: Wissensrepräsentation ; Multilinguale Probleme
Objekt: OntoNotes
-
6Yeh, J.-F. ; Wu, C.-H. ; Chen, M.-J.: Ontology-based speech act identification in a bilingual dialog system using partial pattern trees.
In: Journal of the American Society for Information Science and Technology. 59(2008) no.5, S.684-694.
Abstract: This article presents a bilingual ontology-based dialog system with multiple services. An ontology-alignment algorithm is proposed to integrate ontologies of different languages for cross-language applications. A domain-specific ontology is further extracted from the bilingual ontology using an island-driven algorithm and a domain corpus. This study extracts the semantic words/concepts using latent semantic analysis (LSA). Based on the extracted semantic words and the domain ontology, a partial pattern tree is constructed to model the speech act of a spoken utterance. The partial pattern tree is used to deal with the ill-formed sentence problem in a spoken-dialog system. Concept expansion based on domain ontology is also adopted to improve system performance. For performance evaluation, a medical dialog system with multiple services, including registration information, clinic information, and FAQ information, is implemented. Four performance measures were used separately for evaluation. The speech act identification rate was 86.2%. A task success rate of 77% was obtained. The contextual appropriateness of the system response was 78.5%. Finally, the rate for correct FAQ retrieval was 82%, an improvement of 15% over the keyword-based vector-space model. The results show the proposed ontology-based speech-act identification is effective for dialog management.
-
7Omekwu, C.O.: Cataloguers in a global information network environment.
In: Electronic library. 26(2008) no.2, S.188-201.
Abstract: Purpose - This paper aims to explore the traditional and emerging roles of cataloguing professionals in a global network information environment. That exploration becomes even more critical in view of migration of information resources into digital, electronic and virtual domains. Design/methodology/approach - An exploratory research design was adopted. The method was to first examine current issues in library and information practice with specific focus on digital technology, the electronic environment, automation, networking, the internet, cyberspace and virtual libraries. The next approach was to examine the challenges of operating in a globalized information environment. Findings - Cataloguers have key roles in knowledge segmentation, identification, organization and authentication. They are vital as content and system managers, software specialists and information retrieval system designers. In all, more than 23 roles are articulated for forward-looking cataloguing professionals. Originality/value - The paper's originality lies in its argument that roles are correlates of competencies and that as the practice of knowledge organization migrates to a dominantly global information network environment, cataloguers must upgrade their competencies in order to effectively operate in the emerging environment.
Themenfeld: Formalerschließung
Land/Ort: Nigeria
-
8Wu, C.-H. ; Lee, T.-Z. ; Kao, S.-C.: Knowledge discovery applied to material acquisitions for libraries.
In: Information processing and management. 40(2004) no.4, S.709-725.
Abstract: Since the quality of a library is not in the number of materials that are available, but in the number of materials that are actually utilized, this is what a material acquisitions operation should be concerned with. In support of this goal, the library management has been paying increased attention to the value of the usage data in support of a variety of managerial decisions. Although many approaches and research reports have been extensively used to help library material acquisitions, the knowledge contained in circulation databases has hardly ever been used to investigate in-depth how the acquired materials are being used. Thus, there may not be adequate indications on which the material acquisitions operation can rely when making decisions. This paper introduces a model based on knowledge discovery (KDBMLMA ) that embeds a circulation statistics mechanism and an association rule discovery mechanism to help derive the utilization of library material categories. A practical application case is presented and managerial implications discussed in this research.
-
9Wu, C.-J.: Experiments on using the Dublin Core to reduce the retrieval error ratio.
In: Journal of library and information science. 24(1998) no.1, S.50-64.
Abstract: In order to test the power of metadata on information retrieval, an experiment was designed and conducted on a group of 7 graduate students using the Dublin Core as the cataloguing metadata. Results show that, on average, the retrieval error rate is only 2.9 per cent for the MES system (http://140.136.85.194), which utilizes the Dublin Core to describe the documents on the World Wide Web, in contrast to 20.7 per cent for the 7 famous search engines including HOTBOT, GAIS, LYCOS, EXCITE, INFOSEEK, YAHOO, and OCTOPUS. The very low error rate indicates that the users can use the information of the Dublin Core to decide whether to retrieve the documents or not
Anmerkung: In Chinesisch
Themenfeld: Metadaten ; Retrievalstudien
Objekt: Dublin core
-
10Wu, C.-J.: Mapping the Dublin Core to the Chinese MARC.
In: Bulletin of the Library Association of China. 1998, no.60, Jun., S.61-73.
Abstract: Discusses how to map the Dublin Core to the Chinese MARC that is used by most of the libraries in Taiwan
Anmerkung: [In Chinesisch]
Themenfeld: Metadaten ; Datenformate
Objekt: Dublin core ; CMARC
Land/Ort: Taiwan
-
11Wu, C.-J.: Metadata and future developments in cataloguing.
In: Journal of information; communication; and library science. 4(1997) no.2, S.42-52.
Abstract: Metadata is a resource which can assist the information retrieval of digital documents on the Internet. In designing a metadata system, it is necessary to consider the characteristics of digital documents, such as the variety of file formats, frequent format transformation and the difficulty of distinguishing between the different versions. Provides a brief analysis of some existing metadata formats, and introduces several pronciples for the future development of cataloguing on the Internet. Briefly describes the Metadata Experimental System (MES) currently under development, and located at the author's homepage
Inhalt: Vgl.: http://blue.lins.fju.edu.tw/~wu
Anmerkung: In Chinesisch
Themenfeld: Metadaten ; Formalerschließung
Behandelte Form: Elektronische Dokumente