Diese Datenbank enthält über 40.000 Dokumente zu Themen aus den Bereichen Formalerschließung – Inhaltserschließung – Information Retrieval.
© 2015 W. Gödert, TH Köln, Institut für Informationswissenschaft / Powered by litecat, BIS Oldenburg (Stand: 28. April 2022)
1Jiang, X. ; Zhu, X. ; Chen, J.: Main path analysis on cyclic citation networks.
In: Journal of the Association for Information Science and Technology. 71(2020) no.5, S.578-595.
Abstract: Main path analysis is a famous network-based method for understanding the evolution of a scientific domain. Most existing methods have two steps, weighting citation arcs based on search path counting and exploring main paths in a greedy fashion, with the assumption that citation networks are acyclic. The only available proposal that avoids manual cycle removal is to preprint transform a cyclic network to an acyclic counterpart. Through a detailed discussion about the issues concerning this approach, especially deriving the "de-preprinted" main paths for the original network, this article proposes an alternative solution with two-fold contributions. Based on the argument that a publication cannot influence itself through a citation cycle, the SimSPC algorithm is proposed to weight citation arcs by counting simple search paths. A set of algorithms are further proposed for main path exploration and extraction directly from cyclic networks based on a novel data structure main path tree. The experiments on two cyclic citation networks demonstrate the usefulness of the alternative solution. In the meanwhile, experiments show that publications in strongly connected components may sit on the turning points of main path networks, which signifies the necessity of a systematic way of dealing with citation cycles.
2Qiao, C. ; Hu, X.: ¬A joint neural network model for combining heterogeneous user data sources : an example of at-risk student prediction.
In: Journal of the Association for Information Science and Technology. 71(2020) no.10, S.1192-1204.
Abstract: Information service providers often require evidence from multiple, heterogeneous information sources to better characterize users and offer personalized service. In many cases, statistic information (for example, users' profiles) and sequentially dynamic information (for example, logs of interaction with information systems) are two prominent sources that can be combined to achieve optimized results. Previous attempts in combining these two sources mainly exploited models designed for either static or sequential information, but not both. This study aims to fill the gap by proposing a novel joint neural network model that can naturally fit both static and sequential user data. To evaluate the effectiveness of the proposed method, this study uses the problem of at-risk student prediction as an example where both static data (personal profiles) and sequential data (event logs) are involved. A thorough evaluation was conducted on an open data set, with comparisons to a range of existing approaches including both static and sequential models. The results reveal superb performances of the proposed method. Implications of the findings on further research and applications of joint models are discussed.
3Hu, X. ; Rousseau, R.: Do citation chimeras exist? : The case of under-cited influential articles suffering delayed recognition.
In: Journal of the Association for Information Science and Technology. 70(2019) no.5, S.499-508.
Abstract: In this study we investigate if articles suffering delayed recognition can at the same time be under-cited influential articles. Theoretically these two types of articles are independent, in the sense that suffering delayed recognition depends on the number and time distribution of received citations, while being an under-cited influential article depends only partially on the number of received (first generation) citations, and much more on second and third citation generations. Among 49 articles suffering delayed recognition we found 13 that are also under-cited influential. Based on a thorough investigation of these special cases we found that so-called authoritative citers play an important role in uniting the two different document types into a special citation chimera. Our investigation contributes to the classification of publications.
Inhalt: Vgl.: https://onlinelibrary.wiley.com/doi/10.1002/asi.24115.
Themenfeld: Informetrie ; Citation indexing
4Zhu, X. ; Freeman, M.A.: ¬An evaluation of U.S. municipal open data portals : a user interaction framework.
In: Journal of the Association for Information Science and Technology. 70(2019) no.1, S.27-37.
Abstract: As an increasing number of open government data (OGD) portals are created, an evaluation method is needed to assess these portals. In this study, we drew from the existing principles and evaluation methods to develop a User Interaction Framework, with concrete criteria in five dimensions: Access, Trust, Understand, Engage-integrate, and Participate. The framework was then used to evaluate the current OGD sites created and maintained by 34 U.S. municipal government agencies. The results show that, overall, portals perform well in terms of providing access, but not so well in helping users understand and engage with data. These findings indicate room for improvement in multiple areas and suggest potential roles for information professionals as data mediators. The study also reveals that portals using the Socrata platform performed better, regarding user access, trust, engagement, and participation. However, the variability among portals indicates that some portals should improve their platforms to achieve greater user engagement and participation. In addition, city governments need to develop clear plans about what data should be available and how to make them available to their public.
Inhalt: Vgl.: https://onlinelibrary.wiley.com/doi/10.1002/asi.24081.
Themenfeld: Information Gateway
5Hu, X. ; Ng, J. ; Xia, S.: User-Centered evaluation of metadata schema for nonmovable cultural heritage : murals and stone cave temples.
In: Journal of the Association for Information Science and Technology. 69(2018) no.12, S.1476-1487.
Abstract: Digitization provides a solution for documentation and preservation of nonmovable cultural heritages. Despite efforts for the preservation of cultural heritages around the world, no well-accepted metadata schema has been developed for murals and stone cave temples, which are often high-value heritages built in ancient times. In addition, the literature is scarce on the user-centered evaluation of metadata schemas of this kind. This study therefore aims to offer insights on developing and evaluating a metadata schema for organizing information of these historic and complex cultural heritages. In-depth interviews were conducted with a total of 30 users, including 18 professional and 12 public users, and interview transcripts were coded through a qualitative content analysis approach. Findings reveal the importance of specific metadata elements as perceived by the two groups of end users, which correlated with their cultural heritage information-seeking behaviors. In addition, the issues of standardization of cataloging of cultural heritage information and interoperability among metadata schemas have been raised by users for enhancing the user experience with digital platforms of cultural heritage information. The coding schema developed in this study can serve as a framework for follow-up evaluations of metadata schemas, contributing to the ongoing development of cultural heritage metadata.
6Hu, X. ; Lee, J.H. ; Bainbridge, D. ; Choi, K. ; Organisciak, P. ; Downie, J.S.: ¬The MIREX grand challenge : a framework of holistic user-experience evaluation in music information retrieval.
In: Journal of the Association for Information Science and Technology. 68(2017) no.1, S.97-112.
Abstract: Music Information Retrieval (MIR) evaluation has traditionally focused on system-centered approaches where components of MIR systems are evaluated against predefined data sets and golden answers (i.e., ground truth). There are two major limitations of such system-centered evaluation approaches: (a) The evaluation focuses on subtasks in music information retrieval, but not on entire systems and (b) users and their interactions with MIR systems are largely excluded. This article describes the first implementation of a holistic user-experience evaluation in MIR, the MIREX Grand Challenge, where complete MIR systems are evaluated, with user experience being the single overarching goal. It is the first time that complete MIR systems have been evaluated with end users in a realistic scenario. We present the design of the evaluation task, the evaluation criteria and a novel evaluation interface, and the data-collection platform. This is followed by an analysis of the results, reflection on the experience and lessons learned, and plans for future directions.
Inhalt: Vgl.: http://onlinelibrary.wiley.com/doi/10.1002/asi.23618/full.
7Hu, X. ; Choi, K. ; Downie, J.S.: ¬A framework for evaluating multimodal music mood classification.
In: Journal of the Association for Information Science and Technology. 68(2017) no.2, S.273-285.
Abstract: This research proposes a framework for music mood classification that uses multiple and complementary information sources, namely, music audio, lyric text, and social tags associated with music pieces. This article presents the framework and a thorough evaluation of each of its components. Experimental results on a large data set of 18 mood categories show that combining lyrics and audio significantly outperformed systems using audio-only features. Automatic feature selection techniques were further proved to have reduced feature space. In addition, the examination of learning curves shows that the hybrid systems using lyrics and audio needed fewer training samples and shorter audio clips to achieve the same or better classification accuracies than systems using lyrics or audio singularly. Last but not least, performance comparisons reveal the relative importance of audio and lyric features across mood categories.
Inhalt: Vgl.: http://onlinelibrary.wiley.com/doi/10.1002/asi.23649/full.
8Hu, X. ; Kando, N.: Task complexity and difficulty in music information retrieval.
In: Journal of the Association for Information Science and Technology. 68(2017) no.7, S.1711-1723.
Abstract: There has been little research on task complexity and difficulty in music information retrieval (MIR), whereas many studies in the text retrieval domain have found that task complexity and difficulty have significant effects on user effectiveness. This study aimed to bridge the gap by exploring i) the relationship between task complexity and difficulty; ii) factors affecting task difficulty; and iii) the relationship between task difficulty, task complexity, and user search behaviors in MIR. An empirical user experiment was conducted with 51 participants and a novel MIR system. The participants searched for 6 topics across 3 complexity levels. The results revealed that i) perceived task difficulty in music search is influenced by task complexity, user background, system affordances, and task uncertainty and enjoyability; and ii) perceived task difficulty in MIR is significantly correlated with effectiveness metrics such as the number of songs found, number of clicks, and task completion time. The findings have implications for the design of music search tasks (in research) or use cases (in system development) as well as future MIR systems that can detect task difficulty based on user effectiveness metrics.
Inhalt: Vgl.: http://onlinelibrary.wiley.com/doi/10.1002/asi.23803/full.
9Hu, X. ; Yang, Y.-H.: ¬The mood of Chinese Pop music : representation and recognition.
In: Journal of the Association for Information Science and Technology. 68(2017) no.8, S.1899-1910.
Abstract: Music mood recognition (MMR) has attracted much attention in music information retrieval research, yet there are few MMR studies that focus on non-Western music. In addition, little has been done on connecting the 2 most adopted music mood representation models: categorical and dimensional. To bridge these gaps, we constructed a new data set consisting of 818 Chinese Pop (C-Pop) songs, 3 complete sets of mood annotations in both representations, as well as audio features corresponding to 5 distinct categories of musical characteristics. The mood space of C-Pop songs was analyzed and compared to that of Western Pop songs. We also explored the relationship between categorical and dimensional annotations and the results revealed that one set of annotations could be reliably predicted by the other. Classification and regression experiments were conducted on the data set, providing benchmarks for future research on MMR of non-Western music. Based on these analyses, we reflect and discuss the implications of the findings to MMR research.
Inhalt: Vgl.: http://onlinelibrary.wiley.com/doi/10.1002/asi.23813/full.
10Pan, X. ; He, S. ; Zhu, X. ; Fu, Q.: How users employ various popular tags to annotate resources in social tagging : an empirical study.
In: Journal of the Association for Information Science and Technology. 67(2016) no.5, S.1121-1137.
Abstract: This paper focuses on exploring the usage patterns and regularities of co-employment of various popular tags and their relationships with the activeness of users and the interest level of resources in social tagging. A hypernetwork for social tagging is constructed in which a tagging action is expressed as a hyperedge and the user, resource, and tag are expressed as nodes. Quantitative measures for the constructed hypernetwork are defined, including the hyperdegree and its distribution, the excess average hyperdegree, and the hyperdegree conditional probability distribution. Using the data set from Delicious, an empirical study was conducted. The empirical results show that multiple individual tags and one or very few popular tags are generally employed together in one tagging action, and the usage patterns and regularities of tags with varying popularity are correlated to both user activity and resource interest. The empirical results are further discussed and explained from the perspectives of tag functions and motivations. Finally, suggestions regarding the usage of various popular tags for both tagging users and service providers of social tagging are given.
Inhalt: Vgl.: http://onlinelibrary.wiley.com/doi/10.1002/asi.23478/abstract.
11Zhu, X. ; Turney, P. ; Lemire, D. ; Vellino, A.: Measuring academic influence : not all citations are equal.
In: Journal of the Association for Information Science and Technology. 66(2015) no.2, S.408-427.
Abstract: The importance of a research article is routinely measured by counting how many times it has been cited. However, treating all citations with equal weight ignores the wide variety of functions that citations perform. We want to automatically identify the subset of references in a bibliography that have a central academic influence on the citing paper. For this purpose, we examine the effectiveness of a variety of features for determining the academic influence of a citation. By asking authors to identify the key references in their own work, we created a data set in which citations were labeled according to their academic influence. Using automatic feature selection with supervised machine learning, we found a model for predicting academic influence that achieves good performance on this data set using only four features. The best features, among those we evaluated, were those based on the number of times a reference is mentioned in the body of a citing paper. The performance of these features inspired us to design an influence-primed h-index (the hip-index). Unlike the conventional h-index, it weights citations by how many times a reference is mentioned. According to our experiments, the hip-index is a better indicator of researcher performance than the conventional h-index.
Inhalt: Vgl.: http://onlinelibrary.wiley.com/doi/10.1002/asi.23179/abstract.
12Hu, X. ; Rousseau, R. ; Chen, J.: ¬A new approach for measuring the value of patents based on structural indicators for ego patent citation networks.
In: Journal of the American Society for Information Science and Technology. 63(2012) no.9, S.1834-1842.
Abstract: Technology sectors differ in terms of technological complexity. When studying technology and innovation through patent analysis it is well known that similar amounts of technological knowledge can produce different numbers of patented innovation as output. A new multilayered approach to measure the technological value of patents based on ego patent citation networks (PCNs) is developed in this study. The results show that the structural indicators for the ego PCN developed in this contribution can characterize groups of patents and, hence, in an indirect way, the health of companies.
13Hu, X.: Loads of special authorship functions : linear growth in the percentage of "equal first authors" and corresponding authors.
In: Journal of the American Society for Information Science and Technology. 60(2009) no.11, S.2378-2381.
Abstract: We show that between 1999 and 2008 the percentage of articles with more than one corresponding author or with several authors that contributed equally, leading to so-called equal first authors, has steadily been on the rise. Increasing numbers of corresponding authors and equally contributing authors may lead to increased stress on teamwork if not properly acknowledged in research evaluation exercises.
14Hu, G. ; Zhou, S. ; Guan, J. ; Hu, X.: Towards effective document clustering : a constrained K-means based approach.
In: Information processing and management. 44(2008) no.4, S.1397-1409.
Abstract: Document clustering is an important tool for document collection organization and browsing. In real applications, some limited knowledge about cluster membership of a small number of documents is often available, such as some pairs of documents belonging to the same cluster. This kind of prior knowledge can be served as constraints for the clustering process. We integrate the constraints into the trace formulation of the sum of square Euclidean distance function of K-means. Then, the combined criterion function is transformed into trace maximization, which is further optimized by eigen-decomposition. Our experimental evaluation shows that the proposed semi-supervised clustering method can achieve better performance, compared to three existing methods.
Themenfeld: Automatisches Klassifizieren
15Kim, K.-S. ; Kim, S.-C.J. ; Park, S.-J. ; Zhu, X. ; Polparsi, J.: Facet analyses of categories used in Web directories : a comparative study.
Abstract: Faceted classification is believed to be suitable for organizing digital information resources. Based on a faceted classification model suggested for Web resources (Zins, 2002), the current study analyzed popular Web directories from different Asian countries/areas and examined cultural differences reflected in their classification systems. Three popular Web directories from four countries/regions (China, Hong Kong, Korea, and Thailand) were selected and their classifications were analyzed and compared: a local Yahoo and two home-grown Web directories from each country/region. Based on the findings, the study suggests a model that might be more suitable to Asian culture.
Inhalt: Vortrag anlässlich der 72ND IFLA General Conference and Council, 20-24 August 2006, Seoul, Korea