Diese Datenbank enthält über 40.000 Dokumente zu Themen aus den Bereichen Formalerschließung – Inhaltserschließung – Information Retrieval.
© 2015 W. Gödert, TH Köln, Institut für Informationswissenschaft / Powered by litecat, BIS Oldenburg (Stand: 21. Januar 2019)
1Luo, P. ; Chen, K. ; Wu, C. ; Li, Y.: Exploring the social influence of multichannel access in an online health community.
In: Journal of the Association for Information Science and Technology. 69(2018) no.1, S.98-109.
Abstract: Social influence has a great impact on human behavior, which has been widely investigated in various research fields. Even so, it has rarely been investigated in the online health community. In this paper, we focus on the multichannel access in online health communities, defining social influence as the average degree of multichannel access to a physician's colleagues. Based on the multinomial logistic regression model, we examined the direct effects of social influence and patients' rating to multichannel access. In addition, we explored the moderating effect of social influence on the relationship between patients' rating and multichannel access in online health communities. The results of the experiment and robustness testing support the propositions that social influence and patients' rating significantly and positively affect multichannel access in an online health community. The moderating effect of social influence is negative and significantly influences the accessible channels provided by the focal physician. This research contributes to the literature concerning online health communities, social influence, and multichannel access; it also has practical implications.
Inhalt: Vgl.: http://onlinelibrary.wiley.com/doi/10.1002/asi.23928/full.
2Song, J. ; Huang, Y. ; Qi, X. ; Li, Y. ; Li, F. ; Fu, K. ; Huang, T.: Discovering hierarchical topic evolution in time-stamped documents.
In: Journal of the Association for Information Science and Technology. 67(2016) no.4, S.915-927.
Abstract: The objective of this paper is to propose a hierarchical topic evolution model (HTEM) that can organize time-varying topics in a hierarchy and discover their evolutions with multiple timescales. In the proposed HTEM, topics near the root of the hierarchy are more abstract and also evolve in the longer timescales than those near the leaves. To achieve this goal, the distance-dependent Chinese restaurant process (ddCRP) is extended to a new nested process that is able to simultaneously model the dependencies among data and the relationship between clusters. The HTEM is proposed based on the new process for time-stamped documents, in which the timestamp is utilized to measure the dependencies among documents. Moreover, an efficient Gibbs sampler is developed for the proposed HTEM. Our experimental results on two popular real-world data sets verify that the proposed HTEM can capture coherent topics and discover their hierarchical evolutions. It also outperforms the baseline model in terms of likelihood on held-out data.
Inhalt: Vgl.: http://onlinelibrary.wiley.com/doi/10.1002/asi.23439/abstract.
Themenfeld: Data Mining
3Arora, S.K. ; Li, Y. ; Youtie, J. ; Shapira, P.: Using the wayback machine to mine websites in the social sciences : a methodological resource.
In: Journal of the Association for Information Science and Technology. 67(2016) no.8, S.1904-1915.
Abstract: Websites offer an unobtrusive data source for developing and analyzing information about various types of social science phenomena. In this paper, we provide a methodological resource for social scientists looking to expand their toolkit using unstructured web-based text, and in particular, with the Wayback Machine, to access historical website data. After providing a literature review of existing research that uses the Wayback Machine, we put forward a step-by-step description of how the analyst can design a research project using archived websites. We draw on the example of a project that analyzes indicators of innovation activities and strategies in 300 U.S. small- and medium-sized enterprises in green goods industries. We present six steps to access historical Wayback website data: (a) sampling, (b) organizing and defining the boundaries of the web crawl, (c) crawling, (d) website variable operationalization, (e) integration with other data sources, and (f) analysis. Although our examples draw on specific types of firms in green goods industries, the method can be generalized to other areas of research. In discussing the limitations and benefits of using the Wayback Machine, we note that both machine and human effort are essential to developing a high-quality data set from archived web information.
Inhalt: Vgl.: http://onlinelibrary.wiley.com/doi/10.1002/asi.23503/abstract.
4Crespo, J.A. ; Herranz, N. ; Li, Y. ; Ruiz-Castillo, J.: ¬The effect on citation inequality of differences in citation practices at the web of science subject category level.
In: Journal of the Association for Information Science and Technology. 65(2014) no.6, S.1244-1256.
Abstract: This article studies the impact of differences in citation practices at the subfield, or Web of Science subject category level, using the model introduced in Crespo, Li, and Ruiz-Castillo (2013a), according to which the number of citations received by an article depends on its underlying scientific influence and the field to which it belongs. We use the same Thomson Reuters data set of about 4.4 million articles used in Crespo et al. (2013a) to analyze 22 broad fields. The main results are the following: First, when the classification system goes from 22 fields to 219 subfields the effect on citation inequality of differences in citation practices increases from ?14% at the field level to 18% at the subfield level. Second, we estimate a set of exchange rates (ERs) over a wide [660, 978] citation quantile interval to express the citation counts of articles into the equivalent counts in the all-sciences case. In the fractional case, for example, we find that in 187 of 219 subfields the ERs are reliable in the sense that the coefficient of variation is smaller than or equal to 0.10. Third, in the fractional case the normalization of the raw data using the ERs (or subfield mean citations) as normalization factors reduces the importance of the differences in citation practices from 18% to 3.8% (3.4%) of overall citation inequality. Fourth, the results in the fractional case are essentially replicated when we adopt a multiplicative approach.
Objekt: Web of Science
5Liu, J.S. ; Chen, H.-H. ; Ho, M.H.-C. ; Li, Y.-C.: Citations with different levels of relevancy : tracing the main paths of legal opinions.
In: Journal of the Association for Information Science and Technology. 65(2014) no.12, S.2479-2488.
Abstract: This study explores the effect from considering citation relevancy in the main path analysis. Traditional citation-based analyses treat all citations equally even though there can be various reasons and different levels of relevancy for one document to reference another. Taking the relevancy level into consideration is intuitively advantageous because it adopts more accurate information and will thus make the results of a citation-based analysis more trustworthy. This is nevertheless a challenging task. We are aware of no citation-based analysis that has taken the relevancy level into consideration. The difficulty lies in the fact that the existing patent or patent citation database provides no readily available relevancy level information. We overcome this issue by obtaining citation relevancy information from a legal database that has relevancy level ranked by legal experts. This paper selects trademark dilution, a legal concept that has been the subject of many lawsuit cases, as the target for exploration. We apply main path analysis, taking citation relevancy into consideration, and verify the results against a set of test cases that are mentioned in an authoritative trademark book. The findings show that relevancy information helps main path analysis uncover legal cases of higher importance. Nevertheless, in terms of the number of significant cases retrieved, relevancy information does not seem to make a noticeable difference.
6Li, Y. ; Xu, S. ; Luo, X. ; Lin, S.: ¬A new algorithm for product image search based on salient edge characterization.
In: Journal of the Association for Information Science and Technology. 65(2014) no.12, S.2534-2551.
Abstract: Visually assisted product image search has gained increasing popularity because of its capability to greatly improve end users' e-commerce shopping experiences. Different from general-purpose content-based image retrieval (CBIR) applications, the specific goal of product image search is to retrieve and rank relevant products from a large-scale product database to visually assist a user's online shopping experience. In this paper, we explore the problem of product image search through salient edge characterization and analysis, for which we propose a novel image search method coupled with an interactive user region-of-interest indication function. Given a product image, the proposed approach first extracts an edge map, based on which contour curves are further extracted. We then segment the extracted contours into fragments according to the detected contour corners. After that, a set of salient edge elements is extracted from each product image. Based on salient edge elements matching and similarity evaluation, the method derives a new pairwise image similarity estimate. Using the new image similarity, we can then retrieve product images. To evaluate the performance of our algorithm, we conducted 120 sessions of querying experiments on a data set comprised of around 13k product images collected from multiple, real-world e-commerce websites. We compared the performance of the proposed method with that of a bag-of-words method (Philbin, Chum, Isard, Sivic, & Zisserman, 2008) and a Pyramid Histogram of Orientated Gradients (PHOG) method (Bosch, Zisserman, & Munoz, 2007). Experimental results demonstrate that the proposed method improves the performance of example-based product image retrieval.
Behandelte Form: Bilder
7Cao, Q. ; Lu, Y. ; Dong, D. ; Tang, Z. ; Li, Y.: ¬The roles of bridging and bonding in social media communities.
In: Journal of the American Society for Information Science and Technology. 64(2013) no.8, S.1671-1681.
Abstract: Social media communities have emerged recently as open and free communication platforms to support real-time information sharing among members. Drawing on social capital theories, we develop a theoretical model to investigate how the two types of social capital (bonding and bridging) contribute to the individual and collective well-being of virtual communities through information exchange. Research hypotheses were tested through survey instruments and computer archive data of 475 members of a large social network site during the Wenchuan earthquake (2008) in China. We find that bonding has a positive and significant impact on bridging. Both bonding and bridging have positive and significant impacts on information quality, but not on information quantity. Results also suggest that information quality is more critical to individuals and collective well-being than information quantity after a disaster.
8Shen, J. ; Yao, L. ; Li, Y. ; Clarke, M. ; Wang, L. ; Li, D.: Visualizing the history of evidence-based medicine : a bibliometric analysis.
In: Journal of the American Society for Information Science and Technology. 64(2013) no.10, S.2157-2172.
Abstract: The aim of this paper is to visualize the history of evidence-based medicine (EBM) and to examine the characteristics of EBM development in China and the West. We searched the Web of Science and the Chinese National Knowledge Infrastructure database for papers related to EBM. We applied information visualization techniques, citation analysis, cocitation analysis, cocitation cluster analysis, and network analysis to construct historiographies, themes networks, and chronological theme maps regarding EBM in China and the West. EBM appeared to develop in 4 stages: incubation (1972-1992 in the West vs. 1982-1999 in China), initiation (1992-1993 vs. 1999-2000), rapid development (1993-2000 vs. 2000-2004), and stable distribution (2000 onwards vs. 2004 onwards). Although there was a lag in EBM initiation in China compared with the West, the pace of development appeared similar. Our study shows that important differences exist in research themes, domain structures, and development depth, and in the speed of adoption between China and the West. In the West, efforts in EBM have shifted from education to practice, and from the quality of evidence to its translation. In China, there was a similar shift from education to practice, and from production of evidence to its translation. In addition, this concept has diffused to other healthcare areas, leading to the development of evidence-based traditional Chinese medicine, evidence-based nursing, and evidence-based policy making.
9Yang, M. ; Kiang, M. ; Chen, H. ; Li, Y.: Artificial immune system for illicit content identification in social media.
In: Journal of the American Society for Information Science and Technology. 63(2012) no.2, S.256-269.
Abstract: Social media is frequently used as a platform for the exchange of information and opinions as well as propaganda dissemination. But online content can be misused for the distribution of illicit information, such as violent postings in web forums. Illicit content is highly distributed in social media, while non-illicit content is unspecific and topically diverse. It is costly and time consuming to label a large amount of illicit content (positive examples) and non-illicit content (negative examples) to train classification systems. Nevertheless, it is relatively easy to obtain large volumes of unlabeled content in social media. In this article, an artificial immune system-based technique is presented to address the difficulties in the illicit content identification in social media. Inspired by the positive selection principle in the immune system, we designed a novel labeling heuristic based on partially supervised learning to extract high-quality positive and negative examples from unlabeled datasets. The empirical evaluation results from two large hate group web forums suggest that our proposed approach generally outperforms the benchmark techniques and exhibits more stable performance.
10Li, Y. ; Belkin, N.J.: ¬An exploration of the relationships between work task and interactive information search behavior.
In: Journal of the American Society for Information Science and Technology. 61(2010) no.9, S.1771-1789.
Abstract: This study explores the relationships between work task and interactive information search behavior. Work task was conceptualized based on a faceted classification of task. An experiment was conducted with six work-task types and simulated work-task situations assigned to 24 participants. The results indicate that users present different behavior patterns to approach useful information for different work tasks: They select information systems to search based on the work tasks at hand, different work tasks motivate different types of search tasks, and different facets controlled in the study play different roles in shaping users' interactive information search behavior. The results provide empirical evidence to support the view that work tasks and search tasks play different roles in a user's interaction with information systems and that work task should be considered as a multifaceted variable. The findings provide a possibility to make predictions of a user's information search behavior from his or her work task, and vice versa. Thus, this study sheds light on task-based information seeking and search, and has implications in adaptive information retrieval (IR) and personalization of IR.
11Li, Y.: Exploring the relationships between work task and search task in information search.
In: Journal of the American Society for Information Science and Technology. 60(2009) no.2, S.275-291.
Abstract: To provide a basis for making predictions of the characteristics of search task (ST), based on work task (WT), and to explore the nature of WT and ST, this study examines the relationships between WT and ST (inter-relationships) and the relationships between the different facets of both WT and ST (intra-relationships), respectively. A faceted classification of task was used to conceptualize work task and search task. Twenty-four pairs of work tasks and their associated search tasks were collected, by semistructured interviews, and classified based on the classification. The results indicate that work task shapes different facets or sub-facets of its associated search tasks to different degrees. Several sub-facets of search task, such as Time (Length), Objective task complexity, and Subjective task complexity, are most strongly affected by work task. The results demonstrate that it is necessary to consider difficulty and complexity as different constructs when investigating their influence on information search behavior. The exploration of intra-relationships illustrates the difference of work task and search task in their nature. The findings provide empirical evidence to support the view that work task and search task are multi-faceted variables and their different effects on users' information search behavior should be examined.
12Zhang, Y. ; Li, Y.: ¬A user-centered functional metadata evaluation of moving image collections.
In: Journal of the American Society for Information Science and Technology. 59(2008) no.8, S.1331-1346.
Abstract: In this article, the authors report a series of evaluations of two metadata schemes developed for Moving Image Collections (MIC), an integrated online catalog of moving images. Through two online surveys and one experiment spanning various stages of metadata implementation, the MIC evaluation team explored a user-centered approach in which the four generic user tasks suggested by IFLA FRBR (International Association of Library Associations Functional Requirement for Bibliographic Records) were embedded in data collection and analyses. Diverse groups of users rated usefulness of individual metadata fields for finding, identifying, selecting, and obtaining moving images. The results demonstrate a consistency across these evaluations with respect to (a) identification of a set of useful metadata fields highly rated by target users for each of the FRBR generic tasks, and (b) indication of a significant interaction between MIC metadata fields and the FRBR generic tasks. The findings provide timely feedback for the MIC implementation specifically, and valuable suggestions to other similar metadata application settings in general. They also suggest the feasibility of using the four IFLA FRBR generic tasks as a framework for user-centered functional metadata evaluations.
Behandelte Form: Videos ; Filme
13Zhang, X. ; Li, Y. ; Liu, J. ; Zhang, Y.: Effects of interaction design in digital libraries on user interactions.
In: Journal of documentation. 64(2008) no.3, S.438-463.
Abstract: Purpose - This study aims to investigate the effects of different search and browse features in digital libraries (DLs) on task interactions, and what features would lead to poor user experience. Design/methodology/approach - Three operational DLs: ACM, IEEE CS, and IEEE Xplore are used in this study. These three DLs present different features in their search and browsing designs. Two information-seeking tasks are constructed: one search task and one browsing task. An experiment was conducted in a usability laboratory. Data from 35 participants are collected on a set of measures for user interactions. Findings - The results demonstrate significant differences in many aspects of the user interactions between the three DLs. For both search and browse designs, the features that lead to poor user interactions are identified. Research limitations/implications - User interactions are affected by specific design features in DLs. Some of the design features may lead to poor user performance and should be improved. The study was limited mainly in the variety and the number of tasks used. Originality/value - The study provided empirical evidence to the effects of interaction design features in DLs on user interactions and performance. The results contribute to our knowledge about DL designs in general and about the three operational DLs in particular.
Themenfeld: Information Gateway ; Benutzerstudien ; Suchoberflächen
14Li, Y. ; Belkin, N.J.: ¬A faceted approach to conceptualizing tasks in information seeking.
In: Information processing and management. 44(2008) no.6, S.1822-1837.
Abstract: The nature of the task that leads a person to engage in information interaction, as well as of information seeking and searching tasks, have been shown to influence individuals' information behavior. Classifying tasks in a domain has been viewed as a departure point of studies on the relationship between tasks and human information behavior. However, previous task classification schemes either classify tasks with respect to the requirements of specific studies or merely classify a certain category of task. Such approaches do not lead to a holistic picture of task since a task involves different aspects. Therefore, the present study aims to develop a faceted classification of task, which can incorporate work tasks and information search tasks into the same classification scheme and characterize tasks in such a way as to help people make predictions of information behavior. For this purpose, previous task classification schemes and their underlying facets are reviewed and discussed. Analysis identifies essential facets and categorizes them into Generic facets of task and Common attributes of task. Generic facets of task include Source of task, Task doer, Time, Action, Product, and Goal. Common attributes of task includes Task characteristics and User's perception of task. Corresponding sub-facets and values are identified as well. In this fashion, a faceted classification of task is established which could be used to describe users' work tasks and information search tasks. This faceted classification provides a framework to further explore the relationships among work tasks, search tasks, and interactive information retrieval and advance adaptive IR systems design.
Anmerkung: Beitrag in einem Themenheft "Adaptive information retrieval"
15Li, Y. ; Shawe-Taylor, J.: Advanced learning algorithms for cross-language patent retrieval and classification.
In: Information processing and management. 43(2007) no.5, S.1183-1199.
Abstract: We study several machine learning algorithms for cross-language patent retrieval and classification. In comparison with most of other studies involving machine learning for cross-language information retrieval, which basically used learning techniques for monolingual sub-tasks, our learning algorithms exploit the bilingual training documents and learn a semantic representation from them. We study Japanese-English cross-language patent retrieval using Kernel Canonical Correlation Analysis (KCCA), a method of correlating linear relationships between two variables in kernel defined feature spaces. The results are quite encouraging and are significantly better than those obtained by other state of the art methods. We also investigate learning algorithms for cross-language document classification. The learning algorithm are based on KCCA and Support Vector Machines (SVM). In particular, we study two ways of combining the KCCA and SVM and found that one particular combination called SVM_2k achieved better results than other learning algorithms for either bilingual or monolingual test documents.
Anmerkung: Beitrag innerhalb eines Themenschwerpunkt "special issue on patent processing"
16Li, Y.: Consistency versus inconsistency : issues in Chinese cataloging in OCLC.
In: Cataloging and classification quarterly. 38(2004) no.2, S.17-xx.
Abstract: This article addresses some unresolved cataloging issue related to pinyin Romanization, vernacular application, field coding, and other aspects of Chinese cataloging in OCLC. These issues lead to inconsistencies in the way Chinese materials are cataloged, though cataloging standards and Romanization rules are made and the processes of the projects like Pinyin Conversion, Manual Review, and Pinyin Clean-Up have been completed. In this article, eight of the most commonly encountered issues and inconsistent practices in Chinese cataloging are discussed. Examples from Chinese records created with OCLC CJK software in WorldCat are used to demonstrate the problems they raise. With the discussion it is hoped that these inconsistent practices can be recognized and avoided in the future.
17Li, Y.-O. ; Leung, S.W.: Computer cataloging of electronic Journals in unstable Aggregator Databases the Hong Kong Baptist University Library experience.
In: Library resources and technical services. 45(2001) no.4, S.198-211.
Abstract: The growth and use of aggregator databases have presented libraries with both new opportunities for collection enhancement and new challenges of bibliographic control. How to integrate full-text electronic journal titles in unstable aggregator databases into a library's OPAC (Online Public Access Catalog) has been an especially taxing matter for libraries. This article describes the Hong Kong Baptist University Library's effort to integrate full-text electronic journal titles from three large, unstable aggregator databases into its INNOPAC-based OPAC. The library's electronic journal computer program (EJCOP) does this in a simple, direct, consistent, and accurate manner and addresses some of the issues elaborated in the January 2000 Final Report of the Task Group on Journals in Aggregator Databases of the Standing Committee on Automation of the Library of Congress Program for Cooperative Cataloging.
Themenfeld: OPAC ; Elektronische Dokumente
18Xianghao, G. ; Yixin, Z. ; Li, Y.: ¬A new method of news test understanding and abstracting based on speech acts theory.
In: Journal of the China Society for Scientific and Technical Information. 17(1998) no.4, S.257-262.
Abstract: Presents a method for the automated analysis and comprehension of foreign affairs news produced by a Chinese news agency. Notes that the development of the method was prededed by a study of the structuring rules of the news. Describes how an abstract of the news story is produced automatically from the analysis. Stresses the main aim of the work which is to use specch act theory to analyse and classify sentences
Anmerkung: [In Chinesisch]
Themenfeld: Computerlinguistik ; Automatisches Abstracting
19Galliers, R.D. ; Merali, Y. ; Spearing, L.: Coping with information technology? : How British executives perceive the key information systems management issues in the mid-1990s.
In: Journal of information technology. 9(1994) no.3, S.223-238.
Abstract: Survey research amongst senior information systems (IS) and non IS executives in UK organizations was conducted in order to identify their views as to the most important and problematic issues they face in managing information systems. The survey mirrored similar research that has been undertaken in the USA and elsewhere, thus enabling international comparisons in the future. Provides comparisons of the views expressed by IS executives vis a vis those with a non IS role
Themenfeld: Information Resources Management
20Merali, Y. ; McKiernan, P.: ¬The strategic positioning of information systems in post-acquisition management.
In: Journal of strategic information systems. 2(1993) no.2, S.105-124.
Abstract: Focuses on the integration of information systems into organizations and the impact of information technology on performance. Based on the results of case studies and a preliminary survey, it is found that managers involved in pre- and post-acquisition making often fail to consider the strategic importance of information systems and technology in contributing to the acquisition outcome. Information systems integration is often derived from immediate requirements and organizations fail to invest in information infrastructures to support longer term requirements. The reactive status accorded to information systems couples with a lack of learning within organizations prevents full exploitation of information systems