Diese Datenbank enthält über 40.000 Dokumente zu Themen aus den Bereichen Formalerschließung – Inhaltserschließung – Information Retrieval.
© 2015 W. Gödert, TH Köln, Institut für Informationswissenschaft / Powered by litecat, BIS Oldenburg (Stand: 28. April 2022)
1Qin, J.: ¬A relation typology in knowledge organization systems : case studies in the research data management domain.
In: Challenges and opportunities for knowledge organization in the digital age: proceedings of the Fifteenth International ISKO Conference, 9-11 July 2018, Porto, Portugal / organized by: International Society for Knowledge Organization (ISKO), ISKO Spain and Portugal Chapter, University of Porto - Faculty of Arts and Humanities, Research Centre in Communication, Information and Digital Culture (CIC.digital) - Porto. Eds.: F. Ribeiro u. M.E. Cerveira. Baden-Baden : Ergon Verlag, 2018. S.409-415.
(Advances in knowledge organization; vol.16)
2Liu, X. ; Qin, J.: ¬An interactive metadata model for structural, descriptive, and referential representation of scholarly output.
In: Journal of the Association for Information Science and Technology. 65(2014) no.5, S.964-983.
Abstract: The scientific metadata model proposed in this article encompasses both classical descriptive metadata such as those defined in the Dublin Core Metadata Element Set (DC) and the innovative structural and referential metadata properties that go beyond the classical model. Structural metadata capture the structural vocabulary in research publications; referential metadata include not only citations but also data about other types of scholarly output that is based on or related to the same publication. The article describes the structural, descriptive, and referential (SDR) elements of the metadata model and explains the underlying assumptions and justifications for each major component in the model. ScholarWiki, an experimental system developed as a proof of concept, was built over the wiki platform to allow user interaction with the metadata and the editing, deleting, and adding of metadata. By allowing and encouraging scholars (both as authors and as users) to participate in the knowledge and metadata editing and enhancing process, the larger community will benefit from more accurate and effective information retrieval. The ScholarWiki system utilizes machine-learning techniques that can automatically produce self-enhanced metadata by learning from the structural metadata that scholars contribute, which will add intelligence to enhance and update automatically the publication of metadata Wiki pages.
3Chau, M. ; Wong, C.H. ; Zhou, Y. ; Qin, J. ; Chen, H.: Evaluating the use of search engine development tools in IT education.
In: Journal of the American Society for Information Science and Technology. 61(2010) no.2, S.288-299.
Abstract: It is important for education in computer science and information systems to keep up to date with the latest development in technology. With the rapid development of the Internet and the Web, many schools have included Internet-related technologies, such as Web search engines and e-commerce, as part of their curricula. Previous research has shown that it is effective to use search engine development tools to facilitate students' learning. However, the effectiveness of these tools in the classroom has not been evaluated. In this article, we review the design of three search engine development tools, SpidersRUs, Greenstone, and Alkaline, followed by an evaluation study that compared the three tools in the classroom. In the study, 33 students were divided into 13 groups and each group used the three tools to develop three independent search engines in a class project. Our evaluation results showed that SpidersRUs performed better than the two other tools in overall satisfaction and the level of knowledge gained in their learning experience when using the tools for a class project on Internet applications development.
Themenfeld: Suchmaschinen ; Ausbildung
4Chen, H. ; Chung, W. ; Qin, J. ; Reid, E. ; Sageman, M. ; Weimann, G.: Uncovering the dark Web : a case study of Jihad on the Web.
In: Journal of the American Society for Information Science and Technology. 59(2008) no.8, S.1347-1359.
Abstract: While the Web has become a worldwide platform for communication, terrorists share their ideology and communicate with members on the Dark Web - the reverse side of the Web used by terrorists. Currently, the problems of information overload and difficulty to obtain a comprehensive picture of terrorist activities hinder effective and efficient analysis of terrorist information on the Web. To improve understanding of terrorist activities, we have developed a novel methodology for collecting and analyzing Dark Web information. The methodology incorporates information collection, analysis, and visualization techniques, and exploits various Web information sources. We applied it to collecting and analyzing information of 39 Jihad Web sites and developed visualization of their site contents, relationships, and activity levels. An expert evaluation showed that the methodology is very useful and promising, having a high potential to assist in investigation and understanding of terrorist activities by producing results that could potentially help guide both policymaking and intelligence research.
5Qin, J.: Controlled semantics versus social semantics : an epistemological analysis.
In: Culture and identity in knowledge organization: Proceedings of the Tenth International ISKO Conference 5-8 August 2008, Montreal, Canada. Ed. by Clément Arsenault and Joseph T. Tennis. Würzburg : Ergon Verlag, 2008. S.229-234.
(Advances in knowledge organization; vol.11)
Inhalt: Social semantics is more than just tags or vocabularies. It involves the users who contribute the tags, the perceptions of the world, and intentions that the tags are created for. Whilst social semantics is a valuable, massive data source for developing new knowledge systems or validating existing ones, there are also pitfalls and uncertainties. The epistemological analysis presented in this paper is an attempt to explain the differences and connections between social and controlled semantics from the perspective of knowledge theory. The epistemological connection between social and controlled semantics is particularly important: empirical knowledge can provide data source for testing the rational knowledge and rational knowledge can provide reliability and predictability. Such connection will have significant implications for future research on social and controlled semantics.
Anmerkung: Vgl. unter: http://www.ergon-verlag.de/isko_ko/tocs/0497f79b0c0b3ed06/0497f79b0c0b5550a/index.php.
Themenfeld: Social tagging
6Chen, M. ; Liu, X. ; Qin, J.: Semantic relation extraction from socially-generated tags : a methodology for metadata generation.
In: Metadata for semantic and social applications : proceedings of the International Conference on Dublin Core and Metadata Applications, Berlin, 22 - 26 September 2008, DC 2008: Berlin, Germany / ed. by Jane Greenberg and Wolfgang Klas. Göttingen : Univ.-Verl., 2008. S.117-127.
Abstract: The growing predominance of social semantics in the form of tagging presents the metadata community with both opportunities and challenges as for leveraging this new form of information content representation and for retrieval. One key challenge is the absence of contextual information associated with these tags. This paper presents an experiment working with Flickr tags as an example of utilizing social semantics sources for enriching subject metadata. The procedure included four steps: 1) Collecting a sample of Flickr tags, 2) Calculating cooccurrences between tags through mutual information, 3) Tracing contextual information of tag pairs via Google search results, 4) Applying natural language processing and machine learning techniques to extract semantic relations between tags. The experiment helped us to build a context sentence collection from the Google search results, which was then processed by natural language processing and machine learning algorithms. This new approach achieved a reasonably good rate of accuracy in assigning semantic relations to tag pairs. This paper also explores the implications of this approach for using social semantics to enrich subject metadata.
Inhalt: Vgl. unter: http://dcpapers.dublincore.org/ojs/pubs/article/view/924/920.
Themenfeld: Social tagging
7Qin, J. ; Hernández, N.: Building interoperable vocabulary and structures for learning objects : an empirical study.
In: Journal of the American Society for Information Science and Technology. 57(2006) no.2, S.280-292.
Abstract: The structural, functional, and production views on learning objects influence metadata structure and vocabulary. The authors drew on these views and conducted a literature review and in-depth analysis of 14 learning objects and over 500 components in these learning objects to model the knowledge framework for a learning object ontology. The learning object ontology reported in this article consists of 8 top-level classes, 28 classes at the second level, and 34 at the third level. Except class Learning object, all other classes have the three properties of preferred term, related term, and synonym. To validate the ontology, we conducted a query log analysis that focused an discovering what terms users have used at both conceptual and word levels. The findings show that the main classes in the ontology are either conceptually or linguistically similar to the top terms in the query log data. The authors built an "Exercise Editor" as an informal experiment to test its adoption ability in authoring tools. The main contribution of this project is in the framework for the learning object domain and the methodology used to develop and validate an ontology.
8Qin, J. ; Zhou, Y. ; Chau, M. ; Chen, H.: Multilingual Web retrieval : an experiment in English-Chinese business intelligence.
In: Journal of the American Society for Information Science and Technology. 57(2006) no.5, S.671-683.
Abstract: As increasing numbers of non-English resources have become available on the Web, the interesting and important issue of how Web users can retrieve documents in different languages has arisen. Cross-language information retrieval (CLIP), the study of retrieving information in one language by queries expressed in another language, is a promising approach to the problem. Cross-language information retrieval has attracted much attention in recent years. Most research systems have achieved satisfactory performance on standard Text REtrieval Conference (TREC) collections such as news articles, but CLIR techniques have not been widely studied and evaluated for applications such as Web portals. In this article, the authors present their research in developing and evaluating a multilingual English-Chinese Web portal that incorporates various CLIP techniques for use in the business domain. A dictionary-based approach was adopted and combines phrasal translation, co-occurrence analysis, and pre- and posttranslation query expansion. The portal was evaluated by domain experts, using a set of queries in both English and Chinese. The experimental results showed that co-occurrence-based phrasal translation achieved a 74.6% improvement in precision over simple word-byword translation. When used together, pre- and posttranslation query expansion improved the performance slightly, achieving a 78.0% improvement over the baseline word-by-word translation approach. In general, applying CLIR techniques in Web applications shows promise.
Anmerkung: Beitrag einer special topic section on multilingual information systems
Themenfeld: Multilinguale Probleme
9Qin, J. ; Creticos, P. ; Hsiao, W.Y.: Adaptive modeling of workforce domain knowledge.
In: Knowledge organization for a global learning society: Proceedings of the 9th International ISKO Conference, 4-7 July 2006, Vienna, Austria. Hrsg.: G. Budin, C. Swertz u. K. Mitgutsch. Würzburg : Ergon Verlag, 2006. S.287-293.
(Advances in knowledge organization; vol.10)
Abstract: Workforce development is a multidisciplinary domain in which policy, laws and regulations, social services, training and education, and information technology and systems are heavily involved. It is essential to have a semantic base accepted by the workforce development community for knowledge sharing and exchange. This paper describes how such a semantic base-the Workforce Open Knowledge Exchange (WOKE) Ontology-was built by using the adaptive modeling approach. The focus of this paper is to address questions such as how ontology designers should extract and model concepts obtained from different sources and what methodologies are useful along the steps of ontology development. The paper proposes a methodology framework "adaptive modeling" and explains the methodology through examples and some lessons learned from the process of developing the WOKE ontology.
Inhalt: Vgl.: http://www.ergon-verlag.de/isko_ko/tocs/0497f79b0c0b3ed06/0497f79b0c0c7c33f/index.php.
10Qin, J.: Evolving paradigms of knowledge representation and organization : a comparative study of classification, XML/DTD and ontology.
In: Challenges in knowledge representation and organization for the 21st century: Integration of knowledge across boundaries. Proceedings of the 7th ISKO International Conference Granada, Spain, July 10-13, 2002. Ed.: M. López-Huertas. Würzburg : Ergon Verlag, 2003. S.465-471.
(Advances in knowledge organization; vol.8)
Abstract: The different points of views an knowledge representation and organization from various research communities reflect underlying philosophies and paradigms in these communities. This paper reviews differences and relations in knowledge representation and organization and generalizes four paradigms-integrative and disintegrative pragmatism and integrative and disintegrative epistemologism. Examples such as classification, XML schemas, and ontologies are compared based an how they specify concepts, build data models, and encode knowledge organization structures. 1. Introduction Knowledge representation (KR) is a term that several research communities use to refer to somewhat different aspects of the same research area. The artificial intelligence (AI) community considers KR as simply "something to do with writing down, in some language or communications medium, descriptions or pictures that correspond in some salient way to the world or a state of the world" (Duce & Ringland, 1988, p. 3). It emphasizes the ways in which knowledge can be encoded in a computer program (Bench-Capon, 1990). For the library and information science (LIS) community, KR is literally the synonym of knowledge organization, i.e., KR is referred to as the process of organizing knowledge into classifications, thesauri, or subject heading lists. KR has another meaning in LIS: it "encompasses every type and method of indexing, abstracting, cataloguing, classification, records management, bibliography and the creation of textual or bibliographic databases for information retrieval" (Anderson, 1996, p. 336). Adding the social dimension to knowledge organization, Hjoerland (1997) states that knowledge is a part of human activities and tied to the division of labor in society, which should be the primary organization of knowledge. Knowledge organization in LIS is secondary or derived, because knowledge is organized in learned institutions and publications. These different points of views an KR suggest that an essential difference in the understanding of KR between both AI and LIS lies in the source of representationwhether KR targets human activities or derivatives (knowledge produced) from human activities. This difference also decides their difference in purpose-in AI KR is mainly computer-application oriented or pragmatic and the result of representation is used to support decisions an human activities, while in LIS KR is conceptually oriented or abstract and the result of representation is used for access to derivatives from human activities.
Themenfeld: Klassifikationstheorie: Elemente / Struktur
11Qin, J. ; Chen, J.: ¬A multi-layered, multi-dimensional representation of digital educational resources.
In: Subject retrieval in a networked environment: Proceedings of the IFLA Satellite Meeting held in Dublin, OH, 14-16 August 2001 and sponsored by the IFLA Classification and Indexing Section, the IFLA Information Technology Section and OCLC. Ed.: I.C. McIlwaine. München : Saur, 2003. S.90-96.
(UBCIM publications: new series; vol.25)
Abstract: Semantic mapping between controlled vocabulary and keywords is the first step towards knowledge-based subject access. This study reports the preliminary result of a semantic mapping experiment for the Gateway to Educational Materials (GEM). A total of 3,555 keywords were mapped with 322 concept names in the GEM controlled vocabulary. The preliminary test to 10,000 metadata records presented widely varied sets of results between the mapped and non-mapped data. The paper discussed linguistic and technical problems encountered in the mapping process and raised issues in the representation technologies and methods, which will lead to future study of knowledge-based access to networked information resources.
Themenfeld: Information Gateway
12Qin, J.: Semantic patterns in bibliographically coupled documents.
In: Encyclopedia of library and information science. Vol.72, [=Suppl.35]. New York : Dekker, 2002. S.341-365.
Abstract: Different research fields have different definitions for semantic patterns. For knowledge discovery and representation, semantic patterns represent the distribution of occurrences of words in documents and/or citations. In the broadest sense, the term semantic patterns may also refer to the distribution of occurrences of subjects or topics as reflected in documents. The semantic pattern in a set of documents or a group of topics therefore implies quantitative indicators that describe the subject characteristics of the documents being examined. These characteristics are often described by frequencies of keyword occurrences, number of co-occurred keywords, occurrences of coword, and number of cocitations. There are many ways to analyze and derive semantic patterns in documents and citations. A typical example is text mining in full-text documents, a research topic that studies how to extract useful associations and patterns through clustering, categorizing, and summarizing words in texts. One unique way in library and information science is to discover semantic patterns through bibliographically coupled citations. The history of bibliographical coupling goes back in the early 1960s when Kassler investigated associations among technical reports and technical information flow patterns. A number of definitions may facilitate our understanding of bibliographic coupling: (1) bibliographic coupling determines meaningful relations between papers by a study of each paper's bibliography; (2) a unit of coupling is the functional bond between papers when they share a single reference item; (3) coupling strength shows the order of combinations of units of coupling into a graded scale between groups of papers; and (4) a coupling criterion is the way by which the coupling units are combined between two or more papers. Kessler's classic paper an bibliographic coupling between scientific papers proposes the following two graded criteria: Criterion A: A number of papers constitute a related group GA if each member of the group has at least one coupling unit to a given test paper P0. The coupling strength between P0 and any member of GA is measured by the number of coupling units n between them. G(subA)(supn) is that portion of GA that is linked to P0 through n coupling units; Criterion B: A number of papers constitute a related group GB if each member of the group has at least one coupling unit to every other member of the group.
14Qin, J.: Semantic similarities between a keyword database and a controlled vocabulary database : an investigation in the antibiotic resistance literature.
In: Journal of the American Society for Information Science. 51(2000) no.2, S.166-180.
Abstract: The 'KeyWords Plus' in the Science Citation Index database represents an approach to combining citation and semantic indexing in describing the document content. This paper explores the similariites or dissimilarities between citation-semantic and analytic indexing. The dataset consisted of over 400 matching records in the SCI and MEDLINE databases on antibiotic resistance in pneumonia. The degree of similarity in indexing terms was found to vary on a scale from completely different to completely identical with various levels in between. The within-document similarity in the 2 databases was measured by a variation on the Jaccard coefficient - the Inclusion Index. The average inclusion coefficient was 0,4134 for SCI and 0,3371 for Medline. The 20 terms occuring most frequently in each database were identified. The 2 groups of terms shared the same terms that consist of the 'intellectual base' for the subject. conceptual similarity was analyzed through scatterplots of matching and nonmatching terms vs. partially identical and broader/narrower terms. The study also found that both databases differed in assigning terms in various semantic categories. Implications of this research and further studies are suggested
Objekt: Science Citation Index ; Medline
16Qin, J.: Discovering semantic patterns in bibliographically coupled documents.
In: Library trends. 48(1999) no.1, S.109-132.
17Qin, J. ; Wesley, K.: Web indexing with meta fields : a survey of Web objects in polymer chemistry.
In: Information technology and libraries. 17(1998) no.3, S.149-156.
Abstract: Reports results of a study of 4 WWW search engines: AltaVista; Lycos; Excite and WebCrawler to collect data on Web objects on polymer chemistry. 1.037 Web objects were examined for data in 4 categories: document information; use of meta fields; use of images and use of chemical names. Issues raised included: whether to provide metadata elements for parts of entities or whole entities only, the use of metasyntax, problems in representation of special types of objects, and whether links should be considered when encoding metadata. Use of metafields was not widespread in the sample and knowledge of metafields in HTML varied greatly among Web object creators. The study formed part of a metadata project funded by the OCLC Library and Information Science Research Grant Program
Themenfeld: Internet ; Metadaten
Objekt: AltaVista ; Lycos ; Excite ; WebCrawler
18Qin, J. ; Lancaster, F.W. ; Allen, B.: Types and levels of collaboration in interdisciplinary research in the sciences.
In: Journal of the American Society for Information Science. 48(1997) no.10, S.893-916.
Abstract: Reports on a study which collected a sample of 846 scientific research papers published in 1992 and tests 3 hypotheses on the relationship between research collaboration and interdisciplinarity. Results showed significant differences in degrees of interdisciplinarity among different levels of collaboration and among different disciplines. Collaboration contributed significantly to the degree of interdisciplinarity in some disciplines and not in others. Uses a survey that asked authors about their form of collaboration, channels of communication and use of information. The survey provides some qualitative explanation for the bibliometrics findings. Discusses the perspective of scientist-scientist interaction, scientist-information interaction and information-information interaction