Diese Datenbank enthält über 40.000 Dokumente zu Themen aus den Bereichen Formalerschließung – Inhaltserschließung – Information Retrieval.
© 2015 W. Gödert, TH Köln, Institut für Informationswissenschaft / Powered by litecat, BIS Oldenburg (Stand: 28. April 2022)
1Kanan, T. ; Fox, E.A.: Automated arabic text classification with P-Stemmer, machine learning, and a tailored news article taxonomy.
In: Journal of the Association for Information Science and Technology. 67(2016) no.11, S.2667-2683.
Abstract: Arabic news articles in electronic collections are difficult to study. Browsing by category is rarely supported. Although helpful machine-learning methods have been applied successfully to similar situations for English news articles, limited research has been completed to yield suitable solutions for Arabic news. In connection with a Qatar National Research Fund (QNRF)-funded project to build digital library community and infrastructure in Qatar, we developed software for browsing a collection of about 237,000 Arabic news articles, which should be applicable to other Arabic news collections. We designed a simple taxonomy for Arabic news stories that is suitable for the needs of Qatar and other nations, is compatible with the subject codes of the International Press Telecommunications Council, and was enhanced with the aid of a librarian expert as well as five Arabic-speaking volunteers. We developed tailored stemming (i.e., a new Arabic light stemmer called P-Stemmer) and automatic classification methods (the best being binary Support Vector Machines classifiers) to work with the taxonomy. Using evaluation techniques commonly used in the information retrieval community, including 10-fold cross-validation and the Wilcoxon signed-rank test, we showed that our approach to stemming and classification is superior to state-of-the-art techniques.
Inhalt: Vgl.: http://onlinelibrary.wiley.com/doi/10.1002/asi.23609/full.
Themenfeld: Automatisches Indexieren
2Gonçalves, M.A. ; Moreira, B.L. ; Fox, E.A. ; Watson, L.T.: "What is a good digital library?" : a quality model for digital libraries.
In: Information processing and management. 43(2007) no.5, S.1416-1437.
Abstract: In this article, we elaborate on the meaning of quality in digital libraries (DLs) by proposing a model that is deeply grounded in a formal framework for digital libraries: 5S (Streams, Structures, Spaces, Scenarios, and Societies). For each major DL concept in the framework we formally define a number of dimensions of quality and propose a set of numerical indicators for those quality dimensions. In particular, we consider key concepts of a minimal DL: catalog, collection, digital object, metadata specification, repository, and services. Regarding quality dimensions, we consider: accessibility, accuracy, completeness, composability, conformance, consistency, effectiveness, efficiency, extensibility, pertinence, preservability, relevance, reliability, reusability, significance, similarity, and timeliness. Regarding measurement, we consider characteristics like: response time (with regard to efficiency), cost of migration (with respect to preservability), and number of service failures (to assess reliability). For some key DL concepts, the (quality dimension, numerical indicator) pairs are illustrated through their application to a number of "real-world" digital libraries. We also discuss connections between the proposed dimensions of DL quality and an expanded version of a workshop's consensus view of the life cycle of information in digital libraries. Such connections can be used to determine when and where quality issues can be measured, assessed, and improved - as well as how possible quality problems can be prevented, detected, and eliminated.
Themenfeld: Information Gateway
3Fan, W. ; Fox, E.A. ; Pathak, P. ; Wu, H.: ¬The effects of fitness functions an genetic programming-based ranking discovery for Web search.
In: Journal of the American Society for Information Science and technology. 55(2004) no.7, S.628-636.
Abstract: Genetic-based evolutionary learning algorithms, such as genetic algorithms (GAs) and genetic programming (GP), have been applied to information retrieval (IR) since the 1980s. Recently, GP has been applied to a new IR taskdiscovery of ranking functions for Web search-and has achieved very promising results. However, in our prior research, only one fitness function has been used for GP-based learning. It is unclear how other fitness functions may affect ranking function discovery for Web search, especially since it is weIl known that choosing a proper fitness function is very important for the effectiveness and efficiency of evolutionary algorithms. In this article, we report our experience in contrasting different fitness function designs an GP-based learning using a very large Web corpus. Our results indicate that the design of fitness functions is instrumental in performance improvement. We also give recommendations an the design of fitness functions for genetic-based information retrieval experiments.
4Fan, W. ; Luo, M. ; Wang, L. ; Xi, W. ; Fox, E.A.: Tuning before feedback : combining ranking discovery and blind feedback for robust retrieval.
In: SIGIR'04: Proceedings of the 27th Annual International ACM-SIGIR Conference an Research and Development in Information Retrieval. Ed.: K. Järvelin, u.a. New York, NY : ACM Press, 2004. S.138-145.
5Suleman, H. ; Fox, E.A.: Leveraging OAI harvesting to disseminate theses.
In: Library hi tech. 21(2003) no.2, S.219-227.
Abstract: NDLTD, the Networked Digital Library of Theses and Dissertations, supports and encourages the production and archiving of electronic theses and dissertations (ETDs). While many current NDLTD member institutions and consortia have individual collections accessible online, there has until recently been no single mechanism to aggregate all ETDs to provide NDLTD-wide services (e.g. searching). With the emergence of the Open Archives Initiative (OAI), that has changed. The OAI's Protocol for Metadata Harvesting is a robust interoperability solution that defines a standard method of exchanging metadata. While working with the OAI to develop and test the metadata harvesting standard, we have set up and actively maintain a central NDLTD metadata collection and multiple user portals. We discuss in this article our experiences in building this distributed digital library based upon the work of the OAI.
Inhalt: Vgl. auch unter: http://www.emeraldinsight.com/10.1108/07378830310479857.
Behandelte Form: Elektronische Dokumente ; Dissertationen
7Fox, E.A. ; Urs, S.R.: Digital libraries.
In: Annual review of information science and technology. 36(2002), S.503-590.
Abstract: The emergence of digital libraries (DLs), at the interface of library and information science with computer and communication technologies, helped to expand significantly the literature in all of these areas during the late 1990s. The pace of development is reflected by the number of special issues of major journals in information science and computer science, and the increasing number of workshops and conferences an digital libraries. For example, starting in 1995, the Communications of the ACM has devoted three special issues to the topic (Fox, Akscyn, Furuta, & Leggett, 1995; Fox & Marchionini, 1998, 2001). The Journal of the American Society for Information Science devoted two issues to digital libraries (H. Chen, 2000; Fox & Lunin, 1993); Information Processing & Management and the Journal of Visual Communication and Image Representation each had one special issue (Chen & Fox, 1996; Marchionini & Fox, 1999). The domain of digital libraries, though still evolving, has matured over the last decade, as demonstrated by coverage through D-Lib (http://www.dlib.org), the International Journal an Digital Libraries (http://link.springer.de/link/service/journals/00799), and two overview works (W Y Arms, 2000; Lesk, 1997; both of which have also served as textbooks). Sun Microsystems published a small book to guide those planning a digital library (Noerr, 2000), and IBM has been developing commercial products for digital libraries since 1994 (IBM, 2000). A number of Web sites have extensive sets of pointers to information an DLs (D-Lib Forum, 2001; Fox, 1998a; Habing, 1998; Hein, 2000; Schwartz, 2001a, 2001b). Further, the field has attracted the attention of diverse academics, research groups, and practitionersmany of whom have attended tutorials, workshops, or conferences, e.g., the Joint Conference an Digital Libraries, which is a sequel to a separate series run by ACM and IEEE-CS. Therefore, it is timely that ARIST publishes this first review focusing specifically an digital libraries. There has been no ARIST chapter to date directly dealing with the area of DLs, though some related domains have been covered-particularly: information retrieval, user interfaces (Marchionini & Komlodi, 1998), social informatics of DLs (Bishop & Star, 1996), and scholarly communication (see Borgman and Furner's chapter in this volume). This chapter provides an overview of the diverse aspects and dimensions of DL research, practice, and literature, identifying trends and delineating research directions.
9Powell, J. ; Fox, E.A.: Multilingual federated searching across heterogeneous collections.
In: D-Lib magazine. 4(1998) no.9, xx S.
Abstract: This article describes a scalable system for searching heterogeneous multilingual collections on the World Wide Web. It details a markup language for describing the characteristics of a search engine and its interface, and a protocol for requesting word translations between languages.
Anmerkung: Vgl.: http://dlib.ukoln.ac.uk/dlib/september98/powell/09powell.html.
Themenfeld: Suchmaschinen ; Multilinguale Probleme
10Fox, E.A.: Users, user interfaces and objects : envision, a digital library.
In: Journal of the American Society for Information Science. 44(1993) no.8, S.480-491.
Abstract: Part of a special section on digital libraries. Reports on Project Envision which aims to build a user centred database from computer science literature. Sets out principles for digital libraries which cover issues of representation of architecture and interfacing. Describes interviews with users which led to design and evaluation of the Envision user interface which features innovative query formulation and search results screens. Describes how the system builds upon work to identity and represent 'objects' in computer science that will facilitate reuse and high level communication of information from author to reader
13Fox, E.A. ; France, R.K.: Architecture of an expert system for composite analysis, representation, and retrieval.
In: Journal of approximate reasoning. 1(1987), S.151-175.
Anmerkung: Wiederabgedruckt in: Readings in information retrieval. Ed.: K. Sparck Jones u. P. Willett. San Francisco: Morgan Kaufmann 1997. S.400-412.