Diese Datenbank enthält über 40.000 Dokumente zu Themen aus den Bereichen Formalerschließung – Inhaltserschließung – Information Retrieval.
© 2015 W. Gödert, TH Köln, Institut für Informationswissenschaft / Powered by litecat, BIS Oldenburg (Stand: 28. April 2022)
1Gill, A.J. ; Hinrichs-Krapels, S. ; Blanke, T. ; Grant, J. ; Hedges, M. ; Tanner, S.: Insight workflow : systematically combining human and computational methods to explore textual data.
In: Journal of the Association for Information Science and Technology. 68(2017) no.7, S.1671-1686.
Abstract: Analyzing large quantities of real-world textual data has the potential to provide new insights for researchers. However, such data present challenges for both human and computational methods, requiring a diverse range of specialist skills, often shared across a number of individuals. In this paper we use the analysis of a real-world data set as our case study, and use this exploration as a demonstration of our "insight workflow," which we present for use and adaptation by other researchers. The data we use are impact case study documents collected as part of the UK Research Excellence Framework (REF), consisting of 6,679 documents and 6.25 million words; the analysis was commissioned by the Higher Education Funding Council for England (published as report HEFCE 2015). In our exploration and analysis we used a variety of techniques, ranging from keyword in context and frequency information to more sophisticated methods (topic modeling), with these automated techniques providing an empirical point of entry for in-depth and intensive human analysis. We present the 60 topics to demonstrate the output of our methods, and illustrate how the variety of analysis techniques can be combined to provide insights. We note potential limitations and propose future work.
Inhalt: Vgl.: http://onlinelibrary.wiley.com/doi/10.1002/asi.23767/full.
Themenfeld: Computerlinguistik ; Data Mining
2Deegan, M. ; Tanner, S.: Digital futures : strategies for the information age.
New York : Neal-Schuman, 2002. XII, 276 S.
Anmerkung: Rez. in: JASIST. 54(2003) no.9, S.908-909 (L. Ennis): "This is a timely and important addition to the growing body of work an libraries and digital collections. Both Deegan and Tanner bring a wide array of experience and knowledge to the work creating a valuable resource for librarians and digital collection managers. The book is the first in what the authors hope will become a series of volumes covering various issues of digital futures. Digital Futures: Strategies for the Information Age contains nine main chapters divided into sections, an introduction, a conclusion, a bibliography, a glossary, and an index. Each chapter begins with a quote or two and an introduction to help set the stage for the rest of the chapter. The first chapter, "Digital Futures in Current Context," outlines the myriad of changes in information technology from the past 50 years and the impact of those changes an libraries, library practices, and publishing. The book is written for people with little or no prior knowledge of information technology, so technologically savvy readers may find the first chapter a little elementary. For instance, the chapter includes a good bit of the history and workings of the Internet and World Wide Web. However, without the chapter included in the text, the work world lack a real starting point for the narrative and possibly alienate readers just starting their voyage into information science. The second chapter, "Why Digitize?" discusses why libraries and librarians should consider digital projects as a means of providing access. While the concentration is an the benefits of digital projects, the authors are also careful to point out various pitfalls and stumbling blocks to creating, managing, and preserving a digital collection. To help demonstrate their point, the authors include examples of a number of active projects covering newspapers, photo collections, books, and periodicals, and provide URLs so readers can visit the projects an their own. This chapter gives the reader a good overview of the various issues surrounding digitization as well as practical examples. While the first two chapters are a good introduction to the subject and examine theoretical issues, the next two chapters begin take an more practical issues. In Chapter Three, "Developing Collections in the Digital World," and four, "The Economic Factors," the authors explore how digital collections work with traditional library collecfions and how collection development for digital resources differs from collection development of non-digital resources. One of the most interesting topics of these chapters covers the issues surrounding serials using JSTOR and Project MUSE as examples. E-books and their impact an libraries is also discussed. The remaining chapters are by far the most timely and important parts of the work. Chapter Five, "Resource Discovery, Description and Use," examines the growing area of metadata and its importance for libraries and librarians. The chapter begins with a look at how the World Wide Web works and the problems with search engines and then evolves into a discussion of what metadata is, the types of metadata, and metadata creation. The authors explain that one of the biggest problems with the World Wide Web is that the construction and description of web pages is imprecise. The solution for bettering retrieval is metadata. ; The most common definition for metadata is "data about data." What metadata does is provide schemes for describing, organizing, exchanging, and receiving information over networks. The authors explain how metadata is used to describe resources by tagging item attributes like author, title, creation date, key words, file formats, compression, etc. The most well known scheme is MARC, but other schemes are developing for creating and managing digital collections, such as XML, TEI, EAD, and Dublin Core. The authors also do a good job of describing the difference between metadata and mark-up languages like HTML. The next two chapters discuss developing, designing, and providing access to a digital collection. In Chapter Six, "Developing and Designing Systems for Sharing Digital Resources," the authors examine a number of issues related to designing a shared collection. For instance, one issue the authors examine is interoperability. The authors stress that when designing a digital collection the creators should take care to ensure that their collection is "managed in such a way as to maximize opportunities for exchange and reuse of information, whether internally or externally" (p. 140). As a complement to Chapter Six, Chapter Seven, "Portals and Personalization: Mechanisms for End-user Access," focuses an the other end of the process; how the collection is used once it is made available. The majority of this chapter concentrates an the use of portals or gateways to digital collections. One example the authors use is MyLibrary@NCState, which provides the university community with a flexible user-drive customizable portal that allows user to access remote and local resources. The work logically concludes with a chapter an preservation and a chapter an the evolving role of librarians. Chapter Eight, "Preservation," is a thought-provoking discussion an preserving digital data and digitization as a preservation technique. The authors do a good job of relaying the complexity of preservation issues in a digital world in a single chapter. While the authors do not answer their questions, they definitely provide the reader wich some things to ponder. The final chapter, "Digital Librarians: New Roles for the Information Age," outlines where the authors believe librarianship is headed. Throughout the work they stress the role of the librarian in the digital world, but Chapter Nine really brings the point home. As the authors stress, librarians have always managed information and as experienced leaders in the information field, librarians are uniquely suited to take the digital bull by the horns. Also, the role of the librarian and what librarians can do is growing and evolving. The authors suggest that librarians are likely to move into rotes such as knowledge mediator, information architect, hybrid librarian-who brings resources and technologies together, and knowledge preserver. While these librarians must have the technical skills to cope with new technologies, the authors also state that management skills and subject skills will prove equally important.
LCSH: Digital libraries ; Libraries / Special collections / Electronic information resources ; Digital preservation
RSWK: Elektronische Bibliothek / Elektronische Medien / Online-Informationssystem
BK: 06.54 Bibliotheksautomatisierung