Diese Datenbank enthält über 40.000 Dokumente zu Themen aus den Bereichen Formalerschließung – Inhaltserschließung – Information Retrieval.
© 2015 W. Gödert, TH Köln, Institut für Informationswissenschaft / Powered by litecat, BIS Oldenburg (Stand: 28. April 2022)
1Belew, R.K.: Finding out about : a cognitive perspective on search engine technology and the WWW.
Cambridge : Cambridge University Press, 2001. XXVII, 356 S. + 1 CD-ROM.
Abstract: The World Wide Web is rapidly filling with more text than anyone could have imagined even a short time ago, but the task of isolating relevant parts of this vast information has become just that much more daunting. Richard Belew brings a cognitive perspective to the study of information retrieval as a discipline within computer science. He introduces the idea of Finding Out About (FDA) as the process of actively seeking out information relevant to a topic of interest and describes its many facets - ranging from creating a good characterization of what the user seeks, to what documents actually mean, to methods of inferring semantic clues about each document, to the problem of evaluating whether our search engines are performing as we have intended. Finding Out About explains how to build the tools that are useful for searching collections of text and other media. In the process it takes a close look at the properties of textual documents that do not become clear until very large collections of them are brought together and shows that the construction of effective search engines requires knowledge of the statistical and mathematical properties of linguistic phenomena, as well as an appreciation for the cognitive foundation we bring to the task as language users. The unique approach of this book is its even handling of the phenomena of both numbers and words, making it accessible to a wide audience. The textbook is usable in both undergraduate and graduate classes on information retrieval, library science, and computational linguistics. The text is accompanied by a CD-ROM that contains a hypertext version of the book, including additional topics and notes not present in the printed edition. In addition, the CD contains the full text of C.J. "Keith" van Rijsbergen's famous textbook, Information Retrieval (now out of print). Many active links from Belew's to van Rijsbergen's hypertexts help to unite the material. Several test corpora and indexing tools are provided, to support the design of your own search engine. Additional exercises using these corpora and code are available to instructors. Also supporting this book is a Web site that will include recent additions to the book, as well as links to sites of new topics and methods.
Themenfeld: Suchmaschinen ; Grundlagen u. Einführungen: Allgemeine Literatur
LCSH: Search engines / Programming ; World Wide Web / Computer programs ; Web search engines
RSWK: Suchmaschine / World Wide Web / Information Retrieval
BK: 54.51 / Programmiermethodik ; 54.32 / Rechnerkommunikation ; 06.74 / Informationssysteme
DDC: 025.04 / dc21
LCC: TK5105.884 B45 2000
RVK: ST 273
2Bartell, B.T. ; Cottrell, G.W. ; Belew, R.K.: Optimizing similarity using multi-query relevance feedback.
In: Journal of the American Society for Information Science. 49(1998) no.8, S.742-761.
Abstract: We propose a novel method for automatically adjusting paprameters in ranked-output text retrieval systems to improve retrieval performance. A renaked-output text retrieval system implements a ranking function which orders documents, placing documents estimated to be more relevant to the user's query before less relevant ones. The systems adjusts its parameters to maximize the match between the systems's document ordering and a target ordering. The target ordering is typically given by user feedback on a set of sample queries, but is more generally any document preference relation. We demonstrate the utility of the approach by using it to estimate a similarity measure (scoring the relevance of documents to queries) in a vector space model of information retrieval. Experimental results using several collections indicate that the approach automatically finds a simimilarity measure which performs equivalently to or better that all 'classic' similarity measures studied. It also performs within 1% of an estimated optimal measure (found by exhaustive sampling of the similarity measures). The method is compared to two alternative methods: a perceptron learning rule motivated by Wong and Yao's (1990) Query Formulation method, and a Least Squared learning rule, motivated by Fuhr and Buckley's (1991) Probabilisitc Learning approach. Though both alternatives have useful characteristics, we demonstrate empirically that neither can be used to estimate the parameters of the optimal similarity measure
3Bartell, B.T. ; Cottrell, G.W. ; Belew, R.K.: Representing documents using an explicit model of their similarities.
In: Journal of the American Society for Information Science. 46(1995) no.4, S.254-271.
Abstract: Proposes a method for creating vector space representations of documents based on modelling target interdocument similariyt values. The target similarity values are assumed to capture semantic relationships, or associations, between the documents. The vector representations are chosen so that the inner product similarities between document vector pairs closely match their target interdocument similarities. The method is closely related to the Latent Semantic Indexing approach
Objekt: Latent Semantic Indexing