Search (31 results, page 2 of 2)

  • × type_ss:"m"
  • × year_i:[2000 TO 2010}
  1. TREC: experiment and evaluation in information retrieval (2005) 0.07
    0.06812473 = product of:
      0.13624945 = sum of:
        0.05086134 = weight(_text_:storage in 636) [ClassicSimilarity], result of:
          0.05086134 = score(doc=636,freq=4.0), product of:
            0.23895897 = queryWeight, product of:
              5.4488444 = idf(docFreq=516, maxDocs=44218)
              0.04385498 = queryNorm
            0.21284549 = fieldWeight in 636, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              5.4488444 = idf(docFreq=516, maxDocs=44218)
              0.01953125 = fieldNorm(doc=636)
        0.065572895 = weight(_text_:retrieval in 636) [ClassicSimilarity], result of:
          0.065572895 = score(doc=636,freq=70.0), product of:
            0.13265759 = queryWeight, product of:
              3.024915 = idf(docFreq=5836, maxDocs=44218)
              0.04385498 = queryNorm
            0.49430186 = fieldWeight in 636, product of:
              8.3666 = tf(freq=70.0), with freq of:
                70.0 = termFreq=70.0
              3.024915 = idf(docFreq=5836, maxDocs=44218)
              0.01953125 = fieldNorm(doc=636)
        0.019815223 = weight(_text_:systems in 636) [ClassicSimilarity], result of:
          0.019815223 = score(doc=636,freq=6.0), product of:
            0.134774 = queryWeight, product of:
              3.0731742 = idf(docFreq=5561, maxDocs=44218)
              0.04385498 = queryNorm
            0.14702557 = fieldWeight in 636, product of:
              2.4494898 = tf(freq=6.0), with freq of:
                6.0 = termFreq=6.0
              3.0731742 = idf(docFreq=5561, maxDocs=44218)
              0.01953125 = fieldNorm(doc=636)
      0.5 = coord(3/6)
    
    Abstract
    The Text REtrieval Conference (TREC), a yearly workshop hosted by the US government's National Institute of Standards and Technology, provides the infrastructure necessary for large-scale evaluation of text retrieval methodologies. With the goal of accelerating research in this area, TREC created the first large test collections of full-text documents and standardized retrieval evaluation. The impact has been significant; since TREC's beginning in 1992, retrieval effectiveness has approximately doubled. TREC has built a variety of large test collections, including collections for such specialized retrieval tasks as cross-language retrieval and retrieval of speech. Moreover, TREC has accelerated the transfer of research ideas into commercial systems, as demonstrated in the number of retrieval techniques developed in TREC that are now used in Web search engines. This book provides a comprehensive review of TREC research, summarizing the variety of TREC results, documenting the best practices in experimental information retrieval, and suggesting areas for further research. The first part of the book describes TREC's history, test collections, and retrieval methodology. Next, the book provides "track" reports -- describing the evaluations of specific tasks, including routing and filtering, interactive retrieval, and retrieving noisy text. The final part of the book offers perspectives on TREC from such participants as Microsoft Research, University of Massachusetts, Cornell University, University of Waterloo, City University of New York, and IBM. The book will be of interest to researchers in information retrieval and related technologies, including natural language processing.
    Content
    Enthält die Beiträge: 1. The Text REtrieval Conference - Ellen M. Voorhees and Donna K. Harman 2. The TREC Test Collections - Donna K. Harman 3. Retrieval System Evaluation - Chris Buckley and Ellen M. Voorhees 4. The TREC Ad Hoc Experiments - Donna K. Harman 5. Routing and Filtering - Stephen Robertson and Jamie Callan 6. The TREC Interactive Tracks: Putting the User into Search - Susan T. Dumais and Nicholas J. Belkin 7. Beyond English - Donna K. Harman 8. Retrieving Noisy Text - Ellen M. Voorhees and John S. Garofolo 9.The Very Large Collection and Web Tracks - David Hawking and Nick Craswell 10. Question Answering in TREC - Ellen M. Voorhees 11. The University of Massachusetts and a Dozen TRECs - James Allan, W. Bruce Croft and Jamie Callan 12. How Okapi Came to TREC - Stephen Robertson 13. The SMART Project at TREC - Chris Buckley 14. Ten Years of Ad Hoc Retrieval at TREC Using PIRCS - Kui-Lam Kwok 15. MultiText Experiments for TREC - Gordon V. Cormack, Charles L. A. Clarke, Christopher R. Palmer and Thomas R. Lynam 16. A Language-Modeling Approach to TREC - Djoerd Hiemstra and Wessel Kraaij 17. BM Research Activities at TREC - Eric W. Brown, David Carmel, Martin Franz, Abraham Ittycheriah, Tapas Kanungo, Yoelle Maarek, J. Scott McCarley, Robert L. Mack, John M. Prager, John R. Smith, Aya Soffer, Jason Y. Zien and Alan D. Marwick Epilogue: Metareflections on TREC - Karen Sparck Jones
    Footnote
    Rez. in: JASIST 58(2007) no.6, S.910-911 (J.L. Vicedo u. J. Gomez): "The Text REtrieval Conference (TREC) is a yearly workshop hosted by the U.S. government's National Institute of Standards and Technology (NIST) that fosters and supports research in information retrieval as well as speeding the transfer of technology between research labs and industry. Since 1992, TREC has provided the infrastructure necessary for large-scale evaluations of different text retrieval methodologies. TREC impact has been very important and its success has been mainly supported by its continuous adaptation to the emerging information retrieval needs. Not in vain, TREC has built evaluation benchmarks for more than 20 different retrieval problems such as Web retrieval, speech retrieval, or question-answering. The large and intense trajectory of annual TREC conferences has resulted in an immense bulk of documents reflecting the different eval uation and research efforts developed. This situation makes it difficult sometimes to observe clearly how research in information retrieval (IR) has evolved over the course of TREC. TREC: Experiment and Evaluation in Information Retrieval succeeds in organizing and condensing all this research into a manageable volume that describes TREC history and summarizes the main lessons learned. The book is organized into three parts. The first part is devoted to the description of TREC's origin and history, the test collections, and the evaluation methodology developed. The second part describes a selection of the major evaluation exercises (tracks), and the third part contains contributions from research groups that had a large and remarkable participation in TREC. Finally, Karen Spark Jones, one of the main promoters of research in IR, closes the book with an epilogue that analyzes the impact of TREC on this research field.
    ... TREC: Experiment and Evaluation in Information Retrieval is a reliable and comprehensive review of the TREC program and has been adopted by NIST as the official history of TREC (see http://trec.nist.gov). We were favorably surprised by the book. Well structured and written, chapters are self-contained and the existence of references to specialized and more detailed publications is continuous, which makes it easier to expand into the different aspects analyzed in the text. This book succeeds in compiling TREC evolution from its inception in 1992 to 2003 in an adequate and manageable volume. Thanks to the impressive effort performed by the authors and their experience in the field, it can satiate the interests of a great variety of readers. While expert researchers in the IR field and IR-related industrial companies can use it as a reference manual, it seems especially useful for students and non-expert readers willing to approach this research area. Like NIST, we would recommend this reading to anyone who may be interested in textual information retrieval."
    LCSH
    Information storage and retrieval systems / Congresses
    Text REtrieval Conference
    RSWK
    Information Retrieval / Textverarbeitung / Aufsatzsammlung (BVB)
    Kongress / Information Retrieval / Kongress (GBV)
    Subject
    Information Retrieval / Textverarbeitung / Aufsatzsammlung (BVB)
    Kongress / Information Retrieval / Kongress (GBV)
    Information storage and retrieval systems / Congresses
    Text REtrieval Conference
  2. Kochtanek, T.R.; Matthews, J.R.: Library information systems : from library automation to distributed information systems (2002) 0.06
    0.060701065 = product of:
      0.12140213 = sum of:
        0.05086134 = weight(_text_:storage in 1792) [ClassicSimilarity], result of:
          0.05086134 = score(doc=1792,freq=4.0), product of:
            0.23895897 = queryWeight, product of:
              5.4488444 = idf(docFreq=516, maxDocs=44218)
              0.04385498 = queryNorm
            0.21284549 = fieldWeight in 1792, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              5.4488444 = idf(docFreq=516, maxDocs=44218)
              0.01953125 = fieldNorm(doc=1792)
        0.015674919 = weight(_text_:retrieval in 1792) [ClassicSimilarity], result of:
          0.015674919 = score(doc=1792,freq=4.0), product of:
            0.13265759 = queryWeight, product of:
              3.024915 = idf(docFreq=5836, maxDocs=44218)
              0.04385498 = queryNorm
            0.11816074 = fieldWeight in 1792, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              3.024915 = idf(docFreq=5836, maxDocs=44218)
              0.01953125 = fieldNorm(doc=1792)
        0.05486587 = weight(_text_:systems in 1792) [ClassicSimilarity], result of:
          0.05486587 = score(doc=1792,freq=46.0), product of:
            0.134774 = queryWeight, product of:
              3.0731742 = idf(docFreq=5561, maxDocs=44218)
              0.04385498 = queryNorm
            0.40709537 = fieldWeight in 1792, product of:
              6.78233 = tf(freq=46.0), with freq of:
                46.0 = termFreq=46.0
              3.0731742 = idf(docFreq=5561, maxDocs=44218)
              0.01953125 = fieldNorm(doc=1792)
      0.5 = coord(3/6)
    
    Abstract
    Specifically designed for core units in library automation and information systems, this long awaited new text gives students a comprehensive overview of one of the most critical areas of library operations. Produced by two internationally known scholars, Thomas Kochtanek and Joseph Matthews, this book will enable students to take the lead in managing an immense diversity of information resources and at the same time handle the complexities that information technology brings to the library. Giving important insight into library information systems-from the historical background to the latest technological trends and developments-the book is organized into 14 chapters, each presenting helpful information on such topics as systems design, types of systems, coverage of standards and standards organizations, technology axioms, system selection and implementation, usability of systems, library information systems management, technology trends, digital libraries, and more. New to the acclaimed Library and Information Science Text Series, this book will prove an indispensable resource to students preparing for a career in today's ever-evolving library environment. Complete with charts and illustrations, chapter summaries, suggested print and electronic resources, a glossary of terms, and an index, this text will be of central importance to libraries and library schools everywhere.
    Footnote
    Rez. in: JASIST 54(2003) no.12, S.1166-1167 (Brenda Chawner): "Kochtanek and Matthews have written a welcome addition to the small set of introductory texts an applications of information technology to library and information Services. The book has fourteen chapters grouped into four sections: "The Broader Context," "The Technologies," "Management Issues," and "Future Considerations." Two chapters provide the broad content, with the first giving a historical overview of the development and adoption of "library information systems." Kochtanek and Matthews define this as "a wide array of solutions that previously might have been considered separate industries with distinctly different marketplaces" (p. 3), referring specifically to integrated library systems (ILS, and offen called library management systems in this part of the world), and online databases, plus the more recent developments of Web-based resources, digital libraries, ebooks, and ejournals. They characterize technology adoption patterns in libraries as ranging from "bleeding edge" to "leading edge" to "in the wedge" to "trailing edge"-this is a catchy restatement of adopter categories from Rogers' diffusion of innovation theory, where they are more conventionally known as "early adopters," "early majority," "late majority," and "laggards." This chapter concludes with a look at more general technology trends that have affected library applications, including developments in hardware (moving from mainframes to minicomputers to personal Computers), changes in software development (from in-house to packages), and developments in communications technology (from dedicated host Computers to more open networks to the current distributed environment found with the Internet). This is followed by a chapter describing the ILS and online database industries in some detail. "The Technologies" begins with a chapter an the structure and functionality of integrated library systems, which also includes a brief discussion of precision versus recall, managing access to internal documents, indexing and searching, and catalogue maintenance. This is followed by a chapter an open systems, which concludes with a useful list of questions to consider to determine an organization's readiness to adopt open source solutions. As one world expect, this section also includes a detailed chapter an telecommunications and networking, which includes types of networks, transmission media, network topologies, switching techniques (ranging from dial up and leased lines to ISDN/DSL, frame relay, and ATM). It concludes with a chapter an the role and importance of standards, which covers the need for standards and standards organizations, and gives examples of different types of standards, such as MARC, Dublin Core, Z39.50, and markup standards such as SGML, HTML, and XML. Unicode is also covered but only briefly. This section world be strengthened by a chapter an hardware concepts-the authors assume that their reader is already familiar with these, which may not be true in all cases (for example, the phrase "client-Server" is first used an page 11, but only given a brief definition in the glossary). Burke's Library Technology Companion: A Basic Guide for Library Staff (New York: Neal-Schuman, 2001) might be useful to fill this gap at an introductory level, and Saffady's Introduction to Automation for Librarians, 4th ed. (Chicago: American Library Association, 1999) world be better for those interested in more detail. The final two sections, however, are the book's real strength, with a strong focus an management issues, and this content distinguishes it from other books an this topic such as Ferguson and Hebels Computers for Librarians: an Introduction to Systems and Applications (Waggawagga, NSW: Centre for Information Studies, Charles Sturt University, 1998). ...
    Though the book definitely meets a need for an up-to-date introduction to library information systems and associated management issues, and the emphasis an management issues means that it will not date too quickly, there is room for improvement. Some topics are described too briefly to be useful, such as customization/personalization, which is covered in a single paragraph, and does not mention recent developments such as the MyLibrary concept. Other topics seem to have only a peripheral connection to the main chapter theme-for example, it is surprising to find a discussion of information literacy at the end of the chapter an system selection and implementation, and the material an personalization/customization is at the end of the discussion of intranets. Despite these comments, 1 would consider using this as a textbook in an introductory course an library automation or information technology, and practitioners who want to upgrade their knowledge of current practices and issues will also find it useful. People who are primarily interested in a specific topic, such as information systems planning or system selection and implementation are likely to find more specialized books such as Planning for Integrated Systems and Technologies: A How-to-Do-It Manual for Librarians by John M. Cohn, Anne L. Kelsey, and Keith Michael Fiels (New York: Neal-Schuman, 2001) more useful."
    LCSH
    Integrated library systems (Computer systems)
    Information storage and retrieval systems
    Subject
    Integrated library systems (Computer systems)
    Information storage and retrieval systems
  3. Warner, J.: Humanizing information technology (2004) 0.05
    0.05179622 = product of:
      0.10359244 = sum of:
        0.05086134 = weight(_text_:storage in 438) [ClassicSimilarity], result of:
          0.05086134 = score(doc=438,freq=4.0), product of:
            0.23895897 = queryWeight, product of:
              5.4488444 = idf(docFreq=516, maxDocs=44218)
              0.04385498 = queryNorm
            0.21284549 = fieldWeight in 438, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              5.4488444 = idf(docFreq=516, maxDocs=44218)
              0.01953125 = fieldNorm(doc=438)
        0.027149757 = weight(_text_:retrieval in 438) [ClassicSimilarity], result of:
          0.027149757 = score(doc=438,freq=12.0), product of:
            0.13265759 = queryWeight, product of:
              3.024915 = idf(docFreq=5836, maxDocs=44218)
              0.04385498 = queryNorm
            0.20466042 = fieldWeight in 438, product of:
              3.4641016 = tf(freq=12.0), with freq of:
                12.0 = termFreq=12.0
              3.024915 = idf(docFreq=5836, maxDocs=44218)
              0.01953125 = fieldNorm(doc=438)
        0.025581343 = weight(_text_:systems in 438) [ClassicSimilarity], result of:
          0.025581343 = score(doc=438,freq=10.0), product of:
            0.134774 = queryWeight, product of:
              3.0731742 = idf(docFreq=5561, maxDocs=44218)
              0.04385498 = queryNorm
            0.18980919 = fieldWeight in 438, product of:
              3.1622777 = tf(freq=10.0), with freq of:
                10.0 = termFreq=10.0
              3.0731742 = idf(docFreq=5561, maxDocs=44218)
              0.01953125 = fieldNorm(doc=438)
      0.5 = coord(3/6)
    
    Content
    An information view of history -- Organs of the human brain, created by the human hand : toward an understanding of information technology -- Information society or cash nexus? : a study of the United States as a copyright haven -- As sharp as a pen : direct semantic ratification in oral, written, and electronic communication -- In the catalogue ye go for men : evaluation criteria for information retrieval systems -- Meta- and object-language for information retrieval research : proposal for a distinction -- Forms of labor in information systems -- W(h)ither information science?
    Footnote
    Rez. in: JASIST. 56(2003) no.12, S.1360 (C.Tomer): "Humanizing Information Technology is a collection of essays that represent what are presumably Julian Warner's best efforts to understand the perpetually nascent discipline of information science and its relationship to information technology. It is clearly a formidable task. Warner succeeds occasionally in this endeavor; more often, he fails. Yet, it would be wrong to mark Humanizing Information Technology as a book not worth reading. On the contrary, though much fault was found and this review is far from positive, it was nevertheless a book well-worth reading. That Humanizing Information Technology succeeds at all is in some ways remarkable, because Warner's prose tends to be dense and graceless, and understanding his commentaries often relies an close readings of a wide array of sources, some of them familiar, many of them less so. The inaccessibility of Warner's prose is unfortunate; there is not a single idea in Humanizing Information Technology so complicated that it could not have been stated in a clear, straightforward manner. The failure to establish a clear, sufficiently füll context for the more obscure sources is an even more serious problem. Perhaps the most conspicuous example of this problem stems from the frequent examination of the concept of the "information society" and the related notion of information as an autonomous variable, each of them ideas drawn largely from Frank Webster's 1995 book, Theories of the Information Society. Several of Warner's essays contain passages in Humanizing Information Technology whose meaning and value are largely dependent an a familiarity with Webster's work. Yet, Warner never refers to Theories of the Information Society in more than cursory terms and never provides a context füll enough to understand the particular points of reference. Suffice it to say, Humanizing Information Technology is not a book for readers who lack patience or a thorough grounding in modern intellectual history. Warner's philosophical analyses, which frequently exhibit the meter, substance, and purpose of a carefully crafted comprehensive examination, are a large part of what is wrong with Humanizing Information Technology. Warner's successes come when he turns his attention away from Marxist scholasticism and toward historical events and trends. "Information Society or Cash Nexus?" the essay in which Warner compares the role of the United States as a "copyright haven" for most of the 19th century to modern China's similar status, is successful because it relies less an abstruse analysis and more an a sharply drawn comparison of the growth of two economies and parallel developments in the treatment of intellectual property. The essay establishes an illuminating context and cites historical precedents in the American experience suggesting that China's official positions toward intellectual property and related international conventions are likely to evolve and grow more mature as its economy expands and becomes more sophisticated. Similarly, the essay entitled "In the Catalogue Ye Go for Men" is effective because Warner comes dangerously close to pragmatism when he focuses an the possibility that aligning cataloging practice with the "paths and tracks" of discourse and its analysis may be the means by which to build more information systems that furnish a more direct basis for intellectual exploration.
    LCSH
    Information storage and retrieval systems
    RSWK
    Informationsgesellschaft / Informationstechnik / Information-Retrieval-System / Informationsspeicher
    Subject
    Informationsgesellschaft / Informationstechnik / Information-Retrieval-System / Informationsspeicher
    Information storage and retrieval systems
  4. Morville, P.: Ambient findability : what we find changes who we become (2005) 0.04
    0.042174384 = product of:
      0.08434877 = sum of:
        0.04068907 = weight(_text_:storage in 312) [ClassicSimilarity], result of:
          0.04068907 = score(doc=312,freq=4.0), product of:
            0.23895897 = queryWeight, product of:
              5.4488444 = idf(docFreq=516, maxDocs=44218)
              0.04385498 = queryNorm
            0.17027639 = fieldWeight in 312, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              5.4488444 = idf(docFreq=516, maxDocs=44218)
              0.015625 = fieldNorm(doc=312)
        0.030716445 = weight(_text_:retrieval in 312) [ClassicSimilarity], result of:
          0.030716445 = score(doc=312,freq=24.0), product of:
            0.13265759 = queryWeight, product of:
              3.024915 = idf(docFreq=5836, maxDocs=44218)
              0.04385498 = queryNorm
            0.23154683 = fieldWeight in 312, product of:
              4.8989797 = tf(freq=24.0), with freq of:
                24.0 = termFreq=24.0
              3.024915 = idf(docFreq=5836, maxDocs=44218)
              0.015625 = fieldNorm(doc=312)
        0.012943249 = weight(_text_:systems in 312) [ClassicSimilarity], result of:
          0.012943249 = score(doc=312,freq=4.0), product of:
            0.134774 = queryWeight, product of:
              3.0731742 = idf(docFreq=5561, maxDocs=44218)
              0.04385498 = queryNorm
            0.096036695 = fieldWeight in 312, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              3.0731742 = idf(docFreq=5561, maxDocs=44218)
              0.015625 = fieldNorm(doc=312)
      0.5 = coord(3/6)
    
    Footnote
    Das zweite Kapitel ("A Brief History of Wayfinding") beschreibt, wie Menschen sich in Umgebungen zurechtfinden. Dies ist insofern interessant, als hier nicht erst bei Informationssystemen oder dem WWW begonnen wird, sondern allgemeine Erkenntnisse beispielsweise über die Orientierung in natürlichen Umgebungen präsentiert werden. Viele typische Verhaltensweisen der Nutzer von Informationssystemen können so erklärt werden. So interessant dieses Thema allerdings ist, wirkt das Kapitel leider doch nur wie eine Zusammenstellung von Informationen aus zweiter Hand. Offensichtlich ist, dass Morville nicht selbst an diesen Themen geforscht hat, sondern die Ergebnisse (wenn auch auf ansprechende Weise) zusammengeschrieben hat. Dieser Eindruck bestätigt sich auch in weiteren Kapiteln: Ein flüssig geschriebener Text, der es jedoch an einigen Stellen an Substanz fehlen lässt. Kapitel drei, "Information Interaction" beginnt mit einem Rückgriff auf Calvin Mooers zentrale Aussage aus dem Jahre 1959: "An information retrieval system will tend not to be used whenever it is more painful and troublesome for a customer to have information than for him not to have it." In der Tat sollte man sich dies bei der Erstellung von Informationssystemen immer vergegenwärtigen; die Reihe der Systeme, die gerade an dieser Hürde gescheitert sind, ist lang. Das weitere Kapitel führt in einige zentrale Konzepte der Informationswissenschaft (Definition des Begriffs Information, Erläuterung des Information Retrieval, Wissensrepräsentation, Information Seeking Behaviour) ein, allerdings ohne jeden Anspruch auf Vollständigkeit. Es wirkt vielmehr so, dass der Autor sich die gerade für sein Anliegen passenden Konzepte auswählt und konkurrierende Ansätze beiseite lässt. Nur ein Beispiel: Im Abschnitt "Information Interaction" wird relativ ausführlich das Konzept des Berrypicking nach Marcia J. Bates präsentiert, allerdings wird es geradezu als exklusiv verkauft, was es natürlich bei weitem nicht ist. Natürlich kann es nicht Aufgabe dieses Buchs sein, einen vollständigen Überblick über alle Theorien des menschlichen Suchverhaltens zu geben (dies ist an anderer Stelle vorbildlich geleistet worden'), aber doch wenigstens der Hinweis auf einige zentrale Ansätze wäre angebracht gewesen. Spätestens in diesem Kapitel wird klar, dass das Buch sich definitiv nicht an Informationswissenschaftler wendet, die auf der einen Seite mit den grundlegenden Themen vertraut sein dürften, andererseits ein wenig mehr Tiefgang erwarten würden. Also stellt sich die Frage - und diese ist zentral für die Bewertung des gesamten Werks.
    LCSH
    Information storage and retrieval systems
    RSWK
    Information Retrieval (GBV)
    Information Retrieval / Ubiquitous Computing (GBV)
    Information Retrieval / Datenbanksystem / Suchmaschine (GBV)
    Information Retrieval / Datenbanksystem (BVB)
    Subject
    Information Retrieval (GBV)
    Information Retrieval / Ubiquitous Computing (GBV)
    Information Retrieval / Datenbanksystem / Suchmaschine (GBV)
    Information Retrieval / Datenbanksystem (BVB)
    Information storage and retrieval systems
  5. Jörgensen, C.: Image retrieval : theory and research (2003) 0.04
    0.041743442 = product of:
      0.083486885 = sum of:
        0.043157276 = weight(_text_:storage in 3080) [ClassicSimilarity], result of:
          0.043157276 = score(doc=3080,freq=2.0), product of:
            0.23895897 = queryWeight, product of:
              5.4488444 = idf(docFreq=516, maxDocs=44218)
              0.04385498 = queryNorm
            0.18060538 = fieldWeight in 3080, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              5.4488444 = idf(docFreq=516, maxDocs=44218)
              0.0234375 = fieldNorm(doc=3080)
        0.02660122 = weight(_text_:retrieval in 3080) [ClassicSimilarity], result of:
          0.02660122 = score(doc=3080,freq=8.0), product of:
            0.13265759 = queryWeight, product of:
              3.024915 = idf(docFreq=5836, maxDocs=44218)
              0.04385498 = queryNorm
            0.20052543 = fieldWeight in 3080, product of:
              2.828427 = tf(freq=8.0), with freq of:
                8.0 = termFreq=8.0
              3.024915 = idf(docFreq=5836, maxDocs=44218)
              0.0234375 = fieldNorm(doc=3080)
        0.0137283895 = weight(_text_:systems in 3080) [ClassicSimilarity], result of:
          0.0137283895 = score(doc=3080,freq=2.0), product of:
            0.134774 = queryWeight, product of:
              3.0731742 = idf(docFreq=5561, maxDocs=44218)
              0.04385498 = queryNorm
            0.1018623 = fieldWeight in 3080, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.0731742 = idf(docFreq=5561, maxDocs=44218)
              0.0234375 = fieldNorm(doc=3080)
      0.5 = coord(3/6)
    
    Footnote
    Rez. in: KO 31(2004) no.2, S.114-115 (J. Turner): "Professor Corinne Jörgensen's book will be useful to researchers, practitioners, and graduate students working in the area of the management of collections of still images. The book is a fine piece of scholarship that is thoroughly researched and nicely written. It integrates information from a number of perspectives, including cognitive psychology and computer science, into an information science text. This work is timely, since images and other nontextual information are forming an ever larger part of the mass of information available to us. Indeed, in the long history of recorded information an our planet, images "were the only form of written communication for 25,000 out of the 30,000 years of human recorded experience ... we are, it appears, an the hinge of an important historical swing back towards what may be called the primacy of the image" (p. ix). The book will be valued for the richness of the information it gathers and for the intelligent discussion it offers. There are six chapters to the work: 1. Why images, and what do we know about them? 2. Cognitive foundations for image processing; 3. Organizing and providing access to images; 4. Machine processing of images; 5. Image attributes: the research framework; 6. Towards the future. In addition, there is an excellent bibliography of over forty pages, which is valuable because it provides so many good leads into the literatures of information science and of related disciplines that contribute to the discussions of image retrieval presented in the book. There are separate subject and author indexes. The author index is considerably longer than the subject index, an indication of how muck published literature is discussed in the text. Finally, a list of figures and a list of tables provide additional finding aids. The inclusion of discussions of issues from disciplines other than information science reflects the changing reality of information systems for managing picture collections. Throughout the time such collections have been built, there has never been much coordination of approaches, methods, or practices, even within the discipline of information science. Since the arrival about ten years ago of the World Wide Web, major changes have taken place in the way information is organised, stored, and retrieved. The new networked environment requires a great deal of coordination, common standards, and much more uniform practices than managers of collections of pictures have been used to in the past. Jörgensen's extensive research into the work accomplished by a number of contributing disciplines and her presentation of it in relation to the problem of managing collections of images indicates a deep understanding of the issues and a remarkable capacity to relate them to issues in information science. Accomplishing such a feat so successfully makes this work a valuable contribution to the ongoing discussion of how collections need to be managed in the networked environment. The interdisciplinary nature of the problem has never before been presented so clearly, nor so thoroughly.
    The discussion of available tools is excellent and quite comprehensive. This will prove very helpful to practitioners and students setting out to learn about the world of storage, retrieval, and indexing of images. The author's simple, straightforward writing style is praiseworthy, since it will help those just starting out in the field to grasp the material quickly. It will also contribute to understanding an the part of readers from other language communities who have English as a second language. Although this book discusses a number of complex topics, the author has succeeded in making the treatment of them eminently understandable. Chapter 6 will prove particularly useful to researchers in the area, many more of whom are needed, and especially to graduate students thinking about undertaking a doctoral programme in the area of image management. The author provides a research agenda which describes a number of areas in which research is needed, including a number of research questions to work on. She also includes her wish list, which "represents a personal perspective, and is offered ... as food for thought and future discussion" (p. 267). Jörgensen feels her book will soon be out of date, and indeed, that in writing the book she has been "pursuing a moving target" (p. 4). Since there is so muck work going an in the broad field of image retrieval (although not enough in the area of information science), new discoveries will be made, new issues we hadn't thought about before will come to light, new methods and standards for managing picture databases will be developed, and new approaches will surely come along. However, I'm not so sure this book will be out of date any time soon, since it serves as a record of arriving at a plateau, a point at which the knowledge accumulated to date has been gathered, recorded, and presented as a portrait of what has been achieved and of where we are now. At the very least, then, Jörgensen's book will remain as a solid record of the research to date. More immediately, it will serve as a guide to what we should be doing now, and to the next steps that need to be taken."
  6. Design and usability of digital libraries : case studies in the Asia-Pacific (2005) 0.04
    0.040966894 = product of:
      0.08193379 = sum of:
        0.04068907 = weight(_text_:storage in 93) [ClassicSimilarity], result of:
          0.04068907 = score(doc=93,freq=4.0), product of:
            0.23895897 = queryWeight, product of:
              5.4488444 = idf(docFreq=516, maxDocs=44218)
              0.04385498 = queryNorm
            0.17027639 = fieldWeight in 93, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              5.4488444 = idf(docFreq=516, maxDocs=44218)
              0.015625 = fieldNorm(doc=93)
        0.015358223 = weight(_text_:retrieval in 93) [ClassicSimilarity], result of:
          0.015358223 = score(doc=93,freq=6.0), product of:
            0.13265759 = queryWeight, product of:
              3.024915 = idf(docFreq=5836, maxDocs=44218)
              0.04385498 = queryNorm
            0.11577342 = fieldWeight in 93, product of:
              2.4494898 = tf(freq=6.0), with freq of:
                6.0 = termFreq=6.0
              3.024915 = idf(docFreq=5836, maxDocs=44218)
              0.015625 = fieldNorm(doc=93)
        0.025886498 = weight(_text_:systems in 93) [ClassicSimilarity], result of:
          0.025886498 = score(doc=93,freq=16.0), product of:
            0.134774 = queryWeight, product of:
              3.0731742 = idf(docFreq=5561, maxDocs=44218)
              0.04385498 = queryNorm
            0.19207339 = fieldWeight in 93, product of:
              4.0 = tf(freq=16.0), with freq of:
                16.0 = termFreq=16.0
              3.0731742 = idf(docFreq=5561, maxDocs=44218)
              0.015625 = fieldNorm(doc=93)
      0.5 = coord(3/6)
    
    Footnote
    Rez. in: JASIST 58(2007) no.1, S.152-153 (J.P. Bolstad): "Over the past decade, digital library research and technology have evolved and progressed rapidly. The desire to create new and better digital library systems has inspired researchers and academics worldwide to join forces and work together to develop more efficient and user-friendly technologies. Primarily inspired by ideas presented at the Fourth International Conference on Asian Digital Libraries, which was held in 2002 in Singapore, this book illustrates a selection of diverse digital library systems that have been created in recent years, as researchers have continued to further their ideas about new developments and trends in digital libraries. In Design and Usability of Digital Libraries, the editors, Theng and Foo, compile a collection of 20 valuable case studies written by various researchers. These case studies address not only the successes that have been achieved in improving digital library research and technology, but also the problems and failures that have been discovered. Thus, researchers can perhaps learn from the errors that have occurred in these case studies and prevent the same mistakes from happening in the future. This book also demonstrates the large amount of collaboration that has occurred among various research groups throughout different countries in the Asia Pacific region. The representation of such diverse perspectives from different places is what makes the book interesting because it is particularly enlightening to read about what other countries have developed in terms of digital libraries. In general, the book is organized uniformly and is easy to follow. Each chapter represents one case study and the order of the chapters makes complete sense, as the text flows smoothly from beginning to end. The first chapter begins with a basic history of digital libraries, which helps to familiarize readers with the concept of what a digital library is and provides a brief introduction to how digital libraries came to be. The next few chapters touch on such topics as the design architecture and systems of digital libraries, implementation issues and challenges when designing digital libraries, use and impact of these libraries in societies, considerations that need to be taken into account regarding users and usability, as well as projections of future trends of digital libraries. The editors brilliantly piece together all of the chapters to make the entire book cohesive.
    The chapters are generally less than 20 pages, which allows for concise presentations of each case study. Each chapter contains, more or less, a brief abstract, introduction, related works section, methodology section, conclusion, and references. The chapters are further categorized into six thematic sections. Section I focuses on the history of digital libraries in the Asia Pacific. Section II, composed of four chapters, focuses on the design architecture and systems of digital libraries. The next five chapters, in section III, examine challenges in implementing digital library systems. This section is particularly interesting because issues such as multicultural and multilingual barriers are discussed. Section IV is about the use of and impact of digital libraries in a society. All four chapters in this section emphasize improvements that need to be made to digital libraries regarding different types of users. Particularly important is chapter 14, which discusses digital libraries and their effects on youth. The conclusion of this case study revealed that digital libraries need to support peer learning, as there are many social benefits for youth from interacting with peers. Section V, which focuses on users and usability, consists of five chapters. This section relates directly to the implementation challenges that are mentioned in section III, providing specific examples of cross-cultural issues among users that need to be taken into consideration. In addition, section V discusses the differences in media types and the difficulties with transforming these resources into digital formats. For example, chapter 18, which is about designing a music digital library, demonstrates the difficulties in selecting from the numerous types of technologies that can be used to digitize library collections. Finally, the chapter in section VI discusses the future trends of digital libraries. The editors successfully present diverse perspectives about digital libraries, by including case studies performed in numerous different countries throughout the Asia Pacific region. Countries represented in the case studies include Indonesia, Taiwan, India, China, Singapore, New Zealand, Hong Kong, Philippines, Japan, and Malaysia. The diversity of the users in these countries helps to illustrate the numerous differences and similarities that digital library designers need to take into consideration in the future when developing a universal digital library system. In order to create a successful digital library system that can benefit all users, there must be a sense of balance in the technology used, and the authors of the case studies in this book have definitely proved that there are distinct barriers that need to be overcome in order to achieve this harmony.
    Even though each chapter is short, the entire book covers a vast amount of information. This book is meant to provide an introductory sampling of issues discovered through various case studies, not provide an in-depth report on each of them. The references included at the end of each chapter are particularly helpful because they lead to more information about issues that the particular case study raises. By including a list of references at the end of each chapter, the authors want to encourage interested readers to pursue more about the topics presented. This book clearly offers many opportunities to explore issues on the same topics further. The appendix at the end of the book also contains additional useful information that readers might want to consult if they are interested in finding out more about digital libraries. Selected resources are provided in the form of a list that includes such topics as journal special issues, digital library conference proceedings, and online databases. A key issue that this book brings up is how to include different cultural materials in digital libraries. For example, in chapter 16, the concerns and issues surrounding Maori heritage materials are introduced. The terms and concepts used when classifying Maori resources are so delicate that the meaning behind them can completely change with even a slight variation. Preserving other cultures correctly is important, and researchers need to consider the consequences of any errors made during digitization of resources. Another example illustrating the importance of including information about different cultures is presented in chapter 9. The authors talk about the various different languages used in the world and suggest ways to integrate them into information retrieval systems. As all digital library researchers know, the ideal system would allow all users to retrieve results in their own languages. The authors go on to discuss a few approaches that can be taken to assist with overcoming this challenge.
    LCSH
    Information storage and retrieval systems / Case studies
    Subject
    Information storage and retrieval systems / Case studies
  7. Berry, M.W.; Browne, M.: Understanding search engines : mathematical modeling and text retrieval (2005) 0.04
    0.035900928 = product of:
      0.10770278 = sum of:
        0.057543036 = weight(_text_:storage in 7) [ClassicSimilarity], result of:
          0.057543036 = score(doc=7,freq=2.0), product of:
            0.23895897 = queryWeight, product of:
              5.4488444 = idf(docFreq=516, maxDocs=44218)
              0.04385498 = queryNorm
            0.24080718 = fieldWeight in 7, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              5.4488444 = idf(docFreq=516, maxDocs=44218)
              0.03125 = fieldNorm(doc=7)
        0.05015974 = weight(_text_:retrieval in 7) [ClassicSimilarity], result of:
          0.05015974 = score(doc=7,freq=16.0), product of:
            0.13265759 = queryWeight, product of:
              3.024915 = idf(docFreq=5836, maxDocs=44218)
              0.04385498 = queryNorm
            0.37811437 = fieldWeight in 7, product of:
              4.0 = tf(freq=16.0), with freq of:
                16.0 = termFreq=16.0
              3.024915 = idf(docFreq=5836, maxDocs=44218)
              0.03125 = fieldNorm(doc=7)
      0.33333334 = coord(2/6)
    
    Abstract
    The second edition of Understanding Search Engines: Mathematical Modeling and Text Retrieval follows the basic premise of the first edition by discussing many of the key design issues for building search engines and emphasizing the important role that applied mathematics can play in improving information retrieval. The authors discuss important data structures, algorithms, and software as well as user-centered issues such as interfaces, manual indexing, and document preparation. Significant changes bring the text up to date on current information retrieval methods: for example the addition of a new chapter on link-structure algorithms used in search engines such as Google. The chapter on user interface has been rewritten to specifically focus on search engine usability. In addition the authors have added new recommendations for further reading and expanded the bibliography, and have updated and streamlined the index to make it more reader friendly.
    Content
    Inhalt: Introduction Document File Preparation - Manual Indexing - Information Extraction - Vector Space Modeling - Matrix Decompositions - Query Representations - Ranking and Relevance Feedback - Searching by Link Structure - User Interface - Book Format Document File Preparation Document Purification and Analysis - Text Formatting - Validation - Manual Indexing - Automatic Indexing - Item Normalization - Inverted File Structures - Document File - Dictionary List - Inversion List - Other File Structures Vector Space Models Construction - Term-by-Document Matrices - Simple Query Matching - Design Issues - Term Weighting - Sparse Matrix Storage - Low-Rank Approximations Matrix Decompositions QR Factorization - Singular Value Decomposition - Low-Rank Approximations - Query Matching - Software - Semidiscrete Decomposition - Updating Techniques Query Management Query Binding - Types of Queries - Boolean Queries - Natural Language Queries - Thesaurus Queries - Fuzzy Queries - Term Searches - Probabilistic Queries Ranking and Relevance Feedback Performance Evaluation - Precision - Recall - Average Precision - Genetic Algorithms - Relevance Feedback Searching by Link Structure HITS Method - HITS Implementation - HITS Summary - PageRank Method - PageRank Adjustments - PageRank Implementation - PageRank Summary User Interface Considerations General Guidelines - Search Engine Interfaces - Form Fill-in - Display Considerations - Progress Indication - No Penalties for Error - Results - Test and Retest - Final Considerations Further Reading
    RSWK
    Suchmaschine / Information Retrieval
    Suchmaschine / Information Retrieval / Mathematisches Modell (HEBIS)
    Subject
    Suchmaschine / Information Retrieval
    Suchmaschine / Information Retrieval / Mathematisches Modell (HEBIS)
  8. Intner, S.S.; Lazinger, S.S.; Weihs, J.: Metadata and its impact on libraries (2005) 0.03
    0.03308613 = product of:
      0.06617226 = sum of:
        0.04068907 = weight(_text_:storage in 339) [ClassicSimilarity], result of:
          0.04068907 = score(doc=339,freq=4.0), product of:
            0.23895897 = queryWeight, product of:
              5.4488444 = idf(docFreq=516, maxDocs=44218)
              0.04385498 = queryNorm
            0.17027639 = fieldWeight in 339, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              5.4488444 = idf(docFreq=516, maxDocs=44218)
              0.015625 = fieldNorm(doc=339)
        0.012539935 = weight(_text_:retrieval in 339) [ClassicSimilarity], result of:
          0.012539935 = score(doc=339,freq=4.0), product of:
            0.13265759 = queryWeight, product of:
              3.024915 = idf(docFreq=5836, maxDocs=44218)
              0.04385498 = queryNorm
            0.09452859 = fieldWeight in 339, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              3.024915 = idf(docFreq=5836, maxDocs=44218)
              0.015625 = fieldNorm(doc=339)
        0.012943249 = weight(_text_:systems in 339) [ClassicSimilarity], result of:
          0.012943249 = score(doc=339,freq=4.0), product of:
            0.134774 = queryWeight, product of:
              3.0731742 = idf(docFreq=5561, maxDocs=44218)
              0.04385498 = queryNorm
            0.096036695 = fieldWeight in 339, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              3.0731742 = idf(docFreq=5561, maxDocs=44218)
              0.015625 = fieldNorm(doc=339)
      0.5 = coord(3/6)
    
    LCSH
    Information storage and retrieval systems
    Subject
    Information storage and retrieval systems
  9. Introducing information management : an information research reader (2005) 0.03
    0.03121713 = product of:
      0.06243426 = sum of:
        0.028771518 = weight(_text_:storage in 440) [ClassicSimilarity], result of:
          0.028771518 = score(doc=440,freq=2.0), product of:
            0.23895897 = queryWeight, product of:
              5.4488444 = idf(docFreq=516, maxDocs=44218)
              0.04385498 = queryNorm
            0.12040359 = fieldWeight in 440, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              5.4488444 = idf(docFreq=516, maxDocs=44218)
              0.015625 = fieldNorm(doc=440)
        0.015358223 = weight(_text_:retrieval in 440) [ClassicSimilarity], result of:
          0.015358223 = score(doc=440,freq=6.0), product of:
            0.13265759 = queryWeight, product of:
              3.024915 = idf(docFreq=5836, maxDocs=44218)
              0.04385498 = queryNorm
            0.11577342 = fieldWeight in 440, product of:
              2.4494898 = tf(freq=6.0), with freq of:
                6.0 = termFreq=6.0
              3.024915 = idf(docFreq=5836, maxDocs=44218)
              0.015625 = fieldNorm(doc=440)
        0.01830452 = weight(_text_:systems in 440) [ClassicSimilarity], result of:
          0.01830452 = score(doc=440,freq=8.0), product of:
            0.134774 = queryWeight, product of:
              3.0731742 = idf(docFreq=5561, maxDocs=44218)
              0.04385498 = queryNorm
            0.1358164 = fieldWeight in 440, product of:
              2.828427 = tf(freq=8.0), with freq of:
                8.0 = termFreq=8.0
              3.0731742 = idf(docFreq=5561, maxDocs=44218)
              0.015625 = fieldNorm(doc=440)
      0.5 = coord(3/6)
    
    Abstract
    Information management (IM) has exploded in importance in recent years and yet until now there has been no Reader to introduce students to the subject. This comprehensive international collection introduces you to the core topics and methodologies used in teaching IM, namely: information behaviour; environmental scanning and decision making; knowledge management; and information strategy. These peer-reviewed papers represent an elite selection from the respected "Information Research" journal, each carefully updated to take into account recent developments. This book is an essential introduction to IM for all students on courses in library and information science, IM, information systems, business information technology, business management, computer science and information technology; as well as for practitioners working in a wide range of organizations providing information services.
    Footnote
    Rez. in: JASIST 58(2007) no.4, S.607-608 (A.D. Petrou): "One small example of a tension in the book's chapters can be expressed as: What exactly falls under information management (IM) as a domain of study? Is it content and research about a traditional life cycle of information, or is it the latter and also any other important issue in information research, such as culture, virtual reality, and online behavior, and communities of practice? In chapter 13, T.D. Wilson states, "Information management is the management of the life cycle to the point of delivery to the information user" (p. 164), yet as he also recognizes, other aspects of information are now included as IM's study matter. On p. 163 of the same chapter, Wilson offers Figure 12.2, titled "The extended life cycle of information." The life cycle in this case includes the following information stages: acquisition, organization, storage, retrieval, access and lending, and dissemination. All of these six stages Wilson labels, inside the circle, as IM. The rest of the extended information life cycle is information use, which includes use, sharing, and application. Chapter 3's author, Gunilla Widen-Wulff, quoting Davenport (1994), states "effective IM is about helping people make effective use of the information, rather than the machines" (p. 31). Widen-Wulff, however, addresses IM from an information culture perspective. To review the book's critical content, IM definitions and research methodology and methods reported in chapters are critically summarized next. This will provide basic information for anyone interested in using the book as an information research reader.
    The chapter by Wilson and Maceviciûtè should have been the first in the book, as it offers an informative, clearly laid out, research-based picture for IM. The chapter offers IM definitions, as mentioned earlier, and also covers a couple of major studies concerned with mapping diversity of content and topics studied in the IM field. RefViz, a visualization tool and an addition to EndNote, was used to map 462 articles published between 1999 and 2004 that had the term information management in their title. Figure 2.1 (Visualization of the IM literature), presents the map's 18 groups or clusters of documents. Two studies by Wilson also are presented. A study completed in 2004 covered the years 2000 to 2004 and reviewed five journals with articles about information activities. The 2004 study analyzed 190 articles from 383 authors. Wilson developed a number of categories about information activities as part of the 2000 and 2004 studies that indicate the scope of the articles analyzed and IM's diversity of subject matter. The remainder of the chapter presents comparative data between the 2000 and 2004 research studies. Joyce Kirk provides a hierarchy of five IM definitions. "IM as IT systems" and "information resource management" are two of these definitions. While it is difficult to clearly recognize any of the hierarchy statements as a definition for IM, what can be had from this hierarchy is the realization, as cviu te' and Wilson state in chapter 2, that IM "is used as an abbreviation for the management of IT, information systems management, management information systems, etc." (p. 20). Perhaps, the critical usefulness of the chapter resides not so much in that it offers any ready to apply definitions for IM but rather in that it provides an overall review about information. The latter can be helpful for a book intended as an information research reader and as an introduction to IM. WidenWulff examined 15 Finnish insurance businesses and developed scales for the measurement of open and closed organizations, and also presented learning organization attributes in different information environments. A 1999 study by Aiki Tibar about critical success factors (CSF) and information needs of successful Estonian companies is the centerpiece of the chapter. The study's findings are presented in relation to previous and more recent research on CSF. The study's methodology was qualitative in nature, involving semistructured interviews with managers and engineers from 25 of the most successful companies in Estonia; these companies were selected in a contest in 1998 as being included in the top 50 most successful companies. In terms of findings, IM was a CFS that was mentioned the most frequently.
    LCSH
    Information retrieval
    Subject
    Information retrieval
  10. Langville, A.N.; Meyer, C.D.: Google's PageRank and beyond : the science of search engine rankings (2006) 0.02
    0.024299448 = product of:
      0.07289834 = sum of:
        0.043157276 = weight(_text_:storage in 6) [ClassicSimilarity], result of:
          0.043157276 = score(doc=6,freq=2.0), product of:
            0.23895897 = queryWeight, product of:
              5.4488444 = idf(docFreq=516, maxDocs=44218)
              0.04385498 = queryNorm
            0.18060538 = fieldWeight in 6, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              5.4488444 = idf(docFreq=516, maxDocs=44218)
              0.0234375 = fieldNorm(doc=6)
        0.02974107 = weight(_text_:retrieval in 6) [ClassicSimilarity], result of:
          0.02974107 = score(doc=6,freq=10.0), product of:
            0.13265759 = queryWeight, product of:
              3.024915 = idf(docFreq=5836, maxDocs=44218)
              0.04385498 = queryNorm
            0.22419426 = fieldWeight in 6, product of:
              3.1622777 = tf(freq=10.0), with freq of:
                10.0 = termFreq=10.0
              3.024915 = idf(docFreq=5836, maxDocs=44218)
              0.0234375 = fieldNorm(doc=6)
      0.33333334 = coord(2/6)
    
    Content
    Inhalt: Chapter 1. Introduction to Web Search Engines: 1.1 A Short History of Information Retrieval - 1.2 An Overview of Traditional Information Retrieval - 1.3 Web Information Retrieval Chapter 2. Crawling, Indexing, and Query Processing: 2.1 Crawling - 2.2 The Content Index - 2.3 Query Processing Chapter 3. Ranking Webpages by Popularity: 3.1 The Scene in 1998 - 3.2 Two Theses - 3.3 Query-Independence Chapter 4. The Mathematics of Google's PageRank: 4.1 The Original Summation Formula for PageRank - 4.2 Matrix Representation of the Summation Equations - 4.3 Problems with the Iterative Process - 4.4 A Little Markov Chain Theory - 4.5 Early Adjustments to the Basic Model - 4.6 Computation of the PageRank Vector - 4.7 Theorem and Proof for Spectrum of the Google Matrix Chapter 5. Parameters in the PageRank Model: 5.1 The a Factor - 5.2 The Hyperlink Matrix H - 5.3 The Teleportation Matrix E Chapter 6. The Sensitivity of PageRank; 6.1 Sensitivity with respect to alpha - 6.2 Sensitivity with respect to H - 6.3 Sensitivity with respect to vT - 6.4 Other Analyses of Sensitivity - 6.5 Sensitivity Theorems and Proofs Chapter 7. The PageRank Problem as a Linear System: 7.1 Properties of (I - alphaS) - 7.2 Properties of (I - alphaH) - 7.3 Proof of the PageRank Sparse Linear System Chapter 8. Issues in Large-Scale Implementation of PageRank: 8.1 Storage Issues - 8.2 Convergence Criterion - 8.3 Accuracy - 8.4 Dangling Nodes - 8.5 Back Button Modeling
    Chapter 9. Accelerating the Computation of PageRank: 9.1 An Adaptive Power Method - 9.2 Extrapolation - 9.3 Aggregation - 9.4 Other Numerical Methods Chapter 10. Updating the PageRank Vector: 10.1 The Two Updating Problems and their History - 10.2 Restarting the Power Method - 10.3 Approximate Updating Using Approximate Aggregation - 10.4 Exact Aggregation - 10.5 Exact vs. Approximate Aggregation - 10.6 Updating with Iterative Aggregation - 10.7 Determining the Partition - 10.8 Conclusions Chapter 11. The HITS Method for Ranking Webpages: 11.1 The HITS Algorithm - 11.2 HITS Implementation - 11.3 HITS Convergence - 11.4 HITS Example - 11.5 Strengths and Weaknesses of HITS - 11.6 HITS's Relationship to Bibliometrics - 11.7 Query-Independent HITS - 11.8 Accelerating HITS - 11.9 HITS Sensitivity Chapter 12. Other Link Methods for Ranking Webpages: 12.1 SALSA - 12.2 Hybrid Ranking Methods - 12.3 Rankings based on Traffic Flow Chapter 13. The Future of Web Information Retrieval: 13.1 Spam - 13.2 Personalization - 13.3 Clustering - 13.4 Intelligent Agents - 13.5 Trends and Time-Sensitive Search - 13.6 Privacy and Censorship - 13.7 Library Classification Schemes - 13.8 Data Fusion Chapter 14. Resources for Web Information Retrieval: 14.1 Resources for Getting Started - 14.2 Resources for Serious Study Chapter 15. The Mathematics Guide: 15.1 Linear Algebra - 15.2 Perron-Frobenius Theory - 15.3 Markov Chains - 15.4 Perron Complementation - 15.5 Stochastic Complementation - 15.6 Censoring - 15.7 Aggregation - 15.8 Disaggregation
  11. Booth, P.F.: Indexing : the manual of good practice (2001) 0.01
    0.012546197 = product of:
      0.03763859 = sum of:
        0.028771518 = weight(_text_:storage in 1968) [ClassicSimilarity], result of:
          0.028771518 = score(doc=1968,freq=2.0), product of:
            0.23895897 = queryWeight, product of:
              5.4488444 = idf(docFreq=516, maxDocs=44218)
              0.04385498 = queryNorm
            0.12040359 = fieldWeight in 1968, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              5.4488444 = idf(docFreq=516, maxDocs=44218)
              0.015625 = fieldNorm(doc=1968)
        0.008867074 = weight(_text_:retrieval in 1968) [ClassicSimilarity], result of:
          0.008867074 = score(doc=1968,freq=2.0), product of:
            0.13265759 = queryWeight, product of:
              3.024915 = idf(docFreq=5836, maxDocs=44218)
              0.04385498 = queryNorm
            0.06684181 = fieldWeight in 1968, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.024915 = idf(docFreq=5836, maxDocs=44218)
              0.015625 = fieldNorm(doc=1968)
      0.33333334 = coord(2/6)
    
    Footnote
    Rez. in: nfd - Information Wissenschaft und Praxis 54(2003) H.7, S.440-442 (R. Fugmann): "Das Buch beginnt mit dem Kapitel "Myths about Indexing" und mit der Nennung von weit verbreiteten Irrtümern über das Indexieren, und zwar vorrangig über das Registermachen. Mit einem einzigen Satz ist die Problematik treffend skizziert, welcher das Buch gewidmet ist: "With the development of electronic documents, it has become possible to store very large amounts of information; but storage is not of much use without the capability to retrieve, to convert, transfer and reuse the information". Kritisiert wird die weit verbreitet anzutreffende Ansicht, das Indexieren sei lediglich eine Sache vom "picking out words from the text or naming objects in images and using those words as index headings". Eine solche Arbeitsweise führt jedoch nicht zu Registern, sondern zu Konkordanzen (d.h. zu alphabetischen Fundstellenlisten für Textwörter) und"... is entirely dependent an the words themselves and is not concerned with the ideas behind them". Das Sammeln von Information ist einfach. Aber die (Wieder-) Auffindbarkeit herzustellen muss gelernt werden, wenn mehr ermöglicht werden soll als lediglich das Wiederfinden von Texten, die man in allen Einzelheiten noch genau in Erinnerung behalten hat (known-item searches, questions of recall), die Details der sprachlichen Ausdrucksweise für die gesuchten Begriffe eingeschlossen. Die Verfasserin beschreibt aus ihrer großen praktischen Erfahrung, welche Schritte hierzu auf der gedanklichen und technischen Ebene unternommen werden müssen. Zu den erstgenannten Schritten rechnet die Abtrennung von Details, welche nicht im Index vertreten sein sollten ("unsought terms"), weil sie mit Sicherheit kein Suchziel darstellen werden und als "false friends" zur Überflutung des Suchenden mit Nebensächlichkeiten führen würden, eine Entscheidung, welche nur mit guter Sachkenntnis gefällt werden kann. All Dasjenige hingegen, was in Gegenwart und Zukunft (!) ein sinnvolles Suchziel darstellen könnte und "sufficiently informative" ist, verdient ein Schlagwort im Register. Man lernt auch durch lehrreiche Beispiele, wodurch ein Textwort unbrauchbar für das Register wird, wenn es dort als (schlechtes) Schlagwort erscheint, herausgelöst aus dem interpretierenden Zusammenhang, in welchen es im Text eingebettet gewesen ist. Auch muss die Vieldeutigkeit bereinigt werden, die fast jedem natursprachigen Wort anhaftet. Sonst wird der Suchende beim Nachschlagen allzu oft in die Irre geführt, und zwar um so öfter, je größer ein diesbezüglich unbereinigter Speicher bereits geworden ist.
    Der Zugang zum Informationsspeicher ist auch von verwandten Begriffen her zu gewährleisten, denn der Suchende lässt sich gern mit seiner Fragestellung zu allgemeineren und vor allem zu spezifischeren Begriffen leiten. Verweisungen der Art "siehe auch" dienen diesem Zweck. Der Zugang ist auch von unterschiedlichen, aber bedeutungsgleichen Ausdrücken mithilfe einer Verweisung von der Art "siehe" zu gewährleisten, denn ein Fragesteller könnte sich mit einem von diesen Synonymen auf die Suche begeben haben und würde dann nicht fündig werden. Auch wird Vieles, wofür ein Suchender sein Schlagwort parat hat, in einem Text nur in wortreicher Umschreibung und paraphrasiert angetroffen ("Terms that may not appear in the text but are likely to be sought by index users"), d.h. praktisch unauffindbar in einer derartig mannigfaltigen Ausdrucksweise. All dies sollte lexikalisch ausgedrückt werden, und zwar in geläufiger Terminologie, denn in dieser Form erfolgt auch die Fragestellung. Hier wird die Grenze zwischen "concept indexing" gegenüber dem bloßen "word indexing" gezogen, welch letzteres sich mit der Präsentation von nicht interpretierten Textwörtern begnügt. Nicht nur ist eine solche Grenze weit verbreitet unbekannt, ihre Existenz wird zuweilen sogar bestritten, obwohl doch ein Wort meistens viele Begriffe ausdrückt und obwohl ein Begriff meistens durch viele verschiedene Wörter und Sätze ausgedrückt wird. Ein Autor kann und muss sich in seinen Texten oft mit Andeutungen begnügen, weil ein Leser oder Zuhörer das Gemeinte schon aus dem Zusammenhang erkennen kann und nicht mit übergroßer Deutlichkeit (spoon feeding) belästigt sein will, was als Unterstellung von Unkenntnis empfunden würde. Für das Retrieval hingegen muss das Gemeinte explizit ausgedrückt werden. In diesem Buch wird deutlich gemacht, was alles an außertextlichem und Hintergrund-Wissen für ein gutes Indexierungsergebnis aufgeboten werden muss, dies auf der Grundlage von sachverständiger und sorgfältiger Interpretation ("The indexer must understand the meaning of a text"). All dies lässt gutes Indexieren nicht nur als professionelle Dienstleistung erscheinen, sondern auch als Kunst. Als Grundlage für all diese Schritte wird ein Thesaurus empfohlen, mit einem gut strukturierten Netzwerk von verwandtschaftlichen Beziehungen und angepasst an den jeweiligen Buchtext. Aber nur selten wird man auf bereits andernorts vorhandene Thesauri zurückgreifen können. Hier wäre ein Hinweis auf einschlägige Literatur zur Thesaurus-Konstruktion nützlich gewesen.

Languages

  • e 28
  • d 2

Subjects

Classifications