Document (#27051)

Author
Schwartz, C.
Title
Sorting out the Web : approaches to subject access
Imprint
Westport, CT : Ablex
Year
2001
Pages
169 S
Isbn
1-56750-519-8
Footnote
Rez. in: KO 50(2003) no.1, S.45-46 (L.M. Given): "In her own preface to this work, the author notes her lifelong fascination with classification and order, as well as her more recent captivation with the Internet - a place of "chaos in need of organization" (xi). Sorting out the Web examines current efforts to organize the Web and is well-informed by the author's academic and professional expertise in information organization, information retrieval, and Web development. Although the book's level and tone are particularly relevant to a student audience (or others interested in Web-based subject access at an introductory level), it will also appeal to information professionals developing subject access systems across a range of information contexts. There are six chapters in the book, each describing and analyzing one core concept related to the organization of Web content. All topics are presented in a manner ideal for newcomers to the area, with clear definitions, examples, and visuals that illustrate the principles under discussion. The first chapter provides a brief introduction to developments in information technology, including an historical overview of information services, users' needs, and libraries' responses to the Internet. Chapter two introduces metadata, including core concepts and metadata formats. Throughout this chapter the author presents a number of figures that aptly illustrate the application of metadata in HTML, SGML, and MARC record environments, and the use of metadata tools (e.g., XML, RDF). Chapter three begins with an overview of classification theory and specific schemes, but the author devotes most of the discussion to the application of classification systems in the Web environment (e.g., Dewey, LCC, UDC). Web screen captures illustrate the use of these schemes for information sources posted to sites around the world. The chapter closes with a discussion of the future of classification; this is a particularly useful section as the author presents a listing of core journal and conference venues where new approaches to Web classification are explored. In chapter four, the author extends the discussion of classification to the use of controlled vocabularies. As in the first few chapters, the author first presents core background material, including reasons to use controlled vocabularies and the differences between preand post-coordinate indexing, and then discusses the application of specific vocabularies in the Web environment (e.g., Infomine's use of LCSH). The final section of the chapter explores failure in subject searching and the limitations of controlled vocabularies for the Web. Chapter five discusses one of the most common and fast-growing topics related to subject access an the Web: search engines. The author presents a clear definition of the term that encompasses classified search lists (e.g., Yahoo) and query-based engines (e.g., Alta Vista). In addition to historical background an the development of search engines, Schwartz also examines search service types, features, results, and system performance.
The chapter concludes with an appendix of search tips that even seasoned searchers will appreciate; these tips cover the complete search process, from preparation to the examination of results. Chapter six is appropriately entitled "Around the Corner," as it provides the reader with a glimpse of the future of subject access for the Web. Text mining, visualization, machine-aided indexing, and other topics are raised here to whet the reader's appetite for what is yet to come. As the author herself notes in these final pages, librarians will likely increase the depth of their collaboration with software engineers, knowledge managers and others outside of the traditional library community, and thereby push the boundaries of subject access for the digital world. This final chapter leaves this reviewer wanting a second volume of the book, one that might explore these additional topics, as they evolve over the coming years. One characteristic of any book that addresses trends related to the Internet is how quickly the text becomes dated. However, as the author herself asserts, there are core principles related to subject analysis that stand the test of time, leaving the reader with a text that may be generalized well beyond the publication date. In this, Schwartz's text is similar to other recent publications (e.g., Jakob Nielsen's Web Usability, also published in 2001) that acknowledge the mutability of the Web, and therefore discuss core principles and issues that may be applied as the medium itself evolves. This approach to the writing makes this a useful book for those teaching in the areas of subject analysis, information retrieval and Web development for possible consideration as a course text. Although the websites used here may need to be supplemented with more current examples in the classroom, the core content of the book will be relevant for many years to come. Although one might expect that any book taking subject access as its focus world, itself, be easy to navigate, this is not always the case. In this text, however, readers will be pleased to find that no small detail in content access has been spared. The subject Index is thorough and well-crafted, and the inclusion of an exhaustive author index is particularly useful for quick reference. In addition, the table of contents includes sub-themes for each chapter, and a complete table of figures is provided. While the use of colour figures world greatly enhance the text, all black-andwhite images are clear and sharp, a notable fact given that most of the figures are screen captures of websites or database entries. In addition, the inclusion of comprehensive reference lists at the close of each chapter makes this a highly readable text for students and instructors alike; each section of the book can stand as its own "expert review" of the topic at hand. In both content and structure this text is highly recommended. It certainly meets its intended goal of providing a timely introduction to the methods and problems of subject access in the Web environment, and does so in a way that is readable, interesting and engaging."
Theme
Internet
Grundlagen u. Einführungen: Allgemeine Literatur

Similar documents (author)

  1. Schwartz, J.: ¬A new classification and notation (1882) 5.07
    5.070855 = sum of:
      5.070855 = weight(author_txt:schwartz in 820) [ClassicSimilarity], result of:
        5.070855 = fieldWeight in 820, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.113368 = idf(docFreq=35, maxDocs=44218)
          0.625 = fieldNorm(doc=820)
    
  2. Schwartz, W.: Digitalisierung von Büchern : zwei amerikanische Projekte (1992) 5.07
    5.070855 = sum of:
      5.070855 = weight(author_txt:schwartz in 3178) [ClassicSimilarity], result of:
        5.070855 = fieldWeight in 3178, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.113368 = idf(docFreq=35, maxDocs=44218)
          0.625 = fieldNorm(doc=3178)
    
  3. Schwartz, C.: Evaluating CD-ROM products : yet another checklist (1993) 5.07
    5.070855 = sum of:
      5.070855 = weight(author_txt:schwartz in 3750) [ClassicSimilarity], result of:
        5.070855 = fieldWeight in 3750, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.113368 = idf(docFreq=35, maxDocs=44218)
          0.625 = fieldNorm(doc=3750)
    
  4. Schwartz, C.: Trends in interface design in the CD-ROM database environment (1989) 5.07
    5.070855 = sum of:
      5.070855 = weight(author_txt:schwartz in 4019) [ClassicSimilarity], result of:
        5.070855 = fieldWeight in 4019, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.113368 = idf(docFreq=35, maxDocs=44218)
          0.625 = fieldNorm(doc=4019)
    
  5. Schwartz, C.: ¬The CD-ROM journal literature : where do you look? (1992) 5.07
    5.070855 = sum of:
      5.070855 = weight(author_txt:schwartz in 4857) [ClassicSimilarity], result of:
        5.070855 = fieldWeight in 4857, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.113368 = idf(docFreq=35, maxDocs=44218)
          0.625 = fieldNorm(doc=4857)
    

Similar documents (content)

  1. Connell, T.H.: Subject cataloging (1996) 0.52
    0.5205287 = sum of:
      0.5205287 = product of:
        0.6940383 = sum of:
          0.16702889 = weight(abstract_txt:access in 4459) [ClassicSimilarity], result of:
            0.16702889 = score(doc=4459,freq=2.0), product of:
              0.20703667 = queryWeight, product of:
                3.6509786 = idf(docFreq=3120, maxDocs=44218)
                0.056707174 = queryNorm
              0.8067599 = fieldWeight in 4459, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.6509786 = idf(docFreq=3120, maxDocs=44218)
                0.15625 = fieldNorm(doc=4459)
          0.28947777 = weight(abstract_txt:subject in 4459) [ClassicSimilarity], result of:
            0.28947777 = score(doc=4459,freq=4.0), product of:
              0.23709352 = queryWeight, product of:
                1.0701292 = boost
                3.9070187 = idf(docFreq=2415, maxDocs=44218)
                0.056707174 = queryNorm
              1.2209433 = fieldWeight in 4459, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.9070187 = idf(docFreq=2415, maxDocs=44218)
                0.15625 = fieldNorm(doc=4459)
          0.23753163 = weight(abstract_txt:approaches in 4459) [ClassicSimilarity], result of:
            0.23753163 = score(doc=4459,freq=1.0), product of:
              0.3298708 = queryWeight, product of:
                1.2622584 = boost
                4.6084785 = idf(docFreq=1197, maxDocs=44218)
                0.056707174 = queryNorm
              0.7200748 = fieldWeight in 4459, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.6084785 = idf(docFreq=1197, maxDocs=44218)
                0.15625 = fieldNorm(doc=4459)
        0.75 = coord(3/4)
    
  2. Farley, L.: Together at last : regeneration and merging of the MELVYL catalog and periodicals databases (1997) 0.42
    0.41909447 = sum of:
      0.41909447 = product of:
        0.83818895 = sum of:
          0.10131721 = weight(abstract_txt:subject in 1834) [ClassicSimilarity], result of:
            0.10131721 = score(doc=1834,freq=1.0), product of:
              0.23709352 = queryWeight, product of:
                1.0701292 = boost
                3.9070187 = idf(docFreq=2415, maxDocs=44218)
                0.056707174 = queryNorm
              0.42733017 = fieldWeight in 1834, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.9070187 = idf(docFreq=2415, maxDocs=44218)
                0.109375 = fieldNorm(doc=1834)
          0.7368717 = weight(abstract_txt:sorting in 1834) [ClassicSimilarity], result of:
            0.7368717 = score(doc=1834,freq=1.0), product of:
              0.89000434 = queryWeight, product of:
                2.073349 = boost
                7.5697527 = idf(docFreq=61, maxDocs=44218)
                0.056707174 = queryNorm
              0.8279417 = fieldWeight in 1834, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.5697527 = idf(docFreq=61, maxDocs=44218)
                0.109375 = fieldNorm(doc=1834)
        0.5 = coord(2/4)
    
  3. Notess, G.R.: Mega-searching from the desktop (1997) 0.41
    0.4097734 = sum of:
      0.4097734 = product of:
        0.8195468 = sum of:
          0.082675084 = weight(abstract_txt:access in 433) [ClassicSimilarity], result of:
            0.082675084 = score(doc=433,freq=1.0), product of:
              0.20703667 = queryWeight, product of:
                3.6509786 = idf(docFreq=3120, maxDocs=44218)
                0.056707174 = queryNorm
              0.3993258 = fieldWeight in 433, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6509786 = idf(docFreq=3120, maxDocs=44218)
                0.109375 = fieldNorm(doc=433)
          0.7368717 = weight(abstract_txt:sorting in 433) [ClassicSimilarity], result of:
            0.7368717 = score(doc=433,freq=1.0), product of:
              0.89000434 = queryWeight, product of:
                2.073349 = boost
                7.5697527 = idf(docFreq=61, maxDocs=44218)
                0.056707174 = queryNorm
              0.8279417 = fieldWeight in 433, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.5697527 = idf(docFreq=61, maxDocs=44218)
                0.109375 = fieldNorm(doc=433)
        0.5 = coord(2/4)
    
  4. Casey, D.D.: Scouting new horizons : an annotated bibliography introducing subject access in visual image databases (1994) 0.38
    0.38407803 = sum of:
      0.38407803 = product of:
        0.51210403 = sum of:
          0.14319745 = weight(abstract_txt:access in 1428) [ClassicSimilarity], result of:
            0.14319745 = score(doc=1428,freq=3.0), product of:
              0.20703667 = queryWeight, product of:
                3.6509786 = idf(docFreq=3120, maxDocs=44218)
                0.056707174 = queryNorm
              0.69165254 = fieldWeight in 1428, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.6509786 = idf(docFreq=3120, maxDocs=44218)
                0.109375 = fieldNorm(doc=1428)
          0.20263442 = weight(abstract_txt:subject in 1428) [ClassicSimilarity], result of:
            0.20263442 = score(doc=1428,freq=4.0), product of:
              0.23709352 = queryWeight, product of:
                1.0701292 = boost
                3.9070187 = idf(docFreq=2415, maxDocs=44218)
                0.056707174 = queryNorm
              0.85466033 = fieldWeight in 1428, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.9070187 = idf(docFreq=2415, maxDocs=44218)
                0.109375 = fieldNorm(doc=1428)
          0.16627215 = weight(abstract_txt:approaches in 1428) [ClassicSimilarity], result of:
            0.16627215 = score(doc=1428,freq=1.0), product of:
              0.3298708 = queryWeight, product of:
                1.2622584 = boost
                4.6084785 = idf(docFreq=1197, maxDocs=44218)
                0.056707174 = queryNorm
              0.50405234 = fieldWeight in 1428, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.6084785 = idf(docFreq=1197, maxDocs=44218)
                0.109375 = fieldNorm(doc=1428)
        0.75 = coord(3/4)
    
  5. Drabenstott, K.M.; Weller, M.S.: Testing a new design for subject searching in online catalogs (1994) 0.36
    0.3643701 = sum of:
      0.3643701 = product of:
        0.4858268 = sum of:
          0.116920225 = weight(abstract_txt:access in 7716) [ClassicSimilarity], result of:
            0.116920225 = score(doc=7716,freq=2.0), product of:
              0.20703667 = queryWeight, product of:
                3.6509786 = idf(docFreq=3120, maxDocs=44218)
                0.056707174 = queryNorm
              0.56473196 = fieldWeight in 7716, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.6509786 = idf(docFreq=3120, maxDocs=44218)
                0.109375 = fieldNorm(doc=7716)
          0.20263442 = weight(abstract_txt:subject in 7716) [ClassicSimilarity], result of:
            0.20263442 = score(doc=7716,freq=4.0), product of:
              0.23709352 = queryWeight, product of:
                1.0701292 = boost
                3.9070187 = idf(docFreq=2415, maxDocs=44218)
                0.056707174 = queryNorm
              0.85466033 = fieldWeight in 7716, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.9070187 = idf(docFreq=2415, maxDocs=44218)
                0.109375 = fieldNorm(doc=7716)
          0.16627215 = weight(abstract_txt:approaches in 7716) [ClassicSimilarity], result of:
            0.16627215 = score(doc=7716,freq=1.0), product of:
              0.3298708 = queryWeight, product of:
                1.2622584 = boost
                4.6084785 = idf(docFreq=1197, maxDocs=44218)
                0.056707174 = queryNorm
              0.50405234 = fieldWeight in 7716, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.6084785 = idf(docFreq=1197, maxDocs=44218)
                0.109375 = fieldNorm(doc=7716)
        0.75 = coord(3/4)