Search (9 results, page 1 of 1)

  • × author_ss:"Larson, R.R."
  1. Larson, R.R.: Cheshire 2 : design and evaluation of a next-generation online catalog system (1995) 0.04
    0.040823873 = product of:
      0.14288355 = sum of:
        0.07045048 = weight(_text_:techniques in 3820) [ClassicSimilarity], result of:
          0.07045048 = score(doc=3820,freq=2.0), product of:
            0.18093403 = queryWeight, product of:
              4.405231 = idf(docFreq=1467, maxDocs=44218)
              0.04107254 = queryNorm
            0.3893711 = fieldWeight in 3820, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              4.405231 = idf(docFreq=1467, maxDocs=44218)
              0.0625 = fieldNorm(doc=3820)
        0.07243308 = product of:
          0.14486615 = sum of:
            0.14486615 = weight(_text_:mathematics in 3820) [ClassicSimilarity], result of:
              0.14486615 = score(doc=3820,freq=2.0), product of:
                0.25945482 = queryWeight, product of:
                  6.31699 = idf(docFreq=216, maxDocs=44218)
                  0.04107254 = queryNorm
                0.5583483 = fieldWeight in 3820, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  6.31699 = idf(docFreq=216, maxDocs=44218)
                  0.0625 = fieldNorm(doc=3820)
          0.5 = coord(1/2)
      0.2857143 = coord(2/7)
    
    Abstract
    The Cheshire 2 project is developing a next generation online catalogue and full text information retrieval system using advanced information retrieval techniques. It is being deployed at the University of California Berkeley Astronomy-Mathematics-Statistics Library, USA, and its use and acceptance is being evaluated using transaction monitoring and questionnaires. Describes the system architecture and user evaluation tools
  2. Larson, R.R.: Information retrieval systems (2009) 0.03
    0.027844835 = product of:
      0.09745692 = sum of:
        0.04461906 = weight(_text_:processing in 3817) [ClassicSimilarity], result of:
          0.04461906 = score(doc=3817,freq=2.0), product of:
            0.1662677 = queryWeight, product of:
              4.048147 = idf(docFreq=2097, maxDocs=44218)
              0.04107254 = queryNorm
            0.26835677 = fieldWeight in 3817, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              4.048147 = idf(docFreq=2097, maxDocs=44218)
              0.046875 = fieldNorm(doc=3817)
        0.052837856 = weight(_text_:techniques in 3817) [ClassicSimilarity], result of:
          0.052837856 = score(doc=3817,freq=2.0), product of:
            0.18093403 = queryWeight, product of:
              4.405231 = idf(docFreq=1467, maxDocs=44218)
              0.04107254 = queryNorm
            0.2920283 = fieldWeight in 3817, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              4.405231 = idf(docFreq=1467, maxDocs=44218)
              0.046875 = fieldNorm(doc=3817)
      0.2857143 = coord(2/7)
    
    Abstract
    Information retrieval (IR) systems aim to select relevant material from large collections of information in response to user queries. The approaches used to accomplish this have been the focus of much research and development over the past 50 years, and have led to the algorithms underlying many commercial and Web-based search engines today. This entry describes the common components that go into the design of IR systems (from text processing to inverted file indexes). The major classes (or models) of retrieval algorithms (Boolean, vector, and probabilistic) are described along with formal definitions of the basic form of these algorithms and some of the variations in common use in IR research. In addition, the entry examines query expansion techniques, and in particular relevance feedback, and how they are used in IR systems.
  3. Larson, R.R.: Hypertext and information retrieval : towards the next generation of information systems (1988) 0.01
    0.012454004 = product of:
      0.08717802 = sum of:
        0.08717802 = weight(_text_:techniques in 3557) [ClassicSimilarity], result of:
          0.08717802 = score(doc=3557,freq=4.0), product of:
            0.18093403 = queryWeight, product of:
              4.405231 = idf(docFreq=1467, maxDocs=44218)
              0.04107254 = queryNorm
            0.48182213 = fieldWeight in 3557, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              4.405231 = idf(docFreq=1467, maxDocs=44218)
              0.0546875 = fieldNorm(doc=3557)
      0.14285715 = coord(1/7)
    
    Abstract
    Hypertext is an old concept that only recently has been demonstrated in working systems. Examines the conceptual basis of hypertext, reviews some representative hypertext systems, and discusses some of the problems to be faced when hypertext techniques are applied to large-scale information systems. Techniques developed in information retrieval research are seen as useful complements to hypertext that may remedy some of these problems.
  4. Larson, R.R.: Evaluation of advanced retrieval techniques in an experimental online catalog (1992) 0.01
    0.010674859 = product of:
      0.07472401 = sum of:
        0.07472401 = weight(_text_:techniques in 481) [ClassicSimilarity], result of:
          0.07472401 = score(doc=481,freq=4.0), product of:
            0.18093403 = queryWeight, product of:
              4.405231 = idf(docFreq=1467, maxDocs=44218)
              0.04107254 = queryNorm
            0.4129904 = fieldWeight in 481, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              4.405231 = idf(docFreq=1467, maxDocs=44218)
              0.046875 = fieldNorm(doc=481)
      0.14285715 = coord(1/7)
    
    Abstract
    Research on the use and users of online catalogs conducted in the early 1980s found that subject searches were the most common form of online catalog search. At the same time, many of the problems experienced by online catalog users have been traced to difficulties with the subject access mechanisms of the online catalog. Numerous proposals have been made for methods intended to improve subject access to online catalog records. These commonly involve enhancing the catalog's bibliographic records with additional terms, or incorporating subject authority files or additional thesauri in the database. Another stream of research has concentrated on applying retrieval techniques derived from information retrieval (IR) research to replace the Boolean search methods of conventional online catalog systems. This study describes the results of retrieval tests using a variety of these search methods in the CHESHIRE experimental online catalog system.
  5. Larson, R.R.; Carson, C.: Information access for a digital library : Cheshire II and the Berkeley environment digital library (1999) 0.01
    0.010482524 = product of:
      0.07337766 = sum of:
        0.07337766 = weight(_text_:digital in 6685) [ClassicSimilarity], result of:
          0.07337766 = score(doc=6685,freq=6.0), product of:
            0.16201277 = queryWeight, product of:
              3.944552 = idf(docFreq=2326, maxDocs=44218)
              0.04107254 = queryNorm
            0.4529128 = fieldWeight in 6685, product of:
              2.4494898 = tf(freq=6.0), with freq of:
                6.0 = termFreq=6.0
              3.944552 = idf(docFreq=2326, maxDocs=44218)
              0.046875 = fieldNorm(doc=6685)
      0.14285715 = coord(1/7)
    
    Abstract
    The Cheshire II system was originally developed to provide a bridge from conventional online library catalogs to full-text online resources. Recently we have begun using the system to implement full-text and fielded searching of bibliographic information for the UC Berkeley Digital Library Initiative project sponsored by NSF, NASA and ARPA. The Cheshire system is also being used to provide scalable performance for image querying using the "Blobworld" image representation. This paper will review the characteristics of the Cheshire II system and examine its performance and behavior when applied to a collection of large full-text documents in the TREC Interactive Retrieval Track and its performance in Blobworld image searching
  6. Larson, R.R.: Classification clustering, probabilistic information retrieval, and the online catalog (1991) 0.01
    0.008806311 = product of:
      0.06164417 = sum of:
        0.06164417 = weight(_text_:techniques in 1070) [ClassicSimilarity], result of:
          0.06164417 = score(doc=1070,freq=2.0), product of:
            0.18093403 = queryWeight, product of:
              4.405231 = idf(docFreq=1467, maxDocs=44218)
              0.04107254 = queryNorm
            0.3406997 = fieldWeight in 1070, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              4.405231 = idf(docFreq=1467, maxDocs=44218)
              0.0546875 = fieldNorm(doc=1070)
      0.14285715 = coord(1/7)
    
    Abstract
    Research into online catalog use and users has found some pervasive problems with subject searching in these systems. Subject searches too often fail to retrieve anything, and those that do succeed often retrieve "too much" material. This article examnies these problems and how they might be remedied. The theoretical principles for the design of effective information retrieval systems are discussed, and an experimental online catalog system based on these principles is described. The system, CHESHIRE, uses a method called "classification clustering", combined with probabilistic retrieval techniques, to provide natural language searching (which helps to reduce search failure) and to provide effective control of "information overload" in subject searching
  7. Larson, R.R.: Experiments in automatic Library of Congress Classification (1992) 0.01
    0.0075482656 = product of:
      0.052837856 = sum of:
        0.052837856 = weight(_text_:techniques in 1054) [ClassicSimilarity], result of:
          0.052837856 = score(doc=1054,freq=2.0), product of:
            0.18093403 = queryWeight, product of:
              4.405231 = idf(docFreq=1467, maxDocs=44218)
              0.04107254 = queryNorm
            0.2920283 = fieldWeight in 1054, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              4.405231 = idf(docFreq=1467, maxDocs=44218)
              0.046875 = fieldNorm(doc=1054)
      0.14285715 = coord(1/7)
    
    Abstract
    This article presents the results of research into the automatic selection of Library of Congress Classification numbers based on the titles and subject headings in MARC records. The method used in this study was based on partial match retrieval techniques using various elements of new recors (i.e., those to be classified) as "queries", and a test database of classification clusters generated from previously classified MARC records. Sixty individual methods for automatic classification were tested on a set of 283 new records, using all combinations of four different partial match methods, five query types, and three representations of search terms. The results indicate that if the best method for a particular case can be determined, then up to 86% of the new records may be correctly classified. The single method with the best accuracy was able to select the correct classification for about 46% of the new records.
  8. Larson, R.R.: ¬The decline of subject searching : long-term trends and patterns of index use in an online catalog (1991) 0.01
    0.0075482656 = product of:
      0.052837856 = sum of:
        0.052837856 = weight(_text_:techniques in 1104) [ClassicSimilarity], result of:
          0.052837856 = score(doc=1104,freq=2.0), product of:
            0.18093403 = queryWeight, product of:
              4.405231 = idf(docFreq=1467, maxDocs=44218)
              0.04107254 = queryNorm
            0.2920283 = fieldWeight in 1104, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              4.405231 = idf(docFreq=1467, maxDocs=44218)
              0.046875 = fieldNorm(doc=1104)
      0.14285715 = coord(1/7)
    
    Abstract
    Search index usage in a large university online catalog system over a six-year period (representing about 15,3 million searches) was investigated using transaction monitor data. Mathematical models of trends and patterns in the data were developed and tested using regression techniques. The results of the analyses show a consistent decline in the frequency of subject index use by online catalog users, with a corresponding increase in the frequency of title keyword searching. Significant annual patterns in index usage were also identified. Analysis of the transaction data, and related previous studies of online catalog users, suggest a number of factors contributing to the decline in subject search frequency. Chief among these factors are user difficulties in formulating subject queries with LCSH, leading to search failure, and the problem of "information overload" as database size increases. This article presents the models and results of the transaction log analysis, discusses the underlying problems with subject searching contributing to the observed decline, and reviews some proposed improvements to online catalog systems to aid in overcoming these problems
  9. Larson, R.R.: Design and development of a network-based electronic library (1994) 0.01
    0.0060520875 = product of:
      0.042364612 = sum of:
        0.042364612 = weight(_text_:digital in 3032) [ClassicSimilarity], result of:
          0.042364612 = score(doc=3032,freq=2.0), product of:
            0.16201277 = queryWeight, product of:
              3.944552 = idf(docFreq=2326, maxDocs=44218)
              0.04107254 = queryNorm
            0.26148933 = fieldWeight in 3032, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.944552 = idf(docFreq=2326, maxDocs=44218)
              0.046875 = fieldNorm(doc=3032)
      0.14285715 = coord(1/7)
    
    Abstract
    Among the proposed innovations in the Clinton Administration's plans to develop a National Information Infrastructure is the creation of, and support for, digital or electronic libraries to store and provide access to the vast amounts of information expected to made available over the 'information superhighway'. Although the exact nature and future architecture of such libraries is still a matter for experimentation (and debate), there are several pioineering efforts underway to establish electronic libraries and to provide access to them. This paper describes one such effort underway at the University of California at Berkeley. In collaboration with four other universities we are developing interoperable electronic library servers containing the Computer Science technical reports for each participant and making them available over the Internet using standard protocols