Search (34 results, page 1 of 2)

  • × theme_ss:"Verbale Doksprachen im Online-Retrieval"
  1. Devadason, F.J.; Intaraksa, N.; Patamawongjariya, P.; Desai, K.: Faceted indexing based system for organizing and accessing Internet resources (2002) 0.03
    0.02571402 = product of:
      0.06857072 = sum of:
        0.022528138 = weight(_text_:wide in 97) [ClassicSimilarity], result of:
          0.022528138 = score(doc=97,freq=2.0), product of:
            0.13148437 = queryWeight, product of:
              4.4307585 = idf(docFreq=1430, maxDocs=44218)
              0.029675366 = queryNorm
            0.171337 = fieldWeight in 97, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              4.4307585 = idf(docFreq=1430, maxDocs=44218)
              0.02734375 = fieldNorm(doc=97)
        0.03456879 = weight(_text_:web in 97) [ClassicSimilarity], result of:
          0.03456879 = score(doc=97,freq=16.0), product of:
            0.096845865 = queryWeight, product of:
              3.2635105 = idf(docFreq=4597, maxDocs=44218)
              0.029675366 = queryNorm
            0.35694647 = fieldWeight in 97, product of:
              4.0 = tf(freq=16.0), with freq of:
                16.0 = termFreq=16.0
              3.2635105 = idf(docFreq=4597, maxDocs=44218)
              0.02734375 = fieldNorm(doc=97)
        0.011473786 = weight(_text_:data in 97) [ClassicSimilarity], result of:
          0.011473786 = score(doc=97,freq=2.0), product of:
            0.093835 = queryWeight, product of:
              3.1620505 = idf(docFreq=5088, maxDocs=44218)
              0.029675366 = queryNorm
            0.12227618 = fieldWeight in 97, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.1620505 = idf(docFreq=5088, maxDocs=44218)
              0.02734375 = fieldNorm(doc=97)
      0.375 = coord(3/8)
    
    Abstract
    Organizing and providing access to the resources an the Internet has been a problem area in spite of the availability of sophisticated search engines and other Software tools. There have been several attempts to organize the resources an the World Wide Web. Some of them have tried to use traditional library classification schemes such as the Library of Congress Classification, the Dewey Decimal Classification and others. However there is a need to assign proper subject headings to them and present them in a logical or hierarchical sequence to cater to the need for browsing. This paper attempts to describe an experimental system designed to organize and provide access to web documents using a faceted pre-coordinate indexing system based an the Deep Structure Indexing System (DSIS) derived from POPSI (Postulate based Permuted Subject Indexing) of Bhattacharyya, and the facet analysis and chain indexing system of Ranganathan. A prototype Software System has been designed to create a database of records specifying Web documents according to the Dublin Core and to input a faceted subject heading according to DSIS. Synonymous terms are added to the Standard terms in the heading using appropriate symbols. Once the data are entered along with a description and the URL of the web document, the record is stored in the System. More than one faceted subject heading can be assigned to a record depending an the content of the original document. The System stores the Surrogates and keeps the faceted subject headings separately after establishing a link. The search is carried out an index entries derived from the faceted subject heading using the chain indexing technique. If a single term is Input, the System searches for its presence in the faceted subject headings and displays the subject headings in a sorted sequence reflecting an organizing sequence. If the number of retrieved Keadings is too large (running into more than a page) the user has the option of entering another search term to be searched in combination. The System searches subject headings already retrieved and looks for those containing the second term. The retrieved faceted subject headings can be displayed and browsed. When the relevant subject heading is selected the system displays the records with their URLs. Using the URL, the original document an the web can be accessed. The prototype system developed in a Windows NT environment using ASP and a web server is under rigorous testing. The database and Index management routines need further development.
    An interesting but somewhat confusing article telling how the writers described web pages with Dublin Core metadata, including a faceted classification, and built a system that lets users browse the collection through the facets. They seem to want to cover too much in a short article, and unnecessary space is given over to screen shots showing how Dublin Core metadata was entered. The screen shots of the resulting browsable system are, unfortunately, not as enlightening as one would hope, and there is no discussion of how the system was actually written or the technology behind it. Still, it could be worth reading as an example of such a system and how it is treated in journals.
    Footnote
    Vgl. auch: Devadason, F.J.: Facet analysis and Semantic Web: musings of a student of Ranganathan. Unter: http://www.geocities.com/devadason.geo/FASEMWEB.html#FacetedIndex.
  2. Poynder, R.: Web research engines? (1996) 0.02
    0.016629778 = product of:
      0.06651911 = sum of:
        0.046849765 = weight(_text_:web in 5698) [ClassicSimilarity], result of:
          0.046849765 = score(doc=5698,freq=10.0), product of:
            0.096845865 = queryWeight, product of:
              3.2635105 = idf(docFreq=4597, maxDocs=44218)
              0.029675366 = queryNorm
            0.48375595 = fieldWeight in 5698, product of:
              3.1622777 = tf(freq=10.0), with freq of:
                10.0 = termFreq=10.0
              3.2635105 = idf(docFreq=4597, maxDocs=44218)
              0.046875 = fieldNorm(doc=5698)
        0.019669347 = weight(_text_:data in 5698) [ClassicSimilarity], result of:
          0.019669347 = score(doc=5698,freq=2.0), product of:
            0.093835 = queryWeight, product of:
              3.1620505 = idf(docFreq=5088, maxDocs=44218)
              0.029675366 = queryNorm
            0.2096163 = fieldWeight in 5698, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.1620505 = idf(docFreq=5088, maxDocs=44218)
              0.046875 = fieldNorm(doc=5698)
      0.25 = coord(2/8)
    
    Abstract
    Describes the shortcomings of search engines for the WWW comparing their current capabilities to those of the first generation CD-ROM products. Some allow phrase searching and most are improving their Boolean searching. Few allow truncation, wild cards or nested logic. They are stateless, losing previous search criteria. Unlike the indexing and classification systems for today's CD-ROMs, those for Web pages are random, unstructured and of variable quality. Considers that at best Web search engines can only offer free text searching. Discusses whether automatic data classification systems such as Infoseek Ultra can overcome the haphazard nature of the Web with neural network technology, and whether Boolean search techniques may be redundant when replaced by technology such as the Euroferret search engine. However, artificial intelligence is rarely successful on huge, varied databases. Relevance ranking and automatic query expansion still use the same simple inverted indexes. Most Web search engines do nothing more than word counting. Further complications arise with foreign languages
  3. Schabas, A.H.: Postcoordinate retrieval : a comparison of two retrieval languages (1982) 0.02
    0.01660908 = product of:
      0.06643632 = sum of:
        0.038619664 = weight(_text_:wide in 1202) [ClassicSimilarity], result of:
          0.038619664 = score(doc=1202,freq=2.0), product of:
            0.13148437 = queryWeight, product of:
              4.4307585 = idf(docFreq=1430, maxDocs=44218)
              0.029675366 = queryNorm
            0.29372054 = fieldWeight in 1202, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              4.4307585 = idf(docFreq=1430, maxDocs=44218)
              0.046875 = fieldNorm(doc=1202)
        0.027816659 = weight(_text_:data in 1202) [ClassicSimilarity], result of:
          0.027816659 = score(doc=1202,freq=4.0), product of:
            0.093835 = queryWeight, product of:
              3.1620505 = idf(docFreq=5088, maxDocs=44218)
              0.029675366 = queryNorm
            0.29644224 = fieldWeight in 1202, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              3.1620505 = idf(docFreq=5088, maxDocs=44218)
              0.046875 = fieldNorm(doc=1202)
      0.25 = coord(2/8)
    
    Abstract
    This article reports on a comparison of the postcoordinate retrieval effectiveness of two indexing languages: LCSH and PRECIS. The effect of augmenting each with title words was also studies. The database for the study was over 15.000 UK MARC records. Users returned 5.326 relevant judgements for citations retrieved for 61 SDI profiles, representing a wide variety of subjects. Results are reported in terms of precision and relative recall. Pure/applied sciences data and social science data were analyzed separately. Cochran's significance tests for ratios were used to interpret the findings. Recall emerged as the more important measure discriminating the behavior of the two languages. Addition of title words was found to improve recall of both indexing languages significantly. A direct relationship was observed between recall and exhaustivity. For the social sciences searches, recalls from PRECIS alone and from PRECIS with title words were significantly higher than those from LCSH alone and from LCSH with title words, respectively. Corresponding comparisons for the pure/applied sciences searches revealed no significant differences
  4. Broughton, V.: Structural, linguistic and mathematical elements in indexing languages and search engines : implications for the use of index languages in electronic and non-LIS environments (2000) 0.01
    0.012410732 = product of:
      0.049642928 = sum of:
        0.03218305 = weight(_text_:wide in 96) [ClassicSimilarity], result of:
          0.03218305 = score(doc=96,freq=2.0), product of:
            0.13148437 = queryWeight, product of:
              4.4307585 = idf(docFreq=1430, maxDocs=44218)
              0.029675366 = queryNorm
            0.24476713 = fieldWeight in 96, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              4.4307585 = idf(docFreq=1430, maxDocs=44218)
              0.0390625 = fieldNorm(doc=96)
        0.017459875 = weight(_text_:web in 96) [ClassicSimilarity], result of:
          0.017459875 = score(doc=96,freq=2.0), product of:
            0.096845865 = queryWeight, product of:
              3.2635105 = idf(docFreq=4597, maxDocs=44218)
              0.029675366 = queryNorm
            0.18028519 = fieldWeight in 96, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.2635105 = idf(docFreq=4597, maxDocs=44218)
              0.0390625 = fieldNorm(doc=96)
      0.25 = coord(2/8)
    
    Abstract
    The paper looks at ways in which traditional classification and indexing tools have dealt with the relationships between constituent terms; variations in these are examined and compared with the methods used in machine searching, particularly of the Internet and World Wide Web. Apparent correspondences with features of index languages are identified, and further methods of applying classification and indexing theory to machine retrieval are proposed. There are various ways in which indexing and retrieval systems, both conventional and electronic, deal with the problem of searching for documents on a subject basis, and various approaches to the analysis and processing of a query. There appear to be three basic models; the taxonomic or structural system, in which the user is offered a map of the `universe of knowledge'; the language based system, which offers a vocabulary of the subject and a grammar for dealing with compound statements; and the mathematical model using the language of symbolic logic or the algebra of set theory
  5. Devadason, F.J.; Intaraksa, N.; Patamawongjariya, P.; Desai, K.: Faceted indexing application for organizing and accessing internet resources (2003) 0.01
    0.011086518 = product of:
      0.04434607 = sum of:
        0.031233175 = weight(_text_:web in 3966) [ClassicSimilarity], result of:
          0.031233175 = score(doc=3966,freq=10.0), product of:
            0.096845865 = queryWeight, product of:
              3.2635105 = idf(docFreq=4597, maxDocs=44218)
              0.029675366 = queryNorm
            0.32250395 = fieldWeight in 3966, product of:
              3.1622777 = tf(freq=10.0), with freq of:
                10.0 = termFreq=10.0
              3.2635105 = idf(docFreq=4597, maxDocs=44218)
              0.03125 = fieldNorm(doc=3966)
        0.013112898 = weight(_text_:data in 3966) [ClassicSimilarity], result of:
          0.013112898 = score(doc=3966,freq=2.0), product of:
            0.093835 = queryWeight, product of:
              3.1620505 = idf(docFreq=5088, maxDocs=44218)
              0.029675366 = queryNorm
            0.1397442 = fieldWeight in 3966, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.1620505 = idf(docFreq=5088, maxDocs=44218)
              0.03125 = fieldNorm(doc=3966)
      0.25 = coord(2/8)
    
    Abstract
    Organizing and providing access to the resources an the Internet has been a problem area in spite of the availability of sophisticated search engines and other Software tools. There have been several attempts to organize the resources an the WWW. Some of them have tried to use traditional library classification schemes such as the Library of Congress Classification, the Dewey Decimal Classification and others. However there is a need to assign proper subject headings to them and present them in a logical or hierarchical sequence to cater to the need for browsing. This paper attempts to describe an experimental system designed to organize and provide access to web documents using a faceted pre-coordinate indexing system based an the Deep Structure Indexing System (DSIS) derived from POPSI (Postulate based Permuted Subject Indexing) of Bhattacharyya, and the facet analysis and chain indexing System of Ranganathan. A prototype software system has been designed to create a database of records specifying Web documents according to the Dublin Core and input a faceted subject heading according to DSIS. Synonymous terms are added to the standard terms in the heading using appropriate symbols. Once the data are entered along with a description and URL of the Web document, the record is stored in the system. More than one faceted subject heading can be assigned to a record depending an the content of the original document. The system stores the surrogates and keeps the faceted subject headings separately after establishing a link. Search is carried out an index entries derived from the faceted subject heading using chain indexing technique. If a single term is input, the system searches for its presence in the faceted subject headings and displays the subject headings in a sorted sequence reflecting an organizing sequence. If the number of retrieved headings is too large (running into more than a page) then the user has the option of entering another search term to be searched in combination. The system searches subject headings already retrieved and look for those containing the second term. The retrieved faceted subject headings can be displayed and browsed. When the relevant subject heading is selected the system displays the records with their URLs. Using the URL the original document an the web can be accessed. The prototype system developed under Windows NT environment using ASP and web server is under rigorous testing. The database and indexes management routines need further development.
  6. Lambert, N.: Of thesauri and computers : reflections on the need for thesauri (1995) 0.01
    0.010577051 = product of:
      0.042308204 = sum of:
        0.026225796 = weight(_text_:data in 3734) [ClassicSimilarity], result of:
          0.026225796 = score(doc=3734,freq=2.0), product of:
            0.093835 = queryWeight, product of:
              3.1620505 = idf(docFreq=5088, maxDocs=44218)
              0.029675366 = queryNorm
            0.2794884 = fieldWeight in 3734, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.1620505 = idf(docFreq=5088, maxDocs=44218)
              0.0625 = fieldNorm(doc=3734)
        0.01608241 = product of:
          0.03216482 = sum of:
            0.03216482 = weight(_text_:22 in 3734) [ClassicSimilarity], result of:
              0.03216482 = score(doc=3734,freq=2.0), product of:
                0.103918076 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.029675366 = queryNorm
                0.30952093 = fieldWeight in 3734, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0625 = fieldNorm(doc=3734)
          0.5 = coord(1/2)
      0.25 = coord(2/8)
    
    Abstract
    Most indexed databases now include their thesauri and/or coding in their bibliographic files, searchable at the databases' online connect rates. Assesses the searchability of these on the different hosts. Thesauri and classifications are also available as diskette or CD-ROM products. Describes a number of these, highlighting the diskette thesaurus from IFI/Plenum Data for its flexible databases, the CLAIMS Uniterm and Comprehensive indexes to US chemical patents
    Source
    Searcher. 3(1995) no.8, S.18-22
  7. Milstead, J.L.: Thesauri in a full-text world (1998) 0.01
    0.006610657 = product of:
      0.026442628 = sum of:
        0.016391123 = weight(_text_:data in 2337) [ClassicSimilarity], result of:
          0.016391123 = score(doc=2337,freq=2.0), product of:
            0.093835 = queryWeight, product of:
              3.1620505 = idf(docFreq=5088, maxDocs=44218)
              0.029675366 = queryNorm
            0.17468026 = fieldWeight in 2337, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.1620505 = idf(docFreq=5088, maxDocs=44218)
              0.0390625 = fieldNorm(doc=2337)
        0.010051507 = product of:
          0.020103013 = sum of:
            0.020103013 = weight(_text_:22 in 2337) [ClassicSimilarity], result of:
              0.020103013 = score(doc=2337,freq=2.0), product of:
                0.103918076 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.029675366 = queryNorm
                0.19345059 = fieldWeight in 2337, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=2337)
          0.5 = coord(1/2)
      0.25 = coord(2/8)
    
    Date
    22. 9.1997 19:16:05
    Source
    Visualizing subject access for 21st century information resources: Papers presented at the 1997 Clinic on Library Applications of Data Processing, 2-4 Mar 1997, Graduate School of Library and Information Science, University of Illinois at Urbana-Champaign. Ed.: P.A. Cochrane et al
  8. Bates, M.J.: How to use controlled vocabularies more effectively in online searching (1989) 0.01
    0.0064140414 = product of:
      0.05131233 = sum of:
        0.05131233 = weight(_text_:data in 2883) [ClassicSimilarity], result of:
          0.05131233 = score(doc=2883,freq=10.0), product of:
            0.093835 = queryWeight, product of:
              3.1620505 = idf(docFreq=5088, maxDocs=44218)
              0.029675366 = queryNorm
            0.5468357 = fieldWeight in 2883, product of:
              3.1622777 = tf(freq=10.0), with freq of:
                10.0 = termFreq=10.0
              3.1620505 = idf(docFreq=5088, maxDocs=44218)
              0.0546875 = fieldNorm(doc=2883)
      0.125 = coord(1/8)
    
    Abstract
    Optimal retrieval in on-line searching can be achieved through combined use of both natural language and controlled vocabularies. However, there is a large variety of types of controlled vocabulary in data bases and often more than one in a single data base. Optimal use of these vocabularies requires understanding what types of languages are involved, and taking advantage of the particular mix of vocabularies in a given data base. Examples 4 major types of indexing and classification used in data bases and puts these 4 in the context of 3 other approaches to subject access. Discusses how to evaluate a new data base for various forms of subject access.
  9. Bates, M.J.: How to use controlled vocabularies more effectively in online searching (1989) 0.01
    0.0064140414 = product of:
      0.05131233 = sum of:
        0.05131233 = weight(_text_:data in 207) [ClassicSimilarity], result of:
          0.05131233 = score(doc=207,freq=10.0), product of:
            0.093835 = queryWeight, product of:
              3.1620505 = idf(docFreq=5088, maxDocs=44218)
              0.029675366 = queryNorm
            0.5468357 = fieldWeight in 207, product of:
              3.1622777 = tf(freq=10.0), with freq of:
                10.0 = termFreq=10.0
              3.1620505 = idf(docFreq=5088, maxDocs=44218)
              0.0546875 = fieldNorm(doc=207)
      0.125 = coord(1/8)
    
    Abstract
    Optimal retrieval in on-line searching can be achieved through combined use of both natural language and controlled vocabularies. However, there is a large variety of types of controlled vocabulary in data bases and often more than one in a single data base. Optimal use of these vocabularies requires understanding what types of languages are involved, and taking advantage of the particular mix of vocabularies in a given data base. Examples 4 major types of indexing and classification used in data bases and puts these 4 in the context of 3 other approaches to subject access. Discusses how to evaluate a new data base for various forms of subject access.
  10. Aluri, R.D.; Kemp, A.; Boll, J.J.: Subject analysis in online catalogs (1991) 0.01
    0.005736893 = product of:
      0.045895144 = sum of:
        0.045895144 = weight(_text_:data in 863) [ClassicSimilarity], result of:
          0.045895144 = score(doc=863,freq=8.0), product of:
            0.093835 = queryWeight, product of:
              3.1620505 = idf(docFreq=5088, maxDocs=44218)
              0.029675366 = queryNorm
            0.48910472 = fieldWeight in 863, product of:
              2.828427 = tf(freq=8.0), with freq of:
                8.0 = termFreq=8.0
              3.1620505 = idf(docFreq=5088, maxDocs=44218)
              0.0546875 = fieldNorm(doc=863)
      0.125 = coord(1/8)
    
    LCSH
    Subject cataloguing / Data processing
    Machine / readable bibliographic data
    Subject
    Subject cataloguing / Data processing
    Machine / readable bibliographic data
  11. Schabas, A.H.: ¬A comparative evaluation of the retrieval effectiveness of titles, Library of Congress Subject Headings and PRECIS strings for computer searching of UK MARC data (1979) 0.00
    0.0049173366 = product of:
      0.039338693 = sum of:
        0.039338693 = weight(_text_:data in 5277) [ClassicSimilarity], result of:
          0.039338693 = score(doc=5277,freq=2.0), product of:
            0.093835 = queryWeight, product of:
              3.1620505 = idf(docFreq=5088, maxDocs=44218)
              0.029675366 = queryNorm
            0.4192326 = fieldWeight in 5277, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.1620505 = idf(docFreq=5088, maxDocs=44218)
              0.09375 = fieldNorm(doc=5277)
      0.125 = coord(1/8)
    
  12. Papadakis, I.; Stefanidakis, M.; Tzali, A.: Visualizing OPAC subject headings (2008) 0.00
    0.0043649687 = product of:
      0.03491975 = sum of:
        0.03491975 = weight(_text_:web in 2549) [ClassicSimilarity], result of:
          0.03491975 = score(doc=2549,freq=8.0), product of:
            0.096845865 = queryWeight, product of:
              3.2635105 = idf(docFreq=4597, maxDocs=44218)
              0.029675366 = queryNorm
            0.36057037 = fieldWeight in 2549, product of:
              2.828427 = tf(freq=8.0), with freq of:
                8.0 = termFreq=8.0
              3.2635105 = idf(docFreq=4597, maxDocs=44218)
              0.0390625 = fieldNorm(doc=2549)
      0.125 = coord(1/8)
    
    Abstract
    Purpose - This paper aims at providing a robust, user-friendly and efficient navigation procedure in an online library catalog that is based on semantic information encapsulated within subject headings. Design/methodology/approach - The paper describes an interactive navigation procedure inside an online library catalog based on semantic information. The proposed approach is presented through a web-based, prototype application following the most recent trends of the semantic web such as AJAX technology and the web ontology language - OWL for encoding semantics. Findings - According to the proposed method, a GUI interface exposes the hierarchy of the subject headings employed within an OPAC, as well as all stated relations between such headings, as links that the user can follow, effectively traversing the ontology and formulating at the same time the actual query to the underlying OPAC. This act of interactive navigation through the library's assets aids searchers in accurately formulating their queries, by offering broader or narrower concepts for selection or indicating alternative or related concepts they might be initially unaware of. The augmented exposition of inter-relations between concepts provides multiple paths for information retrieval and enables searchers to fulfill their information needs in a faster, more efficient and intuitive manner. Practical implications - The paper includes implications for the development of modern, semantic web applications focused on the library domain. The novel approach of visualizing subject headings could be further extended to visualize a number of other conceptualizations of the library domain. Originality/value - This paper fulfils an identified need to take advantage of the "hidden knowledge" existing within the library domain but, for a number of reasons, is never exposed to the library users.
  13. Blair, D.C.: Language and representation in information retrieval (1991) 0.00
    0.0036651667 = product of:
      0.029321333 = sum of:
        0.029321333 = weight(_text_:data in 1545) [ClassicSimilarity], result of:
          0.029321333 = score(doc=1545,freq=10.0), product of:
            0.093835 = queryWeight, product of:
              3.1620505 = idf(docFreq=5088, maxDocs=44218)
              0.029675366 = queryNorm
            0.31247756 = fieldWeight in 1545, product of:
              3.1622777 = tf(freq=10.0), with freq of:
                10.0 = termFreq=10.0
              3.1620505 = idf(docFreq=5088, maxDocs=44218)
              0.03125 = fieldNorm(doc=1545)
      0.125 = coord(1/8)
    
    Abstract
    Information or Document Retrieval is the subject of this book. It is not an introductory book, although it is self-contained in the sense that it is not necessary to have a background in the theory or practice of Information Retrieval in order to understand its arguments. The book presents, as clearly as possible, one particular perspective on Information Retrieval, and attempts to say that certain aspects of the theory or practice of the management of documents are more important than others. The majority of Information Retrieval research has been aimed at the more experimentally tractable small-scale systems, and although much of that work has added greatly to our understanding of Information Retrieval it is becoming increasingly apparent that retrieval systems with large data bases of documents are a fundamentally different genre of systems than small-scale systems. If this is so, which is the thesis of this book, then we must now study large information retrieval systems with the same rigor and intensity that we once studied small-scale systems. Hegel observed that the quantitative growth of any system caused qualitative changes to take place in its structure and processes.
    Classification
    ST 271 Informatik / Monographien / Software und -entwicklung / Datenbanken, Datenbanksysteme, Data base management, Informationssysteme / Einzelne Datenbanksprachen und Datenbanksysteme
    ST 270 Informatik / Monographien / Software und -entwicklung / Datenbanken, Datenbanksysteme, Data base management, Informationssysteme
    RVK
    ST 271 Informatik / Monographien / Software und -entwicklung / Datenbanken, Datenbanksysteme, Data base management, Informationssysteme / Einzelne Datenbanksprachen und Datenbanksysteme
    ST 270 Informatik / Monographien / Software und -entwicklung / Datenbanken, Datenbanksysteme, Data base management, Informationssysteme
  14. O'Neill, E.T.; Chan, L.M.: FAST - a new approach to controlled subject access (2008) 0.00
    0.0034770824 = product of:
      0.027816659 = sum of:
        0.027816659 = weight(_text_:data in 2181) [ClassicSimilarity], result of:
          0.027816659 = score(doc=2181,freq=4.0), product of:
            0.093835 = queryWeight, product of:
              3.1620505 = idf(docFreq=5088, maxDocs=44218)
              0.029675366 = queryNorm
            0.29644224 = fieldWeight in 2181, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              3.1620505 = idf(docFreq=5088, maxDocs=44218)
              0.046875 = fieldNorm(doc=2181)
      0.125 = coord(1/8)
    
    Abstract
    Recent trends, driven to a large extent by the rapid proliferation of digital resources, are forcing changes in bibliographic control to make it easier to use, understand, and apply subject data. Subject headings are no exception. The enormous volume and rapid growth of digital libraries and repositories and the emergence of numerous metadata schemes have spurred a reexamination of the way subject data are to be provided for such resources efficiently and effectively. To address this need, OCLC in cooperation with the Library of Congress, has taken a new approach, called FAST (Faceted Application of Subject Terminology). FAST headings are based on the existing vocabulary in Library of Congress Subject Headings (LCSH), but are applied with a simpler syntax than required by Library of Congress application policies. Adapting the LCSH vocabulary in a simplified faceted syntax retains the rich vocabulary of LCSH while making it easier to understand, control, apply, and use.
  15. Seeman, D.; Chan, T.; Dykes, K.: Implementation and maintenance of FAST as linked data in a digital collections platform at University of Victoria Libraries (2023) 0.00
    0.0034770824 = product of:
      0.027816659 = sum of:
        0.027816659 = weight(_text_:data in 1165) [ClassicSimilarity], result of:
          0.027816659 = score(doc=1165,freq=4.0), product of:
            0.093835 = queryWeight, product of:
              3.1620505 = idf(docFreq=5088, maxDocs=44218)
              0.029675366 = queryNorm
            0.29644224 = fieldWeight in 1165, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              3.1620505 = idf(docFreq=5088, maxDocs=44218)
              0.046875 = fieldNorm(doc=1165)
      0.125 = coord(1/8)
    
    Abstract
    University of Victoria Libraries has implemented faceted vocabularies, particularly FAST, in its digital collections platform (Vault). The process involved migrating a variety of standardized (pre-coordinated Library of Congress subject headings) and non-standardized metadata to conform to a URI-centric metadata application profile. The authors argue that faceted vocabularies and FAST have helped to create a robust and intuitive user navigation in the platform and allowed for an efficient and straightforward metadata creation process. Maintaining FAST as linked data within Vault has required putting in place some technical processes to keep URIs and textual labels up to date and solutions (FAST Updater) have been locally developed.
  16. Svenonius, E.: Design of controlled vocabularies in the context of emerging technologies (1988) 0.00
    0.0032782245 = product of:
      0.026225796 = sum of:
        0.026225796 = weight(_text_:data in 762) [ClassicSimilarity], result of:
          0.026225796 = score(doc=762,freq=2.0), product of:
            0.093835 = queryWeight, product of:
              3.1620505 = idf(docFreq=5088, maxDocs=44218)
              0.029675366 = queryNorm
            0.2794884 = fieldWeight in 762, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.1620505 = idf(docFreq=5088, maxDocs=44218)
              0.0625 = fieldNorm(doc=762)
      0.125 = coord(1/8)
    
    Abstract
    Delineates on the changing role of vocabulary control devices such as classification, subject headings, and thesaurus. Identifies the basic issue in the design and development of these devices and their role in the changing information technology. The paper identifies the differentiations needed in the new roles of these devices in data base technology
  17. Walsh, J.: ¬The use of Library of Congress Subject Headings in digital collections (2011) 0.00
    0.0032183053 = product of:
      0.025746442 = sum of:
        0.025746442 = weight(_text_:wide in 4549) [ClassicSimilarity], result of:
          0.025746442 = score(doc=4549,freq=2.0), product of:
            0.13148437 = queryWeight, product of:
              4.4307585 = idf(docFreq=1430, maxDocs=44218)
              0.029675366 = queryNorm
            0.1958137 = fieldWeight in 4549, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              4.4307585 = idf(docFreq=1430, maxDocs=44218)
              0.03125 = fieldNorm(doc=4549)
      0.125 = coord(1/8)
    
    Abstract
    Purpose - This paper attempts to explain the wide dissemination of Library of Congress Subject Headings (LCSH) within digital libraries and presents some of the advantages and disadvantages of using this controlled vocabulary in digital collections. The paper also presents other classifications used in digital collections for subject access and explores ways of improving search functionality in digital collections that employ LCSH. Design/methodology/approach - Unlike traditional libraries that use Library of Congress Classification for organization and retrieval, digital libraries use metadata forms for organization and retrieval. The collections exist in cyberspace of the internet which is known for containing the universe of knowledge. The use of LCSH for information retrieval has been widely criticized for its difficulty of use and its information retrieval effectiveness in online environments. The Library of Congress (LOC) has claimed the headings were not based on comprehensive principles nor ever intended to cover the universe of knowledge. Despite these claims and criticisms, LCSH is the most popular choice for subject access in digital libraries. Findings - The number of digital collections increases every year and LCSH is still the most popular choice of controlled vocabulary for subject access. Of the numerous criticisms, difficulties of use and user unfamiliarity are the greatest disadvantages of using LCSH for subject access. Average users only have a vague notion of what they are looking for when initializing a search. More work is required in automated generation of subject headings and increased usage of LCSH in faceted search retrieval systems. This will provide users with better access to the LCSH used in the back end of information retrieval. Originality/value - The Greek researchers who developed the Dissertation DSPace system believe this type of module will eventually replace the traditional keyword-based indexing back ends employed by many information retrieval modules within current digital library systems. The system offers the type of access and interactivity that will acquaint users with how LCSH looks and is used. Faceted search and automated pattern matching using an ontology based on LCSH have the best promise of overcoming the disadvantages that have always plagued the LOC-controlled vocabulary. These retrieval techniques give LCSH an opportunity to finally achieve the optimal precision and recall it has so far failed to deliver.
  18. Dean, R.J.: FAST: development of simplified headings for metadata (2004) 0.00
    0.002618981 = product of:
      0.020951848 = sum of:
        0.020951848 = weight(_text_:web in 5682) [ClassicSimilarity], result of:
          0.020951848 = score(doc=5682,freq=2.0), product of:
            0.096845865 = queryWeight, product of:
              3.2635105 = idf(docFreq=4597, maxDocs=44218)
              0.029675366 = queryNorm
            0.21634221 = fieldWeight in 5682, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.2635105 = idf(docFreq=4597, maxDocs=44218)
              0.046875 = fieldNorm(doc=5682)
      0.125 = coord(1/8)
    
    Abstract
    The Library of Congress Subject Headings schema (LCSH) is the most commonly used and widely accepted subject vocabulary for general application. It is the de facto universal controlled vocabulary and has been a model for developing subject heading systems by many countries. However, LCSH's complex syntax and rules for constructing headings restrict its application by requiring highly skilled personnel and limit the effectiveness of automated authority control. Recent trends, driven to a large extent by the rapid growth of the Web, are forcing changes in bibliographic control systems to make them easier to use, understand, and apply, and subject headings are no exception. The purpose of adapting the LCSH with a simplified syntax to create FAST (Faceted Application of Subject Terminology) headings is to retain the very rich vocabulary of LCSH while making the schema easier to understand, control, apply, and use. The schema maintains compatibility with LCSH--any valid Library of Congress subject heading can be converted to FAST headings.
  19. O'Neill, E.T.; Bennett, R.; Kammerer, K.: Using authorities to improve subject searches (2012) 0.00
    0.002618981 = product of:
      0.020951848 = sum of:
        0.020951848 = weight(_text_:web in 310) [ClassicSimilarity], result of:
          0.020951848 = score(doc=310,freq=2.0), product of:
            0.096845865 = queryWeight, product of:
              3.2635105 = idf(docFreq=4597, maxDocs=44218)
              0.029675366 = queryNorm
            0.21634221 = fieldWeight in 310, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.2635105 = idf(docFreq=4597, maxDocs=44218)
              0.046875 = fieldNorm(doc=310)
      0.125 = coord(1/8)
    
    Source
    Beyond libraries - subject metadata in the digital environment and semantic web. IFLA Satellite Post-Conference, 17-18 August 2012, Tallinn
  20. Lucarelli, A.; Viti, E.: Florence-Washington round trip : ways and intersections between semantic indexing tools in different languages (2015) 0.00
    0.002618981 = product of:
      0.020951848 = sum of:
        0.020951848 = weight(_text_:web in 1886) [ClassicSimilarity], result of:
          0.020951848 = score(doc=1886,freq=2.0), product of:
            0.096845865 = queryWeight, product of:
              3.2635105 = idf(docFreq=4597, maxDocs=44218)
              0.029675366 = queryNorm
            0.21634221 = fieldWeight in 1886, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.2635105 = idf(docFreq=4597, maxDocs=44218)
              0.046875 = fieldNorm(doc=1886)
      0.125 = coord(1/8)
    
    Abstract
    This article presents an Italian experience of developing streamlined semantic interoperability between the Italian Thesaurus of Nuovo soggettario and the Library of Congress Subject Headings (LCSH). This ongoing project must take into consideration the differences between the two indexing tools, while the criteria on which the resulting actions are based are being clarified continually. Reciprocal interoperability, thanks to the Simple Knowledge Organization System format, enables us to create links with English language subject headings. The National Central Library of Florence is studying methods of automatically catching LCSH equivalents and the question of how to take advantage of both Semantic Web outputs and the multilingual dataset of Wikidata.

Languages

  • e 30
  • d 4

Types

  • a 28
  • m 5
  • d 1
  • el 1
  • s 1
  • More… Less…

Classifications