Search (57 results, page 1 of 3)

  • × theme_ss:"Volltextretrieval"
  1. Laegreid, J.A.: SIFT: a Norwegian information retrieval system (1993) 0.03
    0.02532377 = product of:
      0.037985653 = sum of:
        0.013304878 = weight(_text_:to in 7701) [ClassicSimilarity], result of:
          0.013304878 = score(doc=7701,freq=2.0), product of:
            0.08279609 = queryWeight, product of:
              1.818051 = idf(docFreq=19512, maxDocs=44218)
              0.045541126 = queryNorm
            0.16069452 = fieldWeight in 7701, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.818051 = idf(docFreq=19512, maxDocs=44218)
              0.0625 = fieldNorm(doc=7701)
        0.024680775 = product of:
          0.04936155 = sum of:
            0.04936155 = weight(_text_:22 in 7701) [ClassicSimilarity], result of:
              0.04936155 = score(doc=7701,freq=2.0), product of:
                0.15947726 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.045541126 = queryNorm
                0.30952093 = fieldWeight in 7701, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0625 = fieldNorm(doc=7701)
          0.5 = coord(1/2)
      0.6666667 = coord(2/3)
    
    Abstract
    Describes SIFT (Search in Free Text) an information retrieval system originally developed for administering governmental documents in Norway but which is now being applied alsewhere. SIFT handles structured information well. A library system, SIFT-BIBL, is now available. SIFT's retrieval engine and search facilities are powerful. Its user interface is limited but being imporved. An application programmer interface has been released which will allow programmers to develop their own interface. A Windows-based- client-server version is now being beta tested
    Date
    23. 1.1999 19:22:09
  2. Sievert, M.E.; McKinin, E.J.: Why full-text misses some relevant documents : an analysis of documents not retrieved by CCML or MEDIS (1989) 0.02
    0.018992826 = product of:
      0.02848924 = sum of:
        0.0099786585 = weight(_text_:to in 3564) [ClassicSimilarity], result of:
          0.0099786585 = score(doc=3564,freq=2.0), product of:
            0.08279609 = queryWeight, product of:
              1.818051 = idf(docFreq=19512, maxDocs=44218)
              0.045541126 = queryNorm
            0.12052089 = fieldWeight in 3564, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.818051 = idf(docFreq=19512, maxDocs=44218)
              0.046875 = fieldNorm(doc=3564)
        0.018510582 = product of:
          0.037021164 = sum of:
            0.037021164 = weight(_text_:22 in 3564) [ClassicSimilarity], result of:
              0.037021164 = score(doc=3564,freq=2.0), product of:
                0.15947726 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.045541126 = queryNorm
                0.23214069 = fieldWeight in 3564, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.046875 = fieldNorm(doc=3564)
          0.5 = coord(1/2)
      0.6666667 = coord(2/3)
    
    Abstract
    Searches conducted as part of the MEDLINE/Full-Text Research Project revealed that the full-text data bases of clinical medical journal articles (CCML (Comprehensive Core Medical Library) from BRS Information Technologies, and MEDIS from Mead Data Central) did not retrieve all the relevant citations. An analysis of the data indicated that 204 relevant citations were retrieved only by MEDLINE. A comparison of the strategies used on the full-text data bases with the text of the articles of these 204 citations revealed that 2 reasons contributed to these failure. The searcher often constructed a restrictive strategy which resulted in the loss of relevant documents; and as in other kinds of retrieval, the problems of natural language caused the loss of relevant documents.
    Date
    9. 1.1996 10:22:31
  3. Ashford, J.H.: Full text retrieval in document management : a review (1995) 0.01
    0.009916872 = product of:
      0.029750613 = sum of:
        0.029750613 = weight(_text_:to in 2054) [ClassicSimilarity], result of:
          0.029750613 = score(doc=2054,freq=10.0), product of:
            0.08279609 = queryWeight, product of:
              1.818051 = idf(docFreq=19512, maxDocs=44218)
              0.045541126 = queryNorm
            0.3593239 = fieldWeight in 2054, product of:
              3.1622777 = tf(freq=10.0), with freq of:
                10.0 = termFreq=10.0
              1.818051 = idf(docFreq=19512, maxDocs=44218)
              0.0625 = fieldNorm(doc=2054)
      0.33333334 = coord(1/3)
    
    Abstract
    Full text management which applied to document management tends to be centred on text storage and retrieval. Recent developments are concerned with integration with relational database management system products to deliver document management services offering both the flexibility of text retrieval and the ability to support process based funnctions. There has been a move towards client server architectures, more user friendly user interfaces and more flexible and easier to understand retrieval. Advocates caution in choosing tasks for full text methods. Identifies document management functions for which the combined use of database management systems or special purpose tools should be considered
  4. Pritchard-Schoch, T.: Comparing natural language retrieval : Win & Freestyle (1995) 0.01
    0.009916872 = product of:
      0.029750613 = sum of:
        0.029750613 = weight(_text_:to in 2546) [ClassicSimilarity], result of:
          0.029750613 = score(doc=2546,freq=10.0), product of:
            0.08279609 = queryWeight, product of:
              1.818051 = idf(docFreq=19512, maxDocs=44218)
              0.045541126 = queryNorm
            0.3593239 = fieldWeight in 2546, product of:
              3.1622777 = tf(freq=10.0), with freq of:
                10.0 = termFreq=10.0
              1.818051 = idf(docFreq=19512, maxDocs=44218)
              0.0625 = fieldNorm(doc=2546)
      0.33333334 = coord(1/3)
    
    Abstract
    Reports on a comparison of 2 natural language interfaces to full text legal databases: WIN for access to WESTLAW databases and FREESTYLE for access to the LEXIS database. 30 legal issues in natural langugae queries were presented to identical libraries in both systems. The top 20 ranked documents from each search were analyzed and reviewed for relevance to the legal issue
  5. Dubois, C.P.R.: Free text vs. controlled vocabulary; a reassessment (1987) 0.01
    0.008869919 = product of:
      0.026609756 = sum of:
        0.026609756 = weight(_text_:to in 2048) [ClassicSimilarity], result of:
          0.026609756 = score(doc=2048,freq=8.0), product of:
            0.08279609 = queryWeight, product of:
              1.818051 = idf(docFreq=19512, maxDocs=44218)
              0.045541126 = queryNorm
            0.32138905 = fieldWeight in 2048, product of:
              2.828427 = tf(freq=8.0), with freq of:
                8.0 = termFreq=8.0
              1.818051 = idf(docFreq=19512, maxDocs=44218)
              0.0625 = fieldNorm(doc=2048)
      0.33333334 = coord(1/3)
    
    Abstract
    Free text and controlled vocabulary searching can no longer be viewed as antagonistic techniques in information retrieval since they both display advantages and weaknesses dependent on a fairly wide range of context, with the option to use both increasingly favoured. An attempt is made to present a list of features associated with the two techniques and to suggest a methodology to assist in deciding on the optimal retrieval technique for a particular purpose. The relevance of the techniques in expert systems and full text contexts is also discussed. Finally, recommendations for further research are suggested, concentrating on survey techniques in real-life retrieval situations
  6. White, F.: ISYS 3.01 : a review (1993) 0.01
    0.008869919 = product of:
      0.026609756 = sum of:
        0.026609756 = weight(_text_:to in 2052) [ClassicSimilarity], result of:
          0.026609756 = score(doc=2052,freq=8.0), product of:
            0.08279609 = queryWeight, product of:
              1.818051 = idf(docFreq=19512, maxDocs=44218)
              0.045541126 = queryNorm
            0.32138905 = fieldWeight in 2052, product of:
              2.828427 = tf(freq=8.0), with freq of:
                8.0 = termFreq=8.0
              1.818051 = idf(docFreq=19512, maxDocs=44218)
              0.0625 = fieldNorm(doc=2052)
      0.33333334 = coord(1/3)
    
    Abstract
    Reviews ISYS, a full text retrieval system from Odyssey developments. ISYS supports a wide variety of wordprocessor, text and spreadsheet formats. A programming interface is included for developers to integrate ISYS into their own applications. Describes how to use ISYS to create a database index for a set of documents and then query that index to locate, retrieve and display information from the indexed documents. Discusses configuration options, precedence of operators, displaying and manipulating search results, the advanced menu, ease of learning and use, documentation and library applications
  7. Magennis, M.: Expert rule-based query expansion (1995) 0.01
    0.008677262 = product of:
      0.026031785 = sum of:
        0.026031785 = weight(_text_:to in 5181) [ClassicSimilarity], result of:
          0.026031785 = score(doc=5181,freq=10.0), product of:
            0.08279609 = queryWeight, product of:
              1.818051 = idf(docFreq=19512, maxDocs=44218)
              0.045541126 = queryNorm
            0.3144084 = fieldWeight in 5181, product of:
              3.1622777 = tf(freq=10.0), with freq of:
                10.0 = termFreq=10.0
              1.818051 = idf(docFreq=19512, maxDocs=44218)
              0.0546875 = fieldNorm(doc=5181)
      0.33333334 = coord(1/3)
    
    Abstract
    Examines how, for term based free text retrieval, Interactive Query Expansion (IQE) provides better retrieval performance tahn Automatic Query Expansion (AQE) but the performance of IQE depends on the strategy employed by the user to select expansion terms. The aim is to build an expert query expansion system using term selection rules based on expert users' strategies. It is expected that such a system will achieve better performance for novice or inexperienced users that either AQE or IQE. The procedure is to discover expert IQE users' term selection strategies through observation and interrogation, to construct a rule based query expansion (RQE) system based on these and to compare the resulting retrieval performance with that of comparable AQE and IQE systems
  8. Albus, W.; Smulders, H.: Doeltreffend zoeken in volledige teksten : 1. full-text retrieval bij de HavenInformatieBank (1998) 0.01
    0.008677262 = product of:
      0.026031785 = sum of:
        0.026031785 = weight(_text_:to in 1682) [ClassicSimilarity], result of:
          0.026031785 = score(doc=1682,freq=10.0), product of:
            0.08279609 = queryWeight, product of:
              1.818051 = idf(docFreq=19512, maxDocs=44218)
              0.045541126 = queryNorm
            0.3144084 = fieldWeight in 1682, product of:
              3.1622777 = tf(freq=10.0), with freq of:
                10.0 = termFreq=10.0
              1.818051 = idf(docFreq=19512, maxDocs=44218)
              0.0546875 = fieldNorm(doc=1682)
      0.33333334 = coord(1/3)
    
    Abstract
    At Rotterdam Ports Authority in the Netherlands the Habour information database includes a press cuttings service and various online databases. To enable research staff to have direct access to information the POINT (Point Information Net) was begun in 1993. Using Verity software POINT provides simultaneously full text searching on a range of databases. The software uses current Web indexing technqiues to overcome the problems of excessive recall and low precision. A key element is the system's ability to recognise word combinations
  9. Reinisch, F.: Wer suchet - der findet? : oder Die Überwindung der sprachlichen Grenzen bei der Suche in Volltextdatenbanken (2000) 0.01
    0.0082269255 = product of:
      0.024680775 = sum of:
        0.024680775 = product of:
          0.04936155 = sum of:
            0.04936155 = weight(_text_:22 in 4919) [ClassicSimilarity], result of:
              0.04936155 = score(doc=4919,freq=2.0), product of:
                0.15947726 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.045541126 = queryNorm
                0.30952093 = fieldWeight in 4919, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0625 = fieldNorm(doc=4919)
          0.5 = coord(1/2)
      0.33333334 = coord(1/3)
    
    Date
    22. 7.2000 17:48:06
  10. Zillmann, H.: OSIRIS und eLib : Information Retrieval und Search Engines in Full-text Databases (2001) 0.01
    0.0082269255 = product of:
      0.024680775 = sum of:
        0.024680775 = product of:
          0.04936155 = sum of:
            0.04936155 = weight(_text_:22 in 5937) [ClassicSimilarity], result of:
              0.04936155 = score(doc=5937,freq=2.0), product of:
                0.15947726 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.045541126 = queryNorm
                0.30952093 = fieldWeight in 5937, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0625 = fieldNorm(doc=5937)
          0.5 = coord(1/2)
      0.33333334 = coord(1/3)
    
    Date
    14. 6.2001 12:22:31
  11. Dambeck, H.; Engler, T.: Gesucht und gefunden : Neun Volltext-Suchprogramme für den Desktop (2002) 0.01
    0.0082269255 = product of:
      0.024680775 = sum of:
        0.024680775 = product of:
          0.04936155 = sum of:
            0.04936155 = weight(_text_:22 in 1169) [ClassicSimilarity], result of:
              0.04936155 = score(doc=1169,freq=2.0), product of:
                0.15947726 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.045541126 = queryNorm
                0.30952093 = fieldWeight in 1169, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0625 = fieldNorm(doc=1169)
          0.5 = coord(1/2)
      0.33333334 = coord(1/3)
    
    Source
    c't. 2002, H.22, S.190-197
  12. Nahl, D.; Tenopir, C.: Affective and cognitive searching behavior of novice end-users of a full-text database (1996) 0.01
    0.007839975 = product of:
      0.023519924 = sum of:
        0.023519924 = weight(_text_:to in 4213) [ClassicSimilarity], result of:
          0.023519924 = score(doc=4213,freq=16.0), product of:
            0.08279609 = queryWeight, product of:
              1.818051 = idf(docFreq=19512, maxDocs=44218)
              0.045541126 = queryNorm
            0.28407046 = fieldWeight in 4213, product of:
              4.0 = tf(freq=16.0), with freq of:
                16.0 = termFreq=16.0
              1.818051 = idf(docFreq=19512, maxDocs=44218)
              0.0390625 = fieldNorm(doc=4213)
      0.33333334 = coord(1/3)
    
    Footnote
    Novice end users were given 2 hours of training in searching a full-text magazine database (Magazine ASAP(TM)) on DIALOG. Subjects searched during 3 to 4 sessions in the presence of a trained monitor who prompted them to think aloud throughout the sessions. qualitative analysis of the transcripts and transaction logs yielded empirical information on user variables (purpose, motivation, satisfaction), uses of the database, move types, and every question users asked during the searches. The spontaneous, naturalistic questions were categorized according to affective, cognitive, and sensorimotor speech acts. Results show that most of the searches were performed for the self and were work related. The most common use of the database was to retrieve full-text articles online and to download and print them out rather than read them on screen. The majority of searches were judged satisfactory. Innovative uses included browsing for background information and obtaining contextualized sentences for language teaching. Searchers made twice as many moves to limit sets as moves to expand sets. Affective questions outnumbered cognitive and sensorimotor questions by two to one. This preponderance of affective micro-information needs during searching might be addressed by new system functions
  13. Full-text retrieval (1996) 0.01
    0.0077611795 = product of:
      0.023283537 = sum of:
        0.023283537 = weight(_text_:to in 3320) [ClassicSimilarity], result of:
          0.023283537 = score(doc=3320,freq=2.0), product of:
            0.08279609 = queryWeight, product of:
              1.818051 = idf(docFreq=19512, maxDocs=44218)
              0.045541126 = queryNorm
            0.28121543 = fieldWeight in 3320, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.818051 = idf(docFreq=19512, maxDocs=44218)
              0.109375 = fieldNorm(doc=3320)
      0.33333334 = coord(1/3)
    
    Abstract
    A special section devoted to full text information retrieval
  14. Bernard, M.: Modelling the efficient access to full-text information (1996) 0.01
    0.0077611795 = product of:
      0.023283537 = sum of:
        0.023283537 = weight(_text_:to in 5610) [ClassicSimilarity], result of:
          0.023283537 = score(doc=5610,freq=8.0), product of:
            0.08279609 = queryWeight, product of:
              1.818051 = idf(docFreq=19512, maxDocs=44218)
              0.045541126 = queryNorm
            0.28121543 = fieldWeight in 5610, product of:
              2.828427 = tf(freq=8.0), with freq of:
                8.0 = termFreq=8.0
              1.818051 = idf(docFreq=19512, maxDocs=44218)
              0.0546875 = fieldNorm(doc=5610)
      0.33333334 = coord(1/3)
    
    Abstract
    Describes a research goal set by many offices within the US Department of Energy. Reviews efficient full text searching techniques being developed to better understand and meet this goal. Classical computer human interaction (CHI) approaches provided by commercial information retrieval engines fail to contextualize information in ways that facilitates timely decision making. Discusses the uses of advanced CHI techniques in combination with deductive database technology to augment the weaknesses found in the presentation capabilities of information retrieval engines. Presents various techniques employed in a WWW based prototype system currently under development
  15. Melucci, M.: Passage retrieval : a probabilistic technique (1998) 0.01
    0.0077611795 = product of:
      0.023283537 = sum of:
        0.023283537 = weight(_text_:to in 1150) [ClassicSimilarity], result of:
          0.023283537 = score(doc=1150,freq=8.0), product of:
            0.08279609 = queryWeight, product of:
              1.818051 = idf(docFreq=19512, maxDocs=44218)
              0.045541126 = queryNorm
            0.28121543 = fieldWeight in 1150, product of:
              2.828427 = tf(freq=8.0), with freq of:
                8.0 = termFreq=8.0
              1.818051 = idf(docFreq=19512, maxDocs=44218)
              0.0546875 = fieldNorm(doc=1150)
      0.33333334 = coord(1/3)
    
    Abstract
    This paper presents a probabilistic technique to retrieve passages from texts having a large size or heterogeneous semantic content. The proposed technique is independent on any supporting auxiliary data, such as text structure, topic organization, or pre-defined text segments. A Bayesian framework implements the probabilistic technique. We carried out experiments to compare the probabilistique technique to one based on a text segmentation algorithm. In particular, the probabilistique technique is more effective than, or as effective as the one based on the text segmentation to retrieve small passages. Results show that passage size affects passage retrieval performance. Results do also suggest that text organization and query generality may have an impact on the difference in effectiveness between the two techniques
  16. Basch, R.: Searching full-text : tips and techniques (1992) 0.01
    0.0076815756 = product of:
      0.023044726 = sum of:
        0.023044726 = weight(_text_:to in 5050) [ClassicSimilarity], result of:
          0.023044726 = score(doc=5050,freq=6.0), product of:
            0.08279609 = queryWeight, product of:
              1.818051 = idf(docFreq=19512, maxDocs=44218)
              0.045541126 = queryNorm
            0.2783311 = fieldWeight in 5050, product of:
              2.4494898 = tf(freq=6.0), with freq of:
                6.0 = termFreq=6.0
              1.818051 = idf(docFreq=19512, maxDocs=44218)
              0.0625 = fieldNorm(doc=5050)
      0.33333334 = coord(1/3)
    
    Abstract
    Gives tips for effective searching of full-text databases: think creatively about vocabulary and use of the OR connector to link synonyms; use truncation to pick up plurals and alternative work endings; anticipate international differences in spellimg, likie 'labor' and 'labour', and in vocabulary, like 'truck' and 'lorry'; avoid the AND operator, preferring proximity connectors like NEAR, WITHIN or PRE; use jargon instead of trying to anticipate all the abstract terminology that a writer might have used; and exploit any indexing available
  17. Blake, P.: Leading edge : Verity keeps it in the family (1997) 0.01
    0.0074376534 = product of:
      0.02231296 = sum of:
        0.02231296 = weight(_text_:to in 7398) [ClassicSimilarity], result of:
          0.02231296 = score(doc=7398,freq=10.0), product of:
            0.08279609 = queryWeight, product of:
              1.818051 = idf(docFreq=19512, maxDocs=44218)
              0.045541126 = queryNorm
            0.26949292 = fieldWeight in 7398, product of:
              3.1622777 = tf(freq=10.0), with freq of:
                10.0 = termFreq=10.0
              1.818051 = idf(docFreq=19512, maxDocs=44218)
              0.046875 = fieldNorm(doc=7398)
      0.33333334 = coord(1/3)
    
    Abstract
    Verity Search 97 software will index and search e-mail, attachments, folders and documents on local and network disk drives. The Internet may be searched via the same front end and changes to particular documents or pages may be monitored. Documents may be viewed in their native formats including ASCII, HTML, PDF and popular word processors, with highlighted search terms. Agents may be launched into the Internet to retrieve information according to a user-specified profile. The software can index about 700 MB an hour. Describes the search technology which includes fuzzy logic and natural language. The Web version of Personal Search 97 works with Netscape Navigator or Microsoft Internet Explorer, while the Exchange version will work regardless of any attachment to an Exchange server. Search 97 Personal improves online time and access time and allows searches to be refined offline
  18. Leppanen, E.: Homografiongelma tekstihaussa ja homografien disambiguoinnin vaikutukset (1996) 0.01
    0.0074376534 = product of:
      0.02231296 = sum of:
        0.02231296 = weight(_text_:to in 27) [ClassicSimilarity], result of:
          0.02231296 = score(doc=27,freq=10.0), product of:
            0.08279609 = queryWeight, product of:
              1.818051 = idf(docFreq=19512, maxDocs=44218)
              0.045541126 = queryNorm
            0.26949292 = fieldWeight in 27, product of:
              3.1622777 = tf(freq=10.0), with freq of:
                10.0 = termFreq=10.0
              1.818051 = idf(docFreq=19512, maxDocs=44218)
              0.046875 = fieldNorm(doc=27)
      0.33333334 = coord(1/3)
    
    Abstract
    Homonymy is known to often cause false drops in free text searching in a full text database. The problem is quite common and difficult to avoid in Finnish, but nobody has examined it before. Reports on a study that examined the frequency of, and solutions to, the homonymy problem, based on searches made in a Finnish full text database containing about 55.000 newspaper articles. The results indicate that homonymy is not a very serious problem in full text searching, with only about 1 search result set out of 4 containing false drops caused by homonymy. Several other reasons for nonrelevance were much more common. However, in some set results there were a considerable number of homonymy errors, so the number seems to be very random. A study was also made into whether homonyms can be disambiguated by syntactic analysis. The result was that 75,2% of homonyms were disambiguated by this method. Verb homonyms were considerably easier to disambiguate than substantives. Although homonymy is not a very big problem it could perhaps easily be eliminated if there was a suitable syntactic analyzer in the IR system
  19. Hider, P.: ¬The search value added by professional indexing to a bibliographic database (2018) 0.01
    0.0073336246 = product of:
      0.022000873 = sum of:
        0.022000873 = weight(_text_:to in 4300) [ClassicSimilarity], result of:
          0.022000873 = score(doc=4300,freq=14.0), product of:
            0.08279609 = queryWeight, product of:
              1.818051 = idf(docFreq=19512, maxDocs=44218)
              0.045541126 = queryNorm
            0.2657236 = fieldWeight in 4300, product of:
              3.7416575 = tf(freq=14.0), with freq of:
                14.0 = termFreq=14.0
              1.818051 = idf(docFreq=19512, maxDocs=44218)
              0.0390625 = fieldNorm(doc=4300)
      0.33333334 = coord(1/3)
    
    Abstract
    Gross et al. (2015) have demonstrated that about a quarter of hits would typically be lost to keyword searchers if contemporary academic library catalogs dropped their controlled subject headings. This article reports on an investigation of the search value that subject descriptors and identifiers assigned by professional indexers add to a bibliographic database, namely the Australian Education Index (AEI). First, a similar methodology to that developed by Gross et al. (2015) was applied, with keyword searches representing a range of educational topics run on the AEI database with and without its subject indexing. The results indicated that AEI users would also lose, on average, about a quarter of hits per query. Second, an alternative research design was applied in which an experienced literature searcher was asked to find resources on a set of educational topics on an AEI database stripped of its subject indexing and then asked to search for additional resources on the same topics after the subject indexing had been reinserted. In this study, the proportion of additional resources that would have been lost had it not been for the subject indexing was again found to be about a quarter of the total resources found for each topic, on average.
  20. Hider, P.: ¬The search value added by professional indexing to a bibliographic database (2017) 0.01
    0.006789617 = product of:
      0.02036885 = sum of:
        0.02036885 = weight(_text_:to in 3868) [ClassicSimilarity], result of:
          0.02036885 = score(doc=3868,freq=12.0), product of:
            0.08279609 = queryWeight, product of:
              1.818051 = idf(docFreq=19512, maxDocs=44218)
              0.045541126 = queryNorm
            0.24601223 = fieldWeight in 3868, product of:
              3.4641016 = tf(freq=12.0), with freq of:
                12.0 = termFreq=12.0
              1.818051 = idf(docFreq=19512, maxDocs=44218)
              0.0390625 = fieldNorm(doc=3868)
      0.33333334 = coord(1/3)
    
    Abstract
    Gross et al. (2015) have demonstrated that about a quarter of hits would typically be lost to keyword searchers if contemporary academic library catalogs dropped their controlled subject headings. This paper reports on an analysis of the loss levels that would result if a bibliographic database, namely the Australian Education Index (AEI), were missing the subject descriptors and identifiers assigned by its professional indexers, employing the methodology developed by Gross and Taylor (2005), and later by Gross et al. (2015). The results indicate that AEI users would lose a similar proportion of hits per query to that experienced by library catalog users: on average, 27% of the resources found by a sample of keyword queries on the AEI database would not have been found without the subject indexing, based on the Australian Thesaurus of Education Descriptors (ATED). The paper also discusses the methodological limitations of these studies, pointing out that real-life users might still find some of the resources missed by a particular query through follow-up searches, while additional resources might also be found through iterative searching on the subject vocabulary. The paper goes on to describe a new research design, based on a before - and - after experiment, which addresses some of these limitations. It is argued that this alternative design will provide a more realistic picture of the value that professionally assigned subject indexing and controlled subject vocabularies can add to literature searching of a more scholarly and thorough kind.
    Content
    Beitrag bei: NASKO 2017: Visualizing Knowledge Organization: Bringing Focus to Abstract Realities. The sixth North American Symposium on Knowledge Organization (NASKO 2017), June 15-16, 2017, in Champaign, IL, USA.

Years

Languages

Types

  • a 55
  • s 2
  • el 1
  • More… Less…