Search (30 results, page 1 of 2)

  • × theme_ss:"Volltextretrieval"
  1. Sievert, M.E.; McKinin, E.J.: Why full-text misses some relevant documents : an analysis of documents not retrieved by CCML or MEDIS (1989) 0.01
    0.008804738 = product of:
      0.030816581 = sum of:
        0.014989593 = weight(_text_:with in 3564) [ClassicSimilarity], result of:
          0.014989593 = score(doc=3564,freq=2.0), product of:
            0.09383348 = queryWeight, product of:
              2.409771 = idf(docFreq=10797, maxDocs=44218)
              0.038938753 = queryNorm
            0.15974675 = fieldWeight in 3564, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              2.409771 = idf(docFreq=10797, maxDocs=44218)
              0.046875 = fieldNorm(doc=3564)
        0.015826989 = product of:
          0.031653978 = sum of:
            0.031653978 = weight(_text_:22 in 3564) [ClassicSimilarity], result of:
              0.031653978 = score(doc=3564,freq=2.0), product of:
                0.13635688 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.038938753 = queryNorm
                0.23214069 = fieldWeight in 3564, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.046875 = fieldNorm(doc=3564)
          0.5 = coord(1/2)
      0.2857143 = coord(2/7)
    
    Abstract
    Searches conducted as part of the MEDLINE/Full-Text Research Project revealed that the full-text data bases of clinical medical journal articles (CCML (Comprehensive Core Medical Library) from BRS Information Technologies, and MEDIS from Mead Data Central) did not retrieve all the relevant citations. An analysis of the data indicated that 204 relevant citations were retrieved only by MEDLINE. A comparison of the strategies used on the full-text data bases with the text of the articles of these 204 citations revealed that 2 reasons contributed to these failure. The searcher often constructed a restrictive strategy which resulted in the loss of relevant documents; and as in other kinds of retrieval, the problems of natural language caused the loss of relevant documents.
    Date
    9. 1.1996 10:22:31
  2. Frederick, S.C.: SIRE: full-text retrieval with augmented search (1985) 0.01
    0.005710321 = product of:
      0.039972246 = sum of:
        0.039972246 = weight(_text_:with in 482) [ClassicSimilarity], result of:
          0.039972246 = score(doc=482,freq=2.0), product of:
            0.09383348 = queryWeight, product of:
              2.409771 = idf(docFreq=10797, maxDocs=44218)
              0.038938753 = queryNorm
            0.42599133 = fieldWeight in 482, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              2.409771 = idf(docFreq=10797, maxDocs=44218)
              0.125 = fieldNorm(doc=482)
      0.14285715 = coord(1/7)
    
  3. Dubois, C.P.R.: Free text vs. controlled vocabulary; a reassessment (1987) 0.00
    0.004037807 = product of:
      0.02826465 = sum of:
        0.02826465 = weight(_text_:with in 2048) [ClassicSimilarity], result of:
          0.02826465 = score(doc=2048,freq=4.0), product of:
            0.09383348 = queryWeight, product of:
              2.409771 = idf(docFreq=10797, maxDocs=44218)
              0.038938753 = queryNorm
            0.30122137 = fieldWeight in 2048, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              2.409771 = idf(docFreq=10797, maxDocs=44218)
              0.0625 = fieldNorm(doc=2048)
      0.14285715 = coord(1/7)
    
    Abstract
    Free text and controlled vocabulary searching can no longer be viewed as antagonistic techniques in information retrieval since they both display advantages and weaknesses dependent on a fairly wide range of context, with the option to use both increasingly favoured. An attempt is made to present a list of features associated with the two techniques and to suggest a methodology to assist in deciding on the optimal retrieval technique for a particular purpose. The relevance of the techniques in expert systems and full text contexts is also discussed. Finally, recommendations for further research are suggested, concentrating on survey techniques in real-life retrieval situations
  4. Ashford, J.H.: Full text retrieval in document management : a review (1995) 0.00
    0.004037807 = product of:
      0.02826465 = sum of:
        0.02826465 = weight(_text_:with in 2054) [ClassicSimilarity], result of:
          0.02826465 = score(doc=2054,freq=4.0), product of:
            0.09383348 = queryWeight, product of:
              2.409771 = idf(docFreq=10797, maxDocs=44218)
              0.038938753 = queryNorm
            0.30122137 = fieldWeight in 2054, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              2.409771 = idf(docFreq=10797, maxDocs=44218)
              0.0625 = fieldNorm(doc=2054)
      0.14285715 = coord(1/7)
    
    Abstract
    Full text management which applied to document management tends to be centred on text storage and retrieval. Recent developments are concerned with integration with relational database management system products to deliver document management services offering both the flexibility of text retrieval and the ability to support process based funnctions. There has been a move towards client server architectures, more user friendly user interfaces and more flexible and easier to understand retrieval. Advocates caution in choosing tasks for full text methods. Identifies document management functions for which the combined use of database management systems or special purpose tools should be considered
  5. Wenzel, F.: Semantische Eingrenzung im Freitext-Retrieval auf der Basis morphologischer Segmentierungen (1980) 0.00
    0.003568951 = product of:
      0.024982655 = sum of:
        0.024982655 = weight(_text_:with in 2037) [ClassicSimilarity], result of:
          0.024982655 = score(doc=2037,freq=2.0), product of:
            0.09383348 = queryWeight, product of:
              2.409771 = idf(docFreq=10797, maxDocs=44218)
              0.038938753 = queryNorm
            0.2662446 = fieldWeight in 2037, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              2.409771 = idf(docFreq=10797, maxDocs=44218)
              0.078125 = fieldNorm(doc=2037)
      0.14285715 = coord(1/7)
    
    Abstract
    The basic problem in freetext retrieval is that the retrieval language is not properly adapted to that of the author. Morphological segmentation, where words with the same root are grouped together in the inverted file, is a good eliminator of noise and information loss, providing high recall but low precision
  6. Sievert, M.C.: Full-text information retrieval : introduction (1996) 0.00
    0.003568951 = product of:
      0.024982655 = sum of:
        0.024982655 = weight(_text_:with in 4719) [ClassicSimilarity], result of:
          0.024982655 = score(doc=4719,freq=2.0), product of:
            0.09383348 = queryWeight, product of:
              2.409771 = idf(docFreq=10797, maxDocs=44218)
              0.038938753 = queryNorm
            0.2662446 = fieldWeight in 4719, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              2.409771 = idf(docFreq=10797, maxDocs=44218)
              0.078125 = fieldNorm(doc=4719)
      0.14285715 = coord(1/7)
    
    Abstract
    Introduces a special section devoted to full text information retrieval. Gives an overview of full text databases and research into them. There is a lack of a single definition of full text. Articles in the library related literature about full text have appeared with increasing frequnecy
  7. Pearce, C.; Nicholas, C.: TELLTALE: Experiments in a dynamic hypertext environment for degraded and multilingual data (1996) 0.00
    0.0030283553 = product of:
      0.021198487 = sum of:
        0.021198487 = weight(_text_:with in 4071) [ClassicSimilarity], result of:
          0.021198487 = score(doc=4071,freq=4.0), product of:
            0.09383348 = queryWeight, product of:
              2.409771 = idf(docFreq=10797, maxDocs=44218)
              0.038938753 = queryNorm
            0.22591603 = fieldWeight in 4071, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              2.409771 = idf(docFreq=10797, maxDocs=44218)
              0.046875 = fieldNorm(doc=4071)
      0.14285715 = coord(1/7)
    
    Abstract
    Methods and tools for finding documents relevant to a user's needs in a document corpora can be found in the information retrieval, library science, and hypertext communities. Typically, these systems provide retrieval capabilities for fairly static copora, their algorithms are dependent on the language for which they are written, e.g. English, and they do not perform well when presented with misspelled words or text that has been degraded by OCR techniques. In this article, we present experimentation results for the TELLTALE system. TELLTALE is a dynamic hypertext environment that provides full-text search from a hypertext-style user interface for text corpora that may be garbled by OCR or transmission errors, and that may contain languages other than English. TELLTALE uses several techniques based on n-grams (n character sequences of text). With these results we show that the dynamic linkage mechanisms in TELLTALE are tolerant of garbles in up to 30% of the characters in the body of the texts
  8. Blake, P.: Leading edge : Verity keeps it in the family (1997) 0.00
    0.0030283553 = product of:
      0.021198487 = sum of:
        0.021198487 = weight(_text_:with in 7398) [ClassicSimilarity], result of:
          0.021198487 = score(doc=7398,freq=4.0), product of:
            0.09383348 = queryWeight, product of:
              2.409771 = idf(docFreq=10797, maxDocs=44218)
              0.038938753 = queryNorm
            0.22591603 = fieldWeight in 7398, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              2.409771 = idf(docFreq=10797, maxDocs=44218)
              0.046875 = fieldNorm(doc=7398)
      0.14285715 = coord(1/7)
    
    Abstract
    Verity Search 97 software will index and search e-mail, attachments, folders and documents on local and network disk drives. The Internet may be searched via the same front end and changes to particular documents or pages may be monitored. Documents may be viewed in their native formats including ASCII, HTML, PDF and popular word processors, with highlighted search terms. Agents may be launched into the Internet to retrieve information according to a user-specified profile. The software can index about 700 MB an hour. Describes the search technology which includes fuzzy logic and natural language. The Web version of Personal Search 97 works with Netscape Navigator or Microsoft Internet Explorer, while the Exchange version will work regardless of any attachment to an Exchange server. Search 97 Personal improves online time and access time and allows searches to be refined offline
  9. Laegreid, J.A.: SIFT: a Norwegian information retrieval system (1993) 0.00
    0.0030146649 = product of:
      0.021102654 = sum of:
        0.021102654 = product of:
          0.042205308 = sum of:
            0.042205308 = weight(_text_:22 in 7701) [ClassicSimilarity], result of:
              0.042205308 = score(doc=7701,freq=2.0), product of:
                0.13635688 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.038938753 = queryNorm
                0.30952093 = fieldWeight in 7701, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0625 = fieldNorm(doc=7701)
          0.5 = coord(1/2)
      0.14285715 = coord(1/7)
    
    Date
    23. 1.1999 19:22:09
  10. Reinisch, F.: Wer suchet - der findet? : oder Die Überwindung der sprachlichen Grenzen bei der Suche in Volltextdatenbanken (2000) 0.00
    0.0030146649 = product of:
      0.021102654 = sum of:
        0.021102654 = product of:
          0.042205308 = sum of:
            0.042205308 = weight(_text_:22 in 4919) [ClassicSimilarity], result of:
              0.042205308 = score(doc=4919,freq=2.0), product of:
                0.13635688 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.038938753 = queryNorm
                0.30952093 = fieldWeight in 4919, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0625 = fieldNorm(doc=4919)
          0.5 = coord(1/2)
      0.14285715 = coord(1/7)
    
    Date
    22. 7.2000 17:48:06
  11. Zillmann, H.: OSIRIS und eLib : Information Retrieval und Search Engines in Full-text Databases (2001) 0.00
    0.0030146649 = product of:
      0.021102654 = sum of:
        0.021102654 = product of:
          0.042205308 = sum of:
            0.042205308 = weight(_text_:22 in 5937) [ClassicSimilarity], result of:
              0.042205308 = score(doc=5937,freq=2.0), product of:
                0.13635688 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.038938753 = queryNorm
                0.30952093 = fieldWeight in 5937, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0625 = fieldNorm(doc=5937)
          0.5 = coord(1/2)
      0.14285715 = coord(1/7)
    
    Date
    14. 6.2001 12:22:31
  12. Dambeck, H.; Engler, T.: Gesucht und gefunden : Neun Volltext-Suchprogramme für den Desktop (2002) 0.00
    0.0030146649 = product of:
      0.021102654 = sum of:
        0.021102654 = product of:
          0.042205308 = sum of:
            0.042205308 = weight(_text_:22 in 1169) [ClassicSimilarity], result of:
              0.042205308 = score(doc=1169,freq=2.0), product of:
                0.13635688 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.038938753 = queryNorm
                0.30952093 = fieldWeight in 1169, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0625 = fieldNorm(doc=1169)
          0.5 = coord(1/2)
      0.14285715 = coord(1/7)
    
    Source
    c't. 2002, H.22, S.190-197
  13. Perez, E.: Text enhancement : Controlled vocabulary vs. free text (1982) 0.00
    0.0028551605 = product of:
      0.019986123 = sum of:
        0.019986123 = weight(_text_:with in 1355) [ClassicSimilarity], result of:
          0.019986123 = score(doc=1355,freq=2.0), product of:
            0.09383348 = queryWeight, product of:
              2.409771 = idf(docFreq=10797, maxDocs=44218)
              0.038938753 = queryNorm
            0.21299566 = fieldWeight in 1355, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              2.409771 = idf(docFreq=10797, maxDocs=44218)
              0.0625 = fieldNorm(doc=1355)
      0.14285715 = coord(1/7)
    
    Abstract
    Full-text databases are becoming increasingly practical for publication and general use, since most publishers now use some form of computerized editorial and publication system. "Free text" retrieval from full-text databases is possible with many existing systems, but some information professionals are doubtful about the quality of information retrieval using this type of access. The "augmentation" or "enrichment" needed for practical operation of these systems is discussed
  14. Cochrane, P.A.: Subject access - free text and controlled : the case of Papua New Guinea (1985) 0.00
    0.0028551605 = product of:
      0.019986123 = sum of:
        0.019986123 = weight(_text_:with in 1459) [ClassicSimilarity], result of:
          0.019986123 = score(doc=1459,freq=2.0), product of:
            0.09383348 = queryWeight, product of:
              2.409771 = idf(docFreq=10797, maxDocs=44218)
              0.038938753 = queryNorm
            0.21299566 = fieldWeight in 1459, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              2.409771 = idf(docFreq=10797, maxDocs=44218)
              0.0625 = fieldNorm(doc=1459)
      0.14285715 = coord(1/7)
    
    Abstract
    The online catalogue can provide the user with efficient and effective access through a variety of access points. New interests in subject heading is indicated. Keyword access and free text searching are considered alternatice methods. An investigation is suggested into the symbiotic relationship between classification and subject heading
  15. Kristensen, J.: Expanding end-users' query statements for free text searching with a search-aid thesaurus (1993) 0.00
    0.0028551605 = product of:
      0.019986123 = sum of:
        0.019986123 = weight(_text_:with in 6621) [ClassicSimilarity], result of:
          0.019986123 = score(doc=6621,freq=2.0), product of:
            0.09383348 = queryWeight, product of:
              2.409771 = idf(docFreq=10797, maxDocs=44218)
              0.038938753 = queryNorm
            0.21299566 = fieldWeight in 6621, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              2.409771 = idf(docFreq=10797, maxDocs=44218)
              0.0625 = fieldNorm(doc=6621)
      0.14285715 = coord(1/7)
    
  16. Miller, C.: Standardization of full-text files on Dialog : IAC goes first (1995) 0.00
    0.0028551605 = product of:
      0.019986123 = sum of:
        0.019986123 = weight(_text_:with in 2776) [ClassicSimilarity], result of:
          0.019986123 = score(doc=2776,freq=2.0), product of:
            0.09383348 = queryWeight, product of:
              2.409771 = idf(docFreq=10797, maxDocs=44218)
              0.038938753 = queryNorm
            0.21299566 = fieldWeight in 2776, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              2.409771 = idf(docFreq=10797, maxDocs=44218)
              0.0625 = fieldNorm(doc=2776)
      0.14285715 = coord(1/7)
    
    Abstract
    Knight Rider Information Inc./Dialog is creating standards for full text journal databases, directories and full text reports with the aim of making multiple file searching more efficient. The 1st reloaded file adhering to the new standard was loaded in Feb. 95. This was prompted by the merger of IAC Index and ASAP files. Explains how the reload was carried out and the implications for searchers
  17. Turtle, H.; Flood, J.: Query evaluation : strategies and optimizations (1995) 0.00
    0.0028551605 = product of:
      0.019986123 = sum of:
        0.019986123 = weight(_text_:with in 4087) [ClassicSimilarity], result of:
          0.019986123 = score(doc=4087,freq=2.0), product of:
            0.09383348 = queryWeight, product of:
              2.409771 = idf(docFreq=10797, maxDocs=44218)
              0.038938753 = queryNorm
            0.21299566 = fieldWeight in 4087, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              2.409771 = idf(docFreq=10797, maxDocs=44218)
              0.0625 = fieldNorm(doc=4087)
      0.14285715 = coord(1/7)
    
    Abstract
    Discusses the 2 major query evaluation strategies used in large text retrieval systems and analyzes the performance of these strategies. Discusses several optimization techniques that can be used to reduce evaluation costs and present simulation results to compare the performance of these optimization techniques when evaluating natural language queries with a collection of full text legal materials
  18. Paijmans, H.: Gravity wells of meaning : detecting information rich passages in scientific texts (1997) 0.00
    0.0028551605 = product of:
      0.019986123 = sum of:
        0.019986123 = weight(_text_:with in 7444) [ClassicSimilarity], result of:
          0.019986123 = score(doc=7444,freq=2.0), product of:
            0.09383348 = queryWeight, product of:
              2.409771 = idf(docFreq=10797, maxDocs=44218)
              0.038938753 = queryNorm
            0.21299566 = fieldWeight in 7444, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              2.409771 = idf(docFreq=10797, maxDocs=44218)
              0.0625 = fieldNorm(doc=7444)
      0.14285715 = coord(1/7)
    
    Abstract
    Presents research in which 4 term weigthing schemes were used to detect information rich passages in texts and the results compared. Demonstrates that word categories and frequency derived weights have a close correlation but that weighting according to the first mention theory or the cue method shows no correlation with frequency based weights
  19. Witt, M.: Au sujet des mots-clés (1997) 0.00
    0.0028551605 = product of:
      0.019986123 = sum of:
        0.019986123 = weight(_text_:with in 1666) [ClassicSimilarity], result of:
          0.019986123 = score(doc=1666,freq=2.0), product of:
            0.09383348 = queryWeight, product of:
              2.409771 = idf(docFreq=10797, maxDocs=44218)
              0.038938753 = queryNorm
            0.21299566 = fieldWeight in 1666, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              2.409771 = idf(docFreq=10797, maxDocs=44218)
              0.0625 = fieldNorm(doc=1666)
      0.14285715 = coord(1/7)
    
    Abstract
    Information research using keywords is largely ignored by readers but in medium sized libraries such as that of the City of Science and Industry (Paris), with a multimedia collection of some 200.000 documents, it is the most effective tool. The reader, however, needs guidance in using this technique, and in refining searches to reduce noise. Examples are given demonstrating how keyword searches can identify non-indexed material, documents in other languages, and collective authors
  20. Hider, P.: ¬The search value added by professional indexing to a bibliographic database (2018) 0.00
    0.0025236295 = product of:
      0.017665405 = sum of:
        0.017665405 = weight(_text_:with in 4300) [ClassicSimilarity], result of:
          0.017665405 = score(doc=4300,freq=4.0), product of:
            0.09383348 = queryWeight, product of:
              2.409771 = idf(docFreq=10797, maxDocs=44218)
              0.038938753 = queryNorm
            0.18826336 = fieldWeight in 4300, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              2.409771 = idf(docFreq=10797, maxDocs=44218)
              0.0390625 = fieldNorm(doc=4300)
      0.14285715 = coord(1/7)
    
    Abstract
    Gross et al. (2015) have demonstrated that about a quarter of hits would typically be lost to keyword searchers if contemporary academic library catalogs dropped their controlled subject headings. This article reports on an investigation of the search value that subject descriptors and identifiers assigned by professional indexers add to a bibliographic database, namely the Australian Education Index (AEI). First, a similar methodology to that developed by Gross et al. (2015) was applied, with keyword searches representing a range of educational topics run on the AEI database with and without its subject indexing. The results indicated that AEI users would also lose, on average, about a quarter of hits per query. Second, an alternative research design was applied in which an experienced literature searcher was asked to find resources on a set of educational topics on an AEI database stripped of its subject indexing and then asked to search for additional resources on the same topics after the subject indexing had been reinserted. In this study, the proportion of additional resources that would have been lost had it not been for the subject indexing was again found to be about a quarter of the total resources found for each topic, on average.