Search (37 results, page 1 of 2)

  • × theme_ss:"Volltextretrieval"
  1. Schmidt, J.: Full-text searching : as seen from a non-bibliographic searcher's point of view (1989) 0.10
    0.09877356 = product of:
      0.14816034 = sum of:
        0.08401519 = weight(_text_:bibliographic in 2876) [ClassicSimilarity], result of:
          0.08401519 = score(doc=2876,freq=4.0), product of:
            0.19731061 = queryWeight, product of:
              3.893044 = idf(docFreq=2449, maxDocs=44218)
              0.05068286 = queryNorm
            0.4258017 = fieldWeight in 2876, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              3.893044 = idf(docFreq=2449, maxDocs=44218)
              0.0546875 = fieldNorm(doc=2876)
        0.06414515 = product of:
          0.1282903 = sum of:
            0.1282903 = weight(_text_:searching in 2876) [ClassicSimilarity], result of:
              0.1282903 = score(doc=2876,freq=8.0), product of:
                0.20502694 = queryWeight, product of:
                  4.0452914 = idf(docFreq=2103, maxDocs=44218)
                  0.05068286 = queryNorm
                0.6257241 = fieldWeight in 2876, product of:
                  2.828427 = tf(freq=8.0), with freq of:
                    8.0 = termFreq=8.0
                  4.0452914 = idf(docFreq=2103, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=2876)
          0.5 = coord(1/2)
      0.6666667 = coord(2/3)
    
    Abstract
    Examines searching capabilities and search results relating to the same full text data base made available by: a host that offers a command language designed for searching bibliographic data bases and a host that provides search facilities that have been specially designed for full text retrieval. Moreover, the CD-ROM format of an encyclopedia is compared with the equivalent on-line version of the same work, Academic American Encyclopedia. Results reveal that it is easier to search on those systems that offer searching facilities which have been specially designed for full text retrieval.
  2. Tenopir, C.: Full-text retrieval : systems and files (1994) 0.08
    0.079821095 = product of:
      0.119731635 = sum of:
        0.067894526 = weight(_text_:bibliographic in 2424) [ClassicSimilarity], result of:
          0.067894526 = score(doc=2424,freq=2.0), product of:
            0.19731061 = queryWeight, product of:
              3.893044 = idf(docFreq=2449, maxDocs=44218)
              0.05068286 = queryNorm
            0.34409973 = fieldWeight in 2424, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.893044 = idf(docFreq=2449, maxDocs=44218)
              0.0625 = fieldNorm(doc=2424)
        0.05183711 = product of:
          0.10367422 = sum of:
            0.10367422 = weight(_text_:searching in 2424) [ClassicSimilarity], result of:
              0.10367422 = score(doc=2424,freq=4.0), product of:
                0.20502694 = queryWeight, product of:
                  4.0452914 = idf(docFreq=2103, maxDocs=44218)
                  0.05068286 = queryNorm
                0.5056614 = fieldWeight in 2424, product of:
                  2.0 = tf(freq=4.0), with freq of:
                    4.0 = termFreq=4.0
                  4.0452914 = idf(docFreq=2103, maxDocs=44218)
                  0.0625 = fieldNorm(doc=2424)
          0.5 = coord(1/2)
      0.6666667 = coord(2/3)
    
    Abstract
    State of the art review of the development of full text databases, encompassing: types of commercially available full text databases; online systems for full text databases; CD-ROM databases for full text databases; full text databases on magnetic discs or tapes; creation of full text databases; searching and display requirements for full text searching and software. Concludes that bibliographic information services without full text support solve only half of the retrieval problems
  3. Couvreur, T.R.; Benzel, R.N.; Miller, S.F.; Zeitler, D.N.; Lee, D.L.; Singhal, M.; Shivaratri, N.; Wong, W.Y.P.: ¬An analysis of performance and cost factors in searching large text databases using parallel search systems (1994) 0.07
    0.069843456 = product of:
      0.10476518 = sum of:
        0.05940771 = weight(_text_:bibliographic in 7657) [ClassicSimilarity], result of:
          0.05940771 = score(doc=7657,freq=2.0), product of:
            0.19731061 = queryWeight, product of:
              3.893044 = idf(docFreq=2449, maxDocs=44218)
              0.05068286 = queryNorm
            0.30108726 = fieldWeight in 7657, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.893044 = idf(docFreq=2449, maxDocs=44218)
              0.0546875 = fieldNorm(doc=7657)
        0.04535747 = product of:
          0.09071494 = sum of:
            0.09071494 = weight(_text_:searching in 7657) [ClassicSimilarity], result of:
              0.09071494 = score(doc=7657,freq=4.0), product of:
                0.20502694 = queryWeight, product of:
                  4.0452914 = idf(docFreq=2103, maxDocs=44218)
                  0.05068286 = queryNorm
                0.44245374 = fieldWeight in 7657, product of:
                  2.0 = tf(freq=4.0), with freq of:
                    4.0 = termFreq=4.0
                  4.0452914 = idf(docFreq=2103, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=7657)
          0.5 = coord(1/2)
      0.6666667 = coord(2/3)
    
    Abstract
    The results of modelling the performance of searching large text databases (>10 GBytes) via various parallel hardware architectures and search algorithms are discussed. The performance under load and the cost of each configuration are compared. Strengths, weaknesses, performance sensitivities, and search features supported for each configuration are also addressed. In addition, a common search workload used in the modelling is described. The search workload is derived from a set of searches run against the Chemical Abstracts file of bibliographic and abstract text available on STN International. This common workload is applied to all configurations modelled to provide a common basis of comparison
  4. Hider, P.: ¬The search value added by professional indexing to a bibliographic database (2017) 0.06
    0.06160603 = product of:
      0.092409045 = sum of:
        0.06001085 = weight(_text_:bibliographic in 3868) [ClassicSimilarity], result of:
          0.06001085 = score(doc=3868,freq=4.0), product of:
            0.19731061 = queryWeight, product of:
              3.893044 = idf(docFreq=2449, maxDocs=44218)
              0.05068286 = queryNorm
            0.30414405 = fieldWeight in 3868, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              3.893044 = idf(docFreq=2449, maxDocs=44218)
              0.0390625 = fieldNorm(doc=3868)
        0.032398194 = product of:
          0.06479639 = sum of:
            0.06479639 = weight(_text_:searching in 3868) [ClassicSimilarity], result of:
              0.06479639 = score(doc=3868,freq=4.0), product of:
                0.20502694 = queryWeight, product of:
                  4.0452914 = idf(docFreq=2103, maxDocs=44218)
                  0.05068286 = queryNorm
                0.3160384 = fieldWeight in 3868, product of:
                  2.0 = tf(freq=4.0), with freq of:
                    4.0 = termFreq=4.0
                  4.0452914 = idf(docFreq=2103, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=3868)
          0.5 = coord(1/2)
      0.6666667 = coord(2/3)
    
    Abstract
    Gross et al. (2015) have demonstrated that about a quarter of hits would typically be lost to keyword searchers if contemporary academic library catalogs dropped their controlled subject headings. This paper reports on an analysis of the loss levels that would result if a bibliographic database, namely the Australian Education Index (AEI), were missing the subject descriptors and identifiers assigned by its professional indexers, employing the methodology developed by Gross and Taylor (2005), and later by Gross et al. (2015). The results indicate that AEI users would lose a similar proportion of hits per query to that experienced by library catalog users: on average, 27% of the resources found by a sample of keyword queries on the AEI database would not have been found without the subject indexing, based on the Australian Thesaurus of Education Descriptors (ATED). The paper also discusses the methodological limitations of these studies, pointing out that real-life users might still find some of the resources missed by a particular query through follow-up searches, while additional resources might also be found through iterative searching on the subject vocabulary. The paper goes on to describe a new research design, based on a before - and - after experiment, which addresses some of these limitations. It is argued that this alternative design will provide a more realistic picture of the value that professionally assigned subject indexing and controlled subject vocabularies can add to literature searching of a more scholarly and thorough kind.
  5. Kristensen, J.; Järvelin, K.: ¬The effectiveness of a searching thesaurus in free-text searching in a full-text database (1990) 0.03
    0.034558073 = product of:
      0.10367422 = sum of:
        0.10367422 = product of:
          0.20734844 = sum of:
            0.20734844 = weight(_text_:searching in 2043) [ClassicSimilarity], result of:
              0.20734844 = score(doc=2043,freq=4.0), product of:
                0.20502694 = queryWeight, product of:
                  4.0452914 = idf(docFreq=2103, maxDocs=44218)
                  0.05068286 = queryNorm
                1.0113229 = fieldWeight in 2043, product of:
                  2.0 = tf(freq=4.0), with freq of:
                    4.0 = termFreq=4.0
                  4.0452914 = idf(docFreq=2103, maxDocs=44218)
                  0.125 = fieldNorm(doc=2043)
          0.5 = coord(1/2)
      0.33333334 = coord(1/3)
    
  6. Sclafani, F.: Controlled subject heading searching versus keyword searching (1999) 0.03
    0.034558073 = product of:
      0.10367422 = sum of:
        0.10367422 = product of:
          0.20734844 = sum of:
            0.20734844 = weight(_text_:searching in 3790) [ClassicSimilarity], result of:
              0.20734844 = score(doc=3790,freq=4.0), product of:
                0.20502694 = queryWeight, product of:
                  4.0452914 = idf(docFreq=2103, maxDocs=44218)
                  0.05068286 = queryNorm
                1.0113229 = fieldWeight in 3790, product of:
                  2.0 = tf(freq=4.0), with freq of:
                    4.0 = termFreq=4.0
                  4.0452914 = idf(docFreq=2103, maxDocs=44218)
                  0.125 = fieldNorm(doc=3790)
          0.5 = coord(1/2)
      0.33333334 = coord(1/3)
    
  7. Falk, H.: Document file searching (1998) 0.03
    0.026453016 = product of:
      0.07935905 = sum of:
        0.07935905 = product of:
          0.1587181 = sum of:
            0.1587181 = weight(_text_:searching in 2429) [ClassicSimilarity], result of:
              0.1587181 = score(doc=2429,freq=6.0), product of:
                0.20502694 = queryWeight, product of:
                  4.0452914 = idf(docFreq=2103, maxDocs=44218)
                  0.05068286 = queryNorm
                0.77413285 = fieldWeight in 2429, product of:
                  2.4494898 = tf(freq=6.0), with freq of:
                    6.0 = termFreq=6.0
                  4.0452914 = idf(docFreq=2103, maxDocs=44218)
                  0.078125 = fieldNorm(doc=2429)
          0.5 = coord(1/2)
      0.33333334 = coord(1/3)
    
    Abstract
    Considers the importance of generating indexes when creating large document files, to facilitate searching, and evaluates 4 commercial document file index creation and searching software packages: QuickFind; Sonar; ZyIndex; and FastFind
  8. Tenopir, C.: Searching by controlled vocabulary or free text (1987) 0.02
    0.024436247 = product of:
      0.073308736 = sum of:
        0.073308736 = product of:
          0.14661747 = sum of:
            0.14661747 = weight(_text_:searching in 1350) [ClassicSimilarity], result of:
              0.14661747 = score(doc=1350,freq=2.0), product of:
                0.20502694 = queryWeight, product of:
                  4.0452914 = idf(docFreq=2103, maxDocs=44218)
                  0.05068286 = queryNorm
                0.7151132 = fieldWeight in 1350, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  4.0452914 = idf(docFreq=2103, maxDocs=44218)
                  0.125 = fieldNorm(doc=1350)
          0.5 = coord(1/2)
      0.33333334 = coord(1/3)
    
  9. DiMartino, D.; Zoe, L.R.: End-user full text searching : access or excess? (1996) 0.02
    0.024436247 = product of:
      0.073308736 = sum of:
        0.073308736 = product of:
          0.14661747 = sum of:
            0.14661747 = weight(_text_:searching in 7545) [ClassicSimilarity], result of:
              0.14661747 = score(doc=7545,freq=2.0), product of:
                0.20502694 = queryWeight, product of:
                  4.0452914 = idf(docFreq=2103, maxDocs=44218)
                  0.05068286 = queryNorm
                0.7151132 = fieldWeight in 7545, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  4.0452914 = idf(docFreq=2103, maxDocs=44218)
                  0.125 = fieldNorm(doc=7545)
          0.5 = coord(1/2)
      0.33333334 = coord(1/3)
    
  10. Quint, B.: Flipping for full-text (1991) 0.02
    0.02263151 = product of:
      0.067894526 = sum of:
        0.067894526 = weight(_text_:bibliographic in 4893) [ClassicSimilarity], result of:
          0.067894526 = score(doc=4893,freq=2.0), product of:
            0.19731061 = queryWeight, product of:
              3.893044 = idf(docFreq=2449, maxDocs=44218)
              0.05068286 = queryNorm
            0.34409973 = fieldWeight in 4893, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.893044 = idf(docFreq=2449, maxDocs=44218)
              0.0625 = fieldNorm(doc=4893)
      0.33333334 = coord(1/3)
    
    Abstract
    Provides tips for searchers of full text online databases and examines the coverage policies of full text database producers which may change without notification to users. Most full text newspaper files do not carry even bibliographic listings for syndicated columns not created by their own staff. Looks at the development of full-text CD-ROM databases and claims that full-text, though expensive, is the wave of the future
  11. Poler, E.M.: Overcoming limitations in database searches (1992) 0.02
    0.021598797 = product of:
      0.06479639 = sum of:
        0.06479639 = product of:
          0.12959278 = sum of:
            0.12959278 = weight(_text_:searching in 2329) [ClassicSimilarity], result of:
              0.12959278 = score(doc=2329,freq=4.0), product of:
                0.20502694 = queryWeight, product of:
                  4.0452914 = idf(docFreq=2103, maxDocs=44218)
                  0.05068286 = queryNorm
                0.6320768 = fieldWeight in 2329, product of:
                  2.0 = tf(freq=4.0), with freq of:
                    4.0 = termFreq=4.0
                  4.0452914 = idf(docFreq=2103, maxDocs=44218)
                  0.078125 = fieldNorm(doc=2329)
          0.5 = coord(1/2)
      0.33333334 = coord(1/3)
    
    Abstract
    Describes the advantages of full text databases and full text searching, particularly in the context of law databases, the drawbacks to such searching in terms of recall, precision and errors, and methods of overcoming some of these problems
  12. Sormunen, E.: Free-text searching in full-text databases : probing system limits (1993) 0.02
    0.021381717 = product of:
      0.06414515 = sum of:
        0.06414515 = product of:
          0.1282903 = sum of:
            0.1282903 = weight(_text_:searching in 7120) [ClassicSimilarity], result of:
              0.1282903 = score(doc=7120,freq=2.0), product of:
                0.20502694 = queryWeight, product of:
                  4.0452914 = idf(docFreq=2103, maxDocs=44218)
                  0.05068286 = queryNorm
                0.6257241 = fieldWeight in 7120, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  4.0452914 = idf(docFreq=2103, maxDocs=44218)
                  0.109375 = fieldNorm(doc=7120)
          0.5 = coord(1/2)
      0.33333334 = coord(1/3)
    
  13. Wildemuth, B.M.: Measures of success in searching a full-text fact base (1990) 0.02
    0.021381717 = product of:
      0.06414515 = sum of:
        0.06414515 = product of:
          0.1282903 = sum of:
            0.1282903 = weight(_text_:searching in 2050) [ClassicSimilarity], result of:
              0.1282903 = score(doc=2050,freq=8.0), product of:
                0.20502694 = queryWeight, product of:
                  4.0452914 = idf(docFreq=2103, maxDocs=44218)
                  0.05068286 = queryNorm
                0.6257241 = fieldWeight in 2050, product of:
                  2.828427 = tf(freq=8.0), with freq of:
                    8.0 = termFreq=8.0
                  4.0452914 = idf(docFreq=2103, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=2050)
          0.5 = coord(1/2)
      0.33333334 = coord(1/3)
    
    Abstract
    The traditional measures of online searching proficiency (recall and precision) are less appropriate when applied to the searching of full text databases. The pilot study investigated and evaluated 5 measures of overall success in searching a full text data bank. Data was drawn from INQUIRER searches conducted by medical students at North Carolina Univ. at Chapel Hill. INQUIRER ia an online database of facts and concepts in microbiology. The 5 measures were: success/failure; precision; search term overlap; number of search cycles; and time per search. Concludes that the last 4 measures look promising for the evaluation of fact data bases such as ENQUIRER
  14. Hildreth, C.R.: ¬The use and understanding of keyword searching in a university online catalog (1997) 0.02
    0.021381717 = product of:
      0.06414515 = sum of:
        0.06414515 = product of:
          0.1282903 = sum of:
            0.1282903 = weight(_text_:searching in 7377) [ClassicSimilarity], result of:
              0.1282903 = score(doc=7377,freq=2.0), product of:
                0.20502694 = queryWeight, product of:
                  4.0452914 = idf(docFreq=2103, maxDocs=44218)
                  0.05068286 = queryNorm
                0.6257241 = fieldWeight in 7377, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  4.0452914 = idf(docFreq=2103, maxDocs=44218)
                  0.109375 = fieldNorm(doc=7377)
          0.5 = coord(1/2)
      0.33333334 = coord(1/3)
    
  15. Hider, P.: ¬The search value added by professional indexing to a bibliographic database (2018) 0.02
    0.020003617 = product of:
      0.06001085 = sum of:
        0.06001085 = weight(_text_:bibliographic in 4300) [ClassicSimilarity], result of:
          0.06001085 = score(doc=4300,freq=4.0), product of:
            0.19731061 = queryWeight, product of:
              3.893044 = idf(docFreq=2449, maxDocs=44218)
              0.05068286 = queryNorm
            0.30414405 = fieldWeight in 4300, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              3.893044 = idf(docFreq=2449, maxDocs=44218)
              0.0390625 = fieldNorm(doc=4300)
      0.33333334 = coord(1/3)
    
    Abstract
    Gross et al. (2015) have demonstrated that about a quarter of hits would typically be lost to keyword searchers if contemporary academic library catalogs dropped their controlled subject headings. This article reports on an investigation of the search value that subject descriptors and identifiers assigned by professional indexers add to a bibliographic database, namely the Australian Education Index (AEI). First, a similar methodology to that developed by Gross et al. (2015) was applied, with keyword searches representing a range of educational topics run on the AEI database with and without its subject indexing. The results indicated that AEI users would also lose, on average, about a quarter of hits per query. Second, an alternative research design was applied in which an experienced literature searcher was asked to find resources on a set of educational topics on an AEI database stripped of its subject indexing and then asked to search for additional resources on the same topics after the subject indexing had been reinserted. In this study, the proportion of additional resources that would have been lost had it not been for the subject indexing was again found to be about a quarter of the total resources found for each topic, on average.
  16. Allen, E.E.: Searching, naturally (1998) 0.02
    0.018327186 = product of:
      0.054981556 = sum of:
        0.054981556 = product of:
          0.10996311 = sum of:
            0.10996311 = weight(_text_:searching in 2602) [ClassicSimilarity], result of:
              0.10996311 = score(doc=2602,freq=2.0), product of:
                0.20502694 = queryWeight, product of:
                  4.0452914 = idf(docFreq=2103, maxDocs=44218)
                  0.05068286 = queryNorm
                0.53633493 = fieldWeight in 2602, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  4.0452914 = idf(docFreq=2103, maxDocs=44218)
                  0.09375 = fieldNorm(doc=2602)
          0.5 = coord(1/2)
      0.33333334 = coord(1/3)
    
  17. Basch, R.: Searching full-text : tips and techniques (1992) 0.02
    0.017279036 = product of:
      0.05183711 = sum of:
        0.05183711 = product of:
          0.10367422 = sum of:
            0.10367422 = weight(_text_:searching in 5050) [ClassicSimilarity], result of:
              0.10367422 = score(doc=5050,freq=4.0), product of:
                0.20502694 = queryWeight, product of:
                  4.0452914 = idf(docFreq=2103, maxDocs=44218)
                  0.05068286 = queryNorm
                0.5056614 = fieldWeight in 5050, product of:
                  2.0 = tf(freq=4.0), with freq of:
                    4.0 = termFreq=4.0
                  4.0452914 = idf(docFreq=2103, maxDocs=44218)
                  0.0625 = fieldNorm(doc=5050)
          0.5 = coord(1/2)
      0.33333334 = coord(1/3)
    
    Abstract
    Gives tips for effective searching of full-text databases: think creatively about vocabulary and use of the OR connector to link synonyms; use truncation to pick up plurals and alternative work endings; anticipate international differences in spellimg, likie 'labor' and 'labour', and in vocabulary, like 'truck' and 'lorry'; avoid the AND operator, preferring proximity connectors like NEAR, WITHIN or PRE; use jargon instead of trying to anticipate all the abstract terminology that a writer might have used; and exploit any indexing available
  18. Kristensen, J.: Expanding end-users' query statements for free text searching with a search-aid thesaurus (1993) 0.02
    0.017279036 = product of:
      0.05183711 = sum of:
        0.05183711 = product of:
          0.10367422 = sum of:
            0.10367422 = weight(_text_:searching in 6621) [ClassicSimilarity], result of:
              0.10367422 = score(doc=6621,freq=4.0), product of:
                0.20502694 = queryWeight, product of:
                  4.0452914 = idf(docFreq=2103, maxDocs=44218)
                  0.05068286 = queryNorm
                0.5056614 = fieldWeight in 6621, product of:
                  2.0 = tf(freq=4.0), with freq of:
                    4.0 = termFreq=4.0
                  4.0452914 = idf(docFreq=2103, maxDocs=44218)
                  0.0625 = fieldNorm(doc=6621)
          0.5 = coord(1/2)
      0.33333334 = coord(1/3)
    
    Abstract
    Tests the effectiveness of a thesaurus as a search-aid in free text searching of a full text database. A set of queries was searched against a large full text database of newspaper articles. The thesaurus contained equivalence, hierarchical and associative relationships. Each query was searched in five modes: basic search, synonym search, narrower term search, related term search, and union of all previous searches. The searches were analyzed in terms of relative recall and precision
  19. Leppanen, E.: Homografiongelma tekstihaussa ja homografien disambiguoinnin vaikutukset (1996) 0.02
    0.015871808 = product of:
      0.047615424 = sum of:
        0.047615424 = product of:
          0.09523085 = sum of:
            0.09523085 = weight(_text_:searching in 27) [ClassicSimilarity], result of:
              0.09523085 = score(doc=27,freq=6.0), product of:
                0.20502694 = queryWeight, product of:
                  4.0452914 = idf(docFreq=2103, maxDocs=44218)
                  0.05068286 = queryNorm
                0.46447968 = fieldWeight in 27, product of:
                  2.4494898 = tf(freq=6.0), with freq of:
                    6.0 = termFreq=6.0
                  4.0452914 = idf(docFreq=2103, maxDocs=44218)
                  0.046875 = fieldNorm(doc=27)
          0.5 = coord(1/2)
      0.33333334 = coord(1/3)
    
    Abstract
    Homonymy is known to often cause false drops in free text searching in a full text database. The problem is quite common and difficult to avoid in Finnish, but nobody has examined it before. Reports on a study that examined the frequency of, and solutions to, the homonymy problem, based on searches made in a Finnish full text database containing about 55.000 newspaper articles. The results indicate that homonymy is not a very serious problem in full text searching, with only about 1 search result set out of 4 containing false drops caused by homonymy. Several other reasons for nonrelevance were much more common. However, in some set results there were a considerable number of homonymy errors, so the number seems to be very random. A study was also made into whether homonyms can be disambiguated by syntactic analysis. The result was that 75,2% of homonyms were disambiguated by this method. Verb homonyms were considerably easier to disambiguate than substantives. Although homonymy is not a very big problem it could perhaps easily be eliminated if there was a suitable syntactic analyzer in the IR system
    Footnote
    Übers. d. Titels: The homonymy problem in free text searching and the results of homonymy disambiguation
  20. Albus, W.; Smulders, H.: Doeltreffend zoeken in volledige teksten : 1. full-text retrieval bij de HavenInformatieBank (1998) 0.02
    0.015119157 = product of:
      0.04535747 = sum of:
        0.04535747 = product of:
          0.09071494 = sum of:
            0.09071494 = weight(_text_:searching in 1682) [ClassicSimilarity], result of:
              0.09071494 = score(doc=1682,freq=4.0), product of:
                0.20502694 = queryWeight, product of:
                  4.0452914 = idf(docFreq=2103, maxDocs=44218)
                  0.05068286 = queryNorm
                0.44245374 = fieldWeight in 1682, product of:
                  2.0 = tf(freq=4.0), with freq of:
                    4.0 = termFreq=4.0
                  4.0452914 = idf(docFreq=2103, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=1682)
          0.5 = coord(1/2)
      0.33333334 = coord(1/3)
    
    Abstract
    At Rotterdam Ports Authority in the Netherlands the Habour information database includes a press cuttings service and various online databases. To enable research staff to have direct access to information the POINT (Point Information Net) was begun in 1993. Using Verity software POINT provides simultaneously full text searching on a range of databases. The software uses current Web indexing technqiues to overcome the problems of excessive recall and low precision. A key element is the system's ability to recognise word combinations
    Footnote
    Übers. d. Titels: Effective searching on full texts: 1. full-text-retrieval on the Harbour information database