Search (27 results, page 1 of 2)

  • × theme_ss:"Volltextretrieval"
  1. Cochrane, P.A.: Subject access - free text and controlled : the case of Papua New Guinea (1985) 0.02
    0.0241358 = product of:
      0.112633735 = sum of:
        0.058800567 = weight(_text_:subject in 1459) [ClassicSimilarity], result of:
          0.058800567 = score(doc=1459,freq=6.0), product of:
            0.10738805 = queryWeight, product of:
              3.576596 = idf(docFreq=3361, maxDocs=44218)
              0.03002521 = queryNorm
            0.5475522 = fieldWeight in 1459, product of:
              2.4494898 = tf(freq=6.0), with freq of:
                6.0 = termFreq=6.0
              3.576596 = idf(docFreq=3361, maxDocs=44218)
              0.0625 = fieldNorm(doc=1459)
        0.026916584 = weight(_text_:classification in 1459) [ClassicSimilarity], result of:
          0.026916584 = score(doc=1459,freq=2.0), product of:
            0.09562149 = queryWeight, product of:
              3.1847067 = idf(docFreq=4974, maxDocs=44218)
              0.03002521 = queryNorm
            0.28149095 = fieldWeight in 1459, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.1847067 = idf(docFreq=4974, maxDocs=44218)
              0.0625 = fieldNorm(doc=1459)
        0.026916584 = weight(_text_:classification in 1459) [ClassicSimilarity], result of:
          0.026916584 = score(doc=1459,freq=2.0), product of:
            0.09562149 = queryWeight, product of:
              3.1847067 = idf(docFreq=4974, maxDocs=44218)
              0.03002521 = queryNorm
            0.28149095 = fieldWeight in 1459, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.1847067 = idf(docFreq=4974, maxDocs=44218)
              0.0625 = fieldNorm(doc=1459)
      0.21428572 = coord(3/14)
    
    Abstract
    The online catalogue can provide the user with efficient and effective access through a variety of access points. New interests in subject heading is indicated. Keyword access and free text searching are considered alternatice methods. An investigation is suggested into the symbiotic relationship between classification and subject heading
  2. Gross, T.; Taylor, A.G.; Joudrey, D.N.: Still a lot to lose : the role of controlled vocabulary in keyword searching (2015) 0.02
    0.016459066 = product of:
      0.076808974 = sum of:
        0.029704956 = weight(_text_:subject in 2007) [ClassicSimilarity], result of:
          0.029704956 = score(doc=2007,freq=2.0), product of:
            0.10738805 = queryWeight, product of:
              3.576596 = idf(docFreq=3361, maxDocs=44218)
              0.03002521 = queryNorm
            0.27661324 = fieldWeight in 2007, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.576596 = idf(docFreq=3361, maxDocs=44218)
              0.0546875 = fieldNorm(doc=2007)
        0.023552012 = weight(_text_:classification in 2007) [ClassicSimilarity], result of:
          0.023552012 = score(doc=2007,freq=2.0), product of:
            0.09562149 = queryWeight, product of:
              3.1847067 = idf(docFreq=4974, maxDocs=44218)
              0.03002521 = queryNorm
            0.24630459 = fieldWeight in 2007, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.1847067 = idf(docFreq=4974, maxDocs=44218)
              0.0546875 = fieldNorm(doc=2007)
        0.023552012 = weight(_text_:classification in 2007) [ClassicSimilarity], result of:
          0.023552012 = score(doc=2007,freq=2.0), product of:
            0.09562149 = queryWeight, product of:
              3.1847067 = idf(docFreq=4974, maxDocs=44218)
              0.03002521 = queryNorm
            0.24630459 = fieldWeight in 2007, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.1847067 = idf(docFreq=4974, maxDocs=44218)
              0.0546875 = fieldNorm(doc=2007)
      0.21428572 = coord(3/14)
    
    Abstract
    In their 2005 study, Gross and Taylor found that more than a third of records retrieved by keyword searches would be lost without subject headings. A review of the literature since then shows that numerous studies, in various disciplines, have found that a quarter to a third of records returned in a keyword search would be lost without controlled vocabulary. Other writers, though, have continued to suggest that controlled vocabulary be discontinued. Addressing criticisms of the Gross/Taylor study, this study replicates the search process in the same online catalog, but after the addition of automated enriched metadata such as tables of contents and summaries. The proportion of results that would be lost remains high.
    Source
    Cataloging and classification quarterly. 53(2015) no.1, S.1-39
  3. Freitext in Informationssystemen (1985) 0.02
    0.016313914 = product of:
      0.114197396 = sum of:
        0.057098698 = weight(_text_:classification in 2036) [ClassicSimilarity], result of:
          0.057098698 = score(doc=2036,freq=4.0), product of:
            0.09562149 = queryWeight, product of:
              3.1847067 = idf(docFreq=4974, maxDocs=44218)
              0.03002521 = queryNorm
            0.5971325 = fieldWeight in 2036, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              3.1847067 = idf(docFreq=4974, maxDocs=44218)
              0.09375 = fieldNorm(doc=2036)
        0.057098698 = weight(_text_:classification in 2036) [ClassicSimilarity], result of:
          0.057098698 = score(doc=2036,freq=4.0), product of:
            0.09562149 = queryWeight, product of:
              3.1847067 = idf(docFreq=4974, maxDocs=44218)
              0.03002521 = queryNorm
            0.5971325 = fieldWeight in 2036, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              3.1847067 = idf(docFreq=4974, maxDocs=44218)
              0.09375 = fieldNorm(doc=2036)
      0.14285715 = coord(2/14)
    
    Footnote
    Deutsche Fassung von 'Free text in information systems' in: International classification 12(1985) H.2, S.95-98. Wegen einiger Ungereimtheiten sollte die englische Fassung benutzt werden
    Source
    International classification. 12(1985) no.1, S.23-26
  4. Free text in information systems: capabilities and limitations (1985) 0.02
    0.016313914 = product of:
      0.114197396 = sum of:
        0.057098698 = weight(_text_:classification in 2045) [ClassicSimilarity], result of:
          0.057098698 = score(doc=2045,freq=4.0), product of:
            0.09562149 = queryWeight, product of:
              3.1847067 = idf(docFreq=4974, maxDocs=44218)
              0.03002521 = queryNorm
            0.5971325 = fieldWeight in 2045, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              3.1847067 = idf(docFreq=4974, maxDocs=44218)
              0.09375 = fieldNorm(doc=2045)
        0.057098698 = weight(_text_:classification in 2045) [ClassicSimilarity], result of:
          0.057098698 = score(doc=2045,freq=4.0), product of:
            0.09562149 = queryWeight, product of:
              3.1847067 = idf(docFreq=4974, maxDocs=44218)
              0.03002521 = queryNorm
            0.5971325 = fieldWeight in 2045, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              3.1847067 = idf(docFreq=4974, maxDocs=44218)
              0.09375 = fieldNorm(doc=2045)
      0.14285715 = coord(2/14)
    
    Footnote
    Diese Empfehlungen liegen auch in deutscher Übersetzung vor (abgedruckt ebenfalls in International classification), leider ist die Übersetzung nicht in allen Aussagen recht gelungen, so daß das Original vorzuziehen ist
    Source
    International classification. 12(1985), S.95-98
  5. Kristensen, J.; Järvelin, K.: ¬The effectiveness of a searching thesaurus in free-text searching in a full-text database (1990) 0.02
    0.015380906 = product of:
      0.107666336 = sum of:
        0.053833168 = weight(_text_:classification in 2043) [ClassicSimilarity], result of:
          0.053833168 = score(doc=2043,freq=2.0), product of:
            0.09562149 = queryWeight, product of:
              3.1847067 = idf(docFreq=4974, maxDocs=44218)
              0.03002521 = queryNorm
            0.5629819 = fieldWeight in 2043, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.1847067 = idf(docFreq=4974, maxDocs=44218)
              0.125 = fieldNorm(doc=2043)
        0.053833168 = weight(_text_:classification in 2043) [ClassicSimilarity], result of:
          0.053833168 = score(doc=2043,freq=2.0), product of:
            0.09562149 = queryWeight, product of:
              3.1847067 = idf(docFreq=4974, maxDocs=44218)
              0.03002521 = queryNorm
            0.5629819 = fieldWeight in 2043, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.1847067 = idf(docFreq=4974, maxDocs=44218)
              0.125 = fieldNorm(doc=2043)
      0.14285715 = coord(2/14)
    
    Source
    International classification. 17(1990), S.77-84
  6. Paijmans, H.: Gravity wells of meaning : detecting information rich passages in scientific texts (1997) 0.01
    0.013485613 = product of:
      0.09439929 = sum of:
        0.03799808 = product of:
          0.07599616 = sum of:
            0.07599616 = weight(_text_:schemes in 7444) [ClassicSimilarity], result of:
              0.07599616 = score(doc=7444,freq=2.0), product of:
                0.16067243 = queryWeight, product of:
                  5.3512506 = idf(docFreq=569, maxDocs=44218)
                  0.03002521 = queryNorm
                0.4729882 = fieldWeight in 7444, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  5.3512506 = idf(docFreq=569, maxDocs=44218)
                  0.0625 = fieldNorm(doc=7444)
          0.5 = coord(1/2)
        0.056401204 = product of:
          0.11280241 = sum of:
            0.11280241 = weight(_text_:texts in 7444) [ClassicSimilarity], result of:
              0.11280241 = score(doc=7444,freq=4.0), product of:
                0.16460659 = queryWeight, product of:
                  5.4822793 = idf(docFreq=499, maxDocs=44218)
                  0.03002521 = queryNorm
                0.6852849 = fieldWeight in 7444, product of:
                  2.0 = tf(freq=4.0), with freq of:
                    4.0 = termFreq=4.0
                  5.4822793 = idf(docFreq=499, maxDocs=44218)
                  0.0625 = fieldNorm(doc=7444)
          0.5 = coord(1/2)
      0.14285715 = coord(2/14)
    
    Abstract
    Presents research in which 4 term weigthing schemes were used to detect information rich passages in texts and the results compared. Demonstrates that word categories and frequency derived weights have a close correlation but that weighting according to the first mention theory or the cue method shows no correlation with frequency based weights
  7. Hider, P.: ¬The search value added by professional indexing to a bibliographic database (2017) 0.01
    0.012503441 = product of:
      0.087524086 = sum of:
        0.051972847 = weight(_text_:subject in 3868) [ClassicSimilarity], result of:
          0.051972847 = score(doc=3868,freq=12.0), product of:
            0.10738805 = queryWeight, product of:
              3.576596 = idf(docFreq=3361, maxDocs=44218)
              0.03002521 = queryNorm
            0.48397237 = fieldWeight in 3868, product of:
              3.4641016 = tf(freq=12.0), with freq of:
                12.0 = termFreq=12.0
              3.576596 = idf(docFreq=3361, maxDocs=44218)
              0.0390625 = fieldNorm(doc=3868)
        0.035551235 = weight(_text_:bibliographic in 3868) [ClassicSimilarity], result of:
          0.035551235 = score(doc=3868,freq=4.0), product of:
            0.11688946 = queryWeight, product of:
              3.893044 = idf(docFreq=2449, maxDocs=44218)
              0.03002521 = queryNorm
            0.30414405 = fieldWeight in 3868, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              3.893044 = idf(docFreq=2449, maxDocs=44218)
              0.0390625 = fieldNorm(doc=3868)
      0.14285715 = coord(2/14)
    
    Abstract
    Gross et al. (2015) have demonstrated that about a quarter of hits would typically be lost to keyword searchers if contemporary academic library catalogs dropped their controlled subject headings. This paper reports on an analysis of the loss levels that would result if a bibliographic database, namely the Australian Education Index (AEI), were missing the subject descriptors and identifiers assigned by its professional indexers, employing the methodology developed by Gross and Taylor (2005), and later by Gross et al. (2015). The results indicate that AEI users would lose a similar proportion of hits per query to that experienced by library catalog users: on average, 27% of the resources found by a sample of keyword queries on the AEI database would not have been found without the subject indexing, based on the Australian Thesaurus of Education Descriptors (ATED). The paper also discusses the methodological limitations of these studies, pointing out that real-life users might still find some of the resources missed by a particular query through follow-up searches, while additional resources might also be found through iterative searching on the subject vocabulary. The paper goes on to describe a new research design, based on a before - and - after experiment, which addresses some of these limitations. It is argued that this alternative design will provide a more realistic picture of the value that professionally assigned subject indexing and controlled subject vocabularies can add to literature searching of a more scholarly and thorough kind.
  8. Hider, P.: ¬The search value added by professional indexing to a bibliographic database (2018) 0.01
    0.012503441 = product of:
      0.087524086 = sum of:
        0.051972847 = weight(_text_:subject in 4300) [ClassicSimilarity], result of:
          0.051972847 = score(doc=4300,freq=12.0), product of:
            0.10738805 = queryWeight, product of:
              3.576596 = idf(docFreq=3361, maxDocs=44218)
              0.03002521 = queryNorm
            0.48397237 = fieldWeight in 4300, product of:
              3.4641016 = tf(freq=12.0), with freq of:
                12.0 = termFreq=12.0
              3.576596 = idf(docFreq=3361, maxDocs=44218)
              0.0390625 = fieldNorm(doc=4300)
        0.035551235 = weight(_text_:bibliographic in 4300) [ClassicSimilarity], result of:
          0.035551235 = score(doc=4300,freq=4.0), product of:
            0.11688946 = queryWeight, product of:
              3.893044 = idf(docFreq=2449, maxDocs=44218)
              0.03002521 = queryNorm
            0.30414405 = fieldWeight in 4300, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              3.893044 = idf(docFreq=2449, maxDocs=44218)
              0.0390625 = fieldNorm(doc=4300)
      0.14285715 = coord(2/14)
    
    Abstract
    Gross et al. (2015) have demonstrated that about a quarter of hits would typically be lost to keyword searchers if contemporary academic library catalogs dropped their controlled subject headings. This article reports on an investigation of the search value that subject descriptors and identifiers assigned by professional indexers add to a bibliographic database, namely the Australian Education Index (AEI). First, a similar methodology to that developed by Gross et al. (2015) was applied, with keyword searches representing a range of educational topics run on the AEI database with and without its subject indexing. The results indicated that AEI users would also lose, on average, about a quarter of hits per query. Second, an alternative research design was applied in which an experienced literature searcher was asked to find resources on a set of educational topics on an AEI database stripped of its subject indexing and then asked to search for additional resources on the same topics after the subject indexing had been reinserted. In this study, the proportion of additional resources that would have been lost had it not been for the subject indexing was again found to be about a quarter of the total resources found for each topic, on average.
  9. Meunier, J.-G.; Bertrand-Gastaldy, S.; Lebel, H.: ¬A call for enhanced representation of content as a means of improving online full-text retrieval (1987) 0.01
    0.0067291465 = product of:
      0.047104023 = sum of:
        0.023552012 = weight(_text_:classification in 2049) [ClassicSimilarity], result of:
          0.023552012 = score(doc=2049,freq=2.0), product of:
            0.09562149 = queryWeight, product of:
              3.1847067 = idf(docFreq=4974, maxDocs=44218)
              0.03002521 = queryNorm
            0.24630459 = fieldWeight in 2049, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.1847067 = idf(docFreq=4974, maxDocs=44218)
              0.0546875 = fieldNorm(doc=2049)
        0.023552012 = weight(_text_:classification in 2049) [ClassicSimilarity], result of:
          0.023552012 = score(doc=2049,freq=2.0), product of:
            0.09562149 = queryWeight, product of:
              3.1847067 = idf(docFreq=4974, maxDocs=44218)
              0.03002521 = queryNorm
            0.24630459 = fieldWeight in 2049, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.1847067 = idf(docFreq=4974, maxDocs=44218)
              0.0546875 = fieldNorm(doc=2049)
      0.14285715 = coord(2/14)
    
    Source
    International classification. 14(1987), S.2-10
  10. Blair, D.C.: Full text retrieval : Evaluation and implications (1986) 0.01
    0.00576784 = product of:
      0.04037488 = sum of:
        0.02018744 = weight(_text_:classification in 2047) [ClassicSimilarity], result of:
          0.02018744 = score(doc=2047,freq=2.0), product of:
            0.09562149 = queryWeight, product of:
              3.1847067 = idf(docFreq=4974, maxDocs=44218)
              0.03002521 = queryNorm
            0.21111822 = fieldWeight in 2047, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.1847067 = idf(docFreq=4974, maxDocs=44218)
              0.046875 = fieldNorm(doc=2047)
        0.02018744 = weight(_text_:classification in 2047) [ClassicSimilarity], result of:
          0.02018744 = score(doc=2047,freq=2.0), product of:
            0.09562149 = queryWeight, product of:
              3.1847067 = idf(docFreq=4974, maxDocs=44218)
              0.03002521 = queryNorm
            0.21111822 = fieldWeight in 2047, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.1847067 = idf(docFreq=4974, maxDocs=44218)
              0.046875 = fieldNorm(doc=2047)
      0.14285715 = coord(2/14)
    
    Source
    International classification. 13(1986), S.18-23
  11. Voorbij, H.: Title keywords and subject descriptors : a comparison of subject search entries of books in the humanities and social sciences (1998) 0.01
    0.00525005 = product of:
      0.0735007 = sum of:
        0.0735007 = weight(_text_:subject in 4721) [ClassicSimilarity], result of:
          0.0735007 = score(doc=4721,freq=24.0), product of:
            0.10738805 = queryWeight, product of:
              3.576596 = idf(docFreq=3361, maxDocs=44218)
              0.03002521 = queryNorm
            0.68444026 = fieldWeight in 4721, product of:
              4.8989797 = tf(freq=24.0), with freq of:
                24.0 = termFreq=24.0
              3.576596 = idf(docFreq=3361, maxDocs=44218)
              0.0390625 = fieldNorm(doc=4721)
      0.071428575 = coord(1/14)
    
    Abstract
    In order to compare the value of subject descriptors and title keywords as entries to subject searches, two studies were carried out. Both studies concentrated on monographs in the humanities and social sciences, held by the online public access catalogue of the National Library of the Netherlands. In the first study, a comparison was made by subject librarians between the subject descriptors and the title keywords of 475 records. They could express their opinion on a scale from 1 (descriptor is exactly or almost the same as word in title) to 7 (descriptor does not appear in title at all). It was concluded that 37 per cent of the records are considerably enhanced by a subject descriptor, and 49 per cent slightly or considerably enhanced. In the second study, subject librarians performed subject searches using title keywords and subject descriptors on the same topic. The relative recall amounted to 48 per cent and 86 per cent respectively. Failure analysis revealed the reasons why so many records that were found by subject descriptors were not found by title keywords. First, although completely meaningless titles hardly ever appear, the title of a publication does not always offer sufficient clues for title keyword searching. In those cases, descriptors may enhance the record of a publication. A second and even more important task of subject descriptors is controlling the vocabulary. Many relevant titles cannot be retrieved by title keyword searching because of the wide diversity of ways of expressing a topic. Descriptors take away the burden of vocabulary control from the user.
  12. Sclafani, F.: Controlled subject heading searching versus keyword searching (1999) 0.00
    0.004849789 = product of:
      0.067897044 = sum of:
        0.067897044 = weight(_text_:subject in 3790) [ClassicSimilarity], result of:
          0.067897044 = score(doc=3790,freq=2.0), product of:
            0.10738805 = queryWeight, product of:
              3.576596 = idf(docFreq=3361, maxDocs=44218)
              0.03002521 = queryNorm
            0.63225883 = fieldWeight in 3790, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.576596 = idf(docFreq=3361, maxDocs=44218)
              0.125 = fieldNorm(doc=3790)
      0.071428575 = coord(1/14)
    
  13. Pirkola, A.; Jarvelin, K.: ¬The effect of anaphor and ellipsis resolution on proximity searching in a text database (1995) 0.00
    0.004806533 = product of:
      0.03364573 = sum of:
        0.016822865 = weight(_text_:classification in 4088) [ClassicSimilarity], result of:
          0.016822865 = score(doc=4088,freq=2.0), product of:
            0.09562149 = queryWeight, product of:
              3.1847067 = idf(docFreq=4974, maxDocs=44218)
              0.03002521 = queryNorm
            0.17593184 = fieldWeight in 4088, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.1847067 = idf(docFreq=4974, maxDocs=44218)
              0.0390625 = fieldNorm(doc=4088)
        0.016822865 = weight(_text_:classification in 4088) [ClassicSimilarity], result of:
          0.016822865 = score(doc=4088,freq=2.0), product of:
            0.09562149 = queryWeight, product of:
              3.1847067 = idf(docFreq=4974, maxDocs=44218)
              0.03002521 = queryNorm
            0.17593184 = fieldWeight in 4088, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.1847067 = idf(docFreq=4974, maxDocs=44218)
              0.0390625 = fieldNorm(doc=4088)
      0.14285715 = coord(2/14)
    
    Abstract
    So far, methods for ellipsis and anaphor resolution have been developed and the effects of anaphor resolution have been analyzed in the context of statistical information retrieval of scientific abstracts. No significant improvements has been observed. Analyzes the effects of ellipsis and anaphor resolution on proximity searching in a full text database. Anaphora and ellipsis are classified on the basis of the type of their correlates / antecedents rather than, as traditional, on the basis of their own linguistic type. The classification differentiates proper names and common nouns of basic words, compound words, and phrases. The study was carried out in a newspaper article database containing 55.000 full text articles. A set of 154 keyword pairs in different categories was created. Human resolution of keyword ellipsis and anaphora was performed to identify sentences and paragraphs which would match proximity searches after resolution. Findings indicate that ellipsis and anaphor resolution is most relevant for proper name phrases and only marginal in the other keyword categories. Therefore the recall effect of restricted resolution of proper name phrases only was analyzed for keyword pairs containing at least 1 proper name phrase. Findings indicate a recall increase of 38.2% in sentence searches, and 28.8% in paragraph searches when proper name ellipsis were resolved. The recall increase was 17.6% sentence searches, and 19.8% in paragraph searches when proper name anaphora were resolved. Some simple and computationally justifiable resolution method might be developed only for proper name phrases to support keyword based full text information retrieval. Discusses elements of such a method
  14. Schmidt, J.: Full-text searching : as seen from a non-bibliographic searcher's point of view (1989) 0.00
    0.0035551237 = product of:
      0.04977173 = sum of:
        0.04977173 = weight(_text_:bibliographic in 2876) [ClassicSimilarity], result of:
          0.04977173 = score(doc=2876,freq=4.0), product of:
            0.11688946 = queryWeight, product of:
              3.893044 = idf(docFreq=2449, maxDocs=44218)
              0.03002521 = queryNorm
            0.4258017 = fieldWeight in 2876, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              3.893044 = idf(docFreq=2449, maxDocs=44218)
              0.0546875 = fieldNorm(doc=2876)
      0.071428575 = coord(1/14)
    
    Abstract
    Examines searching capabilities and search results relating to the same full text data base made available by: a host that offers a command language designed for searching bibliographic data bases and a host that provides search facilities that have been specially designed for full text retrieval. Moreover, the CD-ROM format of an encyclopedia is compared with the equivalent on-line version of the same work, Academic American Encyclopedia. Results reveal that it is easier to search on those systems that offer searching facilities which have been specially designed for full text retrieval.
  15. Quint, B.: Flipping for full-text (1991) 0.00
    0.002872974 = product of:
      0.04022163 = sum of:
        0.04022163 = weight(_text_:bibliographic in 4893) [ClassicSimilarity], result of:
          0.04022163 = score(doc=4893,freq=2.0), product of:
            0.11688946 = queryWeight, product of:
              3.893044 = idf(docFreq=2449, maxDocs=44218)
              0.03002521 = queryNorm
            0.34409973 = fieldWeight in 4893, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.893044 = idf(docFreq=2449, maxDocs=44218)
              0.0625 = fieldNorm(doc=4893)
      0.071428575 = coord(1/14)
    
    Abstract
    Provides tips for searchers of full text online databases and examines the coverage policies of full text database producers which may change without notification to users. Most full text newspaper files do not carry even bibliographic listings for syndicated columns not created by their own staff. Looks at the development of full-text CD-ROM databases and claims that full-text, though expensive, is the wave of the future
  16. Tenopir, C.: Full-text retrieval : systems and files (1994) 0.00
    0.002872974 = product of:
      0.04022163 = sum of:
        0.04022163 = weight(_text_:bibliographic in 2424) [ClassicSimilarity], result of:
          0.04022163 = score(doc=2424,freq=2.0), product of:
            0.11688946 = queryWeight, product of:
              3.893044 = idf(docFreq=2449, maxDocs=44218)
              0.03002521 = queryNorm
            0.34409973 = fieldWeight in 2424, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.893044 = idf(docFreq=2449, maxDocs=44218)
              0.0625 = fieldNorm(doc=2424)
      0.071428575 = coord(1/14)
    
    Abstract
    State of the art review of the development of full text databases, encompassing: types of commercially available full text databases; online systems for full text databases; CD-ROM databases for full text databases; full text databases on magnetic discs or tapes; creation of full text databases; searching and display requirements for full text searching and software. Concludes that bibliographic information services without full text support solve only half of the retrieval problems
  17. Couvreur, T.R.; Benzel, R.N.; Miller, S.F.; Zeitler, D.N.; Lee, D.L.; Singhal, M.; Shivaratri, N.; Wong, W.Y.P.: ¬An analysis of performance and cost factors in searching large text databases using parallel search systems (1994) 0.00
    0.002513852 = product of:
      0.035193928 = sum of:
        0.035193928 = weight(_text_:bibliographic in 7657) [ClassicSimilarity], result of:
          0.035193928 = score(doc=7657,freq=2.0), product of:
            0.11688946 = queryWeight, product of:
              3.893044 = idf(docFreq=2449, maxDocs=44218)
              0.03002521 = queryNorm
            0.30108726 = fieldWeight in 7657, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.893044 = idf(docFreq=2449, maxDocs=44218)
              0.0546875 = fieldNorm(doc=7657)
      0.071428575 = coord(1/14)
    
    Abstract
    The results of modelling the performance of searching large text databases (>10 GBytes) via various parallel hardware architectures and search algorithms are discussed. The performance under load and the cost of each configuration are compared. Strengths, weaknesses, performance sensitivities, and search features supported for each configuration are also addressed. In addition, a common search workload used in the modelling is described. The search workload is derived from a set of searches run against the Chemical Abstracts file of bibliographic and abstract text available on STN International. This common workload is applied to all configurations modelled to provide a common basis of comparison
  18. Melucci, M.: Passage retrieval : a probabilistic technique (1998) 0.00
    0.0024926048 = product of:
      0.034896467 = sum of:
        0.034896467 = product of:
          0.069792934 = sum of:
            0.069792934 = weight(_text_:texts in 1150) [ClassicSimilarity], result of:
              0.069792934 = score(doc=1150,freq=2.0), product of:
                0.16460659 = queryWeight, product of:
                  5.4822793 = idf(docFreq=499, maxDocs=44218)
                  0.03002521 = queryNorm
                0.42399842 = fieldWeight in 1150, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  5.4822793 = idf(docFreq=499, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=1150)
          0.5 = coord(1/2)
      0.071428575 = coord(1/14)
    
    Abstract
    This paper presents a probabilistic technique to retrieve passages from texts having a large size or heterogeneous semantic content. The proposed technique is independent on any supporting auxiliary data, such as text structure, topic organization, or pre-defined text segments. A Bayesian framework implements the probabilistic technique. We carried out experiments to compare the probabilistique technique to one based on a text segmentation algorithm. In particular, the probabilistique technique is more effective than, or as effective as the one based on the text segmentation to retrieve small passages. Results show that passage size affects passage retrieval performance. Results do also suggest that text organization and query generality may have an impact on the difference in effectiveness between the two techniques
  19. Albus, W.; Smulders, H.: Doeltreffend zoeken in volledige teksten : 1. full-text retrieval bij de HavenInformatieBank (1998) 0.00
    0.0024926048 = product of:
      0.034896467 = sum of:
        0.034896467 = product of:
          0.069792934 = sum of:
            0.069792934 = weight(_text_:texts in 1682) [ClassicSimilarity], result of:
              0.069792934 = score(doc=1682,freq=2.0), product of:
                0.16460659 = queryWeight, product of:
                  5.4822793 = idf(docFreq=499, maxDocs=44218)
                  0.03002521 = queryNorm
                0.42399842 = fieldWeight in 1682, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  5.4822793 = idf(docFreq=499, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=1682)
          0.5 = coord(1/2)
      0.071428575 = coord(1/14)
    
    Footnote
    Übers. d. Titels: Effective searching on full texts: 1. full-text-retrieval on the Harbour information database
  20. Albus, W.; Smulders, H.: Doeltreffend zoeken in volledige teksten : 2. full-text retrieval bij de HavenInformatieBank (1998) 0.00
    0.0024926048 = product of:
      0.034896467 = sum of:
        0.034896467 = product of:
          0.069792934 = sum of:
            0.069792934 = weight(_text_:texts in 2368) [ClassicSimilarity], result of:
              0.069792934 = score(doc=2368,freq=2.0), product of:
                0.16460659 = queryWeight, product of:
                  5.4822793 = idf(docFreq=499, maxDocs=44218)
                  0.03002521 = queryNorm
                0.42399842 = fieldWeight in 2368, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  5.4822793 = idf(docFreq=499, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=2368)
          0.5 = coord(1/2)
      0.071428575 = coord(1/14)
    
    Footnote
    Übers. d. Titels: Effective searching on full texts: 1. full-text-retrieval on the Harbour information database