Search (18 results, page 1 of 1)

  • × theme_ss:"Volltextretrieval"
  1. Cochrane, P.A.: Subject access - free text and controlled : the case of Papua New Guinea (1985) 0.06
    0.05656203 = product of:
      0.11312406 = sum of:
        0.092054576 = weight(_text_:subject in 1459) [ClassicSimilarity], result of:
          0.092054576 = score(doc=1459,freq=6.0), product of:
            0.16812018 = queryWeight, product of:
              3.576596 = idf(docFreq=3361, maxDocs=44218)
              0.04700564 = queryNorm
            0.5475522 = fieldWeight in 1459, product of:
              2.4494898 = tf(freq=6.0), with freq of:
                6.0 = termFreq=6.0
              3.576596 = idf(docFreq=3361, maxDocs=44218)
              0.0625 = fieldNorm(doc=1459)
        0.02106948 = product of:
          0.04213896 = sum of:
            0.04213896 = weight(_text_:classification in 1459) [ClassicSimilarity], result of:
              0.04213896 = score(doc=1459,freq=2.0), product of:
                0.14969917 = queryWeight, product of:
                  3.1847067 = idf(docFreq=4974, maxDocs=44218)
                  0.04700564 = queryNorm
                0.28149095 = fieldWeight in 1459, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.1847067 = idf(docFreq=4974, maxDocs=44218)
                  0.0625 = fieldNorm(doc=1459)
          0.5 = coord(1/2)
      0.5 = coord(2/4)
    
    Abstract
    The online catalogue can provide the user with efficient and effective access through a variety of access points. New interests in subject heading is indicated. Keyword access and free text searching are considered alternatice methods. An investigation is suggested into the symbiotic relationship between classification and subject heading
  2. Gross, T.; Taylor, A.G.; Joudrey, D.N.: Still a lot to lose : the role of controlled vocabulary in keyword searching (2015) 0.03
    0.032470033 = product of:
      0.064940065 = sum of:
        0.046504267 = weight(_text_:subject in 2007) [ClassicSimilarity], result of:
          0.046504267 = score(doc=2007,freq=2.0), product of:
            0.16812018 = queryWeight, product of:
              3.576596 = idf(docFreq=3361, maxDocs=44218)
              0.04700564 = queryNorm
            0.27661324 = fieldWeight in 2007, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.576596 = idf(docFreq=3361, maxDocs=44218)
              0.0546875 = fieldNorm(doc=2007)
        0.018435795 = product of:
          0.03687159 = sum of:
            0.03687159 = weight(_text_:classification in 2007) [ClassicSimilarity], result of:
              0.03687159 = score(doc=2007,freq=2.0), product of:
                0.14969917 = queryWeight, product of:
                  3.1847067 = idf(docFreq=4974, maxDocs=44218)
                  0.04700564 = queryNorm
                0.24630459 = fieldWeight in 2007, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.1847067 = idf(docFreq=4974, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=2007)
          0.5 = coord(1/2)
      0.5 = coord(2/4)
    
    Abstract
    In their 2005 study, Gross and Taylor found that more than a third of records retrieved by keyword searches would be lost without subject headings. A review of the literature since then shows that numerous studies, in various disciplines, have found that a quarter to a third of records returned in a keyword search would be lost without controlled vocabulary. Other writers, though, have continued to suggest that controlled vocabulary be discontinued. Addressing criticisms of the Gross/Taylor study, this study replicates the search process in the same online catalog, but after the addition of automated enriched metadata such as tables of contents and summaries. The proportion of results that would be lost remains high.
    Source
    Cataloging and classification quarterly. 53(2015) no.1, S.1-39
  3. Voorbij, H.: Title keywords and subject descriptors : a comparison of subject search entries of books in the humanities and social sciences (1998) 0.03
    0.028767055 = product of:
      0.11506822 = sum of:
        0.11506822 = weight(_text_:subject in 4721) [ClassicSimilarity], result of:
          0.11506822 = score(doc=4721,freq=24.0), product of:
            0.16812018 = queryWeight, product of:
              3.576596 = idf(docFreq=3361, maxDocs=44218)
              0.04700564 = queryNorm
            0.68444026 = fieldWeight in 4721, product of:
              4.8989797 = tf(freq=24.0), with freq of:
                24.0 = termFreq=24.0
              3.576596 = idf(docFreq=3361, maxDocs=44218)
              0.0390625 = fieldNorm(doc=4721)
      0.25 = coord(1/4)
    
    Abstract
    In order to compare the value of subject descriptors and title keywords as entries to subject searches, two studies were carried out. Both studies concentrated on monographs in the humanities and social sciences, held by the online public access catalogue of the National Library of the Netherlands. In the first study, a comparison was made by subject librarians between the subject descriptors and the title keywords of 475 records. They could express their opinion on a scale from 1 (descriptor is exactly or almost the same as word in title) to 7 (descriptor does not appear in title at all). It was concluded that 37 per cent of the records are considerably enhanced by a subject descriptor, and 49 per cent slightly or considerably enhanced. In the second study, subject librarians performed subject searches using title keywords and subject descriptors on the same topic. The relative recall amounted to 48 per cent and 86 per cent respectively. Failure analysis revealed the reasons why so many records that were found by subject descriptors were not found by title keywords. First, although completely meaningless titles hardly ever appear, the title of a publication does not always offer sufficient clues for title keyword searching. In those cases, descriptors may enhance the record of a publication. A second and even more important task of subject descriptors is controlling the vocabulary. Many relevant titles cannot be retrieved by title keyword searching because of the wide diversity of ways of expressing a topic. Descriptors take away the burden of vocabulary control from the user.
  4. Sclafani, F.: Controlled subject heading searching versus keyword searching (1999) 0.03
    0.026573867 = product of:
      0.10629547 = sum of:
        0.10629547 = weight(_text_:subject in 3790) [ClassicSimilarity], result of:
          0.10629547 = score(doc=3790,freq=2.0), product of:
            0.16812018 = queryWeight, product of:
              3.576596 = idf(docFreq=3361, maxDocs=44218)
              0.04700564 = queryNorm
            0.63225883 = fieldWeight in 3790, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.576596 = idf(docFreq=3361, maxDocs=44218)
              0.125 = fieldNorm(doc=3790)
      0.25 = coord(1/4)
    
  5. Hider, P.: ¬The search value added by professional indexing to a bibliographic database (2017) 0.02
    0.02034138 = product of:
      0.08136552 = sum of:
        0.08136552 = weight(_text_:subject in 3868) [ClassicSimilarity], result of:
          0.08136552 = score(doc=3868,freq=12.0), product of:
            0.16812018 = queryWeight, product of:
              3.576596 = idf(docFreq=3361, maxDocs=44218)
              0.04700564 = queryNorm
            0.48397237 = fieldWeight in 3868, product of:
              3.4641016 = tf(freq=12.0), with freq of:
                12.0 = termFreq=12.0
              3.576596 = idf(docFreq=3361, maxDocs=44218)
              0.0390625 = fieldNorm(doc=3868)
      0.25 = coord(1/4)
    
    Abstract
    Gross et al. (2015) have demonstrated that about a quarter of hits would typically be lost to keyword searchers if contemporary academic library catalogs dropped their controlled subject headings. This paper reports on an analysis of the loss levels that would result if a bibliographic database, namely the Australian Education Index (AEI), were missing the subject descriptors and identifiers assigned by its professional indexers, employing the methodology developed by Gross and Taylor (2005), and later by Gross et al. (2015). The results indicate that AEI users would lose a similar proportion of hits per query to that experienced by library catalog users: on average, 27% of the resources found by a sample of keyword queries on the AEI database would not have been found without the subject indexing, based on the Australian Thesaurus of Education Descriptors (ATED). The paper also discusses the methodological limitations of these studies, pointing out that real-life users might still find some of the resources missed by a particular query through follow-up searches, while additional resources might also be found through iterative searching on the subject vocabulary. The paper goes on to describe a new research design, based on a before - and - after experiment, which addresses some of these limitations. It is argued that this alternative design will provide a more realistic picture of the value that professionally assigned subject indexing and controlled subject vocabularies can add to literature searching of a more scholarly and thorough kind.
  6. Hider, P.: ¬The search value added by professional indexing to a bibliographic database (2018) 0.02
    0.02034138 = product of:
      0.08136552 = sum of:
        0.08136552 = weight(_text_:subject in 4300) [ClassicSimilarity], result of:
          0.08136552 = score(doc=4300,freq=12.0), product of:
            0.16812018 = queryWeight, product of:
              3.576596 = idf(docFreq=3361, maxDocs=44218)
              0.04700564 = queryNorm
            0.48397237 = fieldWeight in 4300, product of:
              3.4641016 = tf(freq=12.0), with freq of:
                12.0 = termFreq=12.0
              3.576596 = idf(docFreq=3361, maxDocs=44218)
              0.0390625 = fieldNorm(doc=4300)
      0.25 = coord(1/4)
    
    Abstract
    Gross et al. (2015) have demonstrated that about a quarter of hits would typically be lost to keyword searchers if contemporary academic library catalogs dropped their controlled subject headings. This article reports on an investigation of the search value that subject descriptors and identifiers assigned by professional indexers add to a bibliographic database, namely the Australian Education Index (AEI). First, a similar methodology to that developed by Gross et al. (2015) was applied, with keyword searches representing a range of educational topics run on the AEI database with and without its subject indexing. The results indicated that AEI users would also lose, on average, about a quarter of hits per query. Second, an alternative research design was applied in which an experienced literature searcher was asked to find resources on a set of educational topics on an AEI database stripped of its subject indexing and then asked to search for additional resources on the same topics after the subject indexing had been reinserted. In this study, the proportion of additional resources that would have been lost had it not been for the subject indexing was again found to be about a quarter of the total resources found for each topic, on average.
  7. Witt, M.: Au sujet des mots-clés (1997) 0.01
    0.013286933 = product of:
      0.053147733 = sum of:
        0.053147733 = weight(_text_:subject in 1666) [ClassicSimilarity], result of:
          0.053147733 = score(doc=1666,freq=2.0), product of:
            0.16812018 = queryWeight, product of:
              3.576596 = idf(docFreq=3361, maxDocs=44218)
              0.04700564 = queryNorm
            0.31612942 = fieldWeight in 1666, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.576596 = idf(docFreq=3361, maxDocs=44218)
              0.0625 = fieldNorm(doc=1666)
      0.25 = coord(1/4)
    
    Footnote
    Übers. des Titels: On the subject of keywords
  8. Freitext in Informationssystemen (1985) 0.01
    0.01117378 = product of:
      0.04469512 = sum of:
        0.04469512 = product of:
          0.08939024 = sum of:
            0.08939024 = weight(_text_:classification in 2036) [ClassicSimilarity], result of:
              0.08939024 = score(doc=2036,freq=4.0), product of:
                0.14969917 = queryWeight, product of:
                  3.1847067 = idf(docFreq=4974, maxDocs=44218)
                  0.04700564 = queryNorm
                0.5971325 = fieldWeight in 2036, product of:
                  2.0 = tf(freq=4.0), with freq of:
                    4.0 = termFreq=4.0
                  3.1847067 = idf(docFreq=4974, maxDocs=44218)
                  0.09375 = fieldNorm(doc=2036)
          0.5 = coord(1/2)
      0.25 = coord(1/4)
    
    Footnote
    Deutsche Fassung von 'Free text in information systems' in: International classification 12(1985) H.2, S.95-98. Wegen einiger Ungereimtheiten sollte die englische Fassung benutzt werden
    Source
    International classification. 12(1985) no.1, S.23-26
  9. Free text in information systems: capabilities and limitations (1985) 0.01
    0.01117378 = product of:
      0.04469512 = sum of:
        0.04469512 = product of:
          0.08939024 = sum of:
            0.08939024 = weight(_text_:classification in 2045) [ClassicSimilarity], result of:
              0.08939024 = score(doc=2045,freq=4.0), product of:
                0.14969917 = queryWeight, product of:
                  3.1847067 = idf(docFreq=4974, maxDocs=44218)
                  0.04700564 = queryNorm
                0.5971325 = fieldWeight in 2045, product of:
                  2.0 = tf(freq=4.0), with freq of:
                    4.0 = termFreq=4.0
                  3.1847067 = idf(docFreq=4974, maxDocs=44218)
                  0.09375 = fieldNorm(doc=2045)
          0.5 = coord(1/2)
      0.25 = coord(1/4)
    
    Footnote
    Diese Empfehlungen liegen auch in deutscher Übersetzung vor (abgedruckt ebenfalls in International classification), leider ist die Übersetzung nicht in allen Aussagen recht gelungen, so daß das Original vorzuziehen ist
    Source
    International classification. 12(1985), S.95-98
  10. Kristensen, J.; Järvelin, K.: ¬The effectiveness of a searching thesaurus in free-text searching in a full-text database (1990) 0.01
    0.01053474 = product of:
      0.04213896 = sum of:
        0.04213896 = product of:
          0.08427792 = sum of:
            0.08427792 = weight(_text_:classification in 2043) [ClassicSimilarity], result of:
              0.08427792 = score(doc=2043,freq=2.0), product of:
                0.14969917 = queryWeight, product of:
                  3.1847067 = idf(docFreq=4974, maxDocs=44218)
                  0.04700564 = queryNorm
                0.5629819 = fieldWeight in 2043, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.1847067 = idf(docFreq=4974, maxDocs=44218)
                  0.125 = fieldNorm(doc=2043)
          0.5 = coord(1/2)
      0.25 = coord(1/4)
    
    Source
    International classification. 17(1990), S.77-84
  11. Laegreid, J.A.: SIFT: a Norwegian information retrieval system (1993) 0.01
    0.006368615 = product of:
      0.02547446 = sum of:
        0.02547446 = product of:
          0.05094892 = sum of:
            0.05094892 = weight(_text_:22 in 7701) [ClassicSimilarity], result of:
              0.05094892 = score(doc=7701,freq=2.0), product of:
                0.16460574 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.04700564 = queryNorm
                0.30952093 = fieldWeight in 7701, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0625 = fieldNorm(doc=7701)
          0.5 = coord(1/2)
      0.25 = coord(1/4)
    
    Date
    23. 1.1999 19:22:09
  12. Reinisch, F.: Wer suchet - der findet? : oder Die Überwindung der sprachlichen Grenzen bei der Suche in Volltextdatenbanken (2000) 0.01
    0.006368615 = product of:
      0.02547446 = sum of:
        0.02547446 = product of:
          0.05094892 = sum of:
            0.05094892 = weight(_text_:22 in 4919) [ClassicSimilarity], result of:
              0.05094892 = score(doc=4919,freq=2.0), product of:
                0.16460574 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.04700564 = queryNorm
                0.30952093 = fieldWeight in 4919, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0625 = fieldNorm(doc=4919)
          0.5 = coord(1/2)
      0.25 = coord(1/4)
    
    Date
    22. 7.2000 17:48:06
  13. Zillmann, H.: OSIRIS und eLib : Information Retrieval und Search Engines in Full-text Databases (2001) 0.01
    0.006368615 = product of:
      0.02547446 = sum of:
        0.02547446 = product of:
          0.05094892 = sum of:
            0.05094892 = weight(_text_:22 in 5937) [ClassicSimilarity], result of:
              0.05094892 = score(doc=5937,freq=2.0), product of:
                0.16460574 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.04700564 = queryNorm
                0.30952093 = fieldWeight in 5937, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0625 = fieldNorm(doc=5937)
          0.5 = coord(1/2)
      0.25 = coord(1/4)
    
    Date
    14. 6.2001 12:22:31
  14. Dambeck, H.; Engler, T.: Gesucht und gefunden : Neun Volltext-Suchprogramme für den Desktop (2002) 0.01
    0.006368615 = product of:
      0.02547446 = sum of:
        0.02547446 = product of:
          0.05094892 = sum of:
            0.05094892 = weight(_text_:22 in 1169) [ClassicSimilarity], result of:
              0.05094892 = score(doc=1169,freq=2.0), product of:
                0.16460574 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.04700564 = queryNorm
                0.30952093 = fieldWeight in 1169, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0625 = fieldNorm(doc=1169)
          0.5 = coord(1/2)
      0.25 = coord(1/4)
    
    Source
    c't. 2002, H.22, S.190-197
  15. Sievert, M.E.; McKinin, E.J.: Why full-text misses some relevant documents : an analysis of documents not retrieved by CCML or MEDIS (1989) 0.00
    0.004776461 = product of:
      0.019105844 = sum of:
        0.019105844 = product of:
          0.03821169 = sum of:
            0.03821169 = weight(_text_:22 in 3564) [ClassicSimilarity], result of:
              0.03821169 = score(doc=3564,freq=2.0), product of:
                0.16460574 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.04700564 = queryNorm
                0.23214069 = fieldWeight in 3564, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.046875 = fieldNorm(doc=3564)
          0.5 = coord(1/2)
      0.25 = coord(1/4)
    
    Date
    9. 1.1996 10:22:31
  16. Meunier, J.-G.; Bertrand-Gastaldy, S.; Lebel, H.: ¬A call for enhanced representation of content as a means of improving online full-text retrieval (1987) 0.00
    0.0046089487 = product of:
      0.018435795 = sum of:
        0.018435795 = product of:
          0.03687159 = sum of:
            0.03687159 = weight(_text_:classification in 2049) [ClassicSimilarity], result of:
              0.03687159 = score(doc=2049,freq=2.0), product of:
                0.14969917 = queryWeight, product of:
                  3.1847067 = idf(docFreq=4974, maxDocs=44218)
                  0.04700564 = queryNorm
                0.24630459 = fieldWeight in 2049, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.1847067 = idf(docFreq=4974, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=2049)
          0.5 = coord(1/2)
      0.25 = coord(1/4)
    
    Source
    International classification. 14(1987), S.2-10
  17. Blair, D.C.: Full text retrieval : Evaluation and implications (1986) 0.00
    0.003950528 = product of:
      0.015802111 = sum of:
        0.015802111 = product of:
          0.031604223 = sum of:
            0.031604223 = weight(_text_:classification in 2047) [ClassicSimilarity], result of:
              0.031604223 = score(doc=2047,freq=2.0), product of:
                0.14969917 = queryWeight, product of:
                  3.1847067 = idf(docFreq=4974, maxDocs=44218)
                  0.04700564 = queryNorm
                0.21111822 = fieldWeight in 2047, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.1847067 = idf(docFreq=4974, maxDocs=44218)
                  0.046875 = fieldNorm(doc=2047)
          0.5 = coord(1/2)
      0.25 = coord(1/4)
    
    Source
    International classification. 13(1986), S.18-23
  18. Pirkola, A.; Jarvelin, K.: ¬The effect of anaphor and ellipsis resolution on proximity searching in a text database (1995) 0.00
    0.0032921063 = product of:
      0.013168425 = sum of:
        0.013168425 = product of:
          0.02633685 = sum of:
            0.02633685 = weight(_text_:classification in 4088) [ClassicSimilarity], result of:
              0.02633685 = score(doc=4088,freq=2.0), product of:
                0.14969917 = queryWeight, product of:
                  3.1847067 = idf(docFreq=4974, maxDocs=44218)
                  0.04700564 = queryNorm
                0.17593184 = fieldWeight in 4088, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.1847067 = idf(docFreq=4974, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=4088)
          0.5 = coord(1/2)
      0.25 = coord(1/4)
    
    Abstract
    So far, methods for ellipsis and anaphor resolution have been developed and the effects of anaphor resolution have been analyzed in the context of statistical information retrieval of scientific abstracts. No significant improvements has been observed. Analyzes the effects of ellipsis and anaphor resolution on proximity searching in a full text database. Anaphora and ellipsis are classified on the basis of the type of their correlates / antecedents rather than, as traditional, on the basis of their own linguistic type. The classification differentiates proper names and common nouns of basic words, compound words, and phrases. The study was carried out in a newspaper article database containing 55.000 full text articles. A set of 154 keyword pairs in different categories was created. Human resolution of keyword ellipsis and anaphora was performed to identify sentences and paragraphs which would match proximity searches after resolution. Findings indicate that ellipsis and anaphor resolution is most relevant for proper name phrases and only marginal in the other keyword categories. Therefore the recall effect of restricted resolution of proper name phrases only was analyzed for keyword pairs containing at least 1 proper name phrase. Findings indicate a recall increase of 38.2% in sentence searches, and 28.8% in paragraph searches when proper name ellipsis were resolved. The recall increase was 17.6% sentence searches, and 19.8% in paragraph searches when proper name anaphora were resolved. Some simple and computationally justifiable resolution method might be developed only for proper name phrases to support keyword based full text information retrieval. Discusses elements of such a method