Search (38 results, page 1 of 2)

  • × theme_ss:"Automatisches Indexieren"
  1. Hodges, P.R.: Keyword in title indexes : effectiveness of retrieval in computer searches (1983) 0.13
    0.1268906 = product of:
      0.19033588 = sum of:
        0.16725978 = weight(_text_:specialist in 5001) [ClassicSimilarity], result of:
          0.16725978 = score(doc=5001,freq=2.0), product of:
            0.32440975 = queryWeight, product of:
              6.666449 = idf(docFreq=152, maxDocs=44218)
              0.04866305 = queryNorm
            0.51558185 = fieldWeight in 5001, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              6.666449 = idf(docFreq=152, maxDocs=44218)
              0.0546875 = fieldNorm(doc=5001)
        0.0230761 = product of:
          0.0461522 = sum of:
            0.0461522 = weight(_text_:22 in 5001) [ClassicSimilarity], result of:
              0.0461522 = score(doc=5001,freq=2.0), product of:
                0.17040971 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.04866305 = queryNorm
                0.2708308 = fieldWeight in 5001, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=5001)
          0.5 = coord(1/2)
      0.6666667 = coord(2/3)
    
    Abstract
    A study was done to test the effectiveness of retrieval using title word searching. It was based on actual search profiles used in the Mechanized Information Center at Ohio State University, in order ro replicate as closely as possible actual searching conditions. Fewer than 50% of the relevant titles were retrieved by keywords in titles. The low rate of retrieval can be attributes to three sources: titles themselves, user and information specialist ignorance of the subject vocabulary in use, and to general language problems. Across fields it was found that the social sciences had the best retrieval rate, with science having the next best, and arts and humanities the lowest. Ways to enhance and supplement keyword in title searching on the computer and in printed indexes are discussed.
    Date
    14. 3.1996 13:22:21
  2. Ward, M.L.: ¬The future of the human indexer (1996) 0.11
    0.10876336 = product of:
      0.16314504 = sum of:
        0.14336552 = weight(_text_:specialist in 7244) [ClassicSimilarity], result of:
          0.14336552 = score(doc=7244,freq=2.0), product of:
            0.32440975 = queryWeight, product of:
              6.666449 = idf(docFreq=152, maxDocs=44218)
              0.04866305 = queryNorm
            0.44192728 = fieldWeight in 7244, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              6.666449 = idf(docFreq=152, maxDocs=44218)
              0.046875 = fieldNorm(doc=7244)
        0.019779515 = product of:
          0.03955903 = sum of:
            0.03955903 = weight(_text_:22 in 7244) [ClassicSimilarity], result of:
              0.03955903 = score(doc=7244,freq=2.0), product of:
                0.17040971 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.04866305 = queryNorm
                0.23214069 = fieldWeight in 7244, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.046875 = fieldNorm(doc=7244)
          0.5 = coord(1/2)
      0.6666667 = coord(2/3)
    
    Abstract
    Considers the principles of indexing and the intellectual skills involved in order to determine what automatic indexing systems would be required in order to supplant or complement the human indexer. Good indexing requires: considerable prior knowledge of the literature; judgement as to what to index and what depth to index; reading skills; abstracting skills; and classification skills, Illustrates these features with a detailed description of abstracting and indexing processes involved in generating entries for the mechanical engineering database POWERLINK. Briefly assesses the possibility of replacing human indexers with specialist indexing software, with particular reference to the Object Analyzer from the InTEXT automatic indexing system and using the criteria described for human indexers. At present, it is unlikely that the automatic indexer will replace the human indexer, but when more primary texts are available in electronic form, it may be a useful productivity tool for dealing with large quantities of low grade texts (should they be wanted in the database)
    Date
    9. 2.1997 18:44:22
  3. Alexander, M.: Automatic indexing of document images using Excalibur EFS (1995) 0.06
    0.06371801 = product of:
      0.19115403 = sum of:
        0.19115403 = weight(_text_:specialist in 1911) [ClassicSimilarity], result of:
          0.19115403 = score(doc=1911,freq=2.0), product of:
            0.32440975 = queryWeight, product of:
              6.666449 = idf(docFreq=152, maxDocs=44218)
              0.04866305 = queryNorm
            0.5892364 = fieldWeight in 1911, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              6.666449 = idf(docFreq=152, maxDocs=44218)
              0.0625 = fieldNorm(doc=1911)
      0.33333334 = coord(1/3)
    
    Abstract
    Discusses research into the application of adaptive pattern recognition technology to enable effective retrieval from scanned document images. Describes application at the British Library of Excalibur EFS software which uses adaptive pattern recognition technology to provide access to digital information in its native forms, fuzzy searching retrieval and automatic indexing capabilities. It was used to make specialist printed catalogues and indexes accessible on computer via content based indexes
  4. Keller, A.: Attitudes among German- and English-speaking librarians toward (automatic) subject indexing (2015) 0.02
    0.017799769 = product of:
      0.053399306 = sum of:
        0.053399306 = product of:
          0.10679861 = sum of:
            0.10679861 = weight(_text_:librarians in 2629) [ClassicSimilarity], result of:
              0.10679861 = score(doc=2629,freq=4.0), product of:
                0.21798341 = queryWeight, product of:
                  4.479444 = idf(docFreq=1362, maxDocs=44218)
                  0.04866305 = queryNorm
                0.48993918 = fieldWeight in 2629, product of:
                  2.0 = tf(freq=4.0), with freq of:
                    4.0 = termFreq=4.0
                  4.479444 = idf(docFreq=1362, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=2629)
          0.5 = coord(1/2)
      0.33333334 = coord(1/3)
    
    Abstract
    The survey described in this article investigates the attitudes of librarians in German- and English-speaking countries toward subject indexing in general, and automatic subject indexing in particular. The results show great similarity between attitudes in both language areas. Respondents agree that the current quality standards should be upheld and dismiss critical voices claiming that subject indexing has lost relevance. With regard to automatic subject indexing, respondents demonstrate considerable skepticism-both with regard to the likely timeframe and the expected quality of such systems. The author considers how this low acceptance poses a difficulty for those involved in change management.
  5. Voorhees, E.M.: Implementing agglomerative hierarchic clustering algorithms for use in document retrieval (1986) 0.02
    0.01758179 = product of:
      0.052745372 = sum of:
        0.052745372 = product of:
          0.105490744 = sum of:
            0.105490744 = weight(_text_:22 in 402) [ClassicSimilarity], result of:
              0.105490744 = score(doc=402,freq=2.0), product of:
                0.17040971 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.04866305 = queryNorm
                0.61904186 = fieldWeight in 402, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.125 = fieldNorm(doc=402)
          0.5 = coord(1/2)
      0.33333334 = coord(1/3)
    
    Source
    Information processing and management. 22(1986) no.6, S.465-476
  6. Fuhr, N.; Niewelt, B.: ¬Ein Retrievaltest mit automatisch indexierten Dokumenten (1984) 0.02
    0.015384067 = product of:
      0.0461522 = sum of:
        0.0461522 = product of:
          0.0923044 = sum of:
            0.0923044 = weight(_text_:22 in 262) [ClassicSimilarity], result of:
              0.0923044 = score(doc=262,freq=2.0), product of:
                0.17040971 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.04866305 = queryNorm
                0.5416616 = fieldWeight in 262, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.109375 = fieldNorm(doc=262)
          0.5 = coord(1/2)
      0.33333334 = coord(1/3)
    
    Date
    20.10.2000 12:22:23
  7. Hlava, M.M.K.: Automatic indexing : comparing rule-based and statistics-based indexing systems (2005) 0.02
    0.015384067 = product of:
      0.0461522 = sum of:
        0.0461522 = product of:
          0.0923044 = sum of:
            0.0923044 = weight(_text_:22 in 6265) [ClassicSimilarity], result of:
              0.0923044 = score(doc=6265,freq=2.0), product of:
                0.17040971 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.04866305 = queryNorm
                0.5416616 = fieldWeight in 6265, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.109375 = fieldNorm(doc=6265)
          0.5 = coord(1/2)
      0.33333334 = coord(1/3)
    
    Source
    Information outlook. 9(2005) no.8, S.22-23
  8. Bloomfield, M.: Indexing : neglected and poorly understood (2001) 0.02
    0.015256945 = product of:
      0.045770835 = sum of:
        0.045770835 = product of:
          0.09154167 = sum of:
            0.09154167 = weight(_text_:librarians in 5439) [ClassicSimilarity], result of:
              0.09154167 = score(doc=5439,freq=4.0), product of:
                0.21798341 = queryWeight, product of:
                  4.479444 = idf(docFreq=1362, maxDocs=44218)
                  0.04866305 = queryNorm
                0.41994786 = fieldWeight in 5439, product of:
                  2.0 = tf(freq=4.0), with freq of:
                    4.0 = termFreq=4.0
                  4.479444 = idf(docFreq=1362, maxDocs=44218)
                  0.046875 = fieldNorm(doc=5439)
          0.5 = coord(1/2)
      0.33333334 = coord(1/3)
    
    Abstract
    The growth of the Internet has highlighted the use of machine indexing. The difficulties in using the Internet as a searching device can be frustrating. The use of the term "Python" is given as an example. Machine indexing is noted as "rotten" and human indexing as "capricious." The problem seems to be a lack of a theoretical foundation for the art of indexing. What librarians have learned over the last hundred years has yet to yield a consistent approach to what really works best in preparing index terms and in the ability of our customers to search the various indexes. An attempt is made to consider the elements of indexing, their pros and cons. The argument is made that machine indexing is far too prolific in its production of index terms. Neither librarians nor computer programmers have made much progress to improve Internet indexing. Human indexing has had the same problems for over fifty years.
  9. Fuhr, N.: Ranking-Experimente mit gewichteter Indexierung (1986) 0.01
    0.013186343 = product of:
      0.03955903 = sum of:
        0.03955903 = product of:
          0.07911806 = sum of:
            0.07911806 = weight(_text_:22 in 58) [ClassicSimilarity], result of:
              0.07911806 = score(doc=58,freq=2.0), product of:
                0.17040971 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.04866305 = queryNorm
                0.46428138 = fieldWeight in 58, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.09375 = fieldNorm(doc=58)
          0.5 = coord(1/2)
      0.33333334 = coord(1/3)
    
    Date
    14. 6.2015 22:12:44
  10. Hauer, M.: Automatische Indexierung (2000) 0.01
    0.013186343 = product of:
      0.03955903 = sum of:
        0.03955903 = product of:
          0.07911806 = sum of:
            0.07911806 = weight(_text_:22 in 5887) [ClassicSimilarity], result of:
              0.07911806 = score(doc=5887,freq=2.0), product of:
                0.17040971 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.04866305 = queryNorm
                0.46428138 = fieldWeight in 5887, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.09375 = fieldNorm(doc=5887)
          0.5 = coord(1/2)
      0.33333334 = coord(1/3)
    
    Source
    Wissen in Aktion: Wege des Knowledge Managements. 22. Online-Tagung der DGI, Frankfurt am Main, 2.-4.5.2000. Proceedings. Hrsg.: R. Schmidt
  11. Fuhr, N.: Rankingexperimente mit gewichteter Indexierung (1986) 0.01
    0.013186343 = product of:
      0.03955903 = sum of:
        0.03955903 = product of:
          0.07911806 = sum of:
            0.07911806 = weight(_text_:22 in 2051) [ClassicSimilarity], result of:
              0.07911806 = score(doc=2051,freq=2.0), product of:
                0.17040971 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.04866305 = queryNorm
                0.46428138 = fieldWeight in 2051, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.09375 = fieldNorm(doc=2051)
          0.5 = coord(1/2)
      0.33333334 = coord(1/3)
    
    Date
    14. 6.2015 22:12:56
  12. Hauer, M.: Tiefenindexierung im Bibliothekskatalog : 17 Jahre intelligentCAPTURE (2019) 0.01
    0.013186343 = product of:
      0.03955903 = sum of:
        0.03955903 = product of:
          0.07911806 = sum of:
            0.07911806 = weight(_text_:22 in 5629) [ClassicSimilarity], result of:
              0.07911806 = score(doc=5629,freq=2.0), product of:
                0.17040971 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.04866305 = queryNorm
                0.46428138 = fieldWeight in 5629, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.09375 = fieldNorm(doc=5629)
          0.5 = coord(1/2)
      0.33333334 = coord(1/3)
    
    Source
    B.I.T.online. 22(2019) H.2, S.163-166
  13. Wolfe, EW.: a case study in automated metadata enhancement : Natural Language Processing in the humanities (2019) 0.01
    0.012586337 = product of:
      0.03775901 = sum of:
        0.03775901 = product of:
          0.07551802 = sum of:
            0.07551802 = weight(_text_:librarians in 5236) [ClassicSimilarity], result of:
              0.07551802 = score(doc=5236,freq=2.0), product of:
                0.21798341 = queryWeight, product of:
                  4.479444 = idf(docFreq=1362, maxDocs=44218)
                  0.04866305 = queryNorm
                0.3464393 = fieldWeight in 5236, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  4.479444 = idf(docFreq=1362, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=5236)
          0.5 = coord(1/2)
      0.33333334 = coord(1/3)
    
    Abstract
    The Black Book Interactive Project at the University of Kansas (KU) is developing an expanded corpus of novels by African American authors, with an emphasis on lesser known writers and a goal of expanding research in this field. Using a custom metadata schema with an emphasis on race-related elements, each novel is analyzed for a variety of elements such as literary style, targeted content analysis, historical context, and other areas. Librarians at KU have worked to develop a variety of computational text analysis processes designed to assist with specific aspects of this metadata collection, including text mining and natural language processing, automated subject extraction based on word sense disambiguation, harvesting data from Wikidata, and other actions.
  14. Biebricher, N.; Fuhr, N.; Lustig, G.; Schwantner, M.; Knorz, G.: ¬The automatic indexing system AIR/PHYS : from research to application (1988) 0.01
    0.010988619 = product of:
      0.032965858 = sum of:
        0.032965858 = product of:
          0.065931715 = sum of:
            0.065931715 = weight(_text_:22 in 1952) [ClassicSimilarity], result of:
              0.065931715 = score(doc=1952,freq=2.0), product of:
                0.17040971 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.04866305 = queryNorm
                0.38690117 = fieldWeight in 1952, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.078125 = fieldNorm(doc=1952)
          0.5 = coord(1/2)
      0.33333334 = coord(1/3)
    
    Date
    16. 8.1998 12:51:22
  15. Kutschekmanesch, S.; Lutes, B.; Moelle, K.; Thiel, U.; Tzeras, K.: Automated multilingual indexing : a synthesis of rule-based and thesaurus-based methods (1998) 0.01
    0.010988619 = product of:
      0.032965858 = sum of:
        0.032965858 = product of:
          0.065931715 = sum of:
            0.065931715 = weight(_text_:22 in 4157) [ClassicSimilarity], result of:
              0.065931715 = score(doc=4157,freq=2.0), product of:
                0.17040971 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.04866305 = queryNorm
                0.38690117 = fieldWeight in 4157, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.078125 = fieldNorm(doc=4157)
          0.5 = coord(1/2)
      0.33333334 = coord(1/3)
    
    Source
    Information und Märkte: 50. Deutscher Dokumentartag 1998, Kongreß der Deutschen Gesellschaft für Dokumentation e.V. (DGD), Rheinische Friedrich-Wilhelms-Universität Bonn, 22.-24. September 1998. Hrsg. von Marlies Ockenfeld u. Gerhard J. Mantwill
  16. Tsareva, P.V.: Algoritmy dlya raspoznavaniya pozitivnykh i negativnykh vkhozdenii deskriptorov v tekst i protsedura avtomaticheskoi klassifikatsii tekstov (1999) 0.01
    0.010988619 = product of:
      0.032965858 = sum of:
        0.032965858 = product of:
          0.065931715 = sum of:
            0.065931715 = weight(_text_:22 in 374) [ClassicSimilarity], result of:
              0.065931715 = score(doc=374,freq=2.0), product of:
                0.17040971 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.04866305 = queryNorm
                0.38690117 = fieldWeight in 374, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.078125 = fieldNorm(doc=374)
          0.5 = coord(1/2)
      0.33333334 = coord(1/3)
    
    Date
    1. 4.2002 10:22:41
  17. Stankovic, R. et al.: Indexing of textual databases based on lexical resources : a case study for Serbian (2016) 0.01
    0.010988619 = product of:
      0.032965858 = sum of:
        0.032965858 = product of:
          0.065931715 = sum of:
            0.065931715 = weight(_text_:22 in 2759) [ClassicSimilarity], result of:
              0.065931715 = score(doc=2759,freq=2.0), product of:
                0.17040971 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.04866305 = queryNorm
                0.38690117 = fieldWeight in 2759, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.078125 = fieldNorm(doc=2759)
          0.5 = coord(1/2)
      0.33333334 = coord(1/3)
    
    Date
    1. 2.2016 18:25:22
  18. Lowe, D.B.; Dollinger, I.; Koster, T.; Herbert, B.E.: Text mining for type of research classification (2021) 0.01
    0.01078829 = product of:
      0.032364868 = sum of:
        0.032364868 = product of:
          0.064729735 = sum of:
            0.064729735 = weight(_text_:librarians in 720) [ClassicSimilarity], result of:
              0.064729735 = score(doc=720,freq=2.0), product of:
                0.21798341 = queryWeight, product of:
                  4.479444 = idf(docFreq=1362, maxDocs=44218)
                  0.04866305 = queryNorm
                0.296948 = fieldWeight in 720, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  4.479444 = idf(docFreq=1362, maxDocs=44218)
                  0.046875 = fieldNorm(doc=720)
          0.5 = coord(1/2)
      0.33333334 = coord(1/3)
    
    Abstract
    This project brought together undergraduate students in Computer Science with librarians to mine abstracts of articles from the Texas A&M University Libraries' institutional repository, OAKTrust, in order to probe the creation of new metadata to improve discovery and use. The mining operation task consisted simply of classifying the articles into two categories of research type: basic research ("for understanding," "curiosity-based," or "knowledge-based") and applied research ("use-based"). These categories are fundamental especially for funders but are also important to researchers. The mining-to-classification steps took several iterations, but ultimately, we achieved good results with the toolkit BERT (Bidirectional Encoder Representations from Transformers). The project and its workflows represent a preview of what may lie ahead in the future of crafting metadata using text mining techniques to enhance discoverability.
  19. Tsujii, J.-I.: Automatic acquisition of semantic collocation from corpora (1995) 0.01
    0.008790895 = product of:
      0.026372686 = sum of:
        0.026372686 = product of:
          0.052745372 = sum of:
            0.052745372 = weight(_text_:22 in 4709) [ClassicSimilarity], result of:
              0.052745372 = score(doc=4709,freq=2.0), product of:
                0.17040971 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.04866305 = queryNorm
                0.30952093 = fieldWeight in 4709, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0625 = fieldNorm(doc=4709)
          0.5 = coord(1/2)
      0.33333334 = coord(1/3)
    
    Date
    31. 7.1996 9:22:19
  20. Riloff, E.: ¬An empirical study of automated dictionary construction for information extraction in three domains (1996) 0.01
    0.008790895 = product of:
      0.026372686 = sum of:
        0.026372686 = product of:
          0.052745372 = sum of:
            0.052745372 = weight(_text_:22 in 6752) [ClassicSimilarity], result of:
              0.052745372 = score(doc=6752,freq=2.0), product of:
                0.17040971 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.04866305 = queryNorm
                0.30952093 = fieldWeight in 6752, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0625 = fieldNorm(doc=6752)
          0.5 = coord(1/2)
      0.33333334 = coord(1/3)
    
    Date
    6. 3.1997 16:22:15