Search (9 results, page 1 of 1)

  • × theme_ss:"Kataloganreicherung"
  1. Tseng, Y.-H.: Automatic cataloguing and searching for retrospective data by use of OCR text (2001) 0.03
    0.031158824 = product of:
      0.062317647 = sum of:
        0.062317647 = product of:
          0.124635294 = sum of:
            0.124635294 = weight(_text_:500 in 5421) [ClassicSimilarity], result of:
              0.124635294 = score(doc=5421,freq=2.0), product of:
                0.3075407 = queryWeight, product of:
                  6.113391 = idf(docFreq=265, maxDocs=44218)
                  0.050306078 = queryNorm
                0.40526438 = fieldWeight in 5421, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  6.113391 = idf(docFreq=265, maxDocs=44218)
                  0.046875 = fieldNorm(doc=5421)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Abstract
    This article describes our efforts in supporting information retrieval from OCR degraded text. In particular, we report our approach to an automatic cataloging and searching contest for books in multiple languages. In this contest, 500 books in English, German, French, and Italian published during the 1770s to 1970s are scanned into images and OCRed to digital text. The goal is to use only automatic ways to extract information for sophisticated searching. We adopted the vector space retrieval model, an n-gram indexing method, and a special weighting scheme to tackle this problem. Although the performance by this approach is slightly inferior to the best approach, which is mainly based on regular expression match, one advantage of our approach is that it is less language dependent and less layout sensitive, thus is readily applicable to other languages and document collections. Problems of OCR text retrieval for some Asian languages are also discussed in this article, and solutions are suggested
  2. Gratch, B.; Settel, B.; Atherton, P.: Characteristics of book indexes for subject retrieval in the humanities and social sciences (1978) 0.02
    0.023855226 = product of:
      0.047710452 = sum of:
        0.047710452 = product of:
          0.095420904 = sum of:
            0.095420904 = weight(_text_:22 in 1061) [ClassicSimilarity], result of:
              0.095420904 = score(doc=1061,freq=2.0), product of:
                0.17616332 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.050306078 = queryNorm
                0.5416616 = fieldWeight in 1061, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.109375 = fieldNorm(doc=1061)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Source
    Indexer. 11(1978), S.14-22
  3. Ingwersen, P.; Wormell, I.: Modern indexing and retrieval techniques matching different types of information needs (1989) 0.02
    0.023855226 = product of:
      0.047710452 = sum of:
        0.047710452 = product of:
          0.095420904 = sum of:
            0.095420904 = weight(_text_:22 in 7322) [ClassicSimilarity], result of:
              0.095420904 = score(doc=7322,freq=2.0), product of:
                0.17616332 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.050306078 = queryNorm
                0.5416616 = fieldWeight in 7322, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.109375 = fieldNorm(doc=7322)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Source
    International forum on information and documentation. 14(1989), S.17-22
  4. Lam, V.-T.: Enhancing subject access to monographs in Online Public Access Catalogs : table of contents added to bibliographic records (2000) 0.01
    0.010223668 = product of:
      0.020447336 = sum of:
        0.020447336 = product of:
          0.040894672 = sum of:
            0.040894672 = weight(_text_:22 in 1187) [ClassicSimilarity], result of:
              0.040894672 = score(doc=1187,freq=2.0), product of:
                0.17616332 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.050306078 = queryNorm
                0.23214069 = fieldWeight in 1187, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.046875 = fieldNorm(doc=1187)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Date
    22. 9.1997 19:16:05
  5. Leissing, U.; Rädler, K.; Hauer, M.: Query-Expansion durch Fachthesauri : Erfahrungsbericht zu dandelon.com, Vorarlberger Parlamentsinformationssystem und vorarlberg.at (2010) 0.01
    0.010223668 = product of:
      0.020447336 = sum of:
        0.020447336 = product of:
          0.040894672 = sum of:
            0.040894672 = weight(_text_:22 in 3728) [ClassicSimilarity], result of:
              0.040894672 = score(doc=3728,freq=2.0), product of:
                0.17616332 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.050306078 = queryNorm
                0.23214069 = fieldWeight in 3728, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.046875 = fieldNorm(doc=3728)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Source
    Wissensspeicher in digitalen Räumen: Nachhaltigkeit - Verfügbarkeit - semantische Interoperabilität. Proceedings der 11. Tagung der Deutschen Sektion der Internationalen Gesellschaft für Wissensorganisation, Konstanz, 20. bis 22. Februar 2008. Hrsg.: J. Sieglerschmidt u. H.P.Ohly
  6. Rädler, K.: Kataloganreicherung mit digitalen Inhaltsverzeichnissen eröffnet neue Geschäftsfelder : Erfahrungen aus der Vorarlberger Landesbibliothek (2008) 0.01
    0.008519724 = product of:
      0.017039448 = sum of:
        0.017039448 = product of:
          0.034078896 = sum of:
            0.034078896 = weight(_text_:22 in 1942) [ClassicSimilarity], result of:
              0.034078896 = score(doc=1942,freq=2.0), product of:
                0.17616332 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.050306078 = queryNorm
                0.19345059 = fieldWeight in 1942, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=1942)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Date
    22. 6.2008 17:14:24
  7. Hauer, M.: Collaborative Catalog Enrichment : Digitalisierung und Information Retrieval (2011) 0.01
    0.008519724 = product of:
      0.017039448 = sum of:
        0.017039448 = product of:
          0.034078896 = sum of:
            0.034078896 = weight(_text_:22 in 160) [ClassicSimilarity], result of:
              0.034078896 = score(doc=160,freq=2.0), product of:
                0.17616332 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.050306078 = queryNorm
                0.19345059 = fieldWeight in 160, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=160)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Source
    ¬Die Kraft der digitalen Unordnung: 32. Arbeits- und Fortbildungstagung der ASpB e. V., Sektion 5 im Deutschen Bibliotheksverband, 22.-25. September 2009 in der Universität Karlsruhe. Hrsg: Jadwiga Warmbrunn u.a
  8. Barnes, S.; McCue, J.: Linking library records to bibliographic databases : an analysis of common data elements in BIOSIS, Agricola, and the OPAC (1991) 0.01
    0.006815779 = product of:
      0.013631558 = sum of:
        0.013631558 = product of:
          0.027263116 = sum of:
            0.027263116 = weight(_text_:22 in 520) [ClassicSimilarity], result of:
              0.027263116 = score(doc=520,freq=2.0), product of:
                0.17616332 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.050306078 = queryNorm
                0.15476047 = fieldWeight in 520, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.03125 = fieldNorm(doc=520)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Date
    8. 1.2007 17:22:25
  9. Ikas, W.-V.; Litten, F.: World Wide Web und Catalogue Enrichment : Möglichkeiten des verbesserten Nachweises von mikroverfilmten Handschriften und Inkunabeln (2007) 0.01
    0.006815779 = product of:
      0.013631558 = sum of:
        0.013631558 = product of:
          0.027263116 = sum of:
            0.027263116 = weight(_text_:22 in 323) [ClassicSimilarity], result of:
              0.027263116 = score(doc=323,freq=2.0), product of:
                0.17616332 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.050306078 = queryNorm
                0.15476047 = fieldWeight in 323, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.03125 = fieldNorm(doc=323)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Date
    22. 5.2007 11:19:21