Document (#21384)

Author
Peis, E.
Fernandez-Molina, J.C.
Title
Enrichment of bibliographic records of online catalogs through ORC and SGML technology
Source
Information technology and libraries. 17(1998) no.3, S.161-172
Year
1998
Abstract
Reports results of research into the feasibility of using OCR scanner technology to capture contents pages of collective monographs and to extract the bibliographic information of each individual work and process this using a standardized language, such as SGML, for tagging electronic documents. By this means, data can be used as electronic information or stored in OPACs, thus providing additional access points. Outlines a pilot system to test the initial hypotheses, show the feasibility of achieving the suggested goals and develop the tasks required for them to be carried out as automatically as possible
Theme
Kataloganreicherung
Object
SGML

Similar documents (author)

  1. Fernández-Molina, J.C.; Peis, E.: ¬The moral rights of authors in the age of digital information (2001) 3.16
    3.1630027 = sum of:
      3.1630027 = product of:
        4.744504 = sum of:
          1.9799455 = weight(author_txt:molina in 580) [ClassicSimilarity], result of:
            1.9799455 = score(doc=580,freq=1.0), product of:
              0.5114476 = queryWeight, product of:
                8.848589 = idf(docFreq=16, maxDocs=43556)
                0.05779991 = queryNorm
              3.8712578 = fieldWeight in 580, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.848589 = idf(docFreq=16, maxDocs=43556)
                0.4375 = fieldNorm(doc=580)
          2.7645583 = weight(author_txt:peis in 580) [ClassicSimilarity], result of:
            2.7645583 = score(doc=580,freq=1.0), product of:
              0.6389245 = queryWeight, product of:
                1.1176972 = boost
                9.890043 = idf(docFreq=5, maxDocs=43556)
                0.05779991 = queryNorm
              4.326894 = fieldWeight in 580, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.890043 = idf(docFreq=5, maxDocs=43556)
                0.4375 = fieldNorm(doc=580)
        0.6666667 = coord(2/3)
    
  2. Peis, E.; Moya, F. de; Fernández-Molina, J.C.: Encoded archival description (EAD) conversion : a methodological proposal (2000) 2.26
    2.2592876 = sum of:
      2.2592876 = product of:
        3.3889313 = sum of:
          1.4142467 = weight(author_txt:molina in 897) [ClassicSimilarity], result of:
            1.4142467 = score(doc=897,freq=1.0), product of:
              0.5114476 = queryWeight, product of:
                8.848589 = idf(docFreq=16, maxDocs=43556)
                0.05779991 = queryNorm
              2.765184 = fieldWeight in 897, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.848589 = idf(docFreq=16, maxDocs=43556)
                0.3125 = fieldNorm(doc=897)
          1.9746847 = weight(author_txt:peis in 897) [ClassicSimilarity], result of:
            1.9746847 = score(doc=897,freq=1.0), product of:
              0.6389245 = queryWeight, product of:
                1.1176972 = boost
                9.890043 = idf(docFreq=5, maxDocs=43556)
                0.05779991 = queryNorm
              3.0906386 = fieldWeight in 897, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.890043 = idf(docFreq=5, maxDocs=43556)
                0.3125 = fieldNorm(doc=897)
        0.6666667 = coord(2/3)
    
  3. Fernandez, C.W.: Semantic relationships between title phrases and LCSH (1991) 1.12
    1.1228243 = sum of:
      1.1228243 = product of:
        3.3684728 = sum of:
          3.3684728 = weight(author_txt:fernandez in 1632) [ClassicSimilarity], result of:
            3.3684728 = score(doc=1632,freq=1.0), product of:
              0.57462746 = queryWeight, product of:
                1.0599676 = boost
                9.379218 = idf(docFreq=9, maxDocs=43556)
                0.05779991 = queryNorm
              5.8620114 = fieldWeight in 1632, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.379218 = idf(docFreq=9, maxDocs=43556)
                0.625 = fieldNorm(doc=1632)
        0.33333334 = coord(1/3)
    
  4. Molina, M.P.: Interdisciplinary approaches to the concept and practice of written documentary content analysis (WTDCA) (1994) 0.94
    0.94283116 = sum of:
      0.94283116 = product of:
        2.8284934 = sum of:
          2.8284934 = weight(author_txt:molina in 6144) [ClassicSimilarity], result of:
            2.8284934 = score(doc=6144,freq=1.0), product of:
              0.5114476 = queryWeight, product of:
                8.848589 = idf(docFreq=16, maxDocs=43556)
                0.05779991 = queryNorm
              5.530368 = fieldWeight in 6144, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.848589 = idf(docFreq=16, maxDocs=43556)
                0.625 = fieldNorm(doc=6144)
        0.33333334 = coord(1/3)
    
  5. Molina, M.P.: Documentary abstracting : toward a methodological approach (1995) 0.94
    0.94283116 = sum of:
      0.94283116 = product of:
        2.8284934 = sum of:
          2.8284934 = weight(author_txt:molina in 1856) [ClassicSimilarity], result of:
            2.8284934 = score(doc=1856,freq=1.0), product of:
              0.5114476 = queryWeight, product of:
                8.848589 = idf(docFreq=16, maxDocs=43556)
                0.05779991 = queryNorm
              5.530368 = fieldWeight in 1856, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.848589 = idf(docFreq=16, maxDocs=43556)
                0.625 = fieldNorm(doc=1856)
        0.33333334 = coord(1/3)
    

Similar documents (content)

  1. Lupovici, C.: ¬L'¬information secondaire du document primaire : format MARC ou SGML? (1997) 0.17
    0.1691913 = sum of:
      0.1691913 = product of:
        0.8459565 = sum of:
          0.09533143 = weight(abstract_txt:tagging in 1890) [ClassicSimilarity], result of:
            0.09533143 = score(doc=1890,freq=1.0), product of:
              0.16124073 = queryWeight, product of:
                1.1155032 = boost
                6.3065243 = idf(docFreq=215, maxDocs=43556)
                0.02291996 = queryNorm
              0.59123665 = fieldWeight in 1890, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.3065243 = idf(docFreq=215, maxDocs=43556)
                0.09375 = fieldNorm(doc=1890)
          0.103575096 = weight(abstract_txt:pilot in 1890) [ClassicSimilarity], result of:
            0.103575096 = score(doc=1890,freq=1.0), product of:
              0.17040706 = queryWeight, product of:
                1.1467724 = boost
                6.483306 = idf(docFreq=180, maxDocs=43556)
                0.02291996 = queryNorm
              0.6078099 = fieldWeight in 1890, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.483306 = idf(docFreq=180, maxDocs=43556)
                0.09375 = fieldNorm(doc=1890)
          0.031791445 = weight(abstract_txt:using in 1890) [ClassicSimilarity], result of:
            0.031791445 = score(doc=1890,freq=1.0), product of:
              0.09769391 = queryWeight, product of:
                1.2279543 = boost
                3.4711347 = idf(docFreq=3679, maxDocs=43556)
                0.02291996 = queryNorm
              0.3254189 = fieldWeight in 1890, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4711347 = idf(docFreq=3679, maxDocs=43556)
                0.09375 = fieldNorm(doc=1890)
          0.09968974 = weight(abstract_txt:bibliographic in 1890) [ClassicSimilarity], result of:
            0.09968974 = score(doc=1890,freq=3.0), product of:
              0.14511776 = queryWeight, product of:
                1.4966103 = boost
                4.2305613 = idf(docFreq=1721, maxDocs=43556)
                0.02291996 = queryNorm
              0.68695754 = fieldWeight in 1890, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.2305613 = idf(docFreq=1721, maxDocs=43556)
                0.09375 = fieldNorm(doc=1890)
          0.5155688 = weight(abstract_txt:sgml in 1890) [ClassicSimilarity], result of:
            0.5155688 = score(doc=1890,freq=5.0), product of:
              0.3660399 = queryWeight, product of:
                2.3769095 = boost
                6.718958 = idf(docFreq=142, maxDocs=43556)
                0.02291996 = queryNorm
              1.4085044 = fieldWeight in 1890, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                6.718958 = idf(docFreq=142, maxDocs=43556)
                0.09375 = fieldNorm(doc=1890)
        0.2 = coord(5/25)
    
  2. Foott, D.: Scanner technology and the addition of contents pages to library records (1993) 0.11
    0.11076291 = sum of:
      0.11076291 = product of:
        0.55381453 = sum of:
          0.08695029 = weight(abstract_txt:contents in 6579) [ClassicSimilarity], result of:
            0.08695029 = score(doc=6579,freq=2.0), product of:
              0.13591754 = queryWeight, product of:
                1.024168 = boost
                5.7901587 = idf(docFreq=361, maxDocs=43556)
                0.02291996 = queryNorm
              0.6397282 = fieldWeight in 6579, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.7901587 = idf(docFreq=361, maxDocs=43556)
                0.078125 = fieldNorm(doc=6579)
          0.08181177 = weight(abstract_txt:capture in 6579) [ClassicSimilarity], result of:
            0.08181177 = score(doc=6579,freq=1.0), product of:
              0.16443036 = queryWeight, product of:
                1.1264825 = boost
                6.3685966 = idf(docFreq=202, maxDocs=43556)
                0.02291996 = queryNorm
              0.4975466 = fieldWeight in 6579, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.3685966 = idf(docFreq=202, maxDocs=43556)
                0.078125 = fieldNorm(doc=6579)
          0.06783027 = weight(abstract_txt:bibliographic in 6579) [ClassicSimilarity], result of:
            0.06783027 = score(doc=6579,freq=2.0), product of:
              0.14511776 = queryWeight, product of:
                1.4966103 = boost
                4.2305613 = idf(docFreq=1721, maxDocs=43556)
                0.02291996 = queryNorm
              0.4674154 = fieldWeight in 6579, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.2305613 = idf(docFreq=1721, maxDocs=43556)
                0.078125 = fieldNorm(doc=6579)
          0.07084043 = weight(abstract_txt:technology in 6579) [ClassicSimilarity], result of:
            0.07084043 = score(doc=6579,freq=2.0), product of:
              0.14937995 = queryWeight, product of:
                1.5184294 = boost
                4.2922387 = idf(docFreq=1618, maxDocs=43556)
                0.02291996 = queryNorm
              0.47422987 = fieldWeight in 6579, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.2922387 = idf(docFreq=1618, maxDocs=43556)
                0.078125 = fieldNorm(doc=6579)
          0.24638174 = weight(abstract_txt:scanner in 6579) [ClassicSimilarity], result of:
            0.24638174 = score(doc=6579,freq=1.0), product of:
              0.34290767 = queryWeight, product of:
                1.6267545 = boost
                9.196897 = idf(docFreq=11, maxDocs=43556)
                0.02291996 = queryNorm
              0.7185075 = fieldWeight in 6579, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.196897 = idf(docFreq=11, maxDocs=43556)
                0.078125 = fieldNorm(doc=6579)
        0.2 = coord(5/25)
    
  3. Electronic cataloging : AACR2 and metadata for serials and monographs (2003) 0.10
    0.10357593 = sum of:
      0.10357593 = product of:
        0.36991403 = sum of:
          0.03433948 = weight(abstract_txt:additional in 4080) [ClassicSimilarity], result of:
            0.03433948 = score(doc=4080,freq=1.0), product of:
              0.12957856 = queryWeight, product of:
                5.6535244 = idf(docFreq=414, maxDocs=43556)
                0.02291996 = queryNorm
              0.26500896 = fieldWeight in 4080, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.6535244 = idf(docFreq=414, maxDocs=43556)
                0.046875 = fieldNorm(doc=4080)
          0.03688988 = weight(abstract_txt:contents in 4080) [ClassicSimilarity], result of:
            0.03688988 = score(doc=4080,freq=1.0), product of:
              0.13591754 = queryWeight, product of:
                1.024168 = boost
                5.7901587 = idf(docFreq=361, maxDocs=43556)
                0.02291996 = queryNorm
              0.27141368 = fieldWeight in 4080, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.7901587 = idf(docFreq=361, maxDocs=43556)
                0.046875 = fieldNorm(doc=4080)
          0.015895722 = weight(abstract_txt:using in 4080) [ClassicSimilarity], result of:
            0.015895722 = score(doc=4080,freq=1.0), product of:
              0.09769391 = queryWeight, product of:
                1.2279543 = boost
                3.4711347 = idf(docFreq=3679, maxDocs=43556)
                0.02291996 = queryNorm
              0.16270944 = fieldWeight in 4080, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4711347 = idf(docFreq=3679, maxDocs=43556)
                0.046875 = fieldNorm(doc=4080)
          0.06734662 = weight(abstract_txt:achieving in 4080) [ClassicSimilarity], result of:
            0.06734662 = score(doc=4080,freq=1.0), product of:
              0.20302424 = queryWeight, product of:
                1.2517205 = boost
                7.0766325 = idf(docFreq=99, maxDocs=43556)
                0.02291996 = queryNorm
              0.33171713 = fieldWeight in 4080, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.0766325 = idf(docFreq=99, maxDocs=43556)
                0.046875 = fieldNorm(doc=4080)
          0.09956021 = weight(abstract_txt:monographs in 4080) [ClassicSimilarity], result of:
            0.09956021 = score(doc=4080,freq=2.0), product of:
              0.20911469 = queryWeight, product of:
                1.2703568 = boost
                7.181993 = idf(docFreq=89, maxDocs=43556)
                0.02291996 = queryNorm
              0.47610337 = fieldWeight in 4080, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.181993 = idf(docFreq=89, maxDocs=43556)
                0.046875 = fieldNorm(doc=4080)
          0.04984487 = weight(abstract_txt:bibliographic in 4080) [ClassicSimilarity], result of:
            0.04984487 = score(doc=4080,freq=3.0), product of:
              0.14511776 = queryWeight, product of:
                1.4966103 = boost
                4.2305613 = idf(docFreq=1721, maxDocs=43556)
                0.02291996 = queryNorm
              0.34347877 = fieldWeight in 4080, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.2305613 = idf(docFreq=1721, maxDocs=43556)
                0.046875 = fieldNorm(doc=4080)
          0.06603726 = weight(abstract_txt:electronic in 4080) [ClassicSimilarity], result of:
            0.06603726 = score(doc=4080,freq=5.0), product of:
              0.14764431 = queryWeight, product of:
                1.5095823 = boost
                4.26723 = idf(docFreq=1659, maxDocs=43556)
                0.02291996 = queryNorm
              0.44727266 = fieldWeight in 4080, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.26723 = idf(docFreq=1659, maxDocs=43556)
                0.046875 = fieldNorm(doc=4080)
        0.28 = coord(7/25)
    
  4. Corthouts, J.; Philips, R.: SGML: a librarian's perception (1996) 0.10
    0.10302041 = sum of:
      0.10302041 = product of:
        0.64387757 = sum of:
          0.061483137 = weight(abstract_txt:contents in 5159) [ClassicSimilarity], result of:
            0.061483137 = score(doc=5159,freq=1.0), product of:
              0.13591754 = queryWeight, product of:
                1.024168 = boost
                5.7901587 = idf(docFreq=361, maxDocs=43556)
                0.02291996 = queryNorm
              0.45235616 = fieldWeight in 5159, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.7901587 = idf(docFreq=361, maxDocs=43556)
                0.078125 = fieldNorm(doc=5159)
          0.026492868 = weight(abstract_txt:using in 5159) [ClassicSimilarity], result of:
            0.026492868 = score(doc=5159,freq=1.0), product of:
              0.09769391 = queryWeight, product of:
                1.2279543 = boost
                3.4711347 = idf(docFreq=3679, maxDocs=43556)
                0.02291996 = queryNorm
              0.2711824 = fieldWeight in 5159, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4711347 = idf(docFreq=3679, maxDocs=43556)
                0.078125 = fieldNorm(doc=5159)
          0.08525374 = weight(abstract_txt:electronic in 5159) [ClassicSimilarity], result of:
            0.08525374 = score(doc=5159,freq=3.0), product of:
              0.14764431 = queryWeight, product of:
                1.5095823 = boost
                4.26723 = idf(docFreq=1659, maxDocs=43556)
                0.02291996 = queryNorm
              0.5774265 = fieldWeight in 5159, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.26723 = idf(docFreq=1659, maxDocs=43556)
                0.078125 = fieldNorm(doc=5159)
          0.4706478 = weight(abstract_txt:sgml in 5159) [ClassicSimilarity], result of:
            0.4706478 = score(doc=5159,freq=6.0), product of:
              0.3660399 = queryWeight, product of:
                2.3769095 = boost
                6.718958 = idf(docFreq=142, maxDocs=43556)
                0.02291996 = queryNorm
              1.2857828 = fieldWeight in 5159, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                6.718958 = idf(docFreq=142, maxDocs=43556)
                0.078125 = fieldNorm(doc=5159)
        0.16 = coord(4/25)
    
  5. ¬The electronic Vatican Library (1994) 0.10
    0.10278626 = sum of:
      0.10278626 = product of:
        0.5139313 = sum of:
          0.07135297 = weight(abstract_txt:carried in 371) [ClassicSimilarity], result of:
            0.07135297 = score(doc=371,freq=1.0), product of:
              0.13292053 = queryWeight, product of:
                1.0128134 = boost
                5.7259655 = idf(docFreq=385, maxDocs=43556)
                0.02291996 = queryNorm
              0.53680927 = fieldWeight in 371, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.7259655 = idf(docFreq=385, maxDocs=43556)
                0.09375 = fieldNorm(doc=371)
          0.103575096 = weight(abstract_txt:pilot in 371) [ClassicSimilarity], result of:
            0.103575096 = score(doc=371,freq=1.0), product of:
              0.17040706 = queryWeight, product of:
                1.1467724 = boost
                6.483306 = idf(docFreq=180, maxDocs=43556)
                0.02291996 = queryNorm
              0.6078099 = fieldWeight in 371, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.483306 = idf(docFreq=180, maxDocs=43556)
                0.09375 = fieldNorm(doc=371)
          0.031791445 = weight(abstract_txt:using in 371) [ClassicSimilarity], result of:
            0.031791445 = score(doc=371,freq=1.0), product of:
              0.09769391 = queryWeight, product of:
                1.2279543 = boost
                3.4711347 = idf(docFreq=3679, maxDocs=43556)
                0.02291996 = queryNorm
              0.3254189 = fieldWeight in 371, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4711347 = idf(docFreq=3679, maxDocs=43556)
                0.09375 = fieldNorm(doc=371)
          0.08353125 = weight(abstract_txt:electronic in 371) [ClassicSimilarity], result of:
            0.08353125 = score(doc=371,freq=2.0), product of:
              0.14764431 = queryWeight, product of:
                1.5095823 = boost
                4.26723 = idf(docFreq=1659, maxDocs=43556)
                0.02291996 = queryNorm
              0.5657601 = fieldWeight in 371, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.26723 = idf(docFreq=1659, maxDocs=43556)
                0.09375 = fieldNorm(doc=371)
          0.22368051 = weight(abstract_txt:feasibility in 371) [ClassicSimilarity], result of:
            0.22368051 = score(doc=371,freq=1.0), product of:
              0.35871217 = queryWeight, product of:
                2.3529975 = boost
                6.651365 = idf(docFreq=152, maxDocs=43556)
                0.02291996 = queryNorm
              0.62356544 = fieldWeight in 371, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.651365 = idf(docFreq=152, maxDocs=43556)
                0.09375 = fieldNorm(doc=371)
        0.2 = coord(5/25)