Document (#7844)

Author
Greenrich, E.
Title
CD-ROM data preparation enhancements
Source
Proceedings of the 14th National Online Meeting 1993, New York, 4-6 May 1993. Ed.: M.E. Williams
Imprint
Medford, NJ : Learned Information
Year
1993
Pages
S.159-163
Abstract
Describes a number of improvements to the process of data preparation for the production of CD-ROM databases: imaging, optical character recognition (OCR) for data vcapture and input; automatic indexing (machine aided indexing); field tagging; and search performance enhancing features (data compression and encoding)

Similar documents (content)

  1. Broadhurst, R.: ¬The digitisation of library material (1993) 0.21
    0.20518462 = sum of:
      0.20518462 = product of:
        0.85493594 = sum of:
          0.045617584 = weight(abstract_txt:describes in 6256) [ClassicSimilarity], result of:
            0.045617584 = score(doc=6256,freq=2.0), product of:
              0.067481324 = queryWeight, product of:
                3.8240511 = idf(docFreq=2624, maxDocs=44218)
                0.017646553 = queryNorm
              0.6760031 = fieldWeight in 6256, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.8240511 = idf(docFreq=2624, maxDocs=44218)
                0.125 = fieldNorm(doc=6256)
          0.03834746 = weight(abstract_txt:process in 6256) [ClassicSimilarity], result of:
            0.03834746 = score(doc=6256,freq=1.0), product of:
              0.075729154 = queryWeight, product of:
                1.0593507 = boost
                4.0510116 = idf(docFreq=2091, maxDocs=44218)
                0.017646553 = queryNorm
              0.50637645 = fieldWeight in 6256, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.0510116 = idf(docFreq=2091, maxDocs=44218)
                0.125 = fieldNorm(doc=6256)
          0.13228182 = weight(abstract_txt:recognition in 6256) [ClassicSimilarity], result of:
            0.13228182 = score(doc=6256,freq=1.0), product of:
              0.17289092 = queryWeight, product of:
                1.6006423 = boost
                6.1209383 = idf(docFreq=263, maxDocs=44218)
                0.017646553 = queryNorm
              0.7651173 = fieldWeight in 6256, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.1209383 = idf(docFreq=263, maxDocs=44218)
                0.125 = fieldNorm(doc=6256)
          0.15951818 = weight(abstract_txt:character in 6256) [ClassicSimilarity], result of:
            0.15951818 = score(doc=6256,freq=1.0), product of:
              0.19587493 = queryWeight, product of:
                1.7037177 = boost
                6.515104 = idf(docFreq=177, maxDocs=44218)
                0.017646553 = queryNorm
              0.814388 = fieldWeight in 6256, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.515104 = idf(docFreq=177, maxDocs=44218)
                0.125 = fieldNorm(doc=6256)
          0.23055181 = weight(abstract_txt:optical in 6256) [ClassicSimilarity], result of:
            0.23055181 = score(doc=6256,freq=1.0), product of:
              0.25039044 = queryWeight, product of:
                1.9262697 = boost
                7.3661537 = idf(docFreq=75, maxDocs=44218)
                0.017646553 = queryNorm
              0.9207692 = fieldWeight in 6256, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.3661537 = idf(docFreq=75, maxDocs=44218)
                0.125 = fieldNorm(doc=6256)
          0.24861906 = weight(abstract_txt:imaging in 6256) [ClassicSimilarity], result of:
            0.24861906 = score(doc=6256,freq=1.0), product of:
              0.26330656 = queryWeight, product of:
                1.9753273 = boost
                7.5537524 = idf(docFreq=62, maxDocs=44218)
                0.017646553 = queryNorm
              0.94421905 = fieldWeight in 6256, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.5537524 = idf(docFreq=62, maxDocs=44218)
                0.125 = fieldNorm(doc=6256)
        0.24 = coord(6/25)
    
  2. Guenette, D.R.: Document imaging, CD-ROM, and CD-R : a starting point (1996) 0.18
    0.1794749 = sum of:
      0.1794749 = product of:
        0.74781215 = sum of:
          0.024192378 = weight(abstract_txt:describes in 4986) [ClassicSimilarity], result of:
            0.024192378 = score(doc=4986,freq=1.0), product of:
              0.067481324 = queryWeight, product of:
                3.8240511 = idf(docFreq=2624, maxDocs=44218)
                0.017646553 = queryNorm
              0.3585048 = fieldWeight in 4986, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.8240511 = idf(docFreq=2624, maxDocs=44218)
                0.09375 = fieldNorm(doc=4986)
          0.028760593 = weight(abstract_txt:process in 4986) [ClassicSimilarity], result of:
            0.028760593 = score(doc=4986,freq=1.0), product of:
              0.075729154 = queryWeight, product of:
                1.0593507 = boost
                4.0510116 = idf(docFreq=2091, maxDocs=44218)
                0.017646553 = queryNorm
              0.37978232 = fieldWeight in 4986, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.0510116 = idf(docFreq=2091, maxDocs=44218)
                0.09375 = fieldNorm(doc=4986)
          0.3729286 = weight(abstract_txt:imaging in 4986) [ClassicSimilarity], result of:
            0.3729286 = score(doc=4986,freq=4.0), product of:
              0.26330656 = queryWeight, product of:
                1.9753273 = boost
                7.5537524 = idf(docFreq=62, maxDocs=44218)
                0.017646553 = queryNorm
              1.4163285 = fieldWeight in 4986, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                7.5537524 = idf(docFreq=62, maxDocs=44218)
                0.09375 = fieldNorm(doc=4986)
          0.1864643 = weight(abstract_txt:compression in 4986) [ClassicSimilarity], result of:
            0.1864643 = score(doc=4986,freq=1.0), product of:
              0.26330656 = queryWeight, product of:
                1.9753273 = boost
                7.5537524 = idf(docFreq=62, maxDocs=44218)
                0.017646553 = queryNorm
              0.7081643 = fieldWeight in 4986, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.5537524 = idf(docFreq=62, maxDocs=44218)
                0.09375 = fieldNorm(doc=4986)
          0.071200274 = weight(abstract_txt:indexing in 4986) [ClassicSimilarity], result of:
            0.071200274 = score(doc=4986,freq=1.0), product of:
              0.17460728 = queryWeight, product of:
                2.2748585 = boost
                4.3495874 = idf(docFreq=1551, maxDocs=44218)
                0.017646553 = queryNorm
              0.40777382 = fieldWeight in 4986, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.3495874 = idf(docFreq=1551, maxDocs=44218)
                0.09375 = fieldNorm(doc=4986)
          0.06426602 = weight(abstract_txt:data in 4986) [ClassicSimilarity], result of:
            0.06426602 = score(doc=4986,freq=1.0), product of:
              0.20546545 = queryWeight, product of:
                3.489857 = boost
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.017646553 = queryNorm
              0.31278262 = fieldWeight in 4986, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.09375 = fieldNorm(doc=4986)
        0.24 = coord(6/25)
    
  3. Silvester, J.P.; Genuardi, M.T.; Klingbiel, P.H.: Machine-aided indexing at NASA (1994) 0.17
    0.17357546 = sum of:
      0.17357546 = product of:
        0.7232311 = sum of:
          0.024192378 = weight(abstract_txt:describes in 8503) [ClassicSimilarity], result of:
            0.024192378 = score(doc=8503,freq=1.0), product of:
              0.067481324 = queryWeight, product of:
                3.8240511 = idf(docFreq=2624, maxDocs=44218)
                0.017646553 = queryNorm
              0.3585048 = fieldWeight in 8503, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.8240511 = idf(docFreq=2624, maxDocs=44218)
                0.09375 = fieldNorm(doc=8503)
          0.08998282 = weight(abstract_txt:machine in 8503) [ClassicSimilarity], result of:
            0.08998282 = score(doc=8503,freq=2.0), product of:
              0.12857631 = queryWeight, product of:
                1.3803483 = boost
                5.2785225 = idf(docFreq=612, maxDocs=44218)
                0.017646553 = queryNorm
              0.69983983 = fieldWeight in 8503, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.2785225 = idf(docFreq=612, maxDocs=44218)
                0.09375 = fieldNorm(doc=8503)
          0.08186182 = weight(abstract_txt:production in 8503) [ClassicSimilarity], result of:
            0.08186182 = score(doc=8503,freq=1.0), product of:
              0.15209636 = queryWeight, product of:
                1.5013005 = boost
                5.74105 = idf(docFreq=385, maxDocs=44218)
                0.017646553 = queryNorm
              0.5382234 = fieldWeight in 8503, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.74105 = idf(docFreq=385, maxDocs=44218)
                0.09375 = fieldNorm(doc=8503)
          0.14004624 = weight(abstract_txt:input in 8503) [ClassicSimilarity], result of:
            0.14004624 = score(doc=8503,freq=2.0), product of:
              0.17267741 = queryWeight, product of:
                1.5996537 = boost
                6.1171575 = idf(docFreq=264, maxDocs=44218)
                0.017646553 = queryNorm
              0.8110281 = fieldWeight in 8503, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.1171575 = idf(docFreq=264, maxDocs=44218)
                0.09375 = fieldNorm(doc=8503)
          0.28645545 = weight(abstract_txt:aided in 8503) [ClassicSimilarity], result of:
            0.28645545 = score(doc=8503,freq=2.0), product of:
              0.27824408 = queryWeight, product of:
                2.030585 = boost
                7.7650614 = idf(docFreq=50, maxDocs=44218)
                0.017646553 = queryNorm
              1.0295115 = fieldWeight in 8503, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.7650614 = idf(docFreq=50, maxDocs=44218)
                0.09375 = fieldNorm(doc=8503)
          0.1006924 = weight(abstract_txt:indexing in 8503) [ClassicSimilarity], result of:
            0.1006924 = score(doc=8503,freq=2.0), product of:
              0.17460728 = queryWeight, product of:
                2.2748585 = boost
                4.3495874 = idf(docFreq=1551, maxDocs=44218)
                0.017646553 = queryNorm
              0.5766793 = fieldWeight in 8503, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.3495874 = idf(docFreq=1551, maxDocs=44218)
                0.09375 = fieldNorm(doc=8503)
        0.24 = coord(6/25)
    
  4. Lancaster, F.W.: Trends in subject indexing from 1957 to 2000 (1980) 0.16
    0.16371635 = sum of:
      0.16371635 = product of:
        0.68215144 = sum of:
          0.060675588 = weight(abstract_txt:automatic in 208) [ClassicSimilarity], result of:
            0.060675588 = score(doc=208,freq=1.0), product of:
              0.1245682 = queryWeight, product of:
                1.3586632 = boost
                5.1955976 = idf(docFreq=665, maxDocs=44218)
                0.017646553 = queryNorm
              0.48708728 = fieldWeight in 208, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.1955976 = idf(docFreq=665, maxDocs=44218)
                0.09375 = fieldNorm(doc=208)
          0.06362746 = weight(abstract_txt:machine in 208) [ClassicSimilarity], result of:
            0.06362746 = score(doc=208,freq=1.0), product of:
              0.12857631 = queryWeight, product of:
                1.3803483 = boost
                5.2785225 = idf(docFreq=612, maxDocs=44218)
                0.017646553 = queryNorm
              0.49486148 = fieldWeight in 208, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.2785225 = idf(docFreq=612, maxDocs=44218)
                0.09375 = fieldNorm(doc=208)
          0.08186182 = weight(abstract_txt:production in 208) [ClassicSimilarity], result of:
            0.08186182 = score(doc=208,freq=1.0), product of:
              0.15209636 = queryWeight, product of:
                1.5013005 = boost
                5.74105 = idf(docFreq=385, maxDocs=44218)
                0.017646553 = queryNorm
              0.5382234 = fieldWeight in 208, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.74105 = idf(docFreq=385, maxDocs=44218)
                0.09375 = fieldNorm(doc=208)
          0.09902765 = weight(abstract_txt:input in 208) [ClassicSimilarity], result of:
            0.09902765 = score(doc=208,freq=1.0), product of:
              0.17267741 = queryWeight, product of:
                1.5996537 = boost
                6.1171575 = idf(docFreq=264, maxDocs=44218)
                0.017646553 = queryNorm
              0.5734835 = fieldWeight in 208, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.1171575 = idf(docFreq=264, maxDocs=44218)
                0.09375 = fieldNorm(doc=208)
          0.2025546 = weight(abstract_txt:aided in 208) [ClassicSimilarity], result of:
            0.2025546 = score(doc=208,freq=1.0), product of:
              0.27824408 = queryWeight, product of:
                2.030585 = boost
                7.7650614 = idf(docFreq=50, maxDocs=44218)
                0.017646553 = queryNorm
              0.72797453 = fieldWeight in 208, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.7650614 = idf(docFreq=50, maxDocs=44218)
                0.09375 = fieldNorm(doc=208)
          0.17440435 = weight(abstract_txt:indexing in 208) [ClassicSimilarity], result of:
            0.17440435 = score(doc=208,freq=6.0), product of:
              0.17460728 = queryWeight, product of:
                2.2748585 = boost
                4.3495874 = idf(docFreq=1551, maxDocs=44218)
                0.017646553 = queryNorm
              0.9988378 = fieldWeight in 208, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                4.3495874 = idf(docFreq=1551, maxDocs=44218)
                0.09375 = fieldNorm(doc=208)
        0.24 = coord(6/25)
    
  5. Milstead, J.L.: Methodologies for subject analysis in bibliographic databases (1992) 0.14
    0.13962741 = sum of:
      0.13962741 = product of:
        0.69813704 = sum of:
          0.037173737 = weight(abstract_txt:databases in 3092) [ClassicSimilarity], result of:
            0.037173737 = score(doc=3092,freq=1.0), product of:
              0.08985771 = queryWeight, product of:
                1.1539471 = boost
                4.4127526 = idf(docFreq=1456, maxDocs=44218)
                0.017646553 = queryNorm
              0.41369557 = fieldWeight in 3092, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4127526 = idf(docFreq=1456, maxDocs=44218)
                0.09375 = fieldNorm(doc=3092)
          0.105093196 = weight(abstract_txt:automatic in 3092) [ClassicSimilarity], result of:
            0.105093196 = score(doc=3092,freq=3.0), product of:
              0.1245682 = queryWeight, product of:
                1.3586632 = boost
                5.1955976 = idf(docFreq=665, maxDocs=44218)
                0.017646553 = queryNorm
              0.8436599 = fieldWeight in 3092, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.1955976 = idf(docFreq=665, maxDocs=44218)
                0.09375 = fieldNorm(doc=3092)
          0.11020599 = weight(abstract_txt:machine in 3092) [ClassicSimilarity], result of:
            0.11020599 = score(doc=3092,freq=3.0), product of:
              0.12857631 = queryWeight, product of:
                1.3803483 = boost
                5.2785225 = idf(docFreq=612, maxDocs=44218)
                0.017646553 = queryNorm
              0.85712516 = fieldWeight in 3092, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.2785225 = idf(docFreq=612, maxDocs=44218)
                0.09375 = fieldNorm(doc=3092)
          0.28645545 = weight(abstract_txt:aided in 3092) [ClassicSimilarity], result of:
            0.28645545 = score(doc=3092,freq=2.0), product of:
              0.27824408 = queryWeight, product of:
                2.030585 = boost
                7.7650614 = idf(docFreq=50, maxDocs=44218)
                0.017646553 = queryNorm
              1.0295115 = fieldWeight in 3092, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.7650614 = idf(docFreq=50, maxDocs=44218)
                0.09375 = fieldNorm(doc=3092)
          0.15920866 = weight(abstract_txt:indexing in 3092) [ClassicSimilarity], result of:
            0.15920866 = score(doc=3092,freq=5.0), product of:
              0.17460728 = queryWeight, product of:
                2.2748585 = boost
                4.3495874 = idf(docFreq=1551, maxDocs=44218)
                0.017646553 = queryNorm
              0.91181 = fieldWeight in 3092, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.3495874 = idf(docFreq=1551, maxDocs=44218)
                0.09375 = fieldNorm(doc=3092)
        0.2 = coord(5/25)