Document (#16032)

Author
Alexander, M.
Title
Retrieving digital data with fuzzy matching
Source
New library world. 97(1996) no.1131, S.28-31
Year
1996
Abstract
Briefly describes the Excalibur EFS system which makes use of adaptive pattern recognition technology as an aid to automatic indexing and how it is being tested at the British Library for the indexing and retrieval of scanned images from the library's holdings. Notes how Excalibur EFS can support a wide degree of fuzzy searching, compensate for the errors produced by OCR conversion of scanned images, reduce the costs of indexing, and require far less storage space than more traditional indexes
Theme
Automatisches Indexieren
Object
Excalibur EFS

Similar documents (author)

  1. Alexander, M.: Automatic indexing of document images using Excalibur EFS (1995) 5.60
    5.5967755 = sum of:
      5.5967755 = weight(author_txt:alexander in 1980) [ClassicSimilarity], result of:
        5.5967755 = fieldWeight in 1980, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.954841 = idf(docFreq=14, maxDocs=42740)
          0.625 = fieldNorm(doc=1980)
    
  2. Alexander, M.: Retrieving digital data with fuzzy matching (1997) 5.60
    5.5967755 = sum of:
      5.5967755 = weight(author_txt:alexander in 152) [ClassicSimilarity], result of:
        5.5967755 = fieldWeight in 152, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.954841 = idf(docFreq=14, maxDocs=42740)
          0.625 = fieldNorm(doc=152)
    
  3. Alexander, J.: Customs and excise process 2.5 million documents (1997) 5.60
    5.5967755 = sum of:
      5.5967755 = weight(author_txt:alexander in 3428) [ClassicSimilarity], result of:
        5.5967755 = fieldWeight in 3428, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.954841 = idf(docFreq=14, maxDocs=42740)
          0.625 = fieldNorm(doc=3428)
    
  4. Alexander, M.: Digitising books, manuscripts and scholarly materials : preparation, handling, scanning, recognition, compression, storage formats (1998) 5.60
    5.5967755 = sum of:
      5.5967755 = weight(author_txt:alexander in 4687) [ClassicSimilarity], result of:
        5.5967755 = fieldWeight in 4687, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.954841 = idf(docFreq=14, maxDocs=42740)
          0.625 = fieldNorm(doc=4687)
    
  5. Alexander, K.: Kompendium der visuellen Information und Kommunikation (2007) 5.60
    5.5967755 = sum of:
      5.5967755 = weight(author_txt:alexander in 2648) [ClassicSimilarity], result of:
        5.5967755 = fieldWeight in 2648, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.954841 = idf(docFreq=14, maxDocs=42740)
          0.625 = fieldNorm(doc=2648)
    

Similar documents (content)

  1. Alexander, M.: Automatic indexing of document images using Excalibur EFS (1995) 0.76
    0.7641202 = sum of:
      0.7641202 = product of:
        1.9103005 = sum of:
          0.09630497 = weight(abstract_txt:indexes in 1980) [ClassicSimilarity], result of:
            0.09630497 = score(doc=1980,freq=2.0), product of:
              0.108945765 = queryWeight, product of:
                1.0036305 = boost
                5.7148557 = idf(docFreq=382, maxDocs=42740)
                0.018994648 = queryNorm
              0.88397163 = fieldWeight in 1980, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.7148557 = idf(docFreq=382, maxDocs=42740)
                0.109375 = fieldNorm(doc=1980)
          0.075570755 = weight(abstract_txt:british in 1980) [ClassicSimilarity], result of:
            0.075570755 = score(doc=1980,freq=1.0), product of:
              0.116776936 = queryWeight, product of:
                1.0390757 = boost
                5.9166875 = idf(docFreq=312, maxDocs=42740)
                0.018994648 = queryNorm
              0.6471377 = fieldWeight in 1980, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.9166875 = idf(docFreq=312, maxDocs=42740)
                0.109375 = fieldNorm(doc=1980)
          0.120944135 = weight(abstract_txt:recognition in 1980) [ClassicSimilarity], result of:
            0.120944135 = score(doc=1980,freq=2.0), product of:
              0.12681417 = queryWeight, product of:
                1.0828108 = boost
                6.1657224 = idf(docFreq=243, maxDocs=42740)
                0.018994648 = queryNorm
              0.9537115 = fieldWeight in 1980, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.1657224 = idf(docFreq=243, maxDocs=42740)
                0.109375 = fieldNorm(doc=1980)
          0.12604694 = weight(abstract_txt:pattern in 1980) [ClassicSimilarity], result of:
            0.12604694 = score(doc=1980,freq=2.0), product of:
              0.13035654 = queryWeight, product of:
                1.0978299 = boost
                6.2512445 = idf(docFreq=223, maxDocs=42740)
                0.018994648 = queryNorm
              0.96694 = fieldWeight in 1980, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.2512445 = idf(docFreq=223, maxDocs=42740)
                0.109375 = fieldNorm(doc=1980)
          0.17147884 = weight(abstract_txt:adaptive in 1980) [ClassicSimilarity], result of:
            0.17147884 = score(doc=1980,freq=2.0), product of:
              0.1600485 = queryWeight, product of:
                1.2164506 = boost
                6.926692 = idf(docFreq=113, maxDocs=42740)
                0.018994648 = queryNorm
              1.0714179 = fieldWeight in 1980, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.926692 = idf(docFreq=113, maxDocs=42740)
                0.109375 = fieldNorm(doc=1980)
          0.11585283 = weight(abstract_txt:images in 1980) [ClassicSimilarity], result of:
            0.11585283 = score(doc=1980,freq=1.0), product of:
              0.19561508 = queryWeight, product of:
                1.9018875 = boost
                5.414848 = idf(docFreq=516, maxDocs=42740)
                0.018994648 = queryNorm
              0.592249 = fieldWeight in 1980, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.414848 = idf(docFreq=516, maxDocs=42740)
                0.109375 = fieldNorm(doc=1980)
          0.08949988 = weight(abstract_txt:indexing in 1980) [ClassicSimilarity], result of:
            0.08949988 = score(doc=1980,freq=1.0), product of:
              0.18852833 = queryWeight, product of:
                2.2867444 = boost
                4.34038 = idf(docFreq=1513, maxDocs=42740)
                0.018994648 = queryNorm
              0.4747291 = fieldWeight in 1980, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.34038 = idf(docFreq=1513, maxDocs=42740)
                0.109375 = fieldNorm(doc=1980)
          0.23134139 = weight(abstract_txt:fuzzy in 1980) [ClassicSimilarity], result of:
            0.23134139 = score(doc=1980,freq=1.0), product of:
              0.31019405 = queryWeight, product of:
                2.3949718 = boost
                6.8187037 = idf(docFreq=126, maxDocs=42740)
                0.018994648 = queryNorm
              0.7457957 = fieldWeight in 1980, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.8187037 = idf(docFreq=126, maxDocs=42740)
                0.109375 = fieldNorm(doc=1980)
          0.3809384 = weight(abstract_txt:scanned in 1980) [ClassicSimilarity], result of:
            0.3809384 = score(doc=1980,freq=1.0), product of:
              0.4325481 = queryWeight, product of:
                2.8281398 = boost
                8.051972 = idf(docFreq=36, maxDocs=42740)
                0.018994648 = queryNorm
              0.8806845 = fieldWeight in 1980, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.051972 = idf(docFreq=36, maxDocs=42740)
                0.109375 = fieldNorm(doc=1980)
          0.5023223 = weight(abstract_txt:excalibur in 1980) [ClassicSimilarity], result of:
            0.5023223 = score(doc=1980,freq=1.0), product of:
              0.52013916 = queryWeight, product of:
                3.1012976 = boost
                8.829678 = idf(docFreq=16, maxDocs=42740)
                0.018994648 = queryNorm
              0.965746 = fieldWeight in 1980, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.829678 = idf(docFreq=16, maxDocs=42740)
                0.109375 = fieldNorm(doc=1980)
        0.4 = coord(10/25)
    
  2. Alexander, M.: Digitising books, manuscripts and scholarly materials : preparation, handling, scanning, recognition, compression, storage formats (1998) 0.23
    0.23188284 = sum of:
      0.23188284 = product of:
        0.96617854 = sum of:
          0.07266442 = weight(abstract_txt:storage in 4687) [ClassicSimilarity], result of:
            0.07266442 = score(doc=4687,freq=1.0), product of:
              0.11376336 = queryWeight, product of:
                1.0255808 = boost
                5.8398447 = idf(docFreq=337, maxDocs=42740)
                0.018994648 = queryNorm
              0.638733 = fieldWeight in 4687, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8398447 = idf(docFreq=337, maxDocs=42740)
                0.109375 = fieldNorm(doc=4687)
          0.075570755 = weight(abstract_txt:british in 4687) [ClassicSimilarity], result of:
            0.075570755 = score(doc=4687,freq=1.0), product of:
              0.116776936 = queryWeight, product of:
                1.0390757 = boost
                5.9166875 = idf(docFreq=312, maxDocs=42740)
                0.018994648 = queryNorm
              0.6471377 = fieldWeight in 4687, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.9166875 = idf(docFreq=312, maxDocs=42740)
                0.109375 = fieldNorm(doc=4687)
          0.11424779 = weight(abstract_txt:library's in 4687) [ClassicSimilarity], result of:
            0.11424779 = score(doc=4687,freq=2.0), product of:
              0.12208898 = queryWeight, product of:
                1.0624461 = boost
                6.0497622 = idf(docFreq=273, maxDocs=42740)
                0.018994648 = queryNorm
              0.9357748 = fieldWeight in 4687, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.0497622 = idf(docFreq=273, maxDocs=42740)
                0.109375 = fieldNorm(doc=4687)
          0.08552042 = weight(abstract_txt:recognition in 4687) [ClassicSimilarity], result of:
            0.08552042 = score(doc=4687,freq=1.0), product of:
              0.12681417 = queryWeight, product of:
                1.0828108 = boost
                6.1657224 = idf(docFreq=243, maxDocs=42740)
                0.018994648 = queryNorm
              0.6743759 = fieldWeight in 4687, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.1657224 = idf(docFreq=243, maxDocs=42740)
                0.109375 = fieldNorm(doc=4687)
          0.11585283 = weight(abstract_txt:images in 4687) [ClassicSimilarity], result of:
            0.11585283 = score(doc=4687,freq=1.0), product of:
              0.19561508 = queryWeight, product of:
                1.9018875 = boost
                5.414848 = idf(docFreq=516, maxDocs=42740)
                0.018994648 = queryNorm
              0.592249 = fieldWeight in 4687, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.414848 = idf(docFreq=516, maxDocs=42740)
                0.109375 = fieldNorm(doc=4687)
          0.5023223 = weight(abstract_txt:excalibur in 4687) [ClassicSimilarity], result of:
            0.5023223 = score(doc=4687,freq=1.0), product of:
              0.52013916 = queryWeight, product of:
                3.1012976 = boost
                8.829678 = idf(docFreq=16, maxDocs=42740)
                0.018994648 = queryNorm
              0.965746 = fieldWeight in 4687, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.829678 = idf(docFreq=16, maxDocs=42740)
                0.109375 = fieldNorm(doc=4687)
        0.24 = coord(6/25)
    
  3. Townsend, J.: Multimedia - myth or reality? (1994) 0.20
    0.20498154 = sum of:
      0.20498154 = product of:
        0.85408974 = sum of:
          0.061538294 = weight(abstract_txt:briefly in 729) [ClassicSimilarity], result of:
            0.061538294 = score(doc=729,freq=1.0), product of:
              0.11285377 = queryWeight, product of:
                1.0214726 = boost
                5.8164515 = idf(docFreq=345, maxDocs=42740)
                0.018994648 = queryNorm
              0.5452923 = fieldWeight in 729, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8164515 = idf(docFreq=345, maxDocs=42740)
                0.09375 = fieldNorm(doc=729)
          0.073303215 = weight(abstract_txt:recognition in 729) [ClassicSimilarity], result of:
            0.073303215 = score(doc=729,freq=1.0), product of:
              0.12681417 = queryWeight, product of:
                1.0828108 = boost
                6.1657224 = idf(docFreq=243, maxDocs=42740)
                0.018994648 = queryNorm
              0.5780365 = fieldWeight in 729, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.1657224 = idf(docFreq=243, maxDocs=42740)
                0.09375 = fieldNorm(doc=729)
          0.108040236 = weight(abstract_txt:pattern in 729) [ClassicSimilarity], result of:
            0.108040236 = score(doc=729,freq=2.0), product of:
              0.13035654 = queryWeight, product of:
                1.0978299 = boost
                6.2512445 = idf(docFreq=223, maxDocs=42740)
                0.018994648 = queryNorm
              0.8288057 = fieldWeight in 729, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.2512445 = idf(docFreq=223, maxDocs=42740)
                0.09375 = fieldNorm(doc=729)
          0.10393187 = weight(abstract_txt:adaptive in 729) [ClassicSimilarity], result of:
            0.10393187 = score(doc=729,freq=1.0), product of:
              0.1600485 = queryWeight, product of:
                1.2164506 = boost
                6.926692 = idf(docFreq=113, maxDocs=42740)
                0.018994648 = queryNorm
              0.64937735 = fieldWeight in 729, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.926692 = idf(docFreq=113, maxDocs=42740)
                0.09375 = fieldNorm(doc=729)
          0.07671419 = weight(abstract_txt:indexing in 729) [ClassicSimilarity], result of:
            0.07671419 = score(doc=729,freq=1.0), product of:
              0.18852833 = queryWeight, product of:
                2.2867444 = boost
                4.34038 = idf(docFreq=1513, maxDocs=42740)
                0.018994648 = queryNorm
              0.40691066 = fieldWeight in 729, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.34038 = idf(docFreq=1513, maxDocs=42740)
                0.09375 = fieldNorm(doc=729)
          0.43056196 = weight(abstract_txt:excalibur in 729) [ClassicSimilarity], result of:
            0.43056196 = score(doc=729,freq=1.0), product of:
              0.52013916 = queryWeight, product of:
                3.1012976 = boost
                8.829678 = idf(docFreq=16, maxDocs=42740)
                0.018994648 = queryNorm
              0.8277823 = fieldWeight in 729, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.829678 = idf(docFreq=16, maxDocs=42740)
                0.09375 = fieldNorm(doc=729)
        0.24 = coord(6/25)
    
  4. Picture content retrieval (1996) 0.20
    0.1988617 = sum of:
      0.1988617 = product of:
        0.9943085 = sum of:
          0.08205106 = weight(abstract_txt:briefly in 45) [ClassicSimilarity], result of:
            0.08205106 = score(doc=45,freq=1.0), product of:
              0.11285377 = queryWeight, product of:
                1.0214726 = boost
                5.8164515 = idf(docFreq=345, maxDocs=42740)
                0.018994648 = queryNorm
              0.72705644 = fieldWeight in 45, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8164515 = idf(docFreq=345, maxDocs=42740)
                0.125 = fieldNorm(doc=45)
          0.097737625 = weight(abstract_txt:recognition in 45) [ClassicSimilarity], result of:
            0.097737625 = score(doc=45,freq=1.0), product of:
              0.12681417 = queryWeight, product of:
                1.0828108 = boost
                6.1657224 = idf(docFreq=243, maxDocs=42740)
                0.018994648 = queryNorm
              0.7707153 = fieldWeight in 45, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.1657224 = idf(docFreq=243, maxDocs=42740)
                0.125 = fieldNorm(doc=45)
          0.10186132 = weight(abstract_txt:pattern in 45) [ClassicSimilarity], result of:
            0.10186132 = score(doc=45,freq=1.0), product of:
              0.13035654 = queryWeight, product of:
                1.0978299 = boost
                6.2512445 = idf(docFreq=223, maxDocs=42740)
                0.018994648 = queryNorm
              0.78140557 = fieldWeight in 45, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.2512445 = idf(docFreq=223, maxDocs=42740)
                0.125 = fieldNorm(doc=45)
          0.13857584 = weight(abstract_txt:adaptive in 45) [ClassicSimilarity], result of:
            0.13857584 = score(doc=45,freq=1.0), product of:
              0.1600485 = queryWeight, product of:
                1.2164506 = boost
                6.926692 = idf(docFreq=113, maxDocs=42740)
                0.018994648 = queryNorm
              0.8658365 = fieldWeight in 45, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.926692 = idf(docFreq=113, maxDocs=42740)
                0.125 = fieldNorm(doc=45)
          0.5740826 = weight(abstract_txt:excalibur in 45) [ClassicSimilarity], result of:
            0.5740826 = score(doc=45,freq=1.0), product of:
              0.52013916 = queryWeight, product of:
                3.1012976 = boost
                8.829678 = idf(docFreq=16, maxDocs=42740)
                0.018994648 = queryNorm
              1.1037097 = fieldWeight in 45, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.829678 = idf(docFreq=16, maxDocs=42740)
                0.125 = fieldNorm(doc=45)
        0.2 = coord(5/25)
    
  5. Brown, S.: Developments in information retrieval systems : RetrievalWare from Excalibur (1996) 0.18
    0.18245147 = sum of:
      0.18245147 = product of:
        1.1403217 = sum of:
          0.12217203 = weight(abstract_txt:recognition in 5021) [ClassicSimilarity], result of:
            0.12217203 = score(doc=5021,freq=1.0), product of:
              0.12681417 = queryWeight, product of:
                1.0828108 = boost
                6.1657224 = idf(docFreq=243, maxDocs=42740)
                0.018994648 = queryNorm
              0.9633941 = fieldWeight in 5021, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.1657224 = idf(docFreq=243, maxDocs=42740)
                0.15625 = fieldNorm(doc=5021)
          0.12732665 = weight(abstract_txt:pattern in 5021) [ClassicSimilarity], result of:
            0.12732665 = score(doc=5021,freq=1.0), product of:
              0.13035654 = queryWeight, product of:
                1.0978299 = boost
                6.2512445 = idf(docFreq=223, maxDocs=42740)
                0.018994648 = queryNorm
              0.97675693 = fieldWeight in 5021, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.2512445 = idf(docFreq=223, maxDocs=42740)
                0.15625 = fieldNorm(doc=5021)
          0.1732198 = weight(abstract_txt:adaptive in 5021) [ClassicSimilarity], result of:
            0.1732198 = score(doc=5021,freq=1.0), product of:
              0.1600485 = queryWeight, product of:
                1.2164506 = boost
                6.926692 = idf(docFreq=113, maxDocs=42740)
                0.018994648 = queryNorm
              1.0822957 = fieldWeight in 5021, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.926692 = idf(docFreq=113, maxDocs=42740)
                0.15625 = fieldNorm(doc=5021)
          0.71760327 = weight(abstract_txt:excalibur in 5021) [ClassicSimilarity], result of:
            0.71760327 = score(doc=5021,freq=1.0), product of:
              0.52013916 = queryWeight, product of:
                3.1012976 = boost
                8.829678 = idf(docFreq=16, maxDocs=42740)
                0.018994648 = queryNorm
              1.3796371 = fieldWeight in 5021, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.829678 = idf(docFreq=16, maxDocs=42740)
                0.15625 = fieldNorm(doc=5021)
        0.16 = coord(4/25)