Document (#16029)

Author
Alexander, M.
Title
Retrieving digital data with fuzzy matching
Source
New library world. 97(1996) no.1131, S.28-31
Year
1996
Abstract
Briefly describes the Excalibur EFS system which makes use of adaptive pattern recognition technology as an aid to automatic indexing and how it is being tested at the British Library for the indexing and retrieval of scanned images from the library's holdings. Notes how Excalibur EFS can support a wide degree of fuzzy searching, compensate for the errors produced by OCR conversion of scanned images, reduce the costs of indexing, and require far less storage space than more traditional indexes
Theme
Automatisches Indexieren
Object
Excalibur EFS

Similar documents (author)

  1. Alexander, M.: Automatic indexing of document images using Excalibur EFS (1995) 5.57
    5.568259 = sum of:
      5.568259 = weight(author_txt:alexander in 1977) [ClassicSimilarity], result of:
        5.568259 = fieldWeight in 1977, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.909214 = idf(docFreq=15, maxDocs=43556)
          0.625 = fieldNorm(doc=1977)
    
  2. Alexander, M.: Retrieving digital data with fuzzy matching (1997) 5.57
    5.568259 = sum of:
      5.568259 = weight(author_txt:alexander in 149) [ClassicSimilarity], result of:
        5.568259 = fieldWeight in 149, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.909214 = idf(docFreq=15, maxDocs=43556)
          0.625 = fieldNorm(doc=149)
    
  3. Alexander, J.: Customs and excise process 2.5 million documents (1997) 5.57
    5.568259 = sum of:
      5.568259 = weight(author_txt:alexander in 3425) [ClassicSimilarity], result of:
        5.568259 = fieldWeight in 3425, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.909214 = idf(docFreq=15, maxDocs=43556)
          0.625 = fieldNorm(doc=3425)
    
  4. Alexander, M.: Digitising books, manuscripts and scholarly materials : preparation, handling, scanning, recognition, compression, storage formats (1998) 5.57
    5.568259 = sum of:
      5.568259 = weight(author_txt:alexander in 4684) [ClassicSimilarity], result of:
        5.568259 = fieldWeight in 4684, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.909214 = idf(docFreq=15, maxDocs=43556)
          0.625 = fieldNorm(doc=4684)
    
  5. Alexander, K.: Kompendium der visuellen Information und Kommunikation (2007) 5.57
    5.568259 = sum of:
      5.568259 = weight(author_txt:alexander in 2645) [ClassicSimilarity], result of:
        5.568259 = fieldWeight in 2645, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.909214 = idf(docFreq=15, maxDocs=43556)
          0.625 = fieldNorm(doc=2645)
    

Similar documents (content)

  1. Alexander, M.: Automatic indexing of document images using Excalibur EFS (1995) 0.76
    0.7645181 = sum of:
      0.7645181 = product of:
        1.9112952 = sum of:
          0.096672595 = weight(abstract_txt:indexes in 1977) [ClassicSimilarity], result of:
            0.096672595 = score(doc=1977,freq=2.0), product of:
              0.109149456 = queryWeight, product of:
                1.0084625 = boost
                5.7259655 = idf(docFreq=385, maxDocs=43556)
                0.018902231 = queryNorm
              0.8856901 = fieldWeight in 1977, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.7259655 = idf(docFreq=385, maxDocs=43556)
                0.109375 = fieldNorm(doc=1977)
          0.07589919 = weight(abstract_txt:british in 1977) [ClassicSimilarity], result of:
            0.07589919 = score(doc=1977,freq=1.0), product of:
              0.11703635 = queryWeight, product of:
                1.0442618 = boost
                5.92923 = idf(docFreq=314, maxDocs=43556)
                0.018902231 = queryNorm
              0.64850956 = fieldWeight in 1977, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.92923 = idf(docFreq=314, maxDocs=43556)
                0.109375 = fieldNorm(doc=1977)
          0.11877307 = weight(abstract_txt:recognition in 1977) [ClassicSimilarity], result of:
            0.11877307 = score(doc=1977,freq=2.0), product of:
              0.12520778 = queryWeight, product of:
                1.0801017 = boost
                6.1327267 = idf(docFreq=256, maxDocs=43556)
                0.018902231 = queryNorm
              0.94860774 = fieldWeight in 1977, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.1327267 = idf(docFreq=256, maxDocs=43556)
                0.109375 = fieldNorm(doc=1977)
          0.12586626 = weight(abstract_txt:pattern in 1977) [ClassicSimilarity], result of:
            0.12586626 = score(doc=1977,freq=2.0), product of:
              0.13014442 = queryWeight, product of:
                1.1011888 = boost
                6.252457 = idf(docFreq=227, maxDocs=43556)
                0.018902231 = queryNorm
              0.9671277 = fieldWeight in 1977, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.252457 = idf(docFreq=227, maxDocs=43556)
                0.109375 = fieldNorm(doc=1977)
          0.172539 = weight(abstract_txt:adaptive in 1977) [ClassicSimilarity], result of:
            0.172539 = score(doc=1977,freq=2.0), product of:
              0.1605995 = queryWeight, product of:
                1.2232666 = boost
                6.9456043 = idf(docFreq=113, maxDocs=43556)
                0.018902231 = queryNorm
              1.0743433 = fieldWeight in 1977, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.9456043 = idf(docFreq=113, maxDocs=43556)
                0.109375 = fieldNorm(doc=1977)
          0.11609242 = weight(abstract_txt:images in 1977) [ClassicSimilarity], result of:
            0.11609242 = score(doc=1977,freq=1.0), product of:
              0.19575307 = queryWeight, product of:
                1.9099337 = boost
                5.422221 = idf(docFreq=522, maxDocs=43556)
                0.018902231 = queryNorm
              0.5930554 = fieldWeight in 1977, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.422221 = idf(docFreq=522, maxDocs=43556)
                0.109375 = fieldNorm(doc=1977)
          0.08983915 = weight(abstract_txt:indexing in 1977) [ClassicSimilarity], result of:
            0.08983915 = score(doc=1977,freq=1.0), product of:
              0.18887748 = queryWeight, product of:
                2.2977338 = boost
                4.3487797 = idf(docFreq=1529, maxDocs=43556)
                0.018902231 = queryNorm
              0.47564778 = fieldWeight in 1977, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.3487797 = idf(docFreq=1529, maxDocs=43556)
                0.109375 = fieldNorm(doc=1977)
          0.23200157 = weight(abstract_txt:fuzzy in 1977) [ClassicSimilarity], result of:
            0.23200157 = score(doc=1977,freq=1.0), product of:
              0.3105751 = queryWeight, product of:
                2.4057324 = boost
                6.8297725 = idf(docFreq=127, maxDocs=43556)
                0.018902231 = queryNorm
              0.74700636 = fieldWeight in 1977, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.8297725 = idf(docFreq=127, maxDocs=43556)
                0.109375 = fieldNorm(doc=1977)
          0.37907347 = weight(abstract_txt:scanned in 1977) [ClassicSimilarity], result of:
            0.37907347 = score(doc=1977,freq=1.0), product of:
              0.43084553 = queryWeight, product of:
                2.8335104 = boost
                8.044216 = idf(docFreq=37, maxDocs=43556)
                0.018902231 = queryNorm
              0.87983614 = fieldWeight in 1977, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.044216 = idf(docFreq=37, maxDocs=43556)
                0.109375 = fieldNorm(doc=1977)
          0.5045385 = weight(abstract_txt:excalibur in 1977) [ClassicSimilarity], result of:
            0.5045385 = score(doc=1977,freq=1.0), product of:
              0.52131736 = queryWeight, product of:
                3.1168442 = boost
                8.848589 = idf(docFreq=16, maxDocs=43556)
                0.018902231 = queryNorm
              0.96781445 = fieldWeight in 1977, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.848589 = idf(docFreq=16, maxDocs=43556)
                0.109375 = fieldNorm(doc=1977)
        0.4 = coord(10/25)
    
  2. Alexander, M.: Digitising books, manuscripts and scholarly materials : preparation, handling, scanning, recognition, compression, storage formats (1998) 0.23
    0.23223908 = sum of:
      0.23223908 = product of:
        0.9676629 = sum of:
          0.07267566 = weight(abstract_txt:storage in 4684) [ClassicSimilarity], result of:
            0.07267566 = score(doc=4684,freq=1.0), product of:
              0.113698654 = queryWeight, product of:
                1.0292637 = boost
                5.8440723 = idf(docFreq=342, maxDocs=43556)
                0.018902231 = queryNorm
              0.63919544 = fieldWeight in 4684, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8440723 = idf(docFreq=342, maxDocs=43556)
                0.109375 = fieldNorm(doc=4684)
          0.07589919 = weight(abstract_txt:british in 4684) [ClassicSimilarity], result of:
            0.07589919 = score(doc=4684,freq=1.0), product of:
              0.11703635 = queryWeight, product of:
                1.0442618 = boost
                5.92923 = idf(docFreq=314, maxDocs=43556)
                0.018902231 = queryNorm
              0.64850956 = fieldWeight in 4684, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.92923 = idf(docFreq=314, maxDocs=43556)
                0.109375 = fieldNorm(doc=4684)
          0.11447187 = weight(abstract_txt:library's in 4684) [ClassicSimilarity], result of:
            0.11447187 = score(doc=4684,freq=2.0), product of:
              0.122166425 = queryWeight, product of:
                1.066903 = boost
                6.057785 = idf(docFreq=276, maxDocs=43556)
                0.018902231 = queryNorm
              0.9370158 = fieldWeight in 4684, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.057785 = idf(docFreq=276, maxDocs=43556)
                0.109375 = fieldNorm(doc=4684)
          0.08398524 = weight(abstract_txt:recognition in 4684) [ClassicSimilarity], result of:
            0.08398524 = score(doc=4684,freq=1.0), product of:
              0.12520778 = queryWeight, product of:
                1.0801017 = boost
                6.1327267 = idf(docFreq=256, maxDocs=43556)
                0.018902231 = queryNorm
              0.67076695 = fieldWeight in 4684, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.1327267 = idf(docFreq=256, maxDocs=43556)
                0.109375 = fieldNorm(doc=4684)
          0.11609242 = weight(abstract_txt:images in 4684) [ClassicSimilarity], result of:
            0.11609242 = score(doc=4684,freq=1.0), product of:
              0.19575307 = queryWeight, product of:
                1.9099337 = boost
                5.422221 = idf(docFreq=522, maxDocs=43556)
                0.018902231 = queryNorm
              0.5930554 = fieldWeight in 4684, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.422221 = idf(docFreq=522, maxDocs=43556)
                0.109375 = fieldNorm(doc=4684)
          0.5045385 = weight(abstract_txt:excalibur in 4684) [ClassicSimilarity], result of:
            0.5045385 = score(doc=4684,freq=1.0), product of:
              0.52131736 = queryWeight, product of:
                3.1168442 = boost
                8.848589 = idf(docFreq=16, maxDocs=43556)
                0.018902231 = queryNorm
              0.96781445 = fieldWeight in 4684, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.848589 = idf(docFreq=16, maxDocs=43556)
                0.109375 = fieldNorm(doc=4684)
        0.24 = coord(6/25)
    
  3. Townsend, J.: Multimedia - myth or reality? (1994) 0.21
    0.20533518 = sum of:
      0.20533518 = product of:
        0.8555633 = sum of:
          0.061649613 = weight(abstract_txt:briefly in 726) [ClassicSimilarity], result of:
            0.061649613 = score(doc=726,freq=1.0), product of:
              0.11291391 = queryWeight, product of:
                1.0257056 = boost
                5.8238697 = idf(docFreq=349, maxDocs=43556)
                0.018902231 = queryNorm
              0.5459878 = fieldWeight in 726, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8238697 = idf(docFreq=349, maxDocs=43556)
                0.09375 = fieldNorm(doc=726)
          0.07198735 = weight(abstract_txt:recognition in 726) [ClassicSimilarity], result of:
            0.07198735 = score(doc=726,freq=1.0), product of:
              0.12520778 = queryWeight, product of:
                1.0801017 = boost
                6.1327267 = idf(docFreq=256, maxDocs=43556)
                0.018902231 = queryNorm
              0.5749431 = fieldWeight in 726, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.1327267 = idf(docFreq=256, maxDocs=43556)
                0.09375 = fieldNorm(doc=726)
          0.10788537 = weight(abstract_txt:pattern in 726) [ClassicSimilarity], result of:
            0.10788537 = score(doc=726,freq=2.0), product of:
              0.13014442 = queryWeight, product of:
                1.1011888 = boost
                6.252457 = idf(docFreq=227, maxDocs=43556)
                0.018902231 = queryNorm
              0.82896656 = fieldWeight in 726, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.252457 = idf(docFreq=227, maxDocs=43556)
                0.09375 = fieldNorm(doc=726)
          0.10457443 = weight(abstract_txt:adaptive in 726) [ClassicSimilarity], result of:
            0.10457443 = score(doc=726,freq=1.0), product of:
              0.1605995 = queryWeight, product of:
                1.2232666 = boost
                6.9456043 = idf(docFreq=113, maxDocs=43556)
                0.018902231 = queryNorm
              0.6511504 = fieldWeight in 726, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.9456043 = idf(docFreq=113, maxDocs=43556)
                0.09375 = fieldNorm(doc=726)
          0.07700499 = weight(abstract_txt:indexing in 726) [ClassicSimilarity], result of:
            0.07700499 = score(doc=726,freq=1.0), product of:
              0.18887748 = queryWeight, product of:
                2.2977338 = boost
                4.3487797 = idf(docFreq=1529, maxDocs=43556)
                0.018902231 = queryNorm
              0.4076981 = fieldWeight in 726, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.3487797 = idf(docFreq=1529, maxDocs=43556)
                0.09375 = fieldNorm(doc=726)
          0.43246153 = weight(abstract_txt:excalibur in 726) [ClassicSimilarity], result of:
            0.43246153 = score(doc=726,freq=1.0), product of:
              0.52131736 = queryWeight, product of:
                3.1168442 = boost
                8.848589 = idf(docFreq=16, maxDocs=43556)
                0.018902231 = queryNorm
              0.8295552 = fieldWeight in 726, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.848589 = idf(docFreq=16, maxDocs=43556)
                0.09375 = fieldNorm(doc=726)
        0.24 = coord(6/25)
    
  4. Picture content retrieval (1996) 0.20
    0.19918919 = sum of:
      0.19918919 = product of:
        0.99594593 = sum of:
          0.082199484 = weight(abstract_txt:briefly in 42) [ClassicSimilarity], result of:
            0.082199484 = score(doc=42,freq=1.0), product of:
              0.11291391 = queryWeight, product of:
                1.0257056 = boost
                5.8238697 = idf(docFreq=349, maxDocs=43556)
                0.018902231 = queryNorm
              0.7279837 = fieldWeight in 42, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8238697 = idf(docFreq=349, maxDocs=43556)
                0.125 = fieldNorm(doc=42)
          0.09598314 = weight(abstract_txt:recognition in 42) [ClassicSimilarity], result of:
            0.09598314 = score(doc=42,freq=1.0), product of:
              0.12520778 = queryWeight, product of:
                1.0801017 = boost
                6.1327267 = idf(docFreq=256, maxDocs=43556)
                0.018902231 = queryNorm
              0.76659083 = fieldWeight in 42, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.1327267 = idf(docFreq=256, maxDocs=43556)
                0.125 = fieldNorm(doc=42)
          0.1017153 = weight(abstract_txt:pattern in 42) [ClassicSimilarity], result of:
            0.1017153 = score(doc=42,freq=1.0), product of:
              0.13014442 = queryWeight, product of:
                1.1011888 = boost
                6.252457 = idf(docFreq=227, maxDocs=43556)
                0.018902231 = queryNorm
              0.78155714 = fieldWeight in 42, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.252457 = idf(docFreq=227, maxDocs=43556)
                0.125 = fieldNorm(doc=42)
          0.13943258 = weight(abstract_txt:adaptive in 42) [ClassicSimilarity], result of:
            0.13943258 = score(doc=42,freq=1.0), product of:
              0.1605995 = queryWeight, product of:
                1.2232666 = boost
                6.9456043 = idf(docFreq=113, maxDocs=43556)
                0.018902231 = queryNorm
              0.86820054 = fieldWeight in 42, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.9456043 = idf(docFreq=113, maxDocs=43556)
                0.125 = fieldNorm(doc=42)
          0.5766154 = weight(abstract_txt:excalibur in 42) [ClassicSimilarity], result of:
            0.5766154 = score(doc=42,freq=1.0), product of:
              0.52131736 = queryWeight, product of:
                3.1168442 = boost
                8.848589 = idf(docFreq=16, maxDocs=43556)
                0.018902231 = queryNorm
              1.1060736 = fieldWeight in 42, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.848589 = idf(docFreq=16, maxDocs=43556)
                0.125 = fieldNorm(doc=42)
        0.2 = coord(5/25)
    
  5. Brown, S.: Developments in information retrieval systems : RetrievalWare from Excalibur (1996) 0.18
    0.18274927 = sum of:
      0.18274927 = product of:
        1.142183 = sum of:
          0.11997892 = weight(abstract_txt:recognition in 5018) [ClassicSimilarity], result of:
            0.11997892 = score(doc=5018,freq=1.0), product of:
              0.12520778 = queryWeight, product of:
                1.0801017 = boost
                6.1327267 = idf(docFreq=256, maxDocs=43556)
                0.018902231 = queryNorm
              0.95823854 = fieldWeight in 5018, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.1327267 = idf(docFreq=256, maxDocs=43556)
                0.15625 = fieldNorm(doc=5018)
          0.12714413 = weight(abstract_txt:pattern in 5018) [ClassicSimilarity], result of:
            0.12714413 = score(doc=5018,freq=1.0), product of:
              0.13014442 = queryWeight, product of:
                1.1011888 = boost
                6.252457 = idf(docFreq=227, maxDocs=43556)
                0.018902231 = queryNorm
              0.9769464 = fieldWeight in 5018, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.252457 = idf(docFreq=227, maxDocs=43556)
                0.15625 = fieldNorm(doc=5018)
          0.1742907 = weight(abstract_txt:adaptive in 5018) [ClassicSimilarity], result of:
            0.1742907 = score(doc=5018,freq=1.0), product of:
              0.1605995 = queryWeight, product of:
                1.2232666 = boost
                6.9456043 = idf(docFreq=113, maxDocs=43556)
                0.018902231 = queryNorm
              1.0852506 = fieldWeight in 5018, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.9456043 = idf(docFreq=113, maxDocs=43556)
                0.15625 = fieldNorm(doc=5018)
          0.72076917 = weight(abstract_txt:excalibur in 5018) [ClassicSimilarity], result of:
            0.72076917 = score(doc=5018,freq=1.0), product of:
              0.52131736 = queryWeight, product of:
                3.1168442 = boost
                8.848589 = idf(docFreq=16, maxDocs=43556)
                0.018902231 = queryNorm
              1.382592 = fieldWeight in 5018, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.848589 = idf(docFreq=16, maxDocs=43556)
                0.15625 = fieldNorm(doc=5018)
        0.16 = coord(4/25)