Document (#16031)

Author
Alexander, M.
Title
Retrieving digital data with fuzzy matching
Source
New library world. 97(1996) no.1131, S.28-31
Year
1996
Abstract
Briefly describes the Excalibur EFS system which makes use of adaptive pattern recognition technology as an aid to automatic indexing and how it is being tested at the British Library for the indexing and retrieval of scanned images from the library's holdings. Notes how Excalibur EFS can support a wide degree of fuzzy searching, compensate for the errors produced by OCR conversion of scanned images, reduce the costs of indexing, and require far less storage space than more traditional indexes
Theme
Automatisches Indexieren
Object
Excalibur EFS

Similar documents (author)

  1. Alexander, M.: Automatic indexing of document images using Excalibur EFS (1995) 5.58
    5.5776863 = sum of:
      5.5776863 = weight(author_txt:alexander in 1911) [ClassicSimilarity], result of:
        5.5776863 = fieldWeight in 1911, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.924298 = idf(docFreq=15, maxDocs=44218)
          0.625 = fieldNorm(doc=1911)
    
  2. Alexander, M.: Retrieving digital data with fuzzy matching (1997) 5.58
    5.5776863 = sum of:
      5.5776863 = weight(author_txt:alexander in 151) [ClassicSimilarity], result of:
        5.5776863 = fieldWeight in 151, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.924298 = idf(docFreq=15, maxDocs=44218)
          0.625 = fieldNorm(doc=151)
    
  3. Alexander, J.: Customs and excise process 2.5 million documents (1997) 5.58
    5.5776863 = sum of:
      5.5776863 = weight(author_txt:alexander in 2427) [ClassicSimilarity], result of:
        5.5776863 = fieldWeight in 2427, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.924298 = idf(docFreq=15, maxDocs=44218)
          0.625 = fieldNorm(doc=2427)
    
  4. Alexander, M.: Digitising books, manuscripts and scholarly materials : preparation, handling, scanning, recognition, compression, storage formats (1998) 5.58
    5.5776863 = sum of:
      5.5776863 = weight(author_txt:alexander in 3686) [ClassicSimilarity], result of:
        5.5776863 = fieldWeight in 3686, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.924298 = idf(docFreq=15, maxDocs=44218)
          0.625 = fieldNorm(doc=3686)
    
  5. Alexander, K.: Kompendium der visuellen Information und Kommunikation (2007) 5.58
    5.5776863 = sum of:
      5.5776863 = weight(author_txt:alexander in 647) [ClassicSimilarity], result of:
        5.5776863 = fieldWeight in 647, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.924298 = idf(docFreq=15, maxDocs=44218)
          0.625 = fieldNorm(doc=647)
    

Similar documents (content)

  1. Alexander, M.: Automatic indexing of document images using Excalibur EFS (1995) 0.77
    0.7651445 = sum of:
      0.7651445 = product of:
        1.9128612 = sum of:
          0.0966969 = weight(abstract_txt:indexes in 1911) [ClassicSimilarity], result of:
            0.0966969 = score(doc=1911,freq=2.0), product of:
              0.109085925 = queryWeight, product of:
                1.0126743 = boost
                5.7307405 = idf(docFreq=389, maxDocs=44218)
                0.018796984 = queryNorm
              0.8864287 = fieldWeight in 1911, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.7307405 = idf(docFreq=389, maxDocs=44218)
                0.109375 = fieldNorm(doc=1911)
          0.07594365 = weight(abstract_txt:british in 1911) [ClassicSimilarity], result of:
            0.07594365 = score(doc=1911,freq=1.0), product of:
              0.11699429 = queryWeight, product of:
                1.0487399 = boost
                5.934836 = idf(docFreq=317, maxDocs=44218)
                0.018796984 = queryNorm
              0.64912266 = fieldWeight in 1911, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.934836 = idf(docFreq=317, maxDocs=44218)
                0.109375 = fieldNorm(doc=1911)
          0.117824145 = weight(abstract_txt:recognition in 1911) [ClassicSimilarity], result of:
            0.117824145 = score(doc=1911,freq=2.0), product of:
              0.12444666 = queryWeight, product of:
                1.0816259 = boost
                6.1209383 = idf(docFreq=263, maxDocs=44218)
                0.018796984 = queryNorm
              0.9467843 = fieldWeight in 1911, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.1209383 = idf(docFreq=263, maxDocs=44218)
                0.109375 = fieldNorm(doc=1911)
          0.1251857 = weight(abstract_txt:pattern in 1911) [ClassicSimilarity], result of:
            0.1251857 = score(doc=1911,freq=2.0), product of:
              0.12957767 = queryWeight, product of:
                1.1036987 = boost
                6.2458487 = idf(docFreq=232, maxDocs=44218)
                0.018796984 = queryNorm
              0.96610546 = fieldWeight in 1911, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.2458487 = idf(docFreq=232, maxDocs=44218)
                0.109375 = fieldNorm(doc=1911)
          0.17197984 = weight(abstract_txt:adaptive in 1911) [ClassicSimilarity], result of:
            0.17197984 = score(doc=1911,freq=2.0), product of:
              0.16013223 = queryWeight, product of:
                1.2269442 = boost
                6.943297 = idf(docFreq=115, maxDocs=44218)
                0.018796984 = queryNorm
              1.0739864 = fieldWeight in 1911, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.943297 = idf(docFreq=115, maxDocs=44218)
                0.109375 = fieldNorm(doc=1911)
          0.116188906 = weight(abstract_txt:images in 1911) [ClassicSimilarity], result of:
            0.116188906 = score(doc=1911,freq=1.0), product of:
              0.19571471 = queryWeight, product of:
                1.9182808 = boost
                5.427791 = idf(docFreq=527, maxDocs=44218)
                0.018796984 = queryNorm
              0.59366465 = fieldWeight in 1911, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.427791 = idf(docFreq=527, maxDocs=44218)
                0.109375 = fieldNorm(doc=1911)
          0.089687265 = weight(abstract_txt:indexing in 1911) [ClassicSimilarity], result of:
            0.089687265 = score(doc=1911,freq=1.0), product of:
              0.18852313 = queryWeight, product of:
                2.305836 = boost
                4.3495874 = idf(docFreq=1551, maxDocs=44218)
                0.018796984 = queryNorm
              0.47573614 = fieldWeight in 1911, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.3495874 = idf(docFreq=1551, maxDocs=44218)
                0.109375 = fieldNorm(doc=1911)
          0.23301741 = weight(abstract_txt:fuzzy in 1911) [ClassicSimilarity], result of:
            0.23301741 = score(doc=1911,freq=1.0), product of:
              0.31124756 = queryWeight, product of:
                2.419098 = boost
                6.8448567 = idf(docFreq=127, maxDocs=44218)
                0.018796984 = queryNorm
              0.7486562 = fieldWeight in 1911, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.8448567 = idf(docFreq=127, maxDocs=44218)
                0.109375 = fieldNorm(doc=1911)
          0.38035354 = weight(abstract_txt:scanned in 1911) [ClassicSimilarity], result of:
            0.38035354 = score(doc=1911,freq=1.0), product of:
              0.4314913 = queryWeight, product of:
                2.8483047 = boost
                8.059301 = idf(docFreq=37, maxDocs=44218)
                0.018796984 = queryNorm
              0.88148606 = fieldWeight in 1911, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.059301 = idf(docFreq=37, maxDocs=44218)
                0.109375 = fieldNorm(doc=1911)
          0.5059838 = weight(abstract_txt:excalibur in 1911) [ClassicSimilarity], result of:
            0.5059838 = score(doc=1911,freq=1.0), product of:
              0.52192104 = queryWeight, product of:
                3.1325848 = boost
                8.863674 = idf(docFreq=16, maxDocs=44218)
                0.018796984 = queryNorm
              0.96946436 = fieldWeight in 1911, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.863674 = idf(docFreq=16, maxDocs=44218)
                0.109375 = fieldNorm(doc=1911)
        0.4 = coord(10/25)
    
  2. Alexander, M.: Digitising books, manuscripts and scholarly materials : preparation, handling, scanning, recognition, compression, storage formats (1998) 0.23
    0.23239626 = sum of:
      0.23239626 = product of:
        0.96831775 = sum of:
          0.07242838 = weight(abstract_txt:storage in 3686) [ClassicSimilarity], result of:
            0.07242838 = score(doc=3686,freq=1.0), product of:
              0.11335558 = queryWeight, product of:
                1.0323023 = boost
                5.8418155 = idf(docFreq=348, maxDocs=44218)
                0.018796984 = queryNorm
              0.63894856 = fieldWeight in 3686, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8418155 = idf(docFreq=348, maxDocs=44218)
                0.109375 = fieldNorm(doc=3686)
          0.07594365 = weight(abstract_txt:british in 3686) [ClassicSimilarity], result of:
            0.07594365 = score(doc=3686,freq=1.0), product of:
              0.11699429 = queryWeight, product of:
                1.0487399 = boost
                5.934836 = idf(docFreq=317, maxDocs=44218)
                0.018796984 = queryNorm
              0.64912266 = fieldWeight in 3686, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.934836 = idf(docFreq=317, maxDocs=44218)
                0.109375 = fieldNorm(doc=3686)
          0.114458755 = weight(abstract_txt:library's in 3686) [ClassicSimilarity], result of:
            0.114458755 = score(doc=3686,freq=2.0), product of:
              0.12206554 = queryWeight, product of:
                1.0712281 = boost
                6.0620975 = idf(docFreq=279, maxDocs=44218)
                0.018796984 = queryNorm
              0.9376828 = fieldWeight in 3686, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.0620975 = idf(docFreq=279, maxDocs=44218)
                0.109375 = fieldNorm(doc=3686)
          0.083314255 = weight(abstract_txt:recognition in 3686) [ClassicSimilarity], result of:
            0.083314255 = score(doc=3686,freq=1.0), product of:
              0.12444666 = queryWeight, product of:
                1.0816259 = boost
                6.1209383 = idf(docFreq=263, maxDocs=44218)
                0.018796984 = queryNorm
              0.66947764 = fieldWeight in 3686, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.1209383 = idf(docFreq=263, maxDocs=44218)
                0.109375 = fieldNorm(doc=3686)
          0.116188906 = weight(abstract_txt:images in 3686) [ClassicSimilarity], result of:
            0.116188906 = score(doc=3686,freq=1.0), product of:
              0.19571471 = queryWeight, product of:
                1.9182808 = boost
                5.427791 = idf(docFreq=527, maxDocs=44218)
                0.018796984 = queryNorm
              0.59366465 = fieldWeight in 3686, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.427791 = idf(docFreq=527, maxDocs=44218)
                0.109375 = fieldNorm(doc=3686)
          0.5059838 = weight(abstract_txt:excalibur in 3686) [ClassicSimilarity], result of:
            0.5059838 = score(doc=3686,freq=1.0), product of:
              0.52192104 = queryWeight, product of:
                3.1325848 = boost
                8.863674 = idf(docFreq=16, maxDocs=44218)
                0.018796984 = queryNorm
              0.96946436 = fieldWeight in 3686, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.863674 = idf(docFreq=16, maxDocs=44218)
                0.109375 = fieldNorm(doc=3686)
        0.24 = coord(6/25)
    
  3. Townsend, J.: Multimedia - myth or reality? (1994) 0.21
    0.20525852 = sum of:
      0.20525852 = product of:
        0.85524386 = sum of:
          0.061718848 = weight(abstract_txt:briefly in 660) [ClassicSimilarity], result of:
            0.061718848 = score(doc=660,freq=1.0), product of:
              0.112913735 = queryWeight, product of:
                1.0302885 = boost
                5.830419 = idf(docFreq=352, maxDocs=44218)
                0.018796984 = queryNorm
              0.5466018 = fieldWeight in 660, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.830419 = idf(docFreq=352, maxDocs=44218)
                0.09375 = fieldNorm(doc=660)
          0.07141222 = weight(abstract_txt:recognition in 660) [ClassicSimilarity], result of:
            0.07141222 = score(doc=660,freq=1.0), product of:
              0.12444666 = queryWeight, product of:
                1.0816259 = boost
                6.1209383 = idf(docFreq=263, maxDocs=44218)
                0.018796984 = queryNorm
              0.573838 = fieldWeight in 660, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.1209383 = idf(docFreq=263, maxDocs=44218)
                0.09375 = fieldNorm(doc=660)
          0.10730202 = weight(abstract_txt:pattern in 660) [ClassicSimilarity], result of:
            0.10730202 = score(doc=660,freq=2.0), product of:
              0.12957767 = queryWeight, product of:
                1.1036987 = boost
                6.2458487 = idf(docFreq=232, maxDocs=44218)
                0.018796984 = queryNorm
              0.82809037 = fieldWeight in 660, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.2458487 = idf(docFreq=232, maxDocs=44218)
                0.09375 = fieldNorm(doc=660)
          0.10423553 = weight(abstract_txt:adaptive in 660) [ClassicSimilarity], result of:
            0.10423553 = score(doc=660,freq=1.0), product of:
              0.16013223 = queryWeight, product of:
                1.2269442 = boost
                6.943297 = idf(docFreq=115, maxDocs=44218)
                0.018796984 = queryNorm
              0.6509341 = fieldWeight in 660, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.943297 = idf(docFreq=115, maxDocs=44218)
                0.09375 = fieldNorm(doc=660)
          0.0768748 = weight(abstract_txt:indexing in 660) [ClassicSimilarity], result of:
            0.0768748 = score(doc=660,freq=1.0), product of:
              0.18852313 = queryWeight, product of:
                2.305836 = boost
                4.3495874 = idf(docFreq=1551, maxDocs=44218)
                0.018796984 = queryNorm
              0.40777382 = fieldWeight in 660, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.3495874 = idf(docFreq=1551, maxDocs=44218)
                0.09375 = fieldNorm(doc=660)
          0.43370044 = weight(abstract_txt:excalibur in 660) [ClassicSimilarity], result of:
            0.43370044 = score(doc=660,freq=1.0), product of:
              0.52192104 = queryWeight, product of:
                3.1325848 = boost
                8.863674 = idf(docFreq=16, maxDocs=44218)
                0.018796984 = queryNorm
              0.83096945 = fieldWeight in 660, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.863674 = idf(docFreq=16, maxDocs=44218)
                0.09375 = fieldNorm(doc=660)
        0.24 = coord(6/25)
    
  4. Picture content retrieval (1996) 0.20
    0.19918428 = sum of:
      0.19918428 = product of:
        0.9959214 = sum of:
          0.0822918 = weight(abstract_txt:briefly in 6975) [ClassicSimilarity], result of:
            0.0822918 = score(doc=6975,freq=1.0), product of:
              0.112913735 = queryWeight, product of:
                1.0302885 = boost
                5.830419 = idf(docFreq=352, maxDocs=44218)
                0.018796984 = queryNorm
              0.7288024 = fieldWeight in 6975, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.830419 = idf(docFreq=352, maxDocs=44218)
                0.125 = fieldNorm(doc=6975)
          0.09521629 = weight(abstract_txt:recognition in 6975) [ClassicSimilarity], result of:
            0.09521629 = score(doc=6975,freq=1.0), product of:
              0.12444666 = queryWeight, product of:
                1.0816259 = boost
                6.1209383 = idf(docFreq=263, maxDocs=44218)
                0.018796984 = queryNorm
              0.7651173 = fieldWeight in 6975, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.1209383 = idf(docFreq=263, maxDocs=44218)
                0.125 = fieldNorm(doc=6975)
          0.10116531 = weight(abstract_txt:pattern in 6975) [ClassicSimilarity], result of:
            0.10116531 = score(doc=6975,freq=1.0), product of:
              0.12957767 = queryWeight, product of:
                1.1036987 = boost
                6.2458487 = idf(docFreq=232, maxDocs=44218)
                0.018796984 = queryNorm
              0.7807311 = fieldWeight in 6975, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.2458487 = idf(docFreq=232, maxDocs=44218)
                0.125 = fieldNorm(doc=6975)
          0.1389807 = weight(abstract_txt:adaptive in 6975) [ClassicSimilarity], result of:
            0.1389807 = score(doc=6975,freq=1.0), product of:
              0.16013223 = queryWeight, product of:
                1.2269442 = boost
                6.943297 = idf(docFreq=115, maxDocs=44218)
                0.018796984 = queryNorm
              0.8679121 = fieldWeight in 6975, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.943297 = idf(docFreq=115, maxDocs=44218)
                0.125 = fieldNorm(doc=6975)
          0.5782673 = weight(abstract_txt:excalibur in 6975) [ClassicSimilarity], result of:
            0.5782673 = score(doc=6975,freq=1.0), product of:
              0.52192104 = queryWeight, product of:
                3.1325848 = boost
                8.863674 = idf(docFreq=16, maxDocs=44218)
                0.018796984 = queryNorm
              1.1079593 = fieldWeight in 6975, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.863674 = idf(docFreq=16, maxDocs=44218)
                0.125 = fieldNorm(doc=6975)
        0.2 = coord(5/25)
    
  5. Brown, S.: Developments in information retrieval systems : RetrievalWare from Excalibur (1996) 0.18
    0.1827259 = sum of:
      0.1827259 = product of:
        1.1420369 = sum of:
          0.11902036 = weight(abstract_txt:recognition in 4952) [ClassicSimilarity], result of:
            0.11902036 = score(doc=4952,freq=1.0), product of:
              0.12444666 = queryWeight, product of:
                1.0816259 = boost
                6.1209383 = idf(docFreq=263, maxDocs=44218)
                0.018796984 = queryNorm
              0.9563966 = fieldWeight in 4952, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.1209383 = idf(docFreq=263, maxDocs=44218)
                0.15625 = fieldNorm(doc=4952)
          0.12645665 = weight(abstract_txt:pattern in 4952) [ClassicSimilarity], result of:
            0.12645665 = score(doc=4952,freq=1.0), product of:
              0.12957767 = queryWeight, product of:
                1.1036987 = boost
                6.2458487 = idf(docFreq=232, maxDocs=44218)
                0.018796984 = queryNorm
              0.9759139 = fieldWeight in 4952, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.2458487 = idf(docFreq=232, maxDocs=44218)
                0.15625 = fieldNorm(doc=4952)
          0.17372587 = weight(abstract_txt:adaptive in 4952) [ClassicSimilarity], result of:
            0.17372587 = score(doc=4952,freq=1.0), product of:
              0.16013223 = queryWeight, product of:
                1.2269442 = boost
                6.943297 = idf(docFreq=115, maxDocs=44218)
                0.018796984 = queryNorm
              1.0848901 = fieldWeight in 4952, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.943297 = idf(docFreq=115, maxDocs=44218)
                0.15625 = fieldNorm(doc=4952)
          0.72283405 = weight(abstract_txt:excalibur in 4952) [ClassicSimilarity], result of:
            0.72283405 = score(doc=4952,freq=1.0), product of:
              0.52192104 = queryWeight, product of:
                3.1325848 = boost
                8.863674 = idf(docFreq=16, maxDocs=44218)
                0.018796984 = queryNorm
              1.3849491 = fieldWeight in 4952, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.863674 = idf(docFreq=16, maxDocs=44218)
                0.15625 = fieldNorm(doc=4952)
        0.16 = coord(4/25)