Document (#16032)

Author
Alexander, M.
Title
Retrieving digital data with fuzzy matching
Source
New library world. 97(1996) no.1131, S.28-31
Year
1996
Abstract
Briefly describes the Excalibur EFS system which makes use of adaptive pattern recognition technology as an aid to automatic indexing and how it is being tested at the British Library for the indexing and retrieval of scanned images from the library's holdings. Notes how Excalibur EFS can support a wide degree of fuzzy searching, compensate for the errors produced by OCR conversion of scanned images, reduce the costs of indexing, and require far less storage space than more traditional indexes
Theme
Automatisches Indexieren
Object
Excalibur EFS

Similar documents (author)

  1. Alexander, M.: Automatic indexing of document images using Excalibur EFS (1995) 5.56
    5.56391 = sum of:
      5.56391 = weight(author_txt:alexander in 2980) [ClassicSimilarity], result of:
        5.56391 = fieldWeight in 2980, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.902256 = idf(docFreq=15, maxDocs=43254)
          0.625 = fieldNorm(doc=2980)
    
  2. Alexander, M.: Retrieving digital data with fuzzy matching (1997) 5.56
    5.56391 = sum of:
      5.56391 = weight(author_txt:alexander in 1152) [ClassicSimilarity], result of:
        5.56391 = fieldWeight in 1152, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.902256 = idf(docFreq=15, maxDocs=43254)
          0.625 = fieldNorm(doc=1152)
    
  3. Alexander, J.: Customs and excise process 2.5 million documents (1997) 5.56
    5.56391 = sum of:
      5.56391 = weight(author_txt:alexander in 4428) [ClassicSimilarity], result of:
        5.56391 = fieldWeight in 4428, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.902256 = idf(docFreq=15, maxDocs=43254)
          0.625 = fieldNorm(doc=4428)
    
  4. Alexander, M.: Digitising books, manuscripts and scholarly materials : preparation, handling, scanning, recognition, compression, storage formats (1998) 5.56
    5.56391 = sum of:
      5.56391 = weight(author_txt:alexander in 5687) [ClassicSimilarity], result of:
        5.56391 = fieldWeight in 5687, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.902256 = idf(docFreq=15, maxDocs=43254)
          0.625 = fieldNorm(doc=5687)
    
  5. Alexander, K.: Kompendium der visuellen Information und Kommunikation (2007) 5.56
    5.56391 = sum of:
      5.56391 = weight(author_txt:alexander in 2648) [ClassicSimilarity], result of:
        5.56391 = fieldWeight in 2648, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.902256 = idf(docFreq=15, maxDocs=43254)
          0.625 = fieldNorm(doc=2648)
    

Similar documents (content)

  1. Alexander, M.: Automatic indexing of document images using Excalibur EFS (1995) 0.76
    0.7644924 = sum of:
      0.7644924 = product of:
        1.9112309 = sum of:
          0.09646499 = weight(abstract_txt:indexes in 2980) [ClassicSimilarity], result of:
            0.09646499 = score(doc=2980,freq=2.0), product of:
              0.10899814 = queryWeight, product of:
                1.0080503 = boost
                5.7216015 = idf(docFreq=384, maxDocs=43254)
                0.01889815 = queryNorm
              0.885015 = fieldWeight in 2980, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.7216015 = idf(docFreq=384, maxDocs=43254)
                0.109375 = fieldNorm(doc=2980)
          0.07564271 = weight(abstract_txt:british in 2980) [ClassicSimilarity], result of:
            0.07564271 = score(doc=2980,freq=1.0), product of:
              0.1167779 = queryWeight, product of:
                1.0434052 = boost
                5.922272 = idf(docFreq=314, maxDocs=43254)
                0.01889815 = queryNorm
              0.64774853 = fieldWeight in 2980, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.922272 = idf(docFreq=314, maxDocs=43254)
                0.109375 = fieldNorm(doc=2980)
          0.119297385 = weight(abstract_txt:recognition in 2980) [ClassicSimilarity], result of:
            0.119297385 = score(doc=2980,freq=2.0), product of:
              0.12558176 = queryWeight, product of:
                1.0820216 = boost
                6.1414557 = idf(docFreq=252, maxDocs=43254)
                0.01889815 = queryNorm
              0.9499579 = fieldWeight in 2980, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.1414557 = idf(docFreq=252, maxDocs=43254)
                0.109375 = fieldNorm(doc=2980)
          0.12546378 = weight(abstract_txt:pattern in 2980) [ClassicSimilarity], result of:
            0.12546378 = score(doc=2980,freq=2.0), product of:
              0.1298728 = queryWeight, product of:
                1.1003523 = boost
                6.245499 = idf(docFreq=227, maxDocs=43254)
                0.01889815 = queryNorm
              0.9660513 = fieldWeight in 2980, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.245499 = idf(docFreq=227, maxDocs=43254)
                0.109375 = fieldNorm(doc=2980)
          0.17204466 = weight(abstract_txt:adaptive in 2980) [ClassicSimilarity], result of:
            0.17204466 = score(doc=2980,freq=2.0), product of:
              0.16029996 = queryWeight, product of:
                1.2224733 = boost
                6.9386463 = idf(docFreq=113, maxDocs=43254)
                0.01889815 = queryNorm
              1.0732671 = fieldWeight in 2980, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.9386463 = idf(docFreq=113, maxDocs=43254)
                0.109375 = fieldNorm(doc=2980)
          0.11603101 = weight(abstract_txt:images in 2980) [ClassicSimilarity], result of:
            0.11603101 = score(doc=2980,freq=1.0), product of:
              0.19569302 = queryWeight, product of:
                1.9101845 = boost
                5.421016 = idf(docFreq=519, maxDocs=43254)
                0.01889815 = queryNorm
              0.59292364 = fieldWeight in 2980, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.421016 = idf(docFreq=519, maxDocs=43254)
                0.109375 = fieldNorm(doc=2980)
          0.08962334 = weight(abstract_txt:indexing in 2980) [ClassicSimilarity], result of:
            0.08962334 = score(doc=2980,freq=1.0), product of:
              0.18858352 = queryWeight, product of:
                2.296599 = boost
                4.345095 = idf(docFreq=1524, maxDocs=43254)
                0.01889815 = queryNorm
              0.4752448 = fieldWeight in 2980, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.345095 = idf(docFreq=1524, maxDocs=43254)
                0.109375 = fieldNorm(doc=2980)
          0.2313251 = weight(abstract_txt:fuzzy in 2980) [ClassicSimilarity], result of:
            0.2313251 = score(doc=2980,freq=1.0), product of:
              0.3099853 = queryWeight, product of:
                2.4041314 = boost
                6.822815 = idf(docFreq=127, maxDocs=43254)
                0.01889815 = queryNorm
              0.7462454 = fieldWeight in 2980, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.822815 = idf(docFreq=127, maxDocs=43254)
                0.109375 = fieldNorm(doc=2980)
          0.38191935 = weight(abstract_txt:scanned in 2980) [ClassicSimilarity], result of:
            0.38191935 = score(doc=2980,freq=1.0), product of:
              0.43301907 = queryWeight, product of:
                2.8414576 = boost
                8.063927 = idf(docFreq=36, maxDocs=43254)
                0.01889815 = queryNorm
              0.881992 = fieldWeight in 2980, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.063927 = idf(docFreq=36, maxDocs=43254)
                0.109375 = fieldNorm(doc=2980)
          0.50341856 = weight(abstract_txt:excalibur in 2980) [ClassicSimilarity], result of:
            0.50341856 = score(doc=2980,freq=1.0), product of:
              0.5205695 = queryWeight, product of:
                3.1154947 = boost
                8.841632 = idf(docFreq=16, maxDocs=43254)
                0.01889815 = queryNorm
              0.9670535 = fieldWeight in 2980, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.841632 = idf(docFreq=16, maxDocs=43254)
                0.109375 = fieldNorm(doc=2980)
        0.4 = coord(10/25)
    
  2. Alexander, M.: Digitising books, manuscripts and scholarly materials : preparation, handling, scanning, recognition, compression, storage formats (1998) 0.23
    0.23191097 = sum of:
      0.23191097 = product of:
        0.9662957 = sum of:
          0.072753854 = weight(abstract_txt:storage in 5687) [ClassicSimilarity], result of:
            0.072753854 = score(doc=5687,freq=1.0), product of:
              0.113785416 = queryWeight, product of:
                1.0299495 = boost
                5.8458996 = idf(docFreq=339, maxDocs=43254)
                0.01889815 = queryNorm
              0.63939524 = fieldWeight in 5687, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8458996 = idf(docFreq=339, maxDocs=43254)
                0.109375 = fieldNorm(doc=5687)
          0.07564271 = weight(abstract_txt:british in 5687) [ClassicSimilarity], result of:
            0.07564271 = score(doc=5687,freq=1.0), product of:
              0.1167779 = queryWeight, product of:
                1.0434052 = boost
                5.922272 = idf(docFreq=314, maxDocs=43254)
                0.01889815 = queryNorm
              0.64774853 = fieldWeight in 5687, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.922272 = idf(docFreq=314, maxDocs=43254)
                0.109375 = fieldNorm(doc=5687)
          0.114093594 = weight(abstract_txt:library's in 5687) [ClassicSimilarity], result of:
            0.114093594 = score(doc=5687,freq=2.0), product of:
              0.121902734 = queryWeight, product of:
                1.0660545 = boost
                6.0508275 = idf(docFreq=276, maxDocs=43254)
                0.01889815 = queryNorm
              0.9359396 = fieldWeight in 5687, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.0508275 = idf(docFreq=276, maxDocs=43254)
                0.109375 = fieldNorm(doc=5687)
          0.08435599 = weight(abstract_txt:recognition in 5687) [ClassicSimilarity], result of:
            0.08435599 = score(doc=5687,freq=1.0), product of:
              0.12558176 = queryWeight, product of:
                1.0820216 = boost
                6.1414557 = idf(docFreq=252, maxDocs=43254)
                0.01889815 = queryNorm
              0.6717217 = fieldWeight in 5687, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.1414557 = idf(docFreq=252, maxDocs=43254)
                0.109375 = fieldNorm(doc=5687)
          0.11603101 = weight(abstract_txt:images in 5687) [ClassicSimilarity], result of:
            0.11603101 = score(doc=5687,freq=1.0), product of:
              0.19569302 = queryWeight, product of:
                1.9101845 = boost
                5.421016 = idf(docFreq=519, maxDocs=43254)
                0.01889815 = queryNorm
              0.59292364 = fieldWeight in 5687, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.421016 = idf(docFreq=519, maxDocs=43254)
                0.109375 = fieldNorm(doc=5687)
          0.50341856 = weight(abstract_txt:excalibur in 5687) [ClassicSimilarity], result of:
            0.50341856 = score(doc=5687,freq=1.0), product of:
              0.5205695 = queryWeight, product of:
                3.1154947 = boost
                8.841632 = idf(docFreq=16, maxDocs=43254)
                0.01889815 = queryNorm
              0.9670535 = fieldWeight in 5687, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.841632 = idf(docFreq=16, maxDocs=43254)
                0.109375 = fieldNorm(doc=5687)
        0.24 = coord(6/25)
    
  3. Townsend, J.: Multimedia - myth or reality? (1994) 0.20
    0.2049528 = sum of:
      0.2049528 = product of:
        0.85397005 = sum of:
          0.06152808 = weight(abstract_txt:briefly in 1729) [ClassicSimilarity], result of:
            0.06152808 = score(doc=1729,freq=1.0), product of:
              0.11277063 = queryWeight, product of:
                1.0253465 = boost
                5.819773 = idf(docFreq=348, maxDocs=43254)
                0.01889815 = queryNorm
              0.54560375 = fieldWeight in 1729, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.819773 = idf(docFreq=348, maxDocs=43254)
                0.09375 = fieldNorm(doc=1729)
          0.072305135 = weight(abstract_txt:recognition in 1729) [ClassicSimilarity], result of:
            0.072305135 = score(doc=1729,freq=1.0), product of:
              0.12558176 = queryWeight, product of:
                1.0820216 = boost
                6.1414557 = idf(docFreq=252, maxDocs=43254)
                0.01889815 = queryNorm
              0.57576144 = fieldWeight in 1729, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.1414557 = idf(docFreq=252, maxDocs=43254)
                0.09375 = fieldNorm(doc=1729)
          0.107540384 = weight(abstract_txt:pattern in 1729) [ClassicSimilarity], result of:
            0.107540384 = score(doc=1729,freq=2.0), product of:
              0.1298728 = queryWeight, product of:
                1.1003523 = boost
                6.245499 = idf(docFreq=227, maxDocs=43254)
                0.01889815 = queryNorm
              0.82804394 = fieldWeight in 1729, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.245499 = idf(docFreq=227, maxDocs=43254)
                0.09375 = fieldNorm(doc=1729)
          0.10427482 = weight(abstract_txt:adaptive in 1729) [ClassicSimilarity], result of:
            0.10427482 = score(doc=1729,freq=1.0), product of:
              0.16029996 = queryWeight, product of:
                1.2224733 = boost
                6.9386463 = idf(docFreq=113, maxDocs=43254)
                0.01889815 = queryNorm
              0.6504981 = fieldWeight in 1729, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.9386463 = idf(docFreq=113, maxDocs=43254)
                0.09375 = fieldNorm(doc=1729)
          0.07682 = weight(abstract_txt:indexing in 1729) [ClassicSimilarity], result of:
            0.07682 = score(doc=1729,freq=1.0), product of:
              0.18858352 = queryWeight, product of:
                2.296599 = boost
                4.345095 = idf(docFreq=1524, maxDocs=43254)
                0.01889815 = queryNorm
              0.4073527 = fieldWeight in 1729, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.345095 = idf(docFreq=1524, maxDocs=43254)
                0.09375 = fieldNorm(doc=1729)
          0.4315016 = weight(abstract_txt:excalibur in 1729) [ClassicSimilarity], result of:
            0.4315016 = score(doc=1729,freq=1.0), product of:
              0.5205695 = queryWeight, product of:
                3.1154947 = boost
                8.841632 = idf(docFreq=16, maxDocs=43254)
                0.01889815 = queryNorm
              0.82890296 = fieldWeight in 1729, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.841632 = idf(docFreq=16, maxDocs=43254)
                0.09375 = fieldNorm(doc=1729)
        0.24 = coord(6/25)
    
  4. Picture content retrieval (1996) 0.20
    0.19884059 = sum of:
      0.19884059 = product of:
        0.9942029 = sum of:
          0.082037434 = weight(abstract_txt:briefly in 45) [ClassicSimilarity], result of:
            0.082037434 = score(doc=45,freq=1.0), product of:
              0.11277063 = queryWeight, product of:
                1.0253465 = boost
                5.819773 = idf(docFreq=348, maxDocs=43254)
                0.01889815 = queryNorm
              0.72747165 = fieldWeight in 45, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.819773 = idf(docFreq=348, maxDocs=43254)
                0.125 = fieldNorm(doc=45)
          0.09640685 = weight(abstract_txt:recognition in 45) [ClassicSimilarity], result of:
            0.09640685 = score(doc=45,freq=1.0), product of:
              0.12558176 = queryWeight, product of:
                1.0820216 = boost
                6.1414557 = idf(docFreq=252, maxDocs=43254)
                0.01889815 = queryNorm
              0.76768196 = fieldWeight in 45, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.1414557 = idf(docFreq=252, maxDocs=43254)
                0.125 = fieldNorm(doc=45)
          0.10139006 = weight(abstract_txt:pattern in 45) [ClassicSimilarity], result of:
            0.10139006 = score(doc=45,freq=1.0), product of:
              0.1298728 = queryWeight, product of:
                1.1003523 = boost
                6.245499 = idf(docFreq=227, maxDocs=43254)
                0.01889815 = queryNorm
              0.7806874 = fieldWeight in 45, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.245499 = idf(docFreq=227, maxDocs=43254)
                0.125 = fieldNorm(doc=45)
          0.1390331 = weight(abstract_txt:adaptive in 45) [ClassicSimilarity], result of:
            0.1390331 = score(doc=45,freq=1.0), product of:
              0.16029996 = queryWeight, product of:
                1.2224733 = boost
                6.9386463 = idf(docFreq=113, maxDocs=43254)
                0.01889815 = queryNorm
              0.8673308 = fieldWeight in 45, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.9386463 = idf(docFreq=113, maxDocs=43254)
                0.125 = fieldNorm(doc=45)
          0.5753355 = weight(abstract_txt:excalibur in 45) [ClassicSimilarity], result of:
            0.5753355 = score(doc=45,freq=1.0), product of:
              0.5205695 = queryWeight, product of:
                3.1154947 = boost
                8.841632 = idf(docFreq=16, maxDocs=43254)
                0.01889815 = queryNorm
              1.105204 = fieldWeight in 45, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.841632 = idf(docFreq=16, maxDocs=43254)
                0.125 = fieldNorm(doc=45)
        0.2 = coord(5/25)
    
  5. Brown, S.: Developments in information retrieval systems : RetrievalWare from Excalibur (1996) 0.18
    0.18243308 = sum of:
      0.18243308 = product of:
        1.1402068 = sum of:
          0.12050857 = weight(abstract_txt:recognition in 6021) [ClassicSimilarity], result of:
            0.12050857 = score(doc=6021,freq=1.0), product of:
              0.12558176 = queryWeight, product of:
                1.0820216 = boost
                6.1414557 = idf(docFreq=252, maxDocs=43254)
                0.01889815 = queryNorm
              0.9596025 = fieldWeight in 6021, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.1414557 = idf(docFreq=252, maxDocs=43254)
                0.15625 = fieldNorm(doc=6021)
          0.12673756 = weight(abstract_txt:pattern in 6021) [ClassicSimilarity], result of:
            0.12673756 = score(doc=6021,freq=1.0), product of:
              0.1298728 = queryWeight, product of:
                1.1003523 = boost
                6.245499 = idf(docFreq=227, maxDocs=43254)
                0.01889815 = queryNorm
              0.9758592 = fieldWeight in 6021, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.245499 = idf(docFreq=227, maxDocs=43254)
                0.15625 = fieldNorm(doc=6021)
          0.17379135 = weight(abstract_txt:adaptive in 6021) [ClassicSimilarity], result of:
            0.17379135 = score(doc=6021,freq=1.0), product of:
              0.16029996 = queryWeight, product of:
                1.2224733 = boost
                6.9386463 = idf(docFreq=113, maxDocs=43254)
                0.01889815 = queryNorm
              1.0841634 = fieldWeight in 6021, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.9386463 = idf(docFreq=113, maxDocs=43254)
                0.15625 = fieldNorm(doc=6021)
          0.7191694 = weight(abstract_txt:excalibur in 6021) [ClassicSimilarity], result of:
            0.7191694 = score(doc=6021,freq=1.0), product of:
              0.5205695 = queryWeight, product of:
                3.1154947 = boost
                8.841632 = idf(docFreq=16, maxDocs=43254)
                0.01889815 = queryNorm
              1.381505 = fieldWeight in 6021, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.841632 = idf(docFreq=16, maxDocs=43254)
                0.15625 = fieldNorm(doc=6021)
        0.16 = coord(4/25)