Search (2 results, page 1 of 1)

  • × author_ss:"Condit, A."
  • × language_ss:"e"
  • × year_i:[1990 TO 2000}
  1. Tagheva, K.; Borsack, J.; Condit, A.: Effects of OCR errors on ranking and feedback using the vector space model (1996) 0.02
    0.016153201 = product of:
      0.0969192 = sum of:
        0.0969192 = weight(_text_:ranking in 4951) [ClassicSimilarity], result of:
          0.0969192 = score(doc=4951,freq=2.0), product of:
            0.20271951 = queryWeight, product of:
              5.4090285 = idf(docFreq=537, maxDocs=44218)
              0.03747799 = queryNorm
            0.47809508 = fieldWeight in 4951, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              5.4090285 = idf(docFreq=537, maxDocs=44218)
              0.0625 = fieldNorm(doc=4951)
      0.16666667 = coord(1/6)
    
  2. Taghva, K.; Borsack, J.; Condit, A.: Evaluation of model-based retrieval effectiveness with OCR text (1996) 0.01
    0.014134051 = product of:
      0.084804304 = sum of:
        0.084804304 = weight(_text_:ranking in 4485) [ClassicSimilarity], result of:
          0.084804304 = score(doc=4485,freq=2.0), product of:
            0.20271951 = queryWeight, product of:
              5.4090285 = idf(docFreq=537, maxDocs=44218)
              0.03747799 = queryNorm
            0.4183332 = fieldWeight in 4485, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              5.4090285 = idf(docFreq=537, maxDocs=44218)
              0.0546875 = fieldNorm(doc=4485)
      0.16666667 = coord(1/6)
    
    Abstract
    Reports on experiments with retrieval from OCR-generated text using systems based on standard models of retrieval. Shows that average precision and recall is not affected by OCR errors across systems for several collections. Both the actual and the simulation experiments include full text and abstract length documents. The ranking and feedback methods associated with the retrieval models are generally not robust enough to deal with OCR errors. OCR errors and garbage strings generated from the mistranslation of graphic objects increase the size of the index significantly. Describes the problems of applying OCR text within an information retrieval environment and offers solutions