Taylor, S.L.: Integrating natural language understanding with document structure analysis (1994)
0.02
0.016588641 = product of:
0.066354565 = sum of:
0.066354565 = product of:
0.13270913 = sum of:
0.13270913 = weight(_text_:processing in 1794) [ClassicSimilarity], result of:
0.13270913 = score(doc=1794,freq=10.0), product of:
0.18956426 = queryWeight, product of:
4.048147 = idf(docFreq=2097, maxDocs=44218)
0.046827413 = queryNorm
0.7000747 = fieldWeight in 1794, product of:
3.1622777 = tf(freq=10.0), with freq of:
10.0 = termFreq=10.0
4.048147 = idf(docFreq=2097, maxDocs=44218)
0.0546875 = fieldNorm(doc=1794)
0.5 = coord(1/2)
0.25 = coord(1/4)
- Abstract
- Document understanding, the interpretation of a document from its image form, is a technology area which benefits greatly from the integration of natural language processing with image processing. Develops a prototype of an Intelligent Document Understanding System (IDUS) which employs several technologies: image processing, optical character recognition, document structure analysis and text understanding in a cooperative fashion. Discusses those areas of research during development of IDUS where it is found that the most benefit from the integration of natural language processing and image processing occured: document structure analysis, OCR correction, and text analysis. Discusses 2 applications which are supported by IDUS: text retrieval and automatic generation of hypertext links