Taylor, S.L.: Integrating natural language understanding with document structure analysis (1994)
0.00
0.0015706726 = product of:
0.021989416 = sum of:
0.021989416 = weight(_text_:retrieval in 1794) [ClassicSimilarity], result of:
0.021989416 = score(doc=1794,freq=2.0), product of:
0.09399342 = queryWeight, product of:
3.024915 = idf(docFreq=5836, maxDocs=44218)
0.031073077 = queryNorm
0.23394634 = fieldWeight in 1794, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.024915 = idf(docFreq=5836, maxDocs=44218)
0.0546875 = fieldNorm(doc=1794)
0.071428575 = coord(1/14)
- Abstract
- Document understanding, the interpretation of a document from its image form, is a technology area which benefits greatly from the integration of natural language processing with image processing. Develops a prototype of an Intelligent Document Understanding System (IDUS) which employs several technologies: image processing, optical character recognition, document structure analysis and text understanding in a cooperative fashion. Discusses those areas of research during development of IDUS where it is found that the most benefit from the integration of natural language processing and image processing occured: document structure analysis, OCR correction, and text analysis. Discusses 2 applications which are supported by IDUS: text retrieval and automatic generation of hypertext links