A model guided document image analysis scheme
2002Vol. 70, pp. 1137–1141
Citations Over TimeTop 14% of 2002 papers
Abstract
This paper presents a new model-based document image segmentation scheme that uses XML-DTDs (eXtensible Markup Language Document Type Definitions). Given a document image, the algorithm has the ability to select the appropriate model. A new wavelet-based tool has been designed for distinguishing text from non-text regions and characterization of font sizes. Our model-based analysis scheme makes use of this tool for identifying the logical components of a document image.
Related Papers
- → Image-based logical document structure recognition(2014)11 cited
- → A model guided document image analysis scheme(2002)13 cited
- → Semi-Structured Document Classification(2005)2 cited
- Logical Structure Extraction of Form Document Image(2000)
- → Semi-Structured Document Classification(2009)