Document Structure and Layout Analysis (2008)
Anoop M. Namboodiri, Anil K. Jain
A document image is composed of a variety of physical entities or regions such as text blocks, lines, words, figures, tables, and background. We could also assign functional or logical labels such as...
On Using Classical Poetry Structure for Indian Language Post-Processing (2008)
Anoop M. Namboodiri, P. J. Narayanan, C. V. Jawahar
Post-processors are critical to the performance of language recognizers like OCRs, speech recognizers, etc. Dictionary-based post-processing commonly employ either an algorithmic approach or a...
Support Vector Machine based Hierarchical Classifiers for Large Class Problems (2008)
Tejo Krishna Chalasani, Anoop M. Namboodiri, C. V. Jawahar
One of the prime challenges in designing a classifier for large-class problems such as Indian language OCRs is the presence of a large similar looking character set. The nature of the character set...
Learning Segmentation of Documents with Complex Scripts (2008)
Anoop M. Namboodiri, C. V. Jawahar
Abstract. Most of the state-of-the-art segmentation algorithms are designed to handle complex document layouts and backgrounds, while assuming a simple script structure such as in Roman script. They...
Repudiation Detection in Handwritten Documents (2008)
Sachin Gupta, Anoop M. Namboodiri
Abstract. Forensic document verification presents a different and interesting set of challenges as opposed to traditional writer identification and verification tasks using natural handwriting. The...
Multimedia Document Authentication using On-line Signatures as Watermarks (2007)
Anoop M. Namboodiri, Anil K. Jain
Authentication of digital documents is an important concern as digital documents are replacing the traditional paper-based documents for o#cial and legal purposes. This is especially true in the case...
Structure in on-line documents (2001)
Anil K. Jain, Anoop M. Namboodiri
We present a hierarchical approach for extracting homogeneous regions in on-line documents. The problem of identifying and processing ruled and unruled tables, text and drawings is addressed. The...