Results 1 - 10
of
12
Address Block Location on Envelopes Using Gabor Filters
- Pattern Recognition
, 1992
"... This paper is organized as follows. Section 2 briefly describes several approaches to address block location reported in the literature. Section 3 introduces the well-known multichannel filtering method and, in particular, Gabor filters for texture-based segmentation. In Section 4, we justify that t ..."
Abstract
-
Cited by 28 (1 self)
- Add to MetaCart
(Show Context)
This paper is organized as follows. Section 2 briefly describes several approaches to address block location reported in the literature. Section 3 introduces the well-known multichannel filtering method and, in particular, Gabor filters for texture-based segmentation. In Section 4, we justify that the problem of address block location can be viewed as a "texture segmentation" problem. Section 5 gives experimental results which demonstrate the feasibility of the proposed method. Finally, Section 6 presents a summary of our work and few concluding remarks. 2 Approaches to Address Block Location Several researchers have dealt with the problem of locating address blocks in images of mail pieces [2, 10, 13, 14]. Expert systems utilizing AI techniques have been built for locating address blocks in the presence of other text and graphic information on the face of the mail piece. Another popular approach is to extract simple geometric features which characterize address blocks. This section contains a brief summary of these approaches reported in the literature. 2.1 An Expert Systems Approach An expert system for automatic sorting of mail pieces has been developed by Wang et al [10, 13]. In this scheme, the address blocks are located by the Address Block Location Subsystem (ABLS). The system is capable of using multispectral images (gray scale, color, infrared, or color under ultraviolet illumination). The ABLS consists of six major components: (i) Mail statistical database (MSD): contains statistics of the geometric features of address labels on various mail items. This database has been generated after 3 careful analysis of a large number of mail pieces. (ii) Tool box: contains image processing tools for thresholding, segmentation, labeling connected components, discrimina...
AUTOMATIC DOCUMENT PROCESSING: A SURVEY
, 1996
"... Surveys of the basic concepts and underlying techniques are presented in this paper. A basic model for document processing is described. In this model, document processing can be divided into two phases: document analysis and document understanding. A document has two structures: geometric (layout) ..."
Abstract
-
Cited by 15 (1 self)
- Add to MetaCart
Surveys of the basic concepts and underlying techniques are presented in this paper. A basic model for document processing is described. In this model, document processing can be divided into two phases: document analysis and document understanding. A document has two structures: geometric (layout) structure and logical structure. Extraction of the geometric structure from a document refers to document analysis; mapping the geometric structure into logical structure deals with document understanding. Both types of document structures and the two areas of document processing are discussed. Two categories of methods have been used in document analysis, namely, (1) hierarchical methods including top-down and bottom-up approaches, (2) no-hierarchical methods including modified fractal signature. Tree transform, formatting knowledge and description language approaches have been used in document understanding. A particular case of form document processing is discussed. Form description and form registration approaches are presented. A form processing system is also introduced. Finally, many techniques, uch as skew detection, Hough transform, Gabor filters, projection, crossing counts, form definition language, etc. which have been used in these approaches are discussed.
Detection of Courtesy Amount Block on Bank Checks
- Journal of Electronic Imaging
, 1995
"... This paper presents a multi-staged technique for locating the courtesy amount block on bank checks. In the case of a check processing system, many of the proposed methods are not acceptable, due to the the presence of many fonts and text sizes, as well as the short length of many text strings. This ..."
Abstract
-
Cited by 11 (3 self)
- Add to MetaCart
(Show Context)
This paper presents a multi-staged technique for locating the courtesy amount block on bank checks. In the case of a check processing system, many of the proposed methods are not acceptable, due to the the presence of many fonts and text sizes, as well as the short length of many text strings. This paper will describe particular methods chosen to implement a Courtesy Amount Block Locator (CABL). First, the connected components in the image are identified. Next, strings are constructed on the basis of proximity and horizontal alignment of characters. Finally a set of rules and heuristics are applied to these strings to choose the correct one. The chosen string is only reported if it passes a verification test, which includes an attempt to recognize the currency sign. Keywords: check analysis and processing, block detection, courtesy amount recognition, image processing, heuristics rules, segmentation 1 Introduction Trillions of dollars change hands each year in the form of handwritten ...
Statistical Zone Finding
, 1996
"... We propose a statistical technique of zone finding for the class of documents that are neither rigidly structured like tax forms nor very unstructured like magazine pages or engineering drawings. Given an initial window assumed to contain the final zone (bounding box) of interest, and a `signature & ..."
Abstract
-
Cited by 3 (1 self)
- Add to MetaCart
We propose a statistical technique of zone finding for the class of documents that are neither rigidly structured like tax forms nor very unstructured like magazine pages or engineering drawings. Given an initial window assumed to contain the final zone (bounding box) of interest, and a `signature ' of the target, we propose to locate the final zone by a combination of simple outside in and inside out searches based on the assumption that the coordinates of the target have unimodal distribution. Results are presented in the bank check domain, and the applicability of the technique to other domains is discussed. 0. Introduction Proc 13th ICPR, Vienna (1996) Vol III, pp 818-822 Real world Optical Character Recognition (OCR) systems rarely enjoy the luxury, often taken for granted in more academic systems, of working with clearly delineated text zones. In fact, the task of zoning, or region extraction, i.e. identifying and precisely demarcating the zone(s) containing the text to be recog...
Postal Address Block Location By Contour Clustering
- PROCEEDINGS OF ICDAR 2003
, 2003
"... We have developed a well performing algorithm for locating address blocks in postal parcel images. Both machine printed and handwritten addresses are processed by the algorithm. The algorithm is invariant to the image orientation and scale, and it works with high noise images. It could also serve as ..."
Abstract
-
Cited by 2 (0 self)
- Add to MetaCart
We have developed a well performing algorithm for locating address blocks in postal parcel images. Both machine printed and handwritten addresses are processed by the algorithm. The algorithm is invariant to the image orientation and scale, and it works with high noise images. It could also serve as an additional step after other address block location algorithms.
Segmentation of Envelopes and Address Block Location by Salient Features and Hypothesis Testing
"... Abstract. Although nowadays there are working systems for sorting mail in some constrained ways, segmenting gray level images of envelopes and locating address blocks in them is still a difficult problem. Pattern Recognition research has contributed greatly to this area since the problem concerns fe ..."
Abstract
-
Cited by 1 (0 self)
- Add to MetaCart
(Show Context)
Abstract. Although nowadays there are working systems for sorting mail in some constrained ways, segmenting gray level images of envelopes and locating address blocks in them is still a difficult problem. Pattern Recognition research has contributed greatly to this area since the problem concerns feature design, extraction, recognition, and also the image segmentation if one deals with the original gray level images from the beginning. This paper presents a segmentation and address block location algorithm based on feature selection in wavelet space. The aim is to automatically separate in postal envelopes the regions related to background, stamps, rubber stamps, and the address blocks. First, a typical image of a postal envelope is decomposed using Mallat algorithm and Haar basis. High frequency channel outputs are analyzed to locate salient points in order to separate the background. A statistical hypothesis test is taken to decide upon more consistent regions in order to clean out some noise left. The selected points are projected back to the original gray level image, where the evidence from the wavelet space is used to start a growing process to include the pixels more likely to belong to the regions of stamps, rubber stamps, and written area. Besides the new features and a growing process controlled by the salient points presented here, a fully comprehensive experimental setup was run by separating and classifying blocks in the envelopes, and validating results by a pixel to pixel accuracy measure using a ground truth database
submitted to Special Issue on Analysis of Historical Documents, International Journal on Document Analysis and Recognition, Springer, 2006. Text Line Segmentation of Historical Documents: a Survey
"... ..."
(Show Context)
unknown title
"... We propose a statistical technique of zone finding for the class of documents that are neither rigidly structured like tax forms nor very unstructured like magazine pages or engineering drawings. Given an initial window assumed to contain the final zone (bounding box) of interest, and a ‘signature’ ..."
Abstract
- Add to MetaCart
(Show Context)
We propose a statistical technique of zone finding for the class of documents that are neither rigidly structured like tax forms nor very unstructured like magazine pages or engineering drawings. Given an initial window assumed to contain the final zone (bounding box) of interest, and a ‘signature’ of the target, we propose to locate the final zone by a combination of simple outside in and inside out searches based on the assumption that the coordinates of the target have unimodal distribution. Results are presented in the bank check domain, and the applicability of the technique to other domains is discussed. Real world Optical Character Recognition (OCR) systems rarely enjoy the luxury, often taken for granted in more
Document Analysis And Recognition By Computers
, 1999
"... and form registration approaches are presented. A form processing system is also introduced. Finally, many techniques, such as skew detection, Hough transform, Gabor filters, projection, crossing counts, form definition language, etc. which have been used in these approaches are discussed in this ch ..."
Abstract
- Add to MetaCart
and form registration approaches are presented. A form processing system is also introduced. Finally, many techniques, such as skew detection, Hough transform, Gabor filters, projection, crossing counts, form definition language, etc. which have been used in these approaches are discussed in this chapter. Keywords: Document processing, Document analysis and understanding, Geometric and Logical structures, Hierarchical and no-hierarchical methods, Tree transform, Formatting knowledge, Description languages, Texture analysis. 1. Introduction Documents contain knowledge. Precisely, they are medium for transferring knowledge. In fact, much knowledge is acquired from documents such as technical reports, 2 Handbook of Pattern Recognition and Computer Vision government files, newspapers, books, journals, magazines, letters, bank cheques, to name a few. The acquisition of knowledge from such documents by an information system can involve an extensi
A Theory Of Document Object Locator Combination
, 1998
"... Traditional approaches to document object location use a single locator that is expected to locate as many instances of the object class of interest as possible. However, if the class includes subclasses with diverse visual characteristics or is not characterized by easily computable visual features ..."
Abstract
- Add to MetaCart
Traditional approaches to document object location use a single locator that is expected to locate as many instances of the object class of interest as possible. However, if the class includes subclasses with diverse visual characteristics or is not characterized by easily computable visual features, it is difficult for the single locator to account for wide variation in object characteristics within the class. As a result, increasingly complex models of objects to be located are used. An alternative approach is to combine the decisions of multiple locators, each of which is suitable for certain image conditions. This approach utilizes a collection of simple locators that complement one another, rather than relying on one complex locator. An effective method for combining the location results is vital to the success of this approach. This thesis presents a theory of combining the results of multiple document object locators tuned to different object characteristics. The purpose of the ...