• Documents
  • Authors
  • Tables
  • Log in
  • Sign up
  • MetaCart
  • DMCA
  • Donate

CiteSeerX logo

Advanced Search Include Citations
Advanced Search Include Citations

Feature extraction methods for character recognition-a survey, Pattern Recognition. (1996)

by D Trier, A K Jain
Add To MetaCart

Tools

Sorted by:
Results 1 - 10 of 268
Next 10 →

Clustering by compression

by Rudi Cilibrasi, Paul M. B. Vitányi - IEEE Transactions on Information Theory , 2005
"... Abstract—We present a new method for clustering based on compression. The method does not use subject-specific features or background knowledge, and works as follows: First, we determine a parameter-free, universal, similarity distance, the normalized compression distance or NCD, computed from the l ..."
Abstract - Cited by 297 (25 self) - Add to MetaCart
Abstract—We present a new method for clustering based on compression. The method does not use subject-specific features or background knowledge, and works as follows: First, we determine a parameter-free, universal, similarity distance, the normalized compression distance or NCD, computed from the lengths of compressed data files (singly and in pairwise concatenation). Second, we apply a hierarchical clustering method. The NCD is not restricted to a specific application area, and works across application area boundaries. A theoretical precursor, the normalized information distance, co-developed by one of the authors, is provably optimal. However, the optimality comes at the price of using the noncomputable notion of Kolmogorovcomplexity. We propose axioms to capture the real-world setting, and show that the NCD approximates optimality. To extract a hierarchy of clusters from the distance matrix, we determine a dendrogram (ternary tree) by a new quartet method and a fast heuristic to implement it. The method is implemented and available as public software, and is robust under choice of different compressors. To substantiate our claims of universality and robustness, we report evidence of successful application in areas as diverse as genomics, virology, languages, literature, music, handwritten digits, astronomy, and combinations of objects from completely different domains, using statistical, dictionary, and block sorting compressors. In genomics, we presented new evidence for major questions in Mammalian evolution, based on whole-mitochondrial genomic analysis: the Eutherian orders and the Marsupionta hypothesis against the Theria hypothesis. Index Terms—Heterogenous data analysis, hierarchical unsupervised clustering, Kolmogorovcomplexity, normalized compression distance, parameter-free data mining, quartet tree method, universal dissimilarity distance. I.
(Show Context)

Citation Context

...ingle decimal digit recognition accuracy of 85%. The current state-of-the-art for this problem, after half a century of interactive feature-driven classification research, in the upper ninety % level =-=[32, 14]-=-. All experiments are bench marked on the standard NIST Special Data Base 19 (optical character recognition database). 5.6 Astronomy As a proof of principle we clustered data from unknown objects, for...

Goal-Directed Evaluation of Binarization Methods

by Øivind Due Trier, Anil K. Jain , 1995
"... This paper presents a methodology for evaluation of low-level image analysis methods, using binarization (two-level thresholding) as an example. Binarization of scanned gray scale images is the first step in most document image analysis systems. Selection of an appropriate binarization method for an ..."
Abstract - Cited by 190 (10 self) - Add to MetaCart
This paper presents a methodology for evaluation of low-level image analysis methods, using binarization (two-level thresholding) as an example. Binarization of scanned gray scale images is the first step in most document image analysis systems. Selection of an appropriate binarization method for an input image domain is a difficult problem. Typically, a human expert evaluates the binarized images according to his/her visual criteria. However, to conduct an objective evaluation, one needs to investigate how well the subsequent image analysis steps will perform on the binarized image. We call this approach goal-directed evaluation, and it can be used to evaluate other low-level image processing methods as well. Our evaluation of binarization methods is in the context of digit recognition, so we define the performance of the character recognition module as the objective measure. Eleven different locally adaptive binarization methods were evaluated, and Niblack's method gave the best perf...

Word spotting for historical documents

by T. M. Rath, R. Manmatha - INTERNATIONAL JOURNAL ON DOCUMENT ANALYSIS AND RECOGNITION , 2007
"... Searching and indexing historical handwritten collections is a very challenging problem. We describe an approach called word spotting which involves grouping word images into clusters of similar words by using image matching to find similarity. By annotating “interesting ” clusters, an index that li ..."
Abstract - Cited by 82 (8 self) - Add to MetaCart
Searching and indexing historical handwritten collections is a very challenging problem. We describe an approach called word spotting which involves grouping word images into clusters of similar words by using image matching to find similarity. By annotating “interesting ” clusters, an index that links words to the locations where they occur can be built automatically. Image similarities computed using a number of different techniques including dynamic time warping are compared. The word similarities are then used for clustering
(Show Context)

Citation Context

...e I ∈ Rh×w are referred to as I(r, c), where r and c indicate the row and column index of the pixel. Our goal was to choose a variety of features presented in handwriting recognition literature (e.g. =-=[35]-=-), such that an approximate reconstruction of a word from its features would be possible. 3.1 Projection Profile Projection profiles capture the distribution of ink along one of the two dimensions in ...

Representation and recognition of handwritten digits using deformable templates

by Anil K. Jain, Douglas Zongker - IEEE Transactions on Pattern Analysis and Machine Intelligence , 1997
"... Abstract—We investigate the application of deformable templates to recognition of handprinted digits. Two characters are matched by deforming the contour of one to fit the edge strengths of the other, and a dissimilarity measure is derived from the amount of deformation needed, the goodness of fit o ..."
Abstract - Cited by 75 (2 self) - Add to MetaCart
Abstract—We investigate the application of deformable templates to recognition of handprinted digits. Two characters are matched by deforming the contour of one to fit the edge strengths of the other, and a dissimilarity measure is derived from the amount of deformation needed, the goodness of fit of the edges, and the interior overlap between the deformed shapes. Classification using the minimum dissimilarity results in recognition rates up to 99.25 percent on a 2,000 character subset of NIST Special Database 1. Additional experiments on an independent test data were done to demonstrate the robustness of this method. Multidimensional scaling is also applied to the 2,000 – 2,000 proximity matrix, using the dissimilarity measure as a distance, to embed the patterns as points in low-dimensional spaces. A nearest neighbor classifier is applied to the resulting pattern matrices. The classification accuracies obtained in the derived feature space demonstrate that there does exist a good low-dimensional representation space. Methods to reduce the computational requirements, the primary limiting factor of this method, are discussed. Index Terms—Digit recognition, deformable template, feature extraction, multidimensional scaling, clustering, nearest neighbor classification. 1
(Show Context)

Citation Context

...tion to handprinted digit recognition. Therefore, our literature review includes only similar approaches; for a wider survey of digit recognition in general, refer to the recent paper by Trier et al. =-=[14]-=-. The goal of this paper is to investigate the deformation of character image outlines as a source of information for recognition. We show that a combination of the deformation energy required to matc...

Distance sets for shape filters and shape recognition

by Cosmin Grigorescu, Nicolai Petkov - IEEE TRANS. IMAGE PROCESSING , 2003
"... We introduce a novel rich local descriptor of an image point, we call the (labeled) distance set, which is determined by the spatial arrangement of image features around that point. We describe a two-dimensional (2-D) visual object by the set of (labeled) distance sets associated with the feature p ..."
Abstract - Cited by 62 (9 self) - Add to MetaCart
We introduce a novel rich local descriptor of an image point, we call the (labeled) distance set, which is determined by the spatial arrangement of image features around that point. We describe a two-dimensional (2-D) visual object by the set of (labeled) distance sets associated with the feature points of that object. Based on a dissimilarity measure between (labeled) distance sets and a dissimilarity measure between sets of (labeled) distance sets, we address two problems that are often encountered in object recognition: object segmentation, for which we formulate a distance sets shape filter, and shape matching. The use of the shape filter is illustrated on printed and handwritten character recognition and detection of traffic signs in complex scenes. The shape comparison procedure is illustrated on handwritten character classification, COIL-20 database object recognition and MPEG-7 silhouette database retrieval.

A scale space approach for automatically segmenting words from historical handwritten documents

by R. Manmatha, Ieee Computer Society, Jamie L. Rothfeder - IEEE Trans. on Pat , 2005
"... Abstract—Many libraries, museums, and other organizations contain large collections of handwritten historical documents, for example, the papers of early presidents like George Washington at the Library of Congress. The first step in providing recognition/ retrieval tools is to automatically segment ..."
Abstract - Cited by 59 (2 self) - Add to MetaCart
Abstract—Many libraries, museums, and other organizations contain large collections of handwritten historical documents, for example, the papers of early presidents like George Washington at the Library of Congress. The first step in providing recognition/ retrieval tools is to automatically segment handwritten pages into words. State of the art segmentation techniques like the gap metrics algorithm have been mostly developed and tested on highly constrained documents like bank checks and postal addresses. There has been little work on full handwritten pages and this work has usually involved testing on clean artificial documents created for the purpose of research. Historical manuscript images, on the other hand, contain a great deal of noise and are much more challenging. Here, a novel scale space algorithm for automatically segmenting handwritten (historical) documents into words is described. First, the page is cleaned to remove margins. This is followed by a gray-level projection profile algorithm for finding lines in images. Each line image is then filtered with an anisotropic Laplacian at several scales. This procedure produces blobs which correspond to portions of characters at small scales and to words at larger scales. Crucial to the algorithm is scale selection, that is, finding the optimum scale at which blobs correspond to words. This is done by finding the maximum over scale of the extent or area of the blobs. This scale maximum is estimated using three different approaches. The blobs recovered at the optimum scale are then bounded with a rectangular box to recover the words. A postprocessing filtering step is performed to eliminate boxes of unusual size which are unlikely to correspond to words. The approach is tested on a number of different data sets and it is shown that, on 100 sampled documents from the George Washington corpus of handwritten document images, a total error rate of 17 percent is observed. The technique
(Show Context)

Citation Context

...d based on aspect ratio, height, and length filters. 5.2 Margin Removal with Projection Profiles Projection profiles, sometimes called projection histograms, have been used widely in document imaging =-=[29]-=-, [24]. However, most of this work uses binary images, while we use gray-scale images. Computing a projection profile is much faster than using a LOG filter or a Hough Transform. Let fðx; yÞ be the in...

Offline Cursive Script Word Recognition -- a Survey

by Tal Steinherz, Ehud Rivlin, Nathan Intrator , 1999
"... We review the field of offline cursive word recognition. We mainly deal with the various methods that were proposed to realize the core of recognition in a word recognition system. These methods are discussed in view of the two most important properties of such a system: the size and nature of the l ..."
Abstract - Cited by 59 (3 self) - Add to MetaCart
We review the field of offline cursive word recognition. We mainly deal with the various methods that were proposed to realize the core of recognition in a word recognition system. These methods are discussed in view of the two most important properties of such a system: the size and nature of the lexicon involved, and whether or not a segmentation stage is present. We classify the field into three categories: segmentation-free methods, which compare a sequence of observations derived from a word image with similar references of words in the lexicon; segmentation-based methods, that look for the best match between consecutive sequences of primitive segments and letters of a possible word; and the perception-oriented approach, that relates to methods that perform a human-like reading technique, in which anchor features found all over the word are used to bootstrap a few candidates for a final evaluation phase.

A Discriminative Learning Framework with Pairwise Constraints for Video Object Classification

by Rong Yan, Jian Zhang, Jie Yang, Alexander Hauptmann, Er Hauptmann - In Proc. of CVPR , 2004
"... In video object classification, insufficient labeled data may at times be easily augmented with pairwise constraints on sample points, i.e, whether they are in the same class or not. In this paper, we proposed a discriminative learning approach which incorporates pairwise constraints into a conventi ..."
Abstract - Cited by 38 (5 self) - Add to MetaCart
In video object classification, insufficient labeled data may at times be easily augmented with pairwise constraints on sample points, i.e, whether they are in the same class or not. In this paper, we proposed a discriminative learning approach which incorporates pairwise constraints into a conventional margin-based learning framework. The proposed approach offers several advantages over existing approaches dealing with pairwise constraints. First, as opposed to learning distance metrics, the new approach derives its classification power by directly modeling the decision boundary. Second, most previous work handles labeled data by converting them to pairwise constraints and thus leads to much more computation. The proposed approach can handle pairwise constraints together with labeled data so that the computation is greatly reduced. The proposed approach is evaluated on a people classification task with two surveillance video datasets.
(Show Context)

Citation Context

...ea in computer vision community for last two decades, though problems of interest have been changing over time, from automatic target recognition (ATR) [18], [19], Optical Character Recognition (OCR) =-=[20]-=-, to face detection and recognition [21]. Visual object recognition has made great progress in recent years because of advances in learning theories, which is evident in several recent papers [22], [2...

Performance evaluation of pattern classifiers for handwritten character recognition

by Cheng-lin Liu, Hiroshi Sako, Hiromichi Fujisawa - International Journal on Document Analysis and Recognition , 2002
"... Abstract. This paper describes a performance evaluation study in which some efficient classifiers are tested in handwritten digit recognition. The evaluated classifiers include a statistical classifier (modified quadratic discriminant function, MQDF), three neural classifiers, and an LVQ (learning v ..."
Abstract - Cited by 36 (3 self) - Add to MetaCart
Abstract. This paper describes a performance evaluation study in which some efficient classifiers are tested in handwritten digit recognition. The evaluated classifiers include a statistical classifier (modified quadratic discriminant function, MQDF), three neural classifiers, and an LVQ (learning vector quantization) classifier. They are efficient in that high accuracies can be achieved at moderate memory space and computation cost. The performance is measured in terms of classification accuracy, sensitivity to training sample size, ambiguity rejection, and outlier resistance. The outlier resistance of neural classifiers is enhanced by training with synthesized outlier data. The classifiers are tested on a large data set extracted from NIST SD19. As results, the test accuracies of the evaluated classifiers are comparable to or higher than those of the nearest neighbor (1-NN) rule and regularized discriminant analysis (RDA). It is shown that neural classifiers are more susceptible to small sample size than MQDF, although they yield higher accuracies on large sample size. As a neural classifier, the polynomial classifier (PC) gives the highest accuracy and performs best in ambiguity rejection. On the other hand, MQDF is superior in outlier rejection even though it is not trained with outlier data. The results indicate that pattern classifiers have complementary advantages and they should be appropriately combined to achieve higher performance.
(Show Context)

Citation Context

...que properties compared to traditional statistical and neural classifiers. For character recognition, the pattern classifiers are usually used for classification based on heuristic feature extraction =-=[15]-=- so that a relatively simple classifier can achieve high accuracy. The efficiency of feature extraction and the simplicity of classification algorithms are preferable for real-time recognition on low-...

Symbol Recognition: Current Advances and Perspectives

by Josep Lladós , Ernest Valveny , Gemma Sánchez , Enric Martí - In GREC – Algorithms and Applications , 2002
"... Abstract. The recognition of symbols in graphic documents is an intensive research activity in the community of pattern recognition and document analysis. A key issue in the interpretation of maps, engineering drawings, diagrams, etc. is the recognition of domain dependent symbols according to a sy ..."
Abstract - Cited by 31 (12 self) - Add to MetaCart
Abstract. The recognition of symbols in graphic documents is an intensive research activity in the community of pattern recognition and document analysis. A key issue in the interpretation of maps, engineering drawings, diagrams, etc. is the recognition of domain dependent symbols according to a symbol database. In this work we first review the most outstanding symbol recognition methods from two different points of view: application domains and pattern recognition methods. In the second part of the paper, open and unaddressed problems involved in symbol recognition are described, analyzing their current state of art and discussing future research challenges. Thus, issues such as symbol representation, matching, segmentation, learning, scalability of recognition methods and performance evaluation are addressed in this work. Finally, we discuss the perspectives of symbol recognition concerning to new paradigms such as user interfaces in handheld computers or document database and WWW indexing by graphical content.
(Show Context)

Citation Context

...we have classified symbol recognition approaches according to these two issues. The selection of the feature space depends on the properties of the patterns to be classified. The main criterion must be to minimize the distance among patterns belonging to the same class and to maximize the distance among patterns belonging to different classes. Additional interesting properties of feature space are invariance to affine transformations and robustness to noise and distortion. An interesting survey of feature extraction methods, applied to the related area of character recognition can be found in [89]. In symbol recognition, only a subset of all these features have been employed. We will classify them into four groups: those based on the pixels of the image, geometric features, geometric moments and image transformations. The simplest feature space is the image space itself. The feature vector is composed of one feature for each pixel value. Usually, the image is first normalized to a fixed size. The main advantages are simplicity, low complexity and direct correspondence with visual appearance. However, the representation is not rotation invariant and it is very sensitive to noise and dis...

Powered by: Apache Solr
  • About CiteSeerX
  • Submit and Index Documents
  • Privacy Policy
  • Help
  • Data
  • Source
  • Contact Us

Developed at and hosted by The College of Information Sciences and Technology

© 2007-2019 The Pennsylvania State University