• Documents
  • Authors
  • Tables
  • Log in
  • Sign up
  • MetaCart
  • DMCA
  • Donate

CiteSeerX logo

Advanced Search Include Citations
Advanced Search Include Citations | Disambiguate

Efficient search strategy in structural analysis for handwritten mathematical expression recognition (0)

by T H Rhee, J H Kim
Venue:Pattern Recognition
Add To MetaCart

Tools

Sorted by:
Results 1 - 6 of 6

Recognition and Retrieval of Mathematical Expressions

by R. Zanibbi, D. Blostein - INTERNATIONAL JOURNAL ON DOCUMENT ANALYSIS AND RECOGNITION
"... Document recognition and retrieval technologies complement one another, providing improved access to increasingly large document collections. While recognition and retrieval of textual information is fairly mature, with wide-spread availability of Optical Character Recognition (OCR) and text-based ..."
Abstract - Cited by 31 (10 self) - Add to MetaCart
Document recognition and retrieval technologies complement one another, providing improved access to increasingly large document collections. While recognition and retrieval of textual information is fairly mature, with wide-spread availability of Optical Character Recognition (OCR) and text-based search engines, recognition and retrieval of graphics such as images, figures, tables, diagrams, and mathematical expressions are in comparatively early stages of research. This paper surveys the state of the art in recognition and retrieval of mathematical expressions, organized around four key problems in math retrieval (query construction, normalization, indexing, and relevance feedback), and four key problems in math recognition (detecting expressions, detecting and classifying symbols, analyzing symbol layout, and constructing a representation of meaning). Of special interest is the machine learning problem of jointly optimizing the component algorithms in a math recognition system, and developing effective indexing, retrieval and relevance feedback algorithms for math retrieval. Another important open problem is developing user interfaces that seamlessly integrate recognition and retrieval. Activity in these important research areas is increasing, in part because math notation provides an excellent domain for studying problems common to many document and graphics recognition and retrieval applications, and also because mature applications will likely provide substantial benefits for education, research, and mathematical literacy.

A new approach for recognizing handwritten mathematics using relational grammars and fuzzy sets

by Scott MacLean, George Labahn
"... ..."
Abstract - Cited by 4 (1 self) - Add to MetaCart
Abstract not found

Baseline Extraction-Driven Parsing of Handwritten Mathematical Expressions

by Lei Hu, Kevin Hart, Richard Pospesel, Richard Zanibbi
"... We generalize recursive baseline extraction algorithms for symbol layout analysis in math expressions so that handwritten strokes may be provided as input. Specifically, baseline extraction is used for lexical analysis in a modified LL(1) parser, returning a set of candidate symbols when the leftmos ..."
Abstract - Cited by 2 (2 self) - Add to MetaCart
We generalize recursive baseline extraction algorithms for symbol layout analysis in math expressions so that handwritten strokes may be provided as input. Specifically, baseline extraction is used for lexical analysis in a modified LL(1) parser, returning a set of candidate symbols when the leftmost or next symbol along the current baseline (from left-to-right) is requested by the parser. Candidate symbols are used to produce a forest of parse trees, and the highest ranked parse returned. Hidden Markov Models (HMMs) are used for symbol classification, and horizontal adjacency between symbols is determined using two probabilistic quadratic classifiers, one for ascenders (e.g. ‘A’) and another for centered and descender symbols (e.g. ‘y ’ and ‘x’). The system placed second in the CROHME 2011 handwritten math recognition competition. 1.
(Show Context)

Citation Context

... and context is often needed for disambiguation [12]. To exploit contextual/linguistic constraints, commonly stochastic or fuzzy language models are used when recognizing handwritten expressions (see =-=[2, 4, 7, 9, 14]-=-, and [15]). A grammar defines legal symbol layouts, and a two-dimensional parser searches through alternative segmentation, classification and layout hypotheses as productions are applied. The most l...

IJDAR DOI 10.1007/s10032-012-0184-x ORIGINAL PAPER

by Fuzzy Sets, Scott Maclean, George Labahn
"... Abstract We present a new approach for parsing twodimensional input using relational grammars and fuzzy sets. A fast, incremental parsing algorithm is developed, motivated by the two-dimensional structure of written mathematics. The approach reports all identifiable parses of the input. The parses a ..."
Abstract - Add to MetaCart
Abstract We present a new approach for parsing twodimensional input using relational grammars and fuzzy sets. A fast, incremental parsing algorithm is developed, motivated by the two-dimensional structure of written mathematics. The approach reports all identifiable parses of the input. The parses are represented as a fuzzy set, in which the membership grade of a parse measures the similarity between it and the handwritten input. To identify and report parses efficiently, we adapt and apply existing techniques such as rectangular partitions and shared parse forests, and introduce new ideas such as relational classes and interchangeability. We also present a correction mechanism that allows users to navigate parse results and choose the correct interpretation in case of recognition errors or ambiguity. Such corrections are incorporated into subsequent incremental recognition results. Finally, we include two empirical evaluations of our recognizer. One uses a novel user-oriented correction count metric, while the other replicates the CROHME 2011 math recognition contest. Both evaluations demonstrate the effectiveness of our proposed approach. 1
(Show Context)

Citation Context

...the potential classes of expressions that might be obtained from any node of the shared parse forest are known. This formulation is similar in some ways to the scoring rules developed by Rhee and Kim =-=[32]-=- to represent all parse trees with identical structure, but differing symbol identities by a single tree with indeterminate terminal symbols. Working in a cost-minimization framework, they defined the...

2009 10th International Conference on Document Analysis and Recognition Utilizing Consistency Context for Handwritten Mathematical Expression Recognition

by Kang Kim, Taik Heon Rhee, Jae Seung Lee, Jin Hyung Kim
"... This paper presents a rule-based approach that utilizes some types of contextual information to improve the accuracy of handwritten mathematical expression(ME) recognition. Mining context from corpus is not practical for ME recognition due to the complexity originated from 2-D nature of MEs. For pra ..."
Abstract - Add to MetaCart
This paper presents a rule-based approach that utilizes some types of contextual information to improve the accuracy of handwritten mathematical expression(ME) recognition. Mining context from corpus is not practical for ME recognition due to the complexity originated from 2-D nature of MEs. For practicality, we identify typical types of consistencies that are often found in customary usage and general patterns in MEs. We aim to increase these consistencies in recognition results by correcting symbol labels and/or spatial relationships among symbols. Such consistencies are easily encoded as condition-action pairs. Preliminary interpretations generated by the base recognizer are reordered by increasing or decreasing scores by the rules. Although our approach is not complete, it easily implements even global context among distant symbols. Experimental results show that our approach is useful to increase the accuracy of handwritten ME recognition. 1.
(Show Context)

Citation Context

...oard and mouse interface is inconvenient. Handwriting would be convenient to input MEs if it is supported by a good recognizer. Although several handwritten ME recognition systems have been developed =-=[1, 6, 9, 11, 12]-=-, it is still hard to obtain robust recognition results for complex inputs. It is well-known that context in MEs can help to improve recognition accuracy by resolving ambiguities in recognition result...

Symbol Knowledge Extraction from a Simple Graphical Language

by Jinpeng Li, Harold Mouchere, Christian Viard-gaudin
"... Abstract—In this paper, we study the problem of symbol knowledge extraction. We assume that some unknown symbols are used to compose a handwritten message, and from a dataset of handwritten samples, we would like to recover the symbol set used in the corresponding language. We applied our approach o ..."
Abstract - Add to MetaCart
Abstract—In this paper, we study the problem of symbol knowledge extraction. We assume that some unknown symbols are used to compose a handwritten message, and from a dataset of handwritten samples, we would like to recover the symbol set used in the corresponding language. We applied our approach on online handwriting, and select the domain of numerical expressions, mixing digits and operators, to test the ability to retrieve the corresponding symbol classes. The proposed method is based on three steps: a quantization of the stroke space, a description of the layout of strokes with a relational graph, and the extraction of an optimal lexicon using a minimum description length algorithm. At the symbol level, a recall rate of 74 % is obtained on the test dataset produced by 100 writers. Keywords-online handwriting; knowledge extraction; mini-mum description length; spatial relation; I.
(Show Context)

Citation Context

... spatial relation below “B” which corresponds to the following subsequence (−, B,−). These graphemes and spatial relations are organized in a relational graph inspired by a symbol relation tree (SRT) =-=[2]-=-. From the relational graph, the sequences containing graphemes and spatial relations are extracted, they are next processed by the third step, which computes the lexicon. The main idea of the develop...

Powered by: Apache Solr
  • About CiteSeerX
  • Submit and Index Documents
  • Privacy Policy
  • Help
  • Data
  • Source
  • Contact Us

Developed at and hosted by The College of Information Sciences and Technology

© 2007-2019 The Pennsylvania State University