Results 1 - 10
of
17
Ground truth data for document image analysis
- in Proceedings of the Symposium on Document Image Understanding and Technology
"... ..."
PixLabeler: User Interface for Pixel-Level Labeling of Elements in Document Images
"... We present a user interface design for labeling elements in document images at a pixel level. Labels are represented by overlay color, which might map to such terms as “handwriting”, “machine print”, “graphics”, etc. The primary purpose is to streamline processes for manual production of groundtruth ..."
Abstract
-
Cited by 11 (1 self)
- Add to MetaCart
(Show Context)
We present a user interface design for labeling elements in document images at a pixel level. Labels are represented by overlay color, which might map to such terms as “handwriting”, “machine print”, “graphics”, etc. The primary purpose is to streamline processes for manual production of groundtruth data, which is necessary for training algorithms and evaluating performance. Unlike general painttype programs, the UI design is targeted specifically toward selection of collections of foreground pixels that are likely to be meaningful elements in a document image analysis context. Our implementation, called PixLabeler, is available for download and allows customized plug-ins for bootstrapping according to the labeling task. 1.
The Architecture of TRUEVIZ: A GroundTRUth/Metadata Editing and VISualiZing Toolkit
- Pattern Recognition
, 2003
"... ..."
Web-based Cooperative Document Understanding
, 2001
"... a Web-based framework for cooperative document understanding. We begin by exposing our motivations for designing a new document understanding environment. We then describe the different levels of cooperation we intend to support and how Web technologies can help us in this respect. Finally, we prese ..."
Abstract
-
Cited by 5 (0 self)
- Add to MetaCart
a Web-based framework for cooperative document understanding. We begin by exposing our motivations for designing a new document understanding environment. We then describe the different levels of cooperation we intend to support and how Web technologies can help us in this respect. Finally, we present Edelweiss, the framework we are currently developing based on this approach.
Doclib: A document processing research tool
- In Proc. Symposium on Document Image Understanding Technology
, 2005
"... Often, valuable document processing intellectual capital is lost due to staff transitions or project restructuring prior to technology transfer. Furthermore, hardware and software integrity, dependencies, and compatibility are critical components that often impede technology migration. While many op ..."
Abstract
-
Cited by 5 (2 self)
- Add to MetaCart
(Show Context)
Often, valuable document processing intellectual capital is lost due to staff transitions or project restructuring prior to technology transfer. Furthermore, hardware and software integrity, dependencies, and compatibility are critical components that often impede technology migration. While many open source tools attempt to mitigate these issues, they do not always address specific design needs and tailored-process that Government organizations must adhere to. This paper addresses the need for a common document processing research vehicle through which institutions can develop and share researchrelated software and applications across academic, business, and Government domains. 1
GEDI – A Groundtruthing Environment for Document Images
"... In this paper, we describe a freely available highly configurable document image annotation tool called GEDI – Groundtruthing Environment for Document Images. Its basic structure involves two types of files, an Image file, and a corresponding.xml file in GEDI format. When users begin ground truthing ..."
Abstract
-
Cited by 4 (2 self)
- Add to MetaCart
In this paper, we describe a freely available highly configurable document image annotation tool called GEDI – Groundtruthing Environment for Document Images. Its basic structure involves two types of files, an Image file, and a corresponding.xml file in GEDI format. When users begin ground truthing an image, they can configure the interface to allow the creation of different types of zones, each of which may have a custom set of “attributes”. The output is compatible with the UMD DocLib architecture [2] and has been used in numerous funded and unfunded programs to create datasets in multiple languages. GEDI has been developed and released to the community as a comprehensive tool that we hope will ease the burden of document annotation and encourage additional sharing of data.
Automatic ground-truth generation for document image analysis and understanding
- Document Analysis and Recognition, 2007. ICDAR 2007. Ninth International Conference on
, 2007
"... HAL is a multi-disciplinary open access archive for the deposit and dissemination of sci-entific research documents, whether they are pub-lished or not. The documents may come from teaching and research institutions in France or abroad, or from public or private research centers. L’archive ouverte p ..."
Abstract
-
Cited by 4 (0 self)
- Add to MetaCart
(Show Context)
HAL is a multi-disciplinary open access archive for the deposit and dissemination of sci-entific research documents, whether they are pub-lished or not. The documents may come from teaching and research institutions in France or abroad, or from public or private research centers. L’archive ouverte pluridisciplinaire HAL, est destinée au dépôt et a ̀ la diffusion de documents scientifiques de niveau recherche, publiés ou non, émanant des établissements d’enseignement et de recherche français ou étrangers, des laboratoires publics ou privés.
Ground-Truth Production and Benchmarking Scenarios Creation with DocMining
- Third International Workshop on Document Layout Interpretation and its Applications (DLIA2003). August 2, 2003
"... In this paper we present the DocMining platform and its application to ground-truth datasets production and page segmentation evaluation. DocMining is a highly modular framework dedicated to document interpretation where document processing tasks are modelized with scenarios. We present here two sce ..."
Abstract
-
Cited by 2 (0 self)
- Add to MetaCart
(Show Context)
In this paper we present the DocMining platform and its application to ground-truth datasets production and page segmentation evaluation. DocMining is a highly modular framework dedicated to document interpretation where document processing tasks are modelized with scenarios. We present here two scenarios which use PDF documents, found on the web or produced from XML files, as basis of the ground-truth dataset.
GROUNDTRUTH GENERATION AND DOCUMENT IMAGE DEGRADATION Gang Zi
, 2005
"... The problem of generating synthetic data for the training and evaluation of document analysis systems has been widely addressed in recent years. With the increased interest in processing multilingual sources, however, there is a tremendous need to be able to rapidly generate data in new languages an ..."
Abstract
- Add to MetaCart
(Show Context)
The problem of generating synthetic data for the training and evaluation of document analysis systems has been widely addressed in recent years. With the increased interest in processing multilingual sources, however, there is a tremendous need to be able to rapidly generate data in new languages and scripts, without the need to develop specialized systems. We have developed a system, which uses language support of the MS Windows operating system combined with custom print drivers to render tiff images simultaneously with windows Enhanced Metafile directives. The metafile information is parsed to generate zone, line, word, and character ground truth including location, font information and content in any language supported by Windows. The resulting images can be physically or synthetically degraded by our degradation modules, and used for training and evaluating Optical Character Recognition (OCR) systems. Our document image degradation methodology incorporates several often-encountered types of noise at the page and pixel levels. Examples of OCR evaluation and synthetically degraded document images are given to demonstrate the effectiveness.
GROUNDTRUTH GENERATION AND DOCUMENT IMAGE DEGRADATION By Gang Zi
, 2005
"... The problem of generating synthetic data for the training and evaluation of document analysis systems has been widely addressed in recent years. With the increased interest in processing multilingual sources, however, there is a tremendous need to be able to rapidly generate data in new languages an ..."
Abstract
- Add to MetaCart
(Show Context)
The problem of generating synthetic data for the training and evaluation of document analysis systems has been widely addressed in recent years. With the increased interest in processing multilingual sources, however, there is a tremendous need to be able to rapidly generate data in new languages and scripts, without the need to develop specialized systems. We have developed a system, which uses language support of the MS Windows operating system combined with custom print drivers to render tiff images simultaneously with windows Enhanced Metafile directives. The metafile information is parsed to generate zone, line, word, and character ground truth including location, font information and content in any language supported by Windows. The resulting images can be physically or synthetically degraded by our degradation modules, and used for training and evaluating Optical Character Recognition (OCR) systems. Our document image degradation methodology incorporates several often-encountered types of noise at the page and pixel levels. Examples of OCR evaluation and synthetically degraded document images are given to demonstrate the effectiveness.