C.L.: Recovery of Distorted Document Images from Bound Volumes (2001) [5 citations — 1 self]
Abstract:
Recovery of document images scanned from thick bound volumes is necessary for the purpose of human reading and text retrieval. The main problem with scanning of bound volumes is that there always occurs perspective distortion. Such distortion causes two sources of degradation for the scanned images – 1) shadow at the book spine area, and 2) warping of the words in the shadow. In this paper, we have developed a restoration system to solve these two problems. First, the boundary between the shadow and the clean area is detected. Then the system applies a modified Niblack’s method to remove the shadow. The system uses a connected component analysis to help improve the noise reduction and adjust the location and orientation of the warped word in the shadow area, i.e. the words within the boundary detected earlier. The implementation results for each step are presented. Our system will be used in the text retrieval
Citations
| 42 | Nonlinear global and local document degradation models – Kanungo, Haralick, et al. - 1994 |
| 21 | Adaptive document image binarization – Sauvola, Pietikainen |
| 10 | State of the art of document image degradation modeling – Baird - 2000 |
| 8 | Document Image Quality: Making Fine Discriminations – Baird - 1999 |
| 4 | C.Chamzas. “Identification of Text-Only Areas in Mixed-Type Documents”, Engng Applic – Strouthopoulos, Papamarkos - 1997 |
| 4 | Restoration of images scanned from thick bound documents – Zhang, Tan - 2001 |

