Border Noise Removal of Camera-Captured Document Images using Page-Frame Detection

Syed Saqib Bukhari, Faisal Shafait, Thomas Breuel
4th International Workshop on Camera-Based Document Analysis and Recognition, Lecture Notes in Computer Science, Beijing, China, Springer, 9/2011

Abstract:

Camera-captured document images usually contain two main types of marginal noise: textual noise (coming from neighboring pages) and non-textual noise (resulting from the page surrounding and/or binarization process). These types of marginal noise degrade the performance of the preprocessing (dewarping) of camera-captured document images and subsequent document digitization/recognition processes. Page frame detection is one of the newly investigated areas in document image processing, which is used to remove border noise and to identify the actual content area of document images. In this paper, we present a new technique for page frame detection of camera-captured document images. We use text and nontext contents information to find the page frame of document images. We evaluate our algorithm on the DFKI-I (CBDAR 2007 Dewarping Contest) dataset. Experimental results show the effectiveness of our method in comparison to other stateof- the-art page frame detection approaches.

Files:

  Bukhari-Page-Frame-CBDAR11.pdf

BibTex:

@inproceedings{ BUKH2011,
	Title = {Border Noise Removal of Camera-Captured Document Images using Page-Frame Detection},
	Author = {Syed Saqib Bukhari and Faisal Shafait and Thomas Breuel},
	BookTitle = {4th International Workshop on Camera-Based Document Analysis and Recognition},
	Month = {9},
	Year = {2011},
	Series = {Lecture Notes in Computer Science},
	Publisher = {Springer}
}

     
Last modified:: 30.08.2016