Dewarping of Document Images using Coupled-Snakes

Syed Saqib Bukhari, Faisal Shafait, Thomas Breuel
Proceedings of the Third International Workshop on Camera-Based Document Analysis and Recognition, Barcelona, Spain, Online, 7/2009

Abstract:

Traditional OCR systems are designed for planar (dewarped) images and the accuracy is reduced when applied on warped images. Therefore, developing new OCR techniques for warped images or developing dewarping techniques are the possible solutions for improving OCR accuracy camera-captured documents. Among different types of dewarping techniques, curled textlines information based dewarping techniques are the most popular ones, but are sensitive to high degrees of curl and variable line spacing. In this paper we build a novel dewarping approach based on curled textlines information, which has been extracted using ridges based modified active contour model (coupled snakes). Our dewarping approach is less sensitive different direction of curl and variable line spacing. Experimental results show that OCR error rate, from warped to dewarped documents, has been reduced from 5.15% to 1.92% on the dataset of CBDAR 2007 document image dewarping contest. We also report the performance of our method in comparison with other state-of-the-art methods.

Files:

  2009-IUPR-21Aug_1705.pdf

BibTex:

@inproceedings{ BUKH2009,
	Title = {Dewarping of Document Images using Coupled-Snakes},
	Author = {Syed Saqib Bukhari and Faisal Shafait and Thomas Breuel},
	BookTitle = {Proceedings of the Third International Workshop on Camera-Based Document Analysis and Recognition},
	Month = {7},
	Year = {2009},
	Publisher = {Online}
}

     
Last modified:: 30.08.2016