A Hybrid Approach for Skew Detection and Correction in the Multi-script Scanned Document

Main Article Content

M. Ramanan

Abstract

Skew detection and correction of a scanned document is a very important step in Optical Character Recognition because skew of scanned document is reducing the accuracy of text line approach for skew detection and correction to calculate the skew angle on multi-script scanned document using Radon transform, Hough transform, Harries corner, Wiener filter and smearing algorithm. In this paper, a proposed approach is compared existing skew detection and correction techniques for printed documents having different scripts: English, Tamil, Sinhala and mixed-script. A proposed hybrid method is tested on 160 documents. The overall testing results is 90.62% for skew detection and correction.

Keywords:
OCR, skew detection and correction, radon transform, Hough transform, Harris corner.

Article Details

How to Cite
Ramanan, M. (2019). A Hybrid Approach for Skew Detection and Correction in the Multi-script Scanned Document. Asian Journal of Research in Computer Science, 4(2), 1-8. https://doi.org/10.9734/ajrcos/2019/v4i230112
Section
Original Research Article

References

Salem Saleh Bafjaish, Mohd Sanusi Azmi, Hairulnizam Mahdin. Skew detection and correction of Mushaf Al-Quran script using hough transform. International Journal of Advanced Computer Science and Applications. 2018;9(8):402-409.

Abdelhak Boukharouba. A new algorithm for skew correction and baseline detection based on the randomized hough transform. Journal of King Saud University– Computer and Information Sciences. 2017; 29-38.

Rubani Jyoti Rani. Skew detection and correction in text document image using projection profile technique. International Journal of Computer Sciences and Engineering. 2018;6(7):986-990.

Ramandeep Kaur, Seema, Sunil Kumar. A hybrid approach to detect and correct a skew in scanned document images using fast Fourier transform and nearest neighbor algorithm. International Journal of Advances in Electronics and Computer Science. 2016;3(5):1-6.

Subrahmanyam MSLB, Vijaya Kumar V, Eswara Reddy B. A new algorithm for skew detection of Telugu language document based on principle-axis farthest pairs quadrilateral (PFPQ). International Journal of Image, Graphics and Signal Processing. 2018;47-58.

Papandreou A, Gatos B. A course to fine skew estimation technique for handwritten documents. International conference on document analysis and recognition (IEEE); 2013.

Aditya B, Kumar A, Yadav BM. Skew Detection in handwritten documents. Inter-national Journal of Computer Applications. 2013;69:17-18.

Jipeng T, Kumar HG, Chethan HK. 'A skew correction for Chinese character using hough transform. International Journal of Advanced Computer Science and Applications. 2011;45-48.

Singh R, Kaur R. 'Improved skew detection and correction approach using discrete fourier algorithm. International Journal of Soft Computing and Engineering. 2013; 3(4):55-57.

Kaur L, Jindal S. Skew detection technique for various scripts. International Journal of Scientific & Engineering Research. 2011;2(9):1-3.

Rao KR, Kim DN, Hwang JJ. Fast Fourier Transform - Algorithms and Applications, Springer, Dordre ht; 2010.

Mehta S, Walia E, Dutta M. Fast frequency domain method to detect skew in a document image. Seventh International Conference on Graphic and Image Processing. 2015;9817.

Yan H. Skew correction of document images using interline cross-correlation. CVGIP Graphical Models and Image Processing. 1993;55(6):538-543.

Li ST, Shen QH, Sun J. Skew detection using wavelet decomposition and projection profile analysis. Patter Recognition Letters. 2007;28(5):555-562.

Basavanna M, Gornale SS. Skew detection and skew correction in scanned document image using principal component analysis. International Journal of Scientific & Engineering Research. 2015;6(1):1414-1417.

Steinherz T, Intrator N, Rivlin E. “Skew detection via principal components analysis. Proceedings of the International Conference on Document Analysis and Recognition. 1999;153-156,

Aithal K, Rajesh G, Acharya D, Siddalingaswary P. A fast and novel skew estimation approach using radon trans-form. International Journal of Computer Information Systems and Industrial Management Applications. 2013;5:337-344.

Ahmed Gari, Ghizlane Khaissidi, Mostafa Mrabti, Driss Chenouni, Mounim El Yacoubi. Skew detection and correction based on Hough transform and Harris corners, 2017 International Conference on Wireless Technologies, Embedded and Intelligent Systems (WITS); 2017.

Ramanan M, Ramanan A, Charles EYA. A preprocessing method for printed Tamil documents: Skew correction and textual classification. IEEE Seventh International Conference on Intelligent Computing and Information Systems (ICICIS). 2015;495-500.

Harris C, Stephens M. A combined corner and edge detector. Proceedings of the 4th Alvey Vision Conference. 1988;147-151.

Shafii M. Optical character recognition of printed persian arabic documents. Electronic Thesis and Dissertations; 2014.