>
Fa   |   Ar   |   En
   An Approach to Skew Detection of Printed Documents  
   
نویسنده Brodić Darko ,Mello Carlos A. B. ,Maluckov Čedomir A. ,Milivojević Zoran N.
منبع journal of universal computer science - 2014 - دوره : 20 - شماره : 4 - صفحه:488 -506
چکیده    In this paper, we propose an approach to estimate the text skew for printed documents. this is an important step to prevent errors in further stages of an automatic document processing system (as text segmentation). our approach is based on the statistical analysis of the height of the connected components. in a nutshell, our algorithm is comprised of four steps: (i) removal of redundant data; (ii) establishment of the connected components, which represent filled convex hulls around each text element; (iii) enlargement of these components using morphological erosion; (iv) removal of the largest connected component to identify the first estimation of text skew. according to it, the connected components are enlarged by oriented morphological erosion and the longest of them is extracted. statistical moments are applied to this longest component to evaluate its orientation and the global text skew of the document is identified. at the end of this process, the original document is rotated back based on the calculated angle. the performance of the proposed algorithm is examined by testing on a custom dataset. the results support the robustness of our approach.
کلیدواژه Document image analysis ,Connected component analysis ,Statistical analysis ,Moment based method ,Skew estimation
آدرس University of Belgrade, Technical Faculty in Bor, Serbia, Universidade Federal de Pernambuco, Centro de Informática, Brazil, University of Belgrade, Technical Faculty in Bor, Serbia, College of Applied Technical Sciences Niš, Serbia
پست الکترونیکی zoran.milivojevic@vtsnis.edu.rs
 
     
   
Authors
  
 
 

Copyright 2023
Islamic World Science Citation Center
All Rights Reserved