|
|
script-independent handwritten text line segmentation using directional 2d filters
|
|
|
|
|
نویسنده
|
ziaratban majid
|
منبع
|
رايانش نرم و فناوري اطلاعات - 2020 - دوره : 9 - شماره : 1 - صفحه:46 -60
|
چکیده
|
Text line segmentation is an important stage of the optical character recognition (ocr) algorithms. to analyze and recognize a document, text lines have to be segmented accurately. text line segmentation of handwritten documents is more difficult than that of machineprinted ones. curved and multiskewed text lines, overlapping text lines, and very small text lines are the main challenges. most of the proposed approaches did not consider local features of text lines in a document image. in our proposed method, both global and local features are considered. the proposed method is based on using directional 2d anisotropic filters. the parameters of our method are tuned based on a main global parameter which is computed for each document, separately. hence, the proposed method is a datasetindependent method. a document is divided into several blocks for which some local characteristics are calculated. in each block, text regions are detected by using local characteristics such as the block skew. in order to estimate the skew of text regions in a block, a novel text block skew estimation algorithm is proposed in this paper. experimental results show that the proposed method outperforms all the stateoftheart methods on three standard datasets. our final fmeasure are 0.54%, 0.03%, and 0.02% greater than the winner of icdar2013 text line segmentation contests on icdar2013, icdar09, and hit-mw datasets, respectively. the experiments proved that the proposed method can accurately segment text lines of complicated handwritings.
|
کلیدواژه
|
text line segmentation ,handwritten documents ,scriptindependent method ,directional 2d filters
|
آدرس
|
golestan university, faculty of engineering, department of electrical engineering, iran
|
پست الکترونیکی
|
m.ziaratban@ gu.ac.ir
|
|
|
|
|
|
|
|
|
|
|
|
Authors
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|