Publications:HMM-Based Handwritten Amharic Word Recognition with Feature Concatenation

From ISLAB/CAISR

Do not edit this section

Keep all hand-made modifications below

Title HMM-Based Handwritten Amharic Word Recognition with Feature Concatenation
Author Yaregal Assabie and Josef Bigun
Year 2009
PublicationType Conference Paper
Journal
HostPublication Proceedings of the International Conference on Document Analysis and Recognition, ICDAR
Conference 10th International Conference on Document Analysis and Recognition, ICDAR '09, July 26-29, Barcelona, Spain, 2009
DOI http://dx.doi.org/10.1109/ICDAR.2009.50
Diva url http://hh.diva-portal.org/smash/record.jsf?searchId=1&pid=diva2:408379
Abstract Amharic is the official language of Ethiopia and uses Ethiopic script for writing. In this paper, we present writer-independent HMM-based Amharic word recognition for offline handwritten text. The underlying units of the recognition system are a set of primitive strokes whose combinations form handwritten Ethiopic characters. For each character, possibly occurring sequences of primitive strokes and their spatial relationships, collectively termed as primitive structural features, are stored as feature list. Hidden Markov models for Amharic words are trained with such sequences of structural features of characters constituting words. The recognition phase does not require segmentation of characters but only requires text line detection and extraction of structural features in each text line. Text lines and primitive structural features are extracted by making use of direction field tensor. The performance of the recognition system is tested by a database of unconstrained handwritten documents collected from various sources.