Publications:A comprehensive Dataset for Ethiopic Handwriting Recognition


Do not edit this section

Keep all hand-made modifications below

Title A comprehensive Dataset for Ethiopic Handwriting Recognition
Author Yaregal Assabie and Josef Bigun
Year 2009
PublicationType Book Chapter
HostPublication Proceedings SSBA '09 : Symposium on Image Analysis, Halmstad University, Halmstad, March 18-20, 2009
Diva url
Abstract Ethiopic script is used by several languages in Ethiopia for writing. We present a comprehensive dataset of handwritten Ethiopic script called DEHR (Dataset for Ethiopic Handwriting Recognition) captured both offline and online. The offline dataset includes isolated characters, Ethiopian church documents and ordinary handwritten texts dealing with various real-life issues. The ordinary texts and isolated characters were freely written by several participants. The church documents are written in Geez and Amharic languages whereas the language for ordinary texts is Amharic only. The online dataset was collected by using two Digimemo devices of different sizes. For isolated characters and online dataset, all the 265 character samples used by Amharic language are included. The dataset is intended to set a benchmark for training and/or testing handwriting recognition, character and word segmentation, and text line detection. The dataset is can be accessed by contacting the authors or via