Publications:A comprehensive Dataset for Ethiopic Handwriting Recognition
From ISLAB/CAISR
Title | A comprehensive Dataset for Ethiopic Handwriting Recognition |
---|---|
Author | Yaregal Assabie and Josef Bigun |
Year | 2009 |
PublicationType | Book Chapter |
Journal | |
HostPublication | Proceedings SSBA '09 : Symposium on Image Analysis, Halmstad University, Halmstad, March 18-20, 2009 |
Conference | |
DOI | |
Diva url | http://hh.diva-portal.org/smash/record.jsf?searchId=1&pid=diva2:728384 |
Abstract | Ethiopic script is used by several languages in Ethiopia for writing. We present a comprehensive dataset of handwritten Ethiopic script called DEHR (Dataset for Ethiopic Handwriting Recognition) captured both offline and online. The offline dataset includes isolated characters, Ethiopian church documents and ordinary handwritten texts dealing with various real-life issues. The ordinary texts and isolated characters were freely written by several participants. The church documents are written in Geez and Amharic languages whereas the language for ordinary texts is Amharic only. The online dataset was collected by using two Digimemo devices of different sizes. For isolated characters and online dataset, all the 265 character samples used by Amharic language are included. The dataset is intended to set a benchmark for training and/or testing handwriting recognition, character and word segmentation, and text line detection. The dataset is can be accessed by contacting the authors or via http://www.hh.se/staff/josef/. |