Difference between revisions of "Publications:Recognition of Printed Sinhala Characters"
(Created page with "<div style='display: none'> == Do not edit this section == </div> {{PublicationSetupTemplate|Author=L. Premaratne, Josef Bigun |PID=408415 |Name=Premaratne, L. (Högskolan i H...") |
(No difference)
|
Revision as of 12:50, 13 March 2014
[[Abstract::Sinhala characters used in the Sinhala script by over 70% of the 18 million population in Sri Lanka, have been descended from the ancient Brahmi script. The Sinhala alphabet consists of vowels and consonants and the consonants are modified using modifier symbols to give the required vocal sounds. In the process of developing an OCR for the Sinhala script, characters are initially recognised through a multi-level filtering process using the Linear Symmetry [LS] feature [1]. The recognised character is then segmented to identify the associated modifier symbol/s. Since the use of LS recognises characters prior to segmentation, the most difficult task of separating touching characters is easily solved. A method to determine the skew angle of the script is also presented. Experiments conducted so far for widely used fonts of different sizes yield encouraging results.
|]]Title | Recognition of Printed Sinhala Characters |
---|---|
Author | L. Premaratne and Josef Bigun |
Year | 2001 |
PublicationType | Conference Paper |
Journal | |
HostPublication | Proceedings of the fifth Asian Conference on Computer Vision : ACCV 2002 |
Conference | ACCV2002: The 5th Asian Conference on Computer Vision, 22-25 January 2002, Melbourne, Australia. |
DOI | |
Diva url | http://hh.diva-portal.org/smash/record.jsf?searchId=1&pid=diva2:408415 |
Abstract | Sinhala characters used in the Sinhala script by over 70% of the 18 million population in Sri Lanka, have been descended from the ancient Brahmi script. The Sinhala alphabet consists of vowels and consonants and the consonants are modified using modifier symbols to give the required vocal sounds. In the process of developing an OCR for the Sinhala script, characters are initially recognised through a multi-level filtering process using the Linear Symmetry (LS) feature (1). The recognised character is then segmented to identify the associated modifier symbol/s. Since the use of LS recognises characters prior to segmentation, the most difficult task of separating touching characters is easily solved. A method to determine the skew angle of the script is also presented. Experiments conducted so far for widely used fonts of different sizes yield encouraging results. |