Publications:Recognition of Printed Sinhala Characters

From ISLAB/CAISR
Revision as of 12:50, 13 March 2014 by SlawekBot (Talk | contribs)

(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)

Do not edit this section

[[Abstract::

Sinhala characters used in the Sinhala script by over 70% of the 18 million population in Sri Lanka, have been descended from the ancient Brahmi script. The Sinhala alphabet consists of vowels and consonants and the consonants are modified using modifier symbols to give the required vocal sounds. In the process of developing an OCR for the Sinhala script, characters are initially recognised through a multi-level filtering process using the Linear Symmetry [LS] feature [1]. The recognised character is then segmented to identify the associated modifier symbol/s. Since the use of LS recognises characters prior to segmentation, the most difficult task of separating touching characters is easily solved. A method to determine the skew angle of the script is also presented. Experiments conducted so far for widely used fonts of different sizes yield encouraging results.

|]]

Keep all hand-made modifications below

Title Recognition of Printed Sinhala Characters
Author L. Premaratne and Josef Bigun
Year 2001
PublicationType Conference Paper
Journal
HostPublication Proceedings of the fifth Asian Conference on Computer Vision : ACCV 2002
Conference ACCV2002: The 5th Asian Conference on Computer Vision, 22-25 January 2002, Melbourne, Australia.
DOI
Diva url http://hh.diva-portal.org/smash/record.jsf?searchId=1&pid=diva2:408415
Abstract Sinhala characters used in the Sinhala script by over 70% of the 18 million population in Sri Lanka, have been descended from the ancient Brahmi script. The Sinhala alphabet consists of vowels and consonants and the consonants are modified using modifier symbols to give the required vocal sounds. In the process of developing an OCR for the Sinhala script, characters are initially recognised through a multi-level filtering process using the Linear Symmetry (LS) feature (1). The recognised character is then segmented to identify the associated modifier symbol/s. Since the use of LS recognises characters prior to segmentation, the most difficult task of separating touching characters is easily solved. A method to determine the skew angle of the script is also presented. Experiments conducted so far for widely used fonts of different sizes yield encouraging results.