OPTICAL CHARACTER RECOGNITION FOR SINHALA

Optical Character Recognition (OCR) for printed Sinhala documents based upon tesseract 3.01 trained by Software Development Unit of University of Colombo School of Computing (UCSC). The process of OCR is divided into 2 stages; input and processing (handled by Tesseract OCR engine) and post processing Engine which was developed at UCSC. Please note, Files loaded…

RESEARCH IN SINHALA NUMERALS

Sinhala numerals were used in the Kandyan convention signed between Kandyan chieftains and the British governor in 1815. Fruition of extensive research initiated by ICTA entitled "Numerations in the Sinhala Language" authored by Harsha Wijayawardhana, Head Software Development Unit, School of Computing, University of Colombo and edited by  Aruni Goonetilleke Programme Manager- Information Infrastructure, ICTA,…