*** Welcome to piglix ***

OCR in Indian Languages


Optical character recognition (Also known as OCR) is the process of converting the image into text. OCR for English and other European languages has been able to achieve a high percentage of accuracy in conversion. But the OCR for Indian Languages were not able to achieve the kind of accuracy they achieved. This is mostly due to the complexity of Indian language, lack of standard representation, encoding, support of operating system and keyboard. Centre for Development of Advanced Computing (C-DAC) and Technology Development for Indian Languages, the premier R&D organisation of the Ministry of Electronics and Information Technology (Also known as MeitY) of India has done many projects for OCR. Their projects include OCR for Malayalam, Odia, Punjabi, Telugu and Devanagari script.


...
Wikipedia

...