Call for Paper - November 2022 Edition
IJCA solicits original research papers for the November 2022 Edition. Last date of manuscript submission is October 20, 2022. Read More

Translation of Brahmi Inscriptions to Sinhala using Natural Language Processing

International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Year of Publication: 2020
Nethmi Hettiarachchi, Indrachapa Pathiraja, Dilum Madawala

Nethmi Hettiarachchi, Indrachapa Pathiraja and Dilum Madawala. Translation of Brahmi Inscriptions to Sinhala using Natural Language Processing. International Journal of Computer Applications 177(30):19-25, January 2020. BibTeX

	author = {Nethmi Hettiarachchi and Indrachapa Pathiraja and Dilum Madawala},
	title = {Translation of Brahmi Inscriptions to Sinhala using Natural Language Processing},
	journal = {International Journal of Computer Applications},
	issue_date = {January 2020},
	volume = {177},
	number = {30},
	month = {Jan},
	year = {2020},
	issn = {0975-8887},
	pages = {19-25},
	numpages = {7},
	url = {},
	doi = {10.5120/ijca2020919776},
	publisher = {Foundation of Computer Science (FCS), NY, USA},
	address = {New York, USA}


The recognition and translation of handwritten characters that were written without constraints is challenging. In this particular domain of interest “ancient inscriptions”, character recognition and translation is more curtail due to the wide variety of endemic writing styles. Research project named “Brahmi to Sinhala Translator”, is a result of an idea of coming up with a software solution to translate the ancient inscriptions written in “Mula Brahmi” language to Sinhala. According to the facts that found through the researchers, it is clear that, there is a huge gap between the Archeology and modern technology. Therefor archeologists still follow manual procedures to get their work done. “Brahmi to Sinhala Translator” is based on image processing, character recognition, text mining and natural language processing [1]. This project has been divided into four different functionalities. Removing the noise of the scanned image of the stain paper, recognition of letter patterns of Brahmi language, identification the corresponding Sinhala letter to the Brahmi letter and performing word and sentence break down and represent Brahmi Script in Sinhala. Since the proposed solution is a step by step approach, it will be able to provide a user friendly environment yet robust and accurate. Therefor this will be a great innovation for not only the field of archeology but also for the information technology.


  1. A. Manpreet Kaur, "Combination Method for Powerline Interference Reduction in ECG," International Journal of Computer Applications, vol. Volume 1– No.14, 2010.
  2. R. P. P. B. Anoop Kunchukuttan, "Brahmi-Net: A transliteration and script conversion system for languages of the Indian subcontinent," in Proceedings of the 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Demonstrations, Denver, Colorado, 2015.
  3. A. l. o. o. p. MaesschalckD.Jouan-RimbaudD.L.Massart, "The Mahalanobis distance," Chemometrics and Intelligent Laboratory Systems, vol. 50, no. Issue 1, 4 January 2000, p. 18, 4 01 2000.
  4. C. M. Sankari, "Object Matching using Skeletonization based," International Journal of Computer Applications , vol. Volume 28, no. No.7, 2011 August.
  5. P. K. B. Gaurav Kumar, "A Detailed Review of Feature Extraction in Image Processing Systems," in 2014 Fourth International Conference on Advanced Computing & Communication Technologies, Rohtak, India, 2014.
  6. D. H. C. L. N. U. Viraj Welgama, "Towards a Sinhala Wordnet," in Conference on Human Language Technology for Development, 2011.
  7. G. G. Chowdhury, Natural Language Processing, 2005.
  8. Parascript, "PARASCRIPT," 2014. [Online]. Available: [Accessed 20 08 2015].
  9. T. S. J. P. Yannis Assael, "Restoring ancient text using deep learning: a case study on Greek epigraphy," Empirical Methods in Natural Language Processing (EMNLP) 2019, 2019.
  10. N. d. Silva, "Survey on Publicly Available Sinhala Natural Language Processing Tools and Research," arXiv:1906.02358v4 [cs.CL] 22 Jul 2019, p. 16, 2019.
  11. I. S. Nurmamatovna, "To recognize the manuscript texts of Arabic letters in ancient Uzbek script," World Scientific News, vol. 115, pp. 160-173, 2019.


Image Processing; Wavelet Decomposition; Character Recognition; Text Mining; Natural Language Processing