CFP last date
20 May 2024
Reseach Article

An Approach to Sort Unicode based Bengali Text using Trie

by Ranit Debnath Akash, U. Khyoi Nu, Biswapriyo Chakrabarty
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 163 - Number 11
Year of Publication: 2017
Authors: Ranit Debnath Akash, U. Khyoi Nu, Biswapriyo Chakrabarty
10.5120/ijca2017913764

Ranit Debnath Akash, U. Khyoi Nu, Biswapriyo Chakrabarty . An Approach to Sort Unicode based Bengali Text using Trie. International Journal of Computer Applications. 163, 11 ( Apr 2017), 18-22. DOI=10.5120/ijca2017913764

@article{ 10.5120/ijca2017913764,
author = { Ranit Debnath Akash, U. Khyoi Nu, Biswapriyo Chakrabarty },
title = { An Approach to Sort Unicode based Bengali Text using Trie },
journal = { International Journal of Computer Applications },
issue_date = { Apr 2017 },
volume = { 163 },
number = { 11 },
month = { Apr },
year = { 2017 },
issn = { 0975-8887 },
pages = { 18-22 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume163/number11/27439-2017913764/ },
doi = { 10.5120/ijca2017913764 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-07T00:09:55.699947+05:30
%A Ranit Debnath Akash
%A U. Khyoi Nu
%A Biswapriyo Chakrabarty
%T An Approach to Sort Unicode based Bengali Text using Trie
%J International Journal of Computer Applications
%@ 0975-8887
%V 163
%N 11
%P 18-22
%D 2017
%I Foundation of Computer Science (FCS), NY, USA
Abstract

This paper proposes a sorting algorithm for Unicode based Bangla texts using Trie.. Bengali texts can not be sorted using the Unicode character scheme as Unicode character sequence is different from the Bangla Academy character sequence. Moreover, Bengali, an Indo-Aryan language spoken by approximately 200 million people  has some distinct properties with its diacritic signs. In this paper, we have sorted Bangla texts based on the Bangla Academy character order using an efficient information retrieval data structure. Our proposed algorithm is more memory efficient and is applicable to any unicode based Bangla text.

References
  1. https://en.wikipedia.org/wiki/Bengali_language
  2. http://www.listsworld.com/top-10-languages-most-spoken-worldwide/
  3. http://timesofindia.indiatimes.com/india/Nearly-60-of-Indians-speak-a-language-other-than-Hindi/articleshow/36922157.cms
  4. https://en.wikipedia.org/wiki/Bangla_Academy
  5. http://forum.daffodilvarsity.edu.bd/index.php?topic=11714.0
  6. Aamira Shabnam, Debakar Shamanta Piklu, “An Easily Comprehendible Unicode Based Sorting Algorithm for Bangla Words”
  7. Aamira Shabnam, Tapashee Tabassum Urmi, Md. Saiful Islam, “A Faster Approach to Sort Unicode Represented Bengali Words”
  8. Partha Sarathi Kar, Shantanu Mandal, Labiba Jahan, “An Improved Unicode Based Sorting Algorithm for Bengali Words”
  9. Md. Mahfuzur Rahaman, “A Revised Unicode based Sorting Algorithm for Bengali Texts”
  10. বাংলা ডাটাসেট (কর্পাস) of http://scdnlab.com/corpus/
Index Terms

Computer Science
Information Sciences

Keywords

Sorting Algorithm Unicode Bengali text Trie