CFP last date
22 April 2024
Reseach Article

Development of Isolated Numeric Speech Corpus for Swahili Language for Development of Automatic Speech Recognition System

by Aaron M. Oirere, Ratnadeep R. Deshmukh, Pukhraj P. Shrishrimal
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 74 - Number 11
Year of Publication: 2013
Authors: Aaron M. Oirere, Ratnadeep R. Deshmukh, Pukhraj P. Shrishrimal
10.5120/12929-9841

Aaron M. Oirere, Ratnadeep R. Deshmukh, Pukhraj P. Shrishrimal . Development of Isolated Numeric Speech Corpus for Swahili Language for Development of Automatic Speech Recognition System. International Journal of Computer Applications. 74, 11 ( July 2013), 20-22. DOI=10.5120/12929-9841

@article{ 10.5120/12929-9841,
author = { Aaron M. Oirere, Ratnadeep R. Deshmukh, Pukhraj P. Shrishrimal },
title = { Development of Isolated Numeric Speech Corpus for Swahili Language for Development of Automatic Speech Recognition System },
journal = { International Journal of Computer Applications },
issue_date = { July 2013 },
volume = { 74 },
number = { 11 },
month = { July },
year = { 2013 },
issn = { 0975-8887 },
pages = { 20-22 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume74/number11/12929-9841/ },
doi = { 10.5120/12929-9841 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-06T21:41:59.090251+05:30
%A Aaron M. Oirere
%A Ratnadeep R. Deshmukh
%A Pukhraj P. Shrishrimal
%T Development of Isolated Numeric Speech Corpus for Swahili Language for Development of Automatic Speech Recognition System
%J International Journal of Computer Applications
%@ 0975-8887
%V 74
%N 11
%P 20-22
%D 2013
%I Foundation of Computer Science (FCS), NY, USA
Abstract

Speech corpus being the basic requirement for the development of Automatic speech recognition (ASR) system, it should be done with much accuracy in order to enhance the performance of the system. This paper describes the proposed procedure to abide while collecting the speech corpus of Swahili language from the native and non native speaker for the development of Automatic Speech Recognition system in Swahili language.

References
  1. Pukhraj Shrishrimal, R. R. Deshmukh, Vishal Waghmare, 2012 "Indian Language Speech Database: A Review", International Journal of Computer Application (IJCA), Vol 47, No. 5, (June – 2012), pp. 17-21.
  2. Guy De Pauw and Gilles-Maurice de Schryver,2008 "Improving the Computational Morphological Analysis of a Swahili Corpus for Lexicographic Purposes" Lexikos 18 (AFRILEX-reeks/series 18: 2008): 303-318
  3. G. De Pauw, G. M. de Schryver, and P. W. Wagacha, 2006 "Data-driven part-of-speech tagging of Kiswahili". In P. Sojka, I. Kope?cek, and K. Pala, editors, Proceedings of Text, Speech and Dialogue, 9th International Conference, volume 4188 of Lecture Notes in Computer Science, pages 197–204, Berlin, Germany, Springer Verlag.
  4. Gakuru, Mucemi Iraki, Frederick K. Tucker, Roger Shalonova, Ksenia Ngugi, Kamanda 2005, "Development of a Kiswahili text to speech system", In INTERSPEECH-2005, 1481-1484.
  5. http://en. wikipedia. org/wiki/Languages_of_the_Democratic_Republic_of_the_Congo dated 27/06/2012
  6. E. A. Alpers, 1975 "Ivory and Slaves in East Central Africa", London, pp. 98– 99 ;
  7. T. Vernet 2002, "Les cités-Etats Swahili et la puissance omanaise" (1650– 1720), Journal des Africanistes, 72(2), pp. 102–105.
  8. Thomas J. Hinnebusch, 1992 "Ethnologue list of countries where Swahili is spoken", "Swahili", International Encyclopedia of Linguistics, Oxford, pp. 99–106
  9. David Dalby, 1999/2000, "The Linguasphere Register of the World's Languages and Speech Communities", Linguasphere Press, Volume Two, pg. 733–735
  10. Arvi Hurskainen, 2004 "Helsinki Corpus of Swahili. Compilers": Institute for Asian and African Studies (University of Helsinki) and CSC.
  11. Guy De Pauw, Peter Waiganjo Wagacha, Gilles-Maurice de Schryver, 2011 "Exploring the SAWA corpus: collection and deployment of a parallel corpus English—Swahili", International Journal of Lang Resources & Evaluation, Springer Verlag, vol 45, pp 331-344.
  12. Deen, Kamil Ud 2002 "The acquisition of Swahili verbal morphology", Palmela, Portugal. Costa, Joao & Freitas, Maria (Eds), in the proceedings to G. A. L. A conference (2002c) pp. 41-48.
  13. Gakuru, Mucemi , Frederick K. Iraki, Roger Tucker, Ksenia Shalonova, Kamanda Ngugi, 2005"Development of a Kiswahili text to speech system", In INTERSPEECH-2005, pp1481-1484.
  14. Hadrien Gelas, Laurent Besacier, F. Pellegrino, 2012 "Developments of Swahili resources for an automatic speech recognition system", SLTU – Workshop on Spoken Language Technologies for Under-Resourced Languages, Cape-Town, South Africa.
  15. Aaron M. Oirere, Ratnadeep R. Deshmukh, Pukhraj P. Shrishrimal, Vishal B. Waghmare, "Swahili Text and Speech Corpus: A Review", Asian Journal of Computer Science and Information Technology, Vol. 2, No. 11, (Nov-2012), pp. 286-290.
Index Terms

Computer Science
Information Sciences

Keywords

Swahili Swahili Text corpus Phonetics Text Corpus and Speech Corpus Automatic Speech Recognition