CFP last date
20 May 2024
Reseach Article

Improved Identification of Protein Coding Region using Wavelet Transform

by Rajbir Singh, Guriqbal Singh, Dheeraj Pal Kaur
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 92 - Number 1
Year of Publication: 2014
Authors: Rajbir Singh, Guriqbal Singh, Dheeraj Pal Kaur
10.5120/15975-4864

Rajbir Singh, Guriqbal Singh, Dheeraj Pal Kaur . Improved Identification of Protein Coding Region using Wavelet Transform. International Journal of Computer Applications. 92, 1 ( April 2014), 32-37. DOI=10.5120/15975-4864

@article{ 10.5120/15975-4864,
author = { Rajbir Singh, Guriqbal Singh, Dheeraj Pal Kaur },
title = { Improved Identification of Protein Coding Region using Wavelet Transform },
journal = { International Journal of Computer Applications },
issue_date = { April 2014 },
volume = { 92 },
number = { 1 },
month = { April },
year = { 2014 },
issn = { 0975-8887 },
pages = { 32-37 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume92/number1/15975-4864/ },
doi = { 10.5120/15975-4864 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-06T22:13:10.442272+05:30
%A Rajbir Singh
%A Guriqbal Singh
%A Dheeraj Pal Kaur
%T Improved Identification of Protein Coding Region using Wavelet Transform
%J International Journal of Computer Applications
%@ 0975-8887
%V 92
%N 1
%P 32-37
%D 2014
%I Foundation of Computer Science (FCS), NY, USA
Abstract

Identification of protein coding regions is an important topic in genomic examination. The various coding DNA model-independent methods are used where there is an existence of specific pattern of nucleotides. These methods presume the window length required for an examination of a DNA region. The DNA model depending coding methods catches more specific features of coding DNA. The model independent methods capture universal features of coding region. We introduce a method which is independent from the window length. Therefore objective of this paper is to improve the reorganization of protein coding region using wavelet transform with improved thresholding algorithms. This novel transform is adapt to examine periodic signal components and presents the advantage of being independent of the window length. Wavelet transform has a noisy signal, for this an improved wavelet transform threshold method has been used to improve the effects of denoising. The eukaryote data sets are used to analogize the results of proposed method with other previous methods. Finally the output obtained illustrates that the proposed method gives the better results with respect to identification accuracy. This method avoids sources of errors and makes a tool for detailed probe of the nucleotide occurrence.

References
  1. A. Marin, oliver et al. "On the Origin of the Periodicity of Three in Protein Coding DNA Sequences," J. Theoretical, vol. 167, no. 4 pp. 413-414, 1994.
  2. Lopez-Villasenor et al. "A Simple Model to Explain Three-Base Periodicity in Coding DNA," vol. 580, no. 27, pp. 6413-6422, 2006.
  3. Trifonov E. Elucidating sequences codes: three codes for evaluation, Ann NY Acad sci. 1999, 870: 330-338.
  4. Kinghom B et al. "A. periodicity of DNA in exons". BMC Molecular Biology 2004.
  5. S. Ramachandran et al. "Prediction of probable genes by Fourier analysis of genomic sequences," CABIOS, vol. 13, no. 3, pp. 263-270, 1997.
  6. B. Yoon, et al. "The Role of Signal-Processing Concepts in Genomics and Proteomics," J. Franklin Inst. , vol. 341, no. 1-2, pp. 111-135, 2004.
  7. T. W. Fox and A. Carreira, "A digital signal processing method for gene prediction with improved noise suppression," EURASIP Journal on Applied Signal Processing, vol. 2004, no. 1, pp. 108-114, 2004.
  8. Y. Zhang, S. C. Agner, et al. "Signal Processing Techniques in Genomic Engineering," Proc. IEEE, vol. 90, no. 12, pp. 1822-1833, 2002.
  9. Datta S. , Asif A. , "A Fast DFT-Based Gene Prediction Algorithm for Identification of Protein Coding Regions," Proceedings of the 30th International Conference on Acoustics, Speech, and Signal Processing, 2005.
  10. H. Yan, and M. Yang, "Pattern Recognition Techniques for the Emerging Field of Bioinformatics: A Review," Pattern Recognition, vol. 38, no. 11, pp. 2055-2073, 2005.
  11. C. N. Moore, and J. C. Nelson, "Preliminary Wavelet Analysis of Genomic Sequences," Proc. IEEE Bioinformatics Conf. , pp. 509-510, 2003.
  12. F. Chen and Y. -T. Zhang, "A DNA Structure-Based Bionic Wavelet Transform and Its Applications to DNA Sequence Analysis," Applied Bionics and Biomechanics, vol. 1, no. 1, pp. 3-9, 2003.
  13. Zhao Ruizhen et al. "Three Novel Models of Threshold Estimator for Wavelet Coefficients". 01/2001; DOI:10. 1007/3-540-45333-4_19 In proceeding of: Wavelet Analysis and Its Applications, Second International Conference, WAA 2001, Hong Kong, China, December 18-20, 2001.
  14. R. F. Voss. Evolution of Long-range Fractal Correlations and 1/f noise in DNA base sequences. Physical Review Letters, 1992, 68(25):3805-3808.
  15. L. F. Costa and R. M. Cesar Jr. , Shape Analysis and Classification: Theory and Practice. CRC Press, 2001.
  16. Y. T. Chan, Wavelet Basics. Kluwer Academic, 1995.
  17. J. P. Carrer, H, Zana, Y, et al. "Identification of protein coding region using the modified gabor wavelet transform" IEEE/ACM transactions on commercial biology and bioinformatics 2008 vol. 5 pp. 198-207.
  18. Omid Abbasi, et al. "identification of exonic region in DNA sequences using cross-correlation and noise suppression by discrete wavelet transform," BMC Bioinformatics 2011; 12:430
Index Terms

Computer Science
Information Sciences

Keywords

Protein coding regions Digital signal processing Wavelet transforms Sequence analysis Wavelet thresholding.