Notification: Our email services are now fully restored after a brief, temporary outage caused by a denial-of-service (DoS) attack. If you sent an email on Dec 6 and haven't received a response, please resend your email.
CFP last date
20 December 2024
Reseach Article

Empirical Analysis and Performance Evaluation of various GPU Implementations of Protein BLAST

by Sita Rani, O. P. Gupta
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 151 - Number 7
Year of Publication: 2016
Authors: Sita Rani, O. P. Gupta
10.5120/ijca2016911779

Sita Rani, O. P. Gupta . Empirical Analysis and Performance Evaluation of various GPU Implementations of Protein BLAST. International Journal of Computer Applications. 151, 7 ( Oct 2016), 22-27. DOI=10.5120/ijca2016911779

@article{ 10.5120/ijca2016911779,
author = { Sita Rani, O. P. Gupta },
title = { Empirical Analysis and Performance Evaluation of various GPU Implementations of Protein BLAST },
journal = { International Journal of Computer Applications },
issue_date = { Oct 2016 },
volume = { 151 },
number = { 7 },
month = { Oct },
year = { 2016 },
issn = { 0975-8887 },
pages = { 22-27 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume151/number7/26245-2016911779/ },
doi = { 10.5120/ijca2016911779 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-06T23:56:28.508494+05:30
%A Sita Rani
%A O. P. Gupta
%T Empirical Analysis and Performance Evaluation of various GPU Implementations of Protein BLAST
%J International Journal of Computer Applications
%@ 0975-8887
%V 151
%N 7
%P 22-27
%D 2016
%I Foundation of Computer Science (FCS), NY, USA
Abstract

Bioinformatics applications are compute and data intensive by nature. As the size of molecular databases is growing from day to day experiments performed in the field of molecular biology, thoughtful steps need to be taken to exploit various methods to accelerate bioinformatics applications. Many efforts have already been put in the field to optimize most of the bioinformatics algorithms. By incorporating Graphical Processing Units (GPUs), many bioinformatics applications have benefited hugely. Compute Unified Device Architecture (CUDA) is a hardware and software platform, used to exploit multi-threaded architecture of GPUs. Basic Local Alignment Search Tool (BLAST) is one of the most frequently used algorithms for bioinformatics applications. Different GPU implementations of protein BLAST have already been proposed by different authors. For each implementation, the authors claimed different speedups. But these implementations are on different hardware platforms and also were experimented with different databases, so it’s difficult to compare their performance accurately. In this paper four different GPU implementations of protein BLAST are explored in detail. To compare their performance, these GPU versions of BLAST are implemented on a common hardware platform, i.e. NVIDIA M2050 GPU with 448 processing cores, 3GB of memory and two hex-core Intel, Xeon 2.93 GHz processors. Experiments are performed on 2.38 GB protein database. Performance is analyzed and compared with standard NCBI-BLASTP. Parameter considered for performance analysis and comparison is the execution time. In the current environment speedup obtained by different implementations varied from 2.3X to 9.8X.

References
  1. Isaza, S., Sanchez, F., Cabarcas, F., Ramirez, A. and Gaydadjiev, G., “parameterizing multicore architectures for multiple sequence alignmnet”, International Conference on Computing Frontiers, May3-5, 2011, Ischia, Italy.
  2. Diaz, D., Esteban, F.J., Hernandez, P. , Caballero, J.A., Dorado, G. and Galvez, S., “Parallelizing and optimizing a bioinformatics sequence alignment algorithm for many-core architecture”,Parallel Computing. vol. 37, 2011, pp. 244-259.
  3. Altschul, S.F., Gish, W. , Miller, W., Myers, E.W. and Lipman, D.J. “ Basic Local Alignmnet Search Tool”, J. Molecular Biology, 1990, vol. 215, pp. 403-410.
  4. Dematte, L. and Prandi , D. , “ GPU computing for system biology” , Briefings in Bioinformatics. 2010, vol. 11. No. 3, pp. 323-333.
  5. Kindratenko, V.V., Enos, J.J. , Shi, G., Showerman, M.T., Arnold, G.W., Stone, J.E., Phillips, J.C. and Hwu, W., “ GPU Clusters for high performance computing” , IEEE International Conference on Cluster Computing and Workshop, Aug, 31-Sep,4, 2009, New Orleans, LA, pp. 1-8.
  6. Sharma , T.R., “ Genome Analysis and Bioinformatics” , 2009, pp.67.
  7. Xiao, S., Lin, H. and Feng , W. “ Accelerating Protein Sequence Search in Heterogeneous Systems” , IEEE Parallel and Distributed Processing Symposium, May, 2011, Anchorage, AK, pp. 112-1222.
  8. Ree, E. J. and B. S., “Parallelization Methods for the Distribution of High Throughput Bioinformatics Algorithms” , Ph.D. Dissertation , Texas University, 2011 .
  9. Liu, W., Schmidt, B. and Muller-Witting, W., “ CUDA-BLASTP: accelerating BLASTP on CUDA-enabled graphics hardware”, IEEE/ACM Transaction on Computational Biology and Bioinformatics , 2011, vol. 8, no. 6, pp. 1678-1684.
  10. Ling, C. and Benkrid, K., “ Design and implementation of a CUDA-compatible GPU-based Core for gapped BLAST algorithm”, International Conference on Computational Science , 2010 is available on Science Direct Procedia Computer Science, vol. 1, no 1,pp.495-504.
  11. Vouzis, P.D., and Sahinidis, N.V., “GPU-BLAST: using graphics processors to accelearte protein sequence alignment”, BIOINFORMATICS, vol.27, no. 2, 2011, pp. 182-188.
  12. National Center for Biotechnology Information: http://www.ncbi.nlm.nih.gov/
  13. Altschul, S.F., Madden, T.L., Schaffer, A.A. , Zhang, J., Zhang, Z., Miller, W., and Lipman, D.J., “Gapped BLAST and PSI-BLAST: a new generation of protein database search programs”, J. Nucleic Acid Research, 1997, vol. 25, no. 17, pp. 3389-3402.
  14. Luebke, D., “CUDA: Scalable parallel programming for high performance computing”, Proc. 5th IEEE International Symposium on Biomedical Imaging, Paris, 2008, pp. 836-838.
  15. Liu, Y., Maskell, D.L., and Schmidt, B.,“ CUDASW++: optimizing Smith-Waterman sequence database searches for CUDA-enabled graphics processing units”, BMC Research Note,2009, vol. 2, no. 73, pp. 1-10.
  16. Nickolls, J. and Dally, W.J. “ The GPU Computing Era”, J. IEEE Computer Society Micro, 2010, vol. 30, no. 2, pp. 56-69.
  17. McClanahan, C., “History and Evolution of GPU Architecture”, 2010, A paper Survey. http://mcclanahoochie.com/blog/wpcontent/uploads/2011/03/gpu-hist-paper.pdf
  18. Fenstermacher, D.,“Introduction to Bioinformatics”, J. of American Society for International Science and Technology, 2005, vol. 56, no. 5, pp. 440-446.
  19. Albayraktaroglu, K., Jaleel, A., Wu, X., Franklin, M., Jacob, B., Tseng, C.W. and Yeung, D. “ BioBench: A Benchmark Suite of Bioinformatics Applications”, Proc. IEEE International Symposium on Performance Analysis of Systems and Soft- wares, Austin, TX,2005, pp. 2-9.
  20. Cohen, J. “Bioinformatics: An Introduction to Computer Scientists”, ACM J. Computing Surveys, 2004, vol. 36, no. 2, pp. 122-158.
  21. Baxevanisand D.A., and Ouellette, B.F., “BIOINFORMATICS A Practical Guide to the analysis of Genes and Proteins,” John Wiley and Sons INC., U.K., 2006, pp. 82-102.
  22. Pang, B., Zhao, N., Becchi, M., Korkin, D. and Shyu, C. “ Accelerating large-scale protein structure alignments with graphics processing units”, BMC Research Notes, 2012.
  23. Lin, C., Hung, C. , and Huang, J., “Efficient GPU-Based Algorithm for Aligning Huge Sequence Database”, IEEE International conference on High Performance Computing and Communications, 10th IEEE International Conference on Embedded and Ubiquitous Computing, Zhangjiajie, 2013, pp. 1758-1762.
  24. Lee, S., Lin, C. and Hung, C.L. “ GPU-Based Cloud Service for Smith-Waterman Algorithm using Frequency Filtration Scheme”, BioMed Research International, Research Article, vol. 2013, pp. 1-8.
  25. Zhu, X. , Li, K., Salah, A., Shi, L. and Li, K., “ Parallel Implementation of MAFFT on CUDA-Enabled Graphics Hardware”, IEEE/ACM Transaction on Computational Biology and Bioinformatics, 2015, vol. 12, no. 1, pp. 205-218.
  26. Zhang, J., Wang, H., Lin, H. and Feng, W., “cuBLASTP: Fine Grained Parallelization of Protein Sequence Search on a GPU”, IEEE 28th Parallel and Distributed Processing Symposium, Phoenix, AZ, 2014, pp. 251-260.
  27. Gupta, OP. and Rani, S., “Accelerating Molecular Sequence Analysis using Distributed Computing Environment”, International Journal of Scientific and Engineering Research, 2013, vol. 4, no. 10, pp. 262-266.
Index Terms

Computer Science
Information Sciences

Keywords

BLAST Bioinformatics CUDA GPU Sequence Alignment Thread.