Call for Paper - November 2019 Edition
IJCA solicits original research papers for the November 2019 Edition. Last date of manuscript submission is October 21, 2019. Read More

K-Harmonic Means Granular Computing Model for Protein Sequence Motif Identification

Print
PDF
IJCA Special Issue on International Conference on Communication, Computing and Information Technology
© 2013 by IJCA Journal
ICCCMIT - Number 1
Year of Publication: 2013
Authors:
M Chitralegha
K Thangavel

M Chitralegha and K Thangavel. Article: K-Harmonic Means Granular Computing Model for Protein Sequence Motif Identification. IJCA Special Issue on International Conference on Communication, Computing and Information Technology ICCCMIT(1):17-23, February 2013. Full text available. BibTeX

@article{key:article,
	author = {M Chitralegha and K Thangavel},
	title = {Article: K-Harmonic Means Granular Computing Model for Protein Sequence Motif Identification},
	journal = {IJCA Special Issue on International Conference on Communication, Computing and Information Technology},
	year = {2013},
	volume = {ICCCMIT},
	number = {1},
	pages = {17-23},
	month = {February},
	note = {Full text available}
}

Abstract

Bioinformatics is concerned with creation and advancement of algorithms using techniques such as computational intelligence, applied mathematics and statistics to solve biological problems. Sequence analysis, protein structure alignment analysis and prediction, gene finding are said to be major research efforts done in the area of bioinformatics. Proteins are considered as one of the most important elements in the process of life. The activities and functions of proteins can be determined by protein sequence motifs. Identifying such motifs is one of the crucial tasks in the area of bioinformatics. In this study, Singular Value Decomposition (SVD) is adopted to select significant sequence segments and then K-Harmonic Means granular computing model is proposed to generate protein sequence motif information efficiently. Experimental result shows that K-Harmonic granular computing model outperforms K-Means granular technique.

References

  • O. Alter, P. O Brown, D. Botstein, "Singular value decomposition for genome-wide expression data preprocessing and modeling", PNAS, Vol. 97, No. 18, pp. 10101-10106, 2000.
  • T K Attwood, M E Beck, A J Bleasby, K. Degtyarenko, DJP. Smityh: Progress with the PRINTS protein fingerprint database. Nucleic Acids Res 1996, 24:182-183.
  • B. Chen, P. C Tai, R. Harrision and Y. Pan, "FIK Model: Novel Efficient Granular Computing Model for Protein Sequence Motifs and Structure Information Discovery", in IEEE proc, 6th symposium on Bioinformatics and Bio Engineering (BIBE), Washington DC, 2006, pp. 20-26.
  • B. Chen, P. C Tai, R. Harrison and Y. Pan, "FGK Model: An Efficient Granular Computing Model for Protein Sequence Motifs Information Discovery", in IASTED proc. International conference on Computational and Systems Biology (CASB), Dallas 2006, pp. 56-61.
  • D. L Davies, and D. W Buldin, "A cluster separation measure", IEEE Trans. Pattern Recogn. Machine Intell. , 1,224-227, 1979.
  • David W. Mount, Sequence and Genome Analysis, Cold Spring Harbor Laboratory Press, New York, 2001.
  • K. F Han and D. Baker, "Recurring local sequence motifs in proteins", J. Mol. Bio, Vol. 251, No. 1, pp. 176-187, 1995.
  • S. Henikoff, J. G. Henikoff and S. Pietrokovski, "Blocks+: a non redundant database of protein Alignment blocks derived from multiple compilation", Bioinformatics, Vol. 15, No. 6, pp. 417-479, 1999.
  • N. Hullo, C. J. A Sigrist, V. Le Saux, P. S Langendijk-Genevaux, L. Bordoli, A. Gattiker, E. De Castro, P. Bucher, and A. Bairoch, "Recent improvements to the PROSITE database", Nucleic Acids Res, Vol. 32, Database issue: D134-137, 2004.
  • W. Kabsch and C. Sander, "Dictionary of protein secondary structure pattern recognition of hydrogen-bonded and geometrical features", Biopolymers, Vol. 22, pp. 2577-2637, 1983.
  • Margaret H. Dunham, Data Mining- Introductory and Advanced Concepts, Pearson Education, 2006.
  • . C. Sander and R. Schneider, "Database of Homology-derived protein structures and the structural meaning of sequence alignment", Proteins: Struct. Funct. Genet. , vol. 9, No. 1, pp. 56-68, 1991.
  • C. Sander and R. Schneider, "Database of similarity derived protein structures and the structural meaning of sequence alignment, "Proteins: Struct. Funct. Gent. Vol. 9, No. 1, pp. 56-68, 1991.
  • G. Wang and R. L Dunbrack,Jr. , "PISCES: a protein sequence culling server", Bioinformatics,Vol. 19, No. 12, pp. 1589-1591,2003.
  • W. Zhong, G. Altun, R. Harrison, P. C Tai and Yi Pan, "Improved K-Means Clustering algorithm for Exploring Local Protein Sequence motifs Representing Common Structural Property", IEEE transactions on Nanobioscience, Vol. 4, No. 3, pp. 255-265, 2005.
  • B. Zhang, M. Hsu, U. Dayal, K-Harmonic means- a data clustering algorithm, Technical report HPL-1999-124. Hewlett Packard laboratories, 1999.