CFP last date
22 April 2024
Reseach Article

A Group Average Cluster Analysis of Few IGF1R Sequences using Modified Group Average Link Clustering Algorithm

by R. Rambabu, P. Srinivasa Rao
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 150 - Number 11
Year of Publication: 2016
Authors: R. Rambabu, P. Srinivasa Rao
10.5120/ijca2016911666

R. Rambabu, P. Srinivasa Rao . A Group Average Cluster Analysis of Few IGF1R Sequences using Modified Group Average Link Clustering Algorithm. International Journal of Computer Applications. 150, 11 ( Sep 2016), 42-46. DOI=10.5120/ijca2016911666

@article{ 10.5120/ijca2016911666,
author = { R. Rambabu, P. Srinivasa Rao },
title = { A Group Average Cluster Analysis of Few IGF1R Sequences using Modified Group Average Link Clustering Algorithm },
journal = { International Journal of Computer Applications },
issue_date = { Sep 2016 },
volume = { 150 },
number = { 11 },
month = { Sep },
year = { 2016 },
issn = { 0975-8887 },
pages = { 42-46 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume150/number11/26141-2016911666/ },
doi = { 10.5120/ijca2016911666 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-06T23:55:45.089201+05:30
%A R. Rambabu
%A P. Srinivasa Rao
%T A Group Average Cluster Analysis of Few IGF1R Sequences using Modified Group Average Link Clustering Algorithm
%J International Journal of Computer Applications
%@ 0975-8887
%V 150
%N 11
%P 42-46
%D 2016
%I Foundation of Computer Science (FCS), NY, USA
Abstract

Clustering techniques have been widely used in the fields of information technology, biomedical sciences. Cluster analysis deals with the identification of a set of objects into subsets with some sort of similarities. Such groups are assigned to have similar function. In this paper, a modified group average clustering program was written in python language and applied on a dataset of IGF1R protein sequences to generate orthologous clusters of sequences and the phylogenetic trees were presented.

References
  1. Jiang, D., Tang, C. and Zhang, A., 2004. Cluster analysis for gene expression data: a survey. IEEE Transactions on knowledge and data engineering, 16(11), pp.1370-1386.
  2. https://www.cse.buffalo.edu/DBGROUP/bioinformatics/papers/survey.pdf
  3. Tavazoie, S., Hughes, J.D., Campbell, M.J., Cho, R.J. and Church, G.M., 1999. Systematic determination of genetic network architecture. Nature genetics, 22(3), pp.281-285.
  4. Eisen, M.B., Spellman, P.T., Brown, P.O. and Botstein, D., 1998. Cluster analysis and display of genome-wide expression patterns. Proceedings of the National Academy of Sciences, 95(25), pp.14863-14868.
  5. Jain, A.K. and Dubes, R.C., 1988. Algorithms for clustering data. Prentice-Hall, Inc..
  6. Shamir, R. and Sharan, R., 2002. Algorithmic approaches to clustering gene expression data. In In.
  7. Jiang, T., Xu, Y. and Zhang, M.Q., 2002. Current topics in computational molecular biology. MIT Press.
  8. Rao, S.G. and Govardhan, A., 2014. Assessing h-and g-Indices of Scientific Papers using k-Means Clustering. International Journal of Computer Applications, 100(11).
  9. Rao, S.G. and Govardhan, A., 2015. Investigation of Validity Metrics for Modified K-Means Clustering Algorithm. i-Manager's Journal on Computer Science, 3(2), p.33.
  10. http://www.expasy.org
  11. Lipman, D.J. and Pearson, W.R., 1985. Rapid and sensitive protein similarity searches. Science, 227(4693), pp.1435-1441.
  12. Rossum, V.G. 2006. "PEP 3000 -- Python 3000". Python Software Foundation. http://www.python.org/dev/peps/pep-3000
  13. Olson, C.F., 1995. Parallel algorithms for hierarchical clustering. Parallel computing, 21(8), pp.1313-1325.
  14. Berkhin, P., 2006. A survey of clustering data mining techniques. In Grouping multidimensional data (pp. 25-71). Springer Berlin Heidelberg.
  15. http://www.ebi.ac.uk/clustalw
  16. Zhou, H. and Zhou, Y., 2005. SPEM: improving multiple sequence alignment with sequence profiles and predicted secondary structures. Bioinformatics, 21(18), pp.3615-3621.
  17. Needleman, S.B. and Wunsch, C.D., 1970. A general method applicable to the search for similarities in the amino acid sequence of two proteins. Journal of molecular biology, 48(3), pp.443-453.
Index Terms

Computer Science
Information Sciences

Keywords

IGF1R clusters group average clustering python program.