CFP last date
22 April 2024
Reseach Article

Clustering Indus Texts using K-means

by Nisha Yadav, Ambuja Salgaonkar, Mayank Vahia
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 162 - Number 1
Year of Publication: 2017
Authors: Nisha Yadav, Ambuja Salgaonkar, Mayank Vahia
10.5120/ijca2017913207

Nisha Yadav, Ambuja Salgaonkar, Mayank Vahia . Clustering Indus Texts using K-means. International Journal of Computer Applications. 162, 1 ( Mar 2017), 16-21. DOI=10.5120/ijca2017913207

@article{ 10.5120/ijca2017913207,
author = { Nisha Yadav, Ambuja Salgaonkar, Mayank Vahia },
title = { Clustering Indus Texts using K-means },
journal = { International Journal of Computer Applications },
issue_date = { Mar 2017 },
volume = { 162 },
number = { 1 },
month = { Mar },
year = { 2017 },
issn = { 0975-8887 },
pages = { 16-21 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume162/number1/27207-2017913207/ },
doi = { 10.5120/ijca2017913207 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-07T00:07:45.923567+05:30
%A Nisha Yadav
%A Ambuja Salgaonkar
%A Mayank Vahia
%T Clustering Indus Texts using K-means
%J International Journal of Computer Applications
%@ 0975-8887
%V 162
%N 1
%P 16-21
%D 2017
%I Foundation of Computer Science (FCS), NY, USA
Abstract

One of the most important undeciphered scripts of the ancient world is the Indus script. Earlier studies had focused on the correlations between signs in the Indus texts using various statistical and computational techniques such as N-grams or Markov chains. In the present study, K-means clustering, an unsupervised machine learning technique is used to identify clusters of similar texts without making any assumptions about its content. The technique is effective in extracting significant clusters and patterns in the script. Nine clusters are extracted from this study. The texts in each cluster share a common set of structural elements and are more similar to each other than the texts in other clusters. The clusters, as extracted from the study, reveal inherent patterns due to adjacent and non-adjacent dependencies between signs in the Indus texts. These clusters have definitive patterns in the usage of the signs but are only weakly associated to any archaeological site or medium of writing. The characteristic signature features of each cluster are identified in the study. The study provides a good handle to extract the logic of writing in the Indus script.

References
  1. Jain, A. K. and Dubes, R. C. 1988 Algorithms for Clustering Data. Upper Saddle River, NJ, USA: Prentice-Hall, Inc.
  2. Han, J., Kamber, M., and Pei, J. 2011 Data Mining: Concepts and Techniques. San Francisco, California: Morgan Kaufmann Publishers.
  3. Myatt, G. J. and Johnson, W. P. 2009 Making Sense of Data II: A Practical Guide to Data Visualization, Advanced Data Mining Methods, and Applications. New Jersey: John Wiley and Sons, Inc.
  4. Kenoyer, J. M. 1998 Ancient Cities of the Indus Valley Civilization. Oxford: Oxford University Press.
  5. Possehl, G. L. 2002 The Indus Civilization: A Contemporary Perspective. New Delhi: Vistaar Publications.
  6. Wright, R. P. 2010 The Ancient Indus – Urbanism, Economy and Society. New York: Cambridge University Press.
  7. Vahia, M. N. and Yadav, N. 2011. Reconstructing the History of Harappan Civilisation. Journal of Social Evolution and History. 10, 67 - 86.
  8. Possehl, G. L. 1996 Indus Age: The Writing System. New Delhi: Oxford & IBH Publishing Co. Pvt. Ltd.
  9. Mahadevan, I. 2002. Aryan or Dravidian or Neither? A Study of Recent Attempts to Decipher the Indus Script (1995-2000). Electronic Journal of Vedic Studies. 8.
  10. Parpola, A. 1994 Deciphering the Indus Script. Cambridge: Cambridge University Press.
  11. Parpola, A. 2005. Study of the Indus Script. In Proceedings of the International Conference of Eastern Studies,Tokyo: The Tôhô Gakkai, , 28-66.
  12. Yadav, N., Vahia, M. N., Mahadevan, I. and Joglekar, H. 2008. A Statistical Approach for Pattern Search in Indus Writing. International Journal of Dravidian Linguistics. vol. XXXVII, pp. 39-52.
  13. Yadav, N., Vahia, M. N., Mahadevan, I. and Joglekar, H. 2008. Segmentation of Indus Texts. International Journal of Dravidian Linguistics. vol. XXXVII, pp. 53-72.
  14. Yadav, N., Joglekar, H., Rao, R. P. N, Vahia, M. N., Adhikari, R. and Mahadevan, I. 2010. Statistical Analysis of the Indus Script Using n-grams. PLoS ONE. vol. 5.
  15. Yadav, N. and Salgaonkar, A. 2012. Statistical Studies of the Indus Script. Man and Environment. vol. XXXVII pp. 1-7.
  16. Yadav, N., Salgaonkar, A. and Vahia, M. N. 2014. Computational Techniques for Inferring the Syntax of Un-Deciphered Scripts. International Journal of Computer Science and Applications. Vol. 11, No. 2, pp. 50-61.
  17. Yadav, N. 2013. Sensitivity of Indus Script to Site and Type of Object. Scripta, vol. 5, pp. 67-103.
  18. Rao, R. P. N., Yadav, N., Vahia, M. N., Joglekar, H., Adhikari, R. and Mahadevan, I. 2009. A Markov Model of the Indus Script. Proceedings of the National Academy of Sciences. vol. 106, pp. 13685-13690.
  19. Rao, R. P. N., Yadav, N., Vahia, M. N., Joglekar, H., Adhikari, R. and Mahadevan, I. 2009. Entropic Evidence for Linguistic Structure in the Indus Script. Science. vol. 324, p. 1165.
  20. Rao, R. P. N., Yadav, N., Vahia, M. N., Joglekar, H. Adhikari, R. and Mahadevan, I. 2010. Entropy, the Indus Script and Language: A Reply to R. Sproat. Computational Linguistics. vol. 36, pp. 795-805.
  21. Yadav, N. and Vahia, M. N. 2011. Indus Script: A Study of its Sign Design. Scripta, vol. 3, pp. 133-172.
  22. Vahia, M. N. and Yadav, N. 2010. Harappan Geometry and Symmetry: A Study of Geometrical Patterns on Indus Objects. Indian Journal of History of Science. vol. 45, pp. 343-368.
  23. Yadav, N. and Vahia, M. N. 2011. Classification of Patterns on Indus Objects. International Journal of Dravidian Linguistics. vol. 40, pp. 89-114.
  24. Sinha, S., Yadav, N. and Vahia, M. N. 2011. In Square Circle: Geometric Knowledge of the Indus Civilization. In Math Unlimited: Essays in Mathematics, R. Sujatha, H. N. Ramaswamy and C. S. Yogananda, Eds., ed Enfield: Science Publishers. pp. 451-462.
  25. Mahadevan, I. 1977. The Indus Script: Texts, Concordance and Tables. New Delhi: Archaeological Survey of India.
Index Terms

Computer Science
Information Sciences

Keywords

Indus texts ancient script undeciphered script