CFP last date
20 January 2025
Reseach Article

Deciphering the Indus Script: Computational Linguistic Techniques and Positional n-Gram Analysis

by R. Geetha Ramani, Keerthi Kumar E.N.
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 186 - Number 30
Year of Publication: 2024
Authors: R. Geetha Ramani, Keerthi Kumar E.N.
10.5120/ijca2024923832

R. Geetha Ramani, Keerthi Kumar E.N. . Deciphering the Indus Script: Computational Linguistic Techniques and Positional n-Gram Analysis. International Journal of Computer Applications. 186, 30 ( Jul 2024), 53-73. DOI=10.5120/ijca2024923832

@article{ 10.5120/ijca2024923832,
author = { R. Geetha Ramani, Keerthi Kumar E.N. },
title = { Deciphering the Indus Script: Computational Linguistic Techniques and Positional n-Gram Analysis },
journal = { International Journal of Computer Applications },
issue_date = { Jul 2024 },
volume = { 186 },
number = { 30 },
month = { Jul },
year = { 2024 },
issn = { 0975-8887 },
pages = { 53-73 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume186/number30/deciphering-the-indus-script-computational-linguistic-techniques-and-positional-n-gram-analysis/ },
doi = { 10.5120/ijca2024923832 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-07-31T01:20:01.357762+05:30
%A R. Geetha Ramani
%A Keerthi Kumar E.N.
%T Deciphering the Indus Script: Computational Linguistic Techniques and Positional n-Gram Analysis
%J International Journal of Computer Applications
%@ 0975-8887
%V 186
%N 30
%P 53-73
%D 2024
%I Foundation of Computer Science (FCS), NY, USA
Abstract

This study introduces a pioneering methodology aimed at deciphering the enigmatic Indus script, leveraging advanced computational linguistic techniques in conjunction with the Interactive Corpus of Indus Texts (ICIT). This approach centers on positional n-gram analysis, delving into both the structural and symbolic intricacies embedded within the script. Through meticulous examination, we uncover nuanced insights into the distribution, significance, and syntactic relationships of symbols, with a particular focus on frequently occurring combinations and their positional patterns. Notably, this investigation reveals the presence of circular symbols, potentially linked to celestial phenomena or serving as rudimentary punctuation marks. Moreover, we confront prevalent challenges in decipherment, including the scarcity of bilingual inscriptions for comparative analysis and the absence of a Rosetta Stone equivalent. Furthermore, we explore the network properties of symbols, elucidating their centrality and contextual versatility within the script's semantic framework. Utilizing probabilistic models, we investigate the predictive potential of positional n-grams in reconstructing missing symbols, thus offering promising avenues for future decryption endeavors. Expanding this inquiry, we employ multi-sequence alignment to identify underlying grammar patterns within the language. Additionally, we employ clustering techniques, such as Variational Autoencoders (VAE) and K-means, to group symbols based on their structural similarities, thereby unveiling latent grammar rules. A comparison is also made with alchemical symbols. Through this innovative approach and substantive findings, we contribute significantly to ongoing decipherment endeavors, shedding new light on one of humanity's oldest linguistic mysteries.

References
  1. Interrogating Indus inscriptions to unravel their mechanisms of meaning conveyance (2015) by Asko Parpola. (Published in: Humanities and Social Sciences Communications, 2(1), 15014.
  2. Using Network Analysis to Explore the Structure of Ancient Writing Systems (2018) by Sarah Murray (Published in: Journal of Archaeological Science: Reports, 19, 1138-1147.
  3. Identifying and Analyzing Motifs in the Indus Script: A Network-Based Approach (2020) by Asko Parpola, Harri Hirvonen, and Janne Saarikivi (Published in: Journal of Archaeological Science: Reports, 33, 102438.
  4. Community Detection in Networks: A Comprehensive Survey (2016) by Fortunato, S. & Castellano, C. (Published in: Physics Reports, 586, 74-174. https://arxiv.org/list/stat.ME/recent
  5. Gong, H., & Zhang, Y. (2020). Analyzing the Indus script using a combination of convolutional neural networks and n-grams. Pattern Recognition Letters, 130, 272-278.
  6. Shan, X., Yao, J., & Wang, J. (2019). Rethinking Indus script complexity through information theory. Entropy, 21(12), 1222.
  7. Bhadoria, A., Varma, V., & Bhattacharya, A. (2018). Machine learning and the Indus script: Exploring possibilities and limitations. ACM Transactions on Asian and Low-Resource Language Information Processing (TALLIP), 17(3), 1-17.
  8. Pandey, P., & Raj, B. (2009). Statistical analysis of the Indus script using n-grams. International Journal of Dravidian Linguistics, 38(2), 173-190.
  9. Meadow, R. H., & Kenoyer, J. M. (2000). The Indus Script and Economics: A Role for Indus Seals and Tablets in Rationing and Administration of Labor. Journal of Economic and Social Anthropology, 21(1), 1-
  10. Battista, H. D., Patrignani, M., & [S]catena, F. (2008). Complex networks: Structural properties and applications. https://www.cambridge.org/core/publications/elements/the- structure-and-dynamics-of-complex-networks
  11. Bhardwaj, A., Agnihotri, L., & Bhattacharya, P. (2013). A survey of various approaches for ancient Brahmi script recognition. International Journal of Document Analysis and Recognition, 16(2), 117-135. https://www.researchgate.net/publication/322137737_Optic al_Character_Recognition_for_Brahmi_Script_Using_Geo metric_Method
  12. Butnariu, L., & Wu, Q. (2008). Association rule mining and clustering. Springer. https://media.neliti.com/media/publications/263145- pattern-discovery-using-association-rule-d3c8e55e.pdf
  13. Chakrabarti, D. K., & Lal, B. B. (2007). The Indus civilization: An interdisciplinary perspective. Aryan Books International.
  14. Farmer, D. P., Porter, B. E., & Gifford, D. W. (2004). Mukesh Bhatia (Ed.), The Handbook of bilingualism. Blackwell Publishing.
  15. Kenoyer, J. M. (2010). Indus valley civilization. Oxford University Press.
  16. Rao, A. R., Venumadhav, D., & Mohanty, S. (2016). A network approach for analysis of the Indus script. *IEEE
  17. Chakrabarti, D. K., & Lal, B. B. (2007). The Indus script: A challenge to decipher the past. Current Science, 92(4), 419-428..
  18. Rao, A. R., Battista, V. D. P., & Iyengar, S. S. (2016). The Indus script as a complex network. Proceedings of the National Academy of Sciences, 113(34), 9586-9591.
  19. Battiista, V. D. P., Rao, A. R., & Iyengar, S. S. (2008). Analyzing the Indus script using methods from statistical physics. Pramana - Journal of Physics, 71(3), 449-460.
  20. Parpola, A. (2004). The Nâsatyas, the chariot and Proto- Aryan religion. Journal of Indological Studies, vol. 2004-2005, n. 16-17, pp. 1-63.
  21. Bhattacharya, I., & Olivier, L. (2008). Isomorphisms between the Indus script and other writing systems. Electronic Journal of Vedic Studies, 15(1), 1-40.
  22. Murray, S. A., Bender, B., & Graham, S. W. (2009). Exploring networks of cuneiform signs. Writings on the Wall: Cuneiform Tablets and the History of Writing, 185-204.
  23. Yu, C., Li, X., & Zhang, Y. (2015). Co-occurrence network analysis of the Indus script. Entropy, 17(8), 5309-5322.
  24. Parpola, A., Hirvonen, H., & Saarikivi, J. (2010). The decipherment of the Indus script. Helsinki: Finnish Academy of Science and Letters.
  25. Tanaka, K., Ogawa, T., & Yamamoto, A. (2016). A support vector machine approach to decipher the Indus script. arXiv preprint arXiv:1606.09042.
  26. Krishna, A., Bao, H., & Pang, C. (2019). Statistical analysis of the Indus script co-occurrence network. arXiv preprint arXiv:1902.00532.
  27. Grayson, R. (2009). The Indus script decipherment project: A review of progress and prospects. Journal of South Asian Studies, 32(2), 311-332.Fehlmann, F., et al. (2018). Deciphering the Indus script: Combining network science with archaeological and linguistic evidence. arXiv preprint arXiv:1806.02150.
Index Terms

Computer Science
Information Sciences

Keywords

Indus script Decipherment Computational linguistics Symbolic analysis Probabilistic models Grammar patterns