International Journal of Computer Applications |
Foundation of Computer Science (FCS), NY, USA |
Volume 186 - Number 7 |
Year of Publication: 2024 |
Authors: Ashwini G. Pawar, Nita V. Patil |
10.5120/ijca2024923414 |
Ashwini G. Pawar, Nita V. Patil . Comparative Study of Techniques for Spoken Language Dialect Identification. International Journal of Computer Applications. 186, 7 ( Feb 2024), 47-58. DOI=10.5120/ijca2024923414
Identifying variations in spoken language resulting from regional or socioeconomic factors, known as dialect identification, is a significant problem in natural language processing and linguistics. This study contrasts various dialect identification approaches and evaluates their effectiveness. Techniques include deep learning, transfer learning, classical acoustic feature analysis, machine learning (ML) algorithms, phonetic and phonological analysis, lexical and grammatical feature extraction, prosodic analysis, and phonological analysis. Meticulous application of these techniques is applied to a hand-picked dataset of multiple dialects, with their performance assessed using accepted evaluation measures. These findings reveal that different techniques capture dialectal variations differently. Phonetic and phonological analysis excels at detecting minute pronunciation changes, while acoustic feature-based ML demonstrates resilience in dialect discrimination. Lexical and grammatical factors effectively recognize small differences in vocabulary and grammar usage. Prosodic features enhance dialect identification through intonation and rhythm patterns. Moreover, deep learning models showcase their capacity to learn intricate patterns from large datasets, and transfer learning techniques are effective in scenarios with limited dialect-specific data. Multilingual and cross-lingual approaches leverage shared linguistic properties for enhanced identification accuracy. Ensemble methods harness the strengths of multiple techniques, resulting in improved overall performance. This study underscores the significance of a diversified approach to dialect identification, with the choice of technique depending on factors such as available resources, data availability, and dialect complexity.