International Journal of Computer Applications |
Foundation of Computer Science (FCS), NY, USA |
Volume 186 - Number 31 |
Year of Publication: 2024 |
Authors: Sahil Panchbhaiya, Pranav Menon, Rishikesh Lingayat, Nikhita Mangaonkar |
10.5120/ijca2024923887 |
Sahil Panchbhaiya, Pranav Menon, Rishikesh Lingayat, Nikhita Mangaonkar . Hindi Pronunciation Analysis for Speech Impaired using MFCC and DTW. International Journal of Computer Applications. 186, 31 ( Aug 2024), 48-54. DOI=10.5120/ijca2024923887
The aim of this experiment is to educate speech-impaired learners on the pronunciation of Hindi syllables by providing word breakdowns, sounds, and examples of their usage. After the speaker becomes familiar with the syllables, a voice sample from the user is taken as input and analyzed to determine whether it matches the predefined data, ensuring that the speaker is following correctly. This feature matching is performed using Dynamic Time Warping (DTW) and Mel-Frequency Cepstral Coefficients (MFCC). The process is carried out using a combination of MFCC and DTW. In the two-step process of speech analysis, MFCC is used in the first phase to extract fourteen features, and the second phase employs three unique classifiers: k-Nearest Neighbour (KNN), Support Vector Machine (SVM), and Dynamic Time Warping (DTW) to determine the best combination for accurate and precise feature matching.