Emerging Trends in Computer Science and Information Technology (ETCSIT2012) |
Foundation of Computer Science USA |
ETCSIT - Number 4 |
April 2012 |
Authors: Borawake Madhuri P |
b6f8a9d3-5f84-447b-8cca-f99e66a06db8 |
Borawake Madhuri P . Innovative Technique for Audio Segmentation. Emerging Trends in Computer Science and Information Technology (ETCSIT2012). ETCSIT, 4 (April 2012), 27-30.
Speech segmentation is the process of identifying the boundaries between words, syllables, or phonemes in spoken natural languages. The term applies both to the mental processes used by humans, and to artificial processes of processing. Speech segmentation is an important sub problem of speech recognition, and cannot be adequately solved in isolation. The lowest level of speech segmentation is the breakup and classification of the sound signal into a string of phones. The difficulty of this problem is compounded by the phenomenon of co-articulation of speech sounds, where one may be modified in various ways by the adjacent sounds: it may blend smoothly with them, fuse with them, split, or even disappear. This phenomenon may happen between adjacent words just as easily as within a single word. The notion that speech is produced like writing, as a sequence of distinct vowels and consonants. In fact, the way we produce vowels depends on the surrounding consonants and the way we produce consonants depends on the surrounding vowels. Therefore, even with the best algorithms, the result of phonetic segmentation will usually be very distant from the standard written language.