Title :
Machine analysis and synthesis of spoken Telugu vowels
Author :
Maddela, Venkata Krishna Rao
Author_Institution :
Dept. of Electron. & Commun. Eng., CMR Inst. of Technol., Hyderabad, India
Abstract :
In this paper, the acoustic characteristics of spoken intrinsic Telugu vowels are studied. A spectrogram is built using the short-time linear prediction (LP) analysis of the vowel signal. The trajectories of the spectral peaks (formants) over time are tracked. The first three formant frequencies are estimated from these trajectories. While the analysis is straight forward for monophthong vowels, it is a bit involved in case of the diphthongs, the nasal and the conjuncts. The vowels are again synthesized using the estimated formant frequencies, formant bandwidths, formant slopes/transitions (in case of diphthongs) using a cascade formant synthesizer. As the study is mainly aimed at the vowel quality rather than the naturalness of the sound, the synthesis is carried out using three excitation sources: band limited impulse train, band limited triangular wave train and the Liljencrants-Fant (LF) glottal source model. For each vowel, each of the formant frequencies is varied over a range and the quality of the synthesized vowel is assessed subjectively. The range of each formant frequency for which the vowel color/quality is maintained, is determined. In each case, the minimum number of formants required to maintain the vowel quality are also determined. The results of this study are useful in Telugu Text-to-Speech systems and Telugu Transcription systems, Indian Music Transcription systems.
Keywords :
acoustic signal processing; natural language processing; speech processing; speech synthesis; Indian music transcription systems; LP analysis; Liljencrants-Fant glottal source model; Telugu text-to-speech systems; Telugu transcription systems; band limited triangular wave train; cascade formant synthesizer; formant bandwidths; formant slopes; monophthong vowels; short-time linear prediction analysis; spectrogram; spoken Telugu vowel machine analysis; spoken Telugu vowel synthesis; spoken intrinsic Telugu vowel acoustic characteristics; vowel color; vowel quality; vowel signal; Conjuncts; Nasals; Telugu; diphthongs; formant tracking; formants; isolated vowels; peak detection; vowel synthesis;
Conference_Titel :
Computational Intelligence and Information Technology, 2013. CIIT 2013. Third International Conference on
Conference_Location :
Mumbai
DOI :
10.1049/cp.2013.2577