DocumentCode :
1196184
Title :
Vowel Onset Point Detection Using Source, Spectral Peaks, and Modulation Spectrum Energies
Author :
Prasanna, S. R Mahadeva ; Reddy, B. V Sandeep ; Krishnamoorthy, P.
Author_Institution :
Dept. of Electron. & Commun. Eng., Indian Inst. of Technol. Guwahati, Guwahati
Volume :
17
Issue :
4
fYear :
2009
fDate :
5/1/2009 12:00:00 AM
Firstpage :
556
Lastpage :
565
Abstract :
Vowel onset point (VOP) is the instant at which the onset of vowel takes place during speech production. There are significant changes occurring in the energies of excitation source, spectral peaks, and modulation spectrum at the VOP. This paper demonstrates the independent use of each of these three energies in detecting the VOPs. Since each of these energies represents a different aspect of speech production, it may be possible that they contain complementary information about the VOP. The individual evidences are therefore combined for detecting the VOPs. The error rates measured as the ratio of missing and spurious to the total number of VOPs evaluated on the sentences taken from the TIMIT database are 6.92%, 8.8%, 6.13%, and 4.0% for source, spectral peaks, modulation spectrum, and combined information, respectively. The performance of the combined method for VOP detection is improved by 2.13% compared to the best performing individual VOP detection method.
Keywords :
speech processing; TIMIT database; excitation source; modulation spectrum energy; spectral peaks; speech production; vowel onset point detection; Automatic speech recognition; Data mining; Databases; Error analysis; Feature extraction; Lips; Shape; Speech processing; Speech recognition; Tongue; Modulation spectrum and combining; source; spectral peaks; vowel onset point (VOP);
fLanguage :
English
Journal_Title :
Audio, Speech, and Language Processing, IEEE Transactions on
Publisher :
ieee
ISSN :
1558-7916
Type :
jour
DOI :
10.1109/TASL.2008.2010884
Filename :
4802173
Link To Document :
بازگشت