Title :
A syllable-based prosody modeling for L1 and L2 English speeches
Author :
Wei-Fan Chen ; Chin-Kuan Kuo ; Yih-Ru Wang ; Sin-Horng Chen
Author_Institution :
Dept. of Electr. Eng., Nat. Chiao Tung Univ., Hsinchu, Taiwan
Abstract :
In this paper, a statistical prosody modeling approach for L1 and L2 English speeches is proposed. The study focuses on the modeling of two prosodic-acoustic features: syllable duration and log-pitch contour. Several major affecting factors (AFs) that influence the variations of these two features are considered. They include lexical stress, word length, nearby break type, phonemic constituent of syllable, and prosodic state. A sequential optimization procedure is adopted to automatically train the two models from the TWNAESOP corpus recorded in Taiwan. Experimental results showed that most AFs estimated agreed well with our prior linguistic knowledge. The differences in the prosody of L1 and L2 speeches were also explored.
Keywords :
acoustic signal processing; linguistics; natural language processing; optimisation; speech processing; statistical analysis; text analysis; AF; L1 English speech; L2 English speech; TWNAESOP corpus; Taiwan; affecting factor; lexical stress; linguistic knowledge; log-pitch contour; phonemic constituent; prosodic state; prosodic-acoustic feature; sequential optimization procedure; statistical prosody modeling; syllable duration; syllable-based prosody modeling; word length; Polynomials; Pragmatics; Scattering; Speech; Standards; Stress; Vectors; L2 English speech; Prosody modeling; duration modeling; pitch contour modeling;
Conference_Titel :
Chinese Spoken Language Processing (ISCSLP), 2012 8th International Symposium on
Conference_Location :
Kowloon
Print_ISBN :
978-1-4673-2506-6
Electronic_ISBN :
978-1-4673-2505-9
DOI :
10.1109/ISCSLP.2012.6423464