Title :
Lexical stress classification for language learning using spectral and segmental features
Author :
Ferrer, Luciana ; Bratt, Harry ; Richey, Colleen ; Franco, Hugo ; Abrash, Victor ; Precoda, Kristin
Author_Institution :
Speech Technol. & Res. Lab., SRI Int., Menlo Park, CA, USA
Abstract :
We present a system for detecting lexical stress in English words spoken by English learners. The system uses both spectral and segmental features to detect three levels of stress for each syllable in a word. The segmental features are computed on the vowels and include normalized energy, pitch, spectral tilt and duration measurements. The spectral features are computed at the frame level and are modeled by one Gaussian Mixture Model (GMM) for each stress class. These GMMs are used to obtain segmental posteriors, which are then appended to the segmental features to obtain a final set of GMMs. The segmental GMMs are used to obtain posteriors for each stress class. The system was tested on English speech from native English-speaking children and from Japanese-speaking children with variable levels of English proficiency. Our algorithm results in an error rate of approximately 13% on native data and 20% on Japanese non-native data.
Keywords :
Gaussian processes; learning systems; mixture models; natural languages; Gaussian mixture model; language learning; lexical stress classification; segmental features; segmental posteriors; spectral features; Adaptation models; Computational modeling; Data models; Feature extraction; Hidden Markov models; Speech; Stress; Computer-aided language learning; Gaussian Mixture Models; Stress classification;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on
Conference_Location :
Florence
DOI :
10.1109/ICASSP.2014.6855099