DocumentCode :
2991612
Title :
Collection of phoneme samples using time alignment and spectral stationarity of speech signals
Author :
Haltsonen, Seppo ; Ruusunen, Pekka
Author_Institution :
Helsinki University of Technology, Espoo, Finland
Volume :
10
fYear :
1985
fDate :
31138
Firstpage :
1561
Lastpage :
1564
Abstract :
An automatic method for collecting a large number of phoneme samples to be used as training data for speech recognition is described. Time alignment and spectral stationarity of speech signals are used to transfer phoneme labels from a hand labeled utterance of a standard speaker to a similar utterance of another speaker for whom training data are needed. Experimental results based on speech data obtained from eight male speakers show that automatically obtained training data almost yield the same phoneme recognition accuracy as hand labeled training data.
Keywords :
Automatic speech recognition; Loudspeakers; Physics; Prototypes; Speech recognition; Training data; Viterbi algorithm; Vocabulary;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, IEEE International Conference on ICASSP '85.
Type :
conf
DOI :
10.1109/ICASSP.1985.1168214
Filename :
1168214
Link To Document :
بازگشت