Title :
Collection of phoneme samples using time alignment and spectral stationarity of speech signals
Author :
Haltsonen, Seppo ; Ruusunen, Pekka
Author_Institution :
Helsinki University of Technology, Espoo, Finland
Abstract :
An automatic method for collecting a large number of phoneme samples to be used as training data for speech recognition is described. Time alignment and spectral stationarity of speech signals are used to transfer phoneme labels from a hand labeled utterance of a standard speaker to a similar utterance of another speaker for whom training data are needed. Experimental results based on speech data obtained from eight male speakers show that automatically obtained training data almost yield the same phoneme recognition accuracy as hand labeled training data.
Keywords :
Automatic speech recognition; Loudspeakers; Physics; Prototypes; Speech recognition; Training data; Viterbi algorithm; Vocabulary;
Conference_Titel :
Acoustics, Speech, and Signal Processing, IEEE International Conference on ICASSP '85.
DOI :
10.1109/ICASSP.1985.1168214