DocumentCode :
2999044
Title :
Automatic alignment of speech with phonetic transcriptions in real time
Author :
Torkkola, Kari
Author_Institution :
Lab. of Inf. & Comput. Sci., Helsinki Univ. of Technol., Espoo, Finland
fYear :
1988
fDate :
11-14 Apr 1988
Firstpage :
611
Abstract :
A system to align speech waveforms with the corresponding phonetic transcriptions is described. The alignment is mainly based on the labeling of speech frames centisecond apart to phonetic classes. A novel method based on neural network principles is used to accomplish the labeling. Another major source of information utilized is spectral stationarity. The alignment is performed in two main stages. First, a list of phonetic events having stationary properties is constructed. The phonetic transcription is roughly aligned with this list. A more detailed boundary refinement is then carried out using heuristic speech-specific knowledge. The system is running on standard IBM PC/AT in real time. It is used for on-line speaker enrollment and syntactic correction analysis in addition to establishing a database for speech recognition research
Keywords :
neural nets; speech analysis and processing; speech recognition; IBM PC/AT; automatic speech alignment; boundary refinement; database; heuristic speech-specific knowledge; neural network principles; online speaker enrollment; phonetic classes; phonetic events; phonetic transcriptions; spectral stationarity; speech frames; speech recognition research; speech waveforms; stationary properties; syntactic correction analysis; Computer science; Dynamic programming; Labeling; Laboratories; Natural languages; Neurons; Prototypes; Refining; Robustness; Speech recognition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1988. ICASSP-88., 1988 International Conference on
Conference_Location :
New York, NY
ISSN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.1988.196659
Filename :
196659
Link To Document :
بازگشت