DocumentCode
3071436
Title
A procedure for automatic alignment of phonetic transcriptions with continuous speech
Author
Leung, Hong C. ; Zue, Victor W.
Author_Institution
Massachusetts Institute of Technology, Cambridge, Massachusetts
Volume
9
fYear
1984
fDate
30742
Firstpage
73
Lastpage
76
Abstract
A system for automatic alignment of phonetic transcriptions with continuous speech has been developed. The speech signal is first segmented into broad classes using a non-parametric Pattern classifier. A knowledge-based dynamic programming algorithm then aligns the broad classes with the phonetic transcriptions. These broad classes provide "islands of reliability" for more detailed segmentation and refinement of boundaries. By doing alignment at the phonetic level, the system can often tolerate inter and intra-speaker variability. The system was evaluated on sixty sentences spoken by three speakers, two male and one female. 93% of the segments are mapped into only one phoneme, 70% of the time the offset between the boundary found by the automatic alignment system and a hand transcriber is less than 10 ms. The performance can be improved by applying more heuristic rules.
Keywords
Displays; Dynamic programming; Heuristic algorithms; Humans; Labeling; Laboratories; Signal processing; Speech processing; Speech recognition; Speech synthesis;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, IEEE International Conference on ICASSP '84.
Type
conf
DOI
10.1109/ICASSP.1984.1172426
Filename
1172426
Link To Document