DocumentCode :
3008231
Title :
A demiphoneme network representation of speech and automatic labeling techniques for speech data base construction
Author :
Tanaka, Kazuyo ; Hayamizu, Satoru ; Ohta, Kozo
Author_Institution :
Electrotechnical Laboratory, Ibaraki, Japan
Volume :
11
fYear :
1986
fDate :
31503
Firstpage :
309
Lastpage :
312
Abstract :
An automatic labeling technique for known speech samples is proposed to construct a fine speech data base for investigating the acoustic-phonetic characteristics of speech. An acoustically compact descriptive unit called Demiphoneme (DPH) is introduced, and a word (or sentence) is represented by a network using DPHs which cover the acoustic variation contained in the utterances of the word (or sentence). An input speech sample is segmented and labeled to the optimal DPH sequence by the following algorithm: (a) Generating possible DPH sequences from an input phoneme sequence by rules. (b) Segmentation of the sample parameter sequence. The resultant segments (called ´SEG´s) are the candidates of DPH boundaries. (c) Determining the optimal correspondence between the SEG sequence and each of the DPH sequences generated in (b). (d) Deciding the minimum error DPH sequence and corresponding SEG boundaries. The feasibility of the method is confirmed by applying it to a word set containing 53 city names.
Keywords :
Automatic speech recognition; Cities and towns; Dynamic programming; Labeling; Laboratories; Signal resolution;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, IEEE International Conference on ICASSP '86.
Type :
conf
DOI :
10.1109/ICASSP.1986.1169174
Filename :
1169174
Link To Document :
بازگشت