مرکز منطقه ای اطلاع رساني علوم و فناوري - A demiphoneme network representation of speech and automatic labeling techniques for speech data base construction

DocumentCode :

3008231

Title :

A demiphoneme network representation of speech and automatic labeling techniques for speech data base construction

Author :

Tanaka, Kazuyo ; Hayamizu, Satoru ; Ohta, Kozo

Author_Institution :

Electrotechnical Laboratory, Ibaraki, Japan

Volume :

fYear :

1986

fDate :

31503

Firstpage :

309

Lastpage :

312

Abstract :

An automatic labeling technique for known speech samples is proposed to construct a fine speech data base for investigating the acoustic-phonetic characteristics of speech. An acoustically compact descriptive unit called Demiphoneme (DPH) is introduced, and a word (or sentence) is represented by a network using DPHs which cover the acoustic variation contained in the utterances of the word (or sentence). An input speech sample is segmented and labeled to the optimal DPH sequence by the following algorithm: (a) Generating possible DPH sequences from an input phoneme sequence by rules. (b) Segmentation of the sample parameter sequence. The resultant segments (called ´SEG´s) are the candidates of DPH boundaries. (c) Determining the optimal correspondence between the SEG sequence and each of the DPH sequences generated in (b). (d) Deciding the minimum error DPH sequence and corresponding SEG boundaries. The feasibility of the method is confirmed by applying it to a word set containing 53 city names.

Keywords :

Automatic speech recognition; Cities and towns; Dynamic programming; Labeling; Laboratories; Signal resolution;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Acoustics, Speech, and Signal Processing, IEEE International Conference on ICASSP '86.

Type :

conf

DOI :

10.1109/ICASSP.1986.1169174

Filename :

1169174

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3008231