مرکز منطقه ای اطلاع رساني علوم و فناوري - Glottal sensing for speech analysis and synthesis

DocumentCode :

3062853

Title :

Glottal sensing for speech analysis and synthesis

Author :

Yea, J.J. ; Krishnamurthy, A.K. ; Naik, J.M. ; Moore, G.P. ; Childers, D.G.

Author_Institution :

University of Florida Gainesville, FL

Volume :

fYear :

1983

fDate :

30407

Firstpage :

1332

Lastpage :

1335

Abstract :

Two channel speech analysis, using the electroglottograph (EGG) and the utterance, is used to parse the utterance into voiced, unvoiced, and silence regions and determine the pitch contour all in real time. Results for two algorithms which use the speech signal alone are compared to our two channel method. The effect a source-tract interaction model, based on our data, has on the production of natural sounding speech is reported. Three glottal source models are compared, namely, Fant\´s, Guerin\´s, and the impulse models. The waveforms generated by these models excite a serial/parallel Klatt formant synthesizer to produce synthesized sentences. The major parameters for producing excellent quality synthesized speech are 1) the "shape" of the glottal excitation waveform, 2) the first formant bandwidth, and 3) the locations of the higher (fourth and fifth) formant frequencies.

Keywords :

Amplitude estimation; Area measurement; Autocorrelation; Performance analysis; Pins; Speech analysis; Speech synthesis; Time measurement; Velocity measurement; Yield estimation;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Acoustics, Speech, and Signal Processing, IEEE International Conference on ICASSP '83.

Type :

conf

DOI :

10.1109/ICASSP.1983.1172019

Filename :

1172019

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3062853