مرکز منطقه ای اطلاع رساني علوم و فناوري - Long-term feature averaging for speaker recognition

DocumentCode :

1089471

Title :

Long-term feature averaging for speaker recognition

Author :

Markel, John D. ; Oshika, Beatrice T. ; Gray, Augustine H., Jr.

Author_Institution :

Speech Communications Research Laboratory, Santa Barbara, CA

Volume :

Issue :

fYear :

1977

fDate :

8/1/1977 12:00:00 AM

Firstpage :

330

Lastpage :

337

Abstract :

The potential benefits of long-term parameter averaging for speaker recognition were investigated. Parameters studied were pitch, gain, and reflection coefficients. Parameter variability was computed over various averaging lengths from one frame averaging (in effect, no averaging) to 1000 frame averaging (about 70 s of speech). It was demonstrated that the between-to-within speaker variance ratio, measured over several speakers, was significantly increased by performing long-term averaging of the parameter sets. The reflection coefficient averages for k2and k6, respectively, were shown to produce the highest variance ratios.

Keywords :

Analysis of variance; Autocorrelation; Equations; Frequency estimation; Loudspeakers; Oral communication; Reflection; Speaker recognition; Speech analysis; Speech synthesis;

fLanguage :

English

Journal_Title :

Acoustics, Speech and Signal Processing, IEEE Transactions on

Publisher :

ieee

ISSN :

0096-3518

Type :

jour

DOI :

10.1109/TASSP.1977.1162961

Filename :

1162961

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=1089471