DocumentCode
395219
Title
Estimating number of speakers by the modulation characteristics of speech
Author
Arai, Takuyuki
Author_Institution
Dept. of Electr. & Electron. Eng., Sophia Univ., Tokyo, Japan
Volume
2
fYear
2003
fDate
6-10 April 2003
Abstract
A method for estimating number of speakers of mixed speech signals was proposed. The algorithm was based on the modulation characteristics of speech, specifically that a single speech utterance typically has a distinct modulation pattern with a peak around 4-5 Hz. Having observed that the modulation peak decreases as number of speakers increases, our estimation algorithm used the region of the modulation frequency between 2 and 8 Hz. We obtained a novel parameter we called "equivalent number of speakers" to estimate the number of simultaneous speakers when speech signals contain multiple speakers.
Keywords
parameter estimation; spectral analysis; speech processing; 2 to 8 Hz; equivalent number of speakers; mixed speech signals; modulation characteristics; modulation pattern; simultaneous speakers; speaker number estimation; speech utterance; Ambient intelligence; Amplitude modulation; Auditory system; Frequency estimation; Frequency modulation; Humans; Loudspeakers; Signal analysis; Speech analysis; Speech enhancement;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03). 2003 IEEE International Conference on
ISSN
1520-6149
Print_ISBN
0-7803-7663-3
Type
conf
DOI
10.1109/ICASSP.2003.1202328
Filename
1202328
Link To Document