Title :
Estimating number of speakers by the modulation characteristics of speech
Author_Institution :
Dept. of Electr. & Electron. Eng., Sophia Univ., Tokyo, Japan
Abstract :
A method for estimating number of speakers of mixed speech signals was proposed. The algorithm was based on the modulation characteristics of speech, specifically that a single speech utterance typically has a distinct modulation pattern with a peak around 4-5 Hz. Having observed that the modulation peak decreases as number of speakers increases, our estimation algorithm used the region of the modulation frequency between 2 and 8 Hz. We obtained a novel parameter we called "equivalent number of speakers" to estimate the number of simultaneous speakers when speech signals contain multiple speakers.
Keywords :
parameter estimation; spectral analysis; speech processing; 2 to 8 Hz; equivalent number of speakers; mixed speech signals; modulation characteristics; modulation pattern; simultaneous speakers; speaker number estimation; speech utterance; Ambient intelligence; Amplitude modulation; Auditory system; Frequency estimation; Frequency modulation; Humans; Loudspeakers; Signal analysis; Speech analysis; Speech enhancement;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03). 2003 IEEE International Conference on
Print_ISBN :
0-7803-7663-3
DOI :
10.1109/ICASSP.2003.1202328