DocumentCode :
2358754
Title :
Speaker discrimination in a conversation
Author :
Brümmer, J. N L
Author_Institution :
Datafusion Systems, Stellenbosch, South Africa
fYear :
1993
fDate :
34187
Firstpage :
156
Lastpage :
161
Abstract :
A method to automatically discriminate between speakers in a conversation between two people, with no prior training have been developed. It is intended as a preprocessing stage in a speaker recognition system. The speech is preprocessed by extracting the syllable nuclei and discarding fricatives and noise. Next, a multidimensional feature set is calculated for the whole conversation and an axis in the feature space is obtained that gives good discrimination between the two speakers. Speech that reaches either extreme on this axis is choosen as belonging to the two speakers respectively. The axis is found by taking a moving average of the feature vectors over a short time interval. This has the effect of reducing the variance in all directions in the feature space. Because the averaging relatively rarely goes over two speakers, the variance changes least in the direction in the direction separating the means of the speakers. This direction of least change of variance is found by eigenvector analysis of the covariance matrices before and after averaging
Keywords :
speaker recognition; speech processing; averaging; conversation; covariance matrices; eigenvector analysis; feature space; moving average; multidimensional feature set; speaker discrimination; speaker recognition system; syllable nuclei extraction; variance; Analysis of variance; Covariance matrix; Data mining; Error analysis; Filters; Multidimensional systems; Speaker recognition; Speech enhancement; Training data;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Communications and Signal Processing, 1993., Proceedings of the 1993 IEEE South African Symposium on
Conference_Location :
Jan Smuts Airport
Print_ISBN :
0-7803-1292-9
Type :
conf
DOI :
10.1109/COMSIG.1993.365853
Filename :
365853
Link To Document :
بازگشت