DocumentCode
3594331
Title
The relation between speech segment selectivity and source localization accuracy
Author
Aarabi, Parham ; Mahdavi, Alborz
Author_Institution
The Edward S. Rogers Sr. Department of Electrical and Computer Engineering, University of Toronto, 10 Kings College Road, Ontario, Canada, M5S 3G4
Volume
1
fYear
2002
Abstract
An experimental analysis of the relation between speech signal segment power and the source direction-of-arrival-estimation accuracy is conducted. A total of 10 different speakers, including both male and female speakers, totaling to approximately 2 hours of speech are used to analyze the performance of the Phase Transform, the Maximum Likelihood, and the Unfiltered Cross Correlation time-delay estimation techniques. For female speakers, it is determined that the Phase Transform technique has a lower percentage of anomalies and a lower direction-of-arrival root mean-square error (DOA RMSE). Conversely, for male speakers, it is determined that the Unfiltered Cross Correlation has a lower percentage of anomalies although the Phase Transform has a lower DOA RMSE. The spatial distribution of the errors as well as the speech segment power relation to the errors are also presented.
Keywords
Artificial neural networks; Signal to noise ratio;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing (ICASSP), 2002 IEEE International Conference on
ISSN
1520-6149
Print_ISBN
0-7803-7402-9
Type
conf
DOI
10.1109/ICASSP.2002.5743707
Filename
5743707
Link To Document