DocumentCode :
642524
Title :
Source number estimation based on clustering of speech activity sequences for microphone array processing
Author :
Jafari, Ingrid ; Ito, Noboru ; Souden, Mehrez ; Araki, Shunsuke ; Nakatani, Takeshi
Author_Institution :
NTT Commun. Sci. Labs., NTT Corp., Kyoto, Japan
fYear :
2013
fDate :
22-25 Sept. 2013
Firstpage :
1
Lastpage :
6
Abstract :
In this paper we introduce a novel technique for source number estimation based on the clustering of speech activity sequences. Speech activity sequences, represented by posterior probability time series of speech activity, are modeled as a mixture of Watson distributions. To enable source number estimation, an adaptive Dirichlet prior probability is imposed upon the mixture weights to promote the formation of empty clusters. The proposed source number estimation technique was evaluated on reverberant over-, even- and under-determined settings with accuracy between 75% and 100%.
Keywords :
microphone arrays; speech processing; time series; Watson distributions; adaptive Dirichlet prior probability; microphone array processing; mixture weights; posterior probability time series; reverberant even-determined setting; reverberant over-determined setting; reverberant under-determined setting; source number estimation technique; speech activity; speech activity sequence clustering; Accuracy; Adaptation models; Clustering algorithms; Estimation; Microphones; Reverberation; Speech; Source number estimation; clustering; mixture model; speech activity sequences;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Machine Learning for Signal Processing (MLSP), 2013 IEEE International Workshop on
Conference_Location :
Southampton
ISSN :
1551-2541
Type :
conf
DOI :
10.1109/MLSP.2013.6661990
Filename :
6661990
Link To Document :
بازگشت