Title :
A privacy-preserving and language-independent speaking detecting and speaker diarization approach for spontaneous conversation using microphones
Author :
Ni Zhang ; Yaginuma, Yoshinori
Author_Institution :
Human Centric Comput. Lab., Fujitsu Labs. Ltd., Kawasaki, Japan
Abstract :
Conversation conveys important social signals of human interaction that indicates interest, service-awareness, persuasiveness, etc. In this paper, the authors employ the most common setting of using microphones to capture spontaneous conversation, and introduce a privacy-preserving and language-independent speech processing approach that can detect speaking and separate speakers in high accuracy for such setting. Experimental results have validated that the approach can deliver accurate speaking recognition results in Japanese, English and Chinese conversation, and can be processed in real time applications.
Keywords :
data privacy; microphones; speaker recognition; speech processing; human interaction; language independent speech detection; language independent speech processing; microphone; privacy preserving approach; social signal; speaker diarization approach; speaker separation; spontaneous conversation; Hidden Markov Model; conversation characteristics measuring; interaction pattern analysis; speaking style; speech processing; voiced/unvoiced sound;
Conference_Titel :
Signal Processing (ICSP), 2012 IEEE 11th International Conference on
Conference_Location :
Beijing
Print_ISBN :
978-1-4673-2196-9
DOI :
10.1109/ICoSP.2012.6491534