• DocumentCode
    1843070
  • Title

    A privacy-preserving and language-independent speaking detecting and speaker diarization approach for spontaneous conversation using microphones

  • Author

    Ni Zhang ; Yaginuma, Yoshinori

  • Author_Institution
    Human Centric Comput. Lab., Fujitsu Labs. Ltd., Kawasaki, Japan
  • Volume
    1
  • fYear
    2012
  • fDate
    21-25 Oct. 2012
  • Firstpage
    499
  • Lastpage
    502
  • Abstract
    Conversation conveys important social signals of human interaction that indicates interest, service-awareness, persuasiveness, etc. In this paper, the authors employ the most common setting of using microphones to capture spontaneous conversation, and introduce a privacy-preserving and language-independent speech processing approach that can detect speaking and separate speakers in high accuracy for such setting. Experimental results have validated that the approach can deliver accurate speaking recognition results in Japanese, English and Chinese conversation, and can be processed in real time applications.
  • Keywords
    data privacy; microphones; speaker recognition; speech processing; human interaction; language independent speech detection; language independent speech processing; microphone; privacy preserving approach; social signal; speaker diarization approach; speaker separation; spontaneous conversation; Hidden Markov Model; conversation characteristics measuring; interaction pattern analysis; speaking style; speech processing; voiced/unvoiced sound;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Signal Processing (ICSP), 2012 IEEE 11th International Conference on
  • Conference_Location
    Beijing
  • ISSN
    2164-5221
  • Print_ISBN
    978-1-4673-2196-9
  • Type

    conf

  • DOI
    10.1109/ICoSP.2012.6491534
  • Filename
    6491534