• DocumentCode
    2139642
  • Title

    A new multi-parameter dual-threshold state discrimination algorithm for voice activity detection

  • Author

    Xinyan Zeng ; Guojun Zhao

  • Author_Institution
    Minist. of Educ. Key Lab. of Mech. Manuf. &Autom., Zhejiang Univ. of Technol., Hangzhou, China
  • fYear
    2013
  • fDate
    23-25 July 2013
  • Firstpage
    1239
  • Lastpage
    1243
  • Abstract
    The desired results cannot be achieved by applying generally accepted algorithm of voice activity detection (VAD). Against such background, a comprehensive algorithm featuring multiple parameters and dual-threshold state discrimination algorithm is proposed. This algorithm sets both a high and a low threshold based on the fundamental characteristic parameters such as the short-time average energy (STAE), the short-time zero-domain-crossing rate (ZDCR) and the time duration, with reference to the design of finite state method. The interval between these two thresholds helps to detect voice activity with the improved results. After the new algorithm was compared and analyzed through Matlab, it experimentally tested on a DSP-based processing system. It proves that it is more effective and accurate than the widely accepted one.
  • Keywords
    speech processing; DSP based processing system; STAE; VAD; ZDCR; comprehensive algorithm; finite state method; fundamental characteristic parameters; new multiparameter dual threshold state discrimination algorithm; short-time average energy; time duration; voice activity detection; zero domain crossing rate; Algorithm design and analysis; Interference; Signal processing algorithms; Signal to noise ratio; Speech; Speech recognition; voice activity detection (VAD); voice signal; zero-crossing rate (ZCR);
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Natural Computation (ICNC), 2013 Ninth International Conference on
  • Conference_Location
    Shenyang
  • Type

    conf

  • DOI
    10.1109/ICNC.2013.6818168
  • Filename
    6818168