• DocumentCode
    704697
  • Title

    Improving accuracy of glottal closure instant detection methods in nonstationary noise

  • Author

    Deshpande, Pranav S. ; Manikandan, M. Sabarimalai

  • Author_Institution
    Sch. of Electr. Sci., Indian Inst. of Technol. Bhubaneswar, Bhubaneswar, India
  • fYear
    2015
  • fDate
    19-20 Feb. 2015
  • Firstpage
    736
  • Lastpage
    741
  • Abstract
    The glottal closure instant (GCI) detection is crucial in all kinds of speech processing applications. A variety of GCI detection methods were proposed but the performance of most existing methods was evaluated using only voiced parts of the speech signal. Under noisy conditions, the recorded speech consists of voiced, unvoiced and non-speech parts. Thus, most GCI detection methods demand an accurate voice activity detection (VAD) design. In this paper, we present post-processing techniques to improve the GCI detection accuracy and robustness in additive nonstationary noisy environments. We study the performance of the four methods: center of gravity (CoG), group delay function (GDF), zero frequency resonator (ZFR), and speech event detection using the residual excitation and a mean-based signal (SEDREAMS). The performance of each method with the proposed method-specific post-processing technique is tested and validated under both clean and noisy environments. Experimental results show that the method with post-processing technique outperforms the conventional GCI detection methods under noisy conditions. Results further show that the proposed technique can not only improve the overall detection accuracy but also reduce the complexity by avoiding VAD algorithm.
  • Keywords
    signal denoising; signal detection; speech synthesis; CoG; GCI detection method accuracy improvement; GDF; SEDREAMS; VAD algorithm; ZFR; additive nonstationary noisy environments; center of gravity; complexity reduction; glottal closure instant detection; group delay function; recorded speech signal; speech event detection using residual excitation and mean-based signal; speech processing applications; speech synthesis; voice activity detection design; zero frequency resonator; Databases; Delays; Feature extraction; Larynx; Noise measurement; Speech; Speech processing; Epoch extraction; glottal closure instant;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Signal Processing and Integrated Networks (SPIN), 2015 2nd International Conference on
  • Conference_Location
    Noida
  • Print_ISBN
    978-1-4799-5990-7
  • Type

    conf

  • DOI
    10.1109/SPIN.2015.7095390
  • Filename
    7095390