• DocumentCode
    231539
  • Title

    Speech enhancement based on analysis-synthesis framework with improved pitch estimation and spectral envelope enhancement

  • Author

    Bin Liu ; Fuyuan Mo ; Jianhua Tao

  • Author_Institution
    Nat. Lab. of Pattern Recognition, Inst. of Autom., Beijing, China
  • fYear
    2014
  • fDate
    19-23 Oct. 2014
  • Firstpage
    461
  • Lastpage
    466
  • Abstract
    This paper presents a speech enhancement approach based on analysis-synthesis framework. An improved multi-band summary correlogram (MBSC) algorithm is proposed for pitch estimation and voiced/unvoiced (V/UV) detection. The proposed pitch detection algorithm achieves a lower pitch detection error compared with the reference algorithm. The denoising autoencoder (DAE) is applied to enhance the line spectrum frequencies (LSFs). The reconstruction loss could be decreased compare with the swallow model. The proposed approach is evaluated using the perceptual evaluation of speech quality (PESQ) and the experimental results show that the proposed approach improves the performance of speech enhancement compared with the conventional speech enhancement approach. In addition, it could be applied to parametric speech coding even at low bit rate and low SNR environments.
  • Keywords
    estimation theory; hearing; signal denoising; signal detection; speech coding; speech enhancement; speech synthesis; vocoders; DAE; LSF; MBSC algorithm; PESQ; SNR; V-UV detection; analysis-synthesis framework; denoising autoencoder; line spectrum frequencies; multiband summary correlogram algorithm; perceptual evaluation; pitch detection algorithm; pitch detection error; pitch estimation; reconstruction loss; reference algorithm; spectral envelope enhancement; speech coding; speech enhancement; speech quality; swallow model; voiced-unvoiced detection; Estimation; Finite impulse response filters; Noise; Noise measurement; Speech; Speech coding; Speech enhancement; analysis-synthesis framework; denoising autoencoder; multi-band summary correlogram; speech coding; speech enhancement;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Signal Processing (ICSP), 2014 12th International Conference on
  • Conference_Location
    Hangzhou
  • ISSN
    2164-5221
  • Print_ISBN
    978-1-4799-2188-1
  • Type

    conf

  • DOI
    10.1109/ICOSP.2014.7015048
  • Filename
    7015048