• DocumentCode
    390489
  • Title

    Quality enhancement of CELP coded speech by using a voicing Gaussian mixture model

  • Author

    Raza, Dar Ghulam ; Chan, Cheung-Fat

  • Author_Institution
    Dept. of Comput. Eng. & IT, City Univ. of Hong Kong, Kowloon, China
  • Volume
    1
  • fYear
    2002
  • fDate
    26-30 Aug. 2002
  • Firstpage
    452
  • Abstract
    This paper presents a procedure to improve the quality of narrowband (0-4 kHz) CELP coded speech. The procedure is based on refining the pitch periodicity and reasserting the high frequency components (4-8 kHz) in the narrowband CELP decoded speech. The narrowband CELP decoded speech is first analyzed with a harmonic+noise analyzer and lowband information is extracted. By exploiting the lowband spectrum envelope V/UV information, the highband (4-8 kHz) spectrum envelope is recovered statistically by using a voiced/unvoiced Gaussian mixture model with interpolation. Lowband information along with the estimated highband information is then fed to the harmonic+noise synthesizer to re-synthesize wideband speech. Objective and subjective tests are performed to evaluate the quality of the re-synthesised wideband (0-8 kHz) speech. The results of experiments show that the re-synthesised wideband speech is pleasant to listen to with crispy characteristics and preferred over CELP coded speech.
  • Keywords
    Gaussian processes; harmonic analysis; linear predictive coding; spectral analysis; speech coding; speech enhancement; speech synthesis; 0 to 8 kHz; CELP coded speech; Gaussian mixture model; V/UV information; harmonic analysis; harmonic+noise synthesizer; high frequency components; highband spectrum envelope; interpolation; lowband spectrum envelope; narrowband speech; objective tests; pitch periodicity; speech quality enhancement; subjective tests; voiced/unvoiced Gaussian mixture model; wideband speech re-synthesis; Data mining; Decoding; Frequency; Harmonic analysis; Information analysis; Interpolation; Narrowband; Speech analysis; Speech enhancement; Wideband;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Signal Processing, 2002 6th International Conference on
  • Print_ISBN
    0-7803-7488-6
  • Type

    conf

  • DOI
    10.1109/ICOSP.2002.1181089
  • Filename
    1181089