• DocumentCode
    2882550
  • Title

    A variable frame-rate scheme for sinusoidal transform coding

  • Author

    Li, Ning ; Cheetham, Barry M.

  • Author_Institution
    University of Manchester, United Kingdom
  • Volume
    4
  • fYear
    2002
  • fDate
    13-17 May 2002
  • Abstract
    Sinusoidal transform coding (STC) is known to be capable of producing good communication quality speech coded at bit-rates below 4kb/s. Discrete all-pole modelling (DAP) which can be more accurate than the conventional linear prediction (LP) analysis for voiced speech is adopted to improve the short-term spectral estimation with modification to accommodate the unvoiced speech in STC. A more robust frequency domain analysis-by-synthesis derived voicing cut-off frequency that divides the whole power spectrum into a lower voiced band and an upper unvoiced band enhances STC performance. In view of the different evolving characteristic of speech, in this paper, we propose a variable frame rate coding scheme by further investigating the potential reason of quality loss in reconstructed speech. This results in performance enhancement as well as bit-rate saving and leads to a more flexible and effective STC vocoder.
  • Keywords
    Encoding; Robustness;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing (ICASSP), 2002 IEEE International Conference on
  • Conference_Location
    Orlando, FL, USA
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-7402-9
  • Type

    conf

  • DOI
    10.1109/ICASSP.2002.5745589
  • Filename
    5745589