• DocumentCode
    179861
  • Title

    A fixed dimension and perceptually based dynamic sinusoidal model of speech

  • Author

    Qiong Hu ; Stylianou, Yannis ; Richmond, Kristof ; Maia, Ranniery ; Yamagishi, Junichi ; Latorre, Javier

  • Author_Institution
    Centre for Speech Technol. Res., Univ. of Edinburgh, Edinburgh, UK
  • fYear
    2014
  • fDate
    4-9 May 2014
  • Firstpage
    6270
  • Lastpage
    6274
  • Abstract
    This paper presents a fixed- and low-dimensional, perceptually based dynamic sinusoidal model of speech referred to as PDM (Perceptual Dynamic Model). To decrease and fix the number of sinusoidal components typically used in the standard sinusoidal model, we propose to use only one dynamic sinusoidal component per critical band. For each band, the sinusoid with the maximum spectral amplitude is selected and associated with the centre frequency of that critical band. The model is expanded at low frequencies by incorporating sinusoids at the boundaries of the corresponding bands while at the higher frequencies a modulated noise component is used. A listening test is conducted to compare speech reconstructed with PDM and state-of-the-art models of speech, where all models are constrained to use an equal number of parameters. The results show that PDM is clearly preferred in terms of quality over the other systems.
  • Keywords
    signal reconstruction; speech processing; vocoders; PDM; dynamic sinusoidal component; listening test; maximum spectral amplitude; modulated noise component; speech fixed dimension model; speech perceptually based dynamic sinusoidal model; speech reconstruction; standard sinusoidal model; vocoders; Adaptation models; Frequency modulation; Harmonic analysis; Psychoacoustic models; Speech; Speech processing; Vocoders; Critical band; Sinusoidal Model; Vocoder;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on
  • Conference_Location
    Florence
  • Type

    conf

  • DOI
    10.1109/ICASSP.2014.6854810
  • Filename
    6854810