• DocumentCode
    1112494
  • Title

    Auditory nerve representation criteria for speech analysis/Synthesis

  • Author

    Ghitza, Oded

  • Author_Institution
    AT&T Bell Laboratories, Murray Hill, NJ
  • Volume
    35
  • Issue
    6
  • fYear
    1987
  • fDate
    6/1/1987 12:00:00 AM
  • Firstpage
    736
  • Lastpage
    740
  • Abstract
    Traditional speech analysis/synthesis techniques are designed to produce synthesized speech with a spectrum (or waveform) that is as close as possible to the original. It is suggested, instead, that representations of the synthetic and the original speech be matched at the auditory nerve level. This concept has been used in conjunction with the sinusoidal representation of speech analysis/synthesis suggested by McAulay and Quatieri [6]. Based on informal listening, the synthesized speech is natural, including some tonal artifact, and highly intelligible for various kinds of speech material, in both quiet and noisy environments. The inherent dominance property of the auditory nerve representation reduces the number of sinusoidal components needed for synthesis by approximately 70 percent, offering a potential for reduced data rate.
  • Keywords
    Control system synthesis; Frequency; Humans; Nervous system; Quantization; Speech analysis; Speech enhancement; Speech synthesis; Vocoders; Working environment noise;
  • fLanguage
    English
  • Journal_Title
    Acoustics, Speech and Signal Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    0096-3518
  • Type

    jour

  • DOI
    10.1109/TASSP.1987.1165223
  • Filename
    1165223