• DocumentCode
    387935
  • Title

    Contributions of pitch, formant frequency and bandwidth to the perception of voice-personality

  • Author

    Takagi, Tohru ; Kuwabara, Hisao

  • Author_Institution
    NHK Science and Technical Research Laboratories, Tokyo, Japan
  • Volume
    11
  • fYear
    1986
  • fDate
    31503
  • Firstpage
    889
  • Lastpage
    892
  • Abstract
    To investigate the contributions of the resonance characteristics of vocal tract and the pitch frequency to the voice-personality, a pitch synchronous analysis/synthesis system has been developed which is capable of independent manipulation of formant frequencies, bandwidths, and pitch frequencies. This paper gives the results of perceptual experiments on voice personality for spectrum and pitch modified speech using this system. Experimental results show that the perception of voice-personality is significantly sensitive to the formant frequency shift and it is almost completely lost for the uniform shift of all formant frequencies larger than 5 percent. The manipulation of pitch frequency and formant bandwidths, on the other hand, is less sensitive to the perception of voice-personality. This study is important as a basic research for the speaker normalization on one hand, and for adding personal information to the non-personal synthetic speech on the other.
  • Keywords
    Bandwidth; Equations; Frequency estimation; Frequency synthesizers; Laboratories; Linear predictive coding; Resonance; Signal synthesis; Speech analysis; Speech synthesis;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, IEEE International Conference on ICASSP '86.
  • Type

    conf

  • DOI
    10.1109/ICASSP.1986.1168977
  • Filename
    1168977