• DocumentCode
    3326679
  • Title

    Voice Source Modeling for Accurate Speech Analysis

  • Author

    Rahman, M. Shahidur ; Shimamura, Tetsuya

  • Author_Institution
    Dept. of Inf. & Comput. Sci., Saitama Univ.
  • fYear
    2005
  • fDate
    Oct. 28 2005-Nov. 1 2005
  • Firstpage
    305
  • Lastpage
    309
  • Abstract
    A two-pass least square method have been proposed for estimating the vocal tract parameters. An often encountered problem in using the conventional linear prediction analysis is due to the harmonic structure of the excitation source of voiced speech. This harmonic characteristic is coupled with the estimation of autoregressive (AR) coefficients that results in difficulties in estimating the vocal tract filter. This paper models the effective voice source from the residual obtained through the covariance analysis in the first-pass which is then used as input to the second-pass least square analysis. A better source-filter separation is thus achieved. The formant frequencies and bandwidths estimated using the proposed method for synthetic. These vowels are found to be accurate up to a factor of more than three (in percent) compared to the conventional method. Since the source characteristic is taken into account, local variations due to the positioning of analysis window are reduced significantly. The validity of the proposed method is also verified by inspecting the spectra obtained from natural vowel sounds uttered by high-pitched female speaker
  • Keywords
    filtering theory; least squares approximations; source separation; speech processing; high-pitched female speaker; linear prediction analysis; source characteristic; source-filter separation; speech analysis; two-pass least square method; vocal tract filter; voice source modeling; Bandwidth; Frequency estimation; Harmonic analysis; Information analysis; Least squares methods; Loudspeakers; Phase estimation; Power harmonic filters; Speech analysis; System identification;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Signals, Systems and Computers, 2005. Conference Record of the Thirty-Ninth Asilomar Conference on
  • Conference_Location
    Pacific Grove, CA
  • ISSN
    1058-6393
  • Print_ISBN
    1-4244-0131-3
  • Type

    conf

  • DOI
    10.1109/ACSSC.2005.1599756
  • Filename
    1599756