• DocumentCode
    336806
  • Title

    Log amplitude modeling of sinusoids in voiced speech

  • Author

    Malik, Najam ; Holmes, W. Harvey

  • Author_Institution
    Sch. of Electr. & Telecommun. Eng., New South Wales Univ., Sydney, NSW, Australia
  • Volume
    1
  • fYear
    1999
  • fDate
    15-19 Mar 1999
  • Firstpage
    465
  • Abstract
    We present an algorithm for all-pole (envelope) modeling of the amplitudes of sinusoids present in voiced speech segments which works even when the number of sinusoids is very small, as occurs with high-pitched speakers. In contrast to previous methods, this algorithm minimizes a squared error criterion in the log amplitude domain rather than the amplitude domain, and so is better matched to the properties of the human auditory system. A weighted iterative approach is used to get near optimal solutions to this otherwise nonlinear problem. This new frequency domain log amplitude modeling (LAM) algorithm gives impressive results, especially in the case of high pitched female voices where conventional linear prediction methods are inadequate. The algorithm can easily be generalized to develop pole-zero models
  • Keywords
    frequency-domain analysis; iterative methods; poles and zeros; spectral analysis; speech processing; all-pole modeling; envelope modeling; frequency domain; high pitched female voices; high-pitched speakers; human auditory system; linear prediction methods; log amplitude modeling algorithm; near optimal solutions; nonlinear problem; pole-zero models; sinusoids; squared error criterion; voiced speech; voiced speech segments; weighted iterative approach; Auditory system; Australia; Distortion measurement; Frequency domain analysis; Humans; Iterative algorithms; Iterative methods; Predictive models; Signal processing algorithms; Speech;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 1999. Proceedings., 1999 IEEE International Conference on
  • Conference_Location
    Phoenix, AZ
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-5041-3
  • Type

    conf

  • DOI
    10.1109/ICASSP.1999.758163
  • Filename
    758163