• DocumentCode
    323751
  • Title

    A novel robust feature of speech signal based on the Mellin transform for speaker-independent speech recognition

  • Author

    Chen, Jingdong ; Xu, Bo ; Huang, Taiyi

  • Author_Institution
    Inst. of Autom., Acad. Sinica, Beijing, China
  • Volume
    2
  • fYear
    1998
  • fDate
    12-15 May 1998
  • Firstpage
    629
  • Abstract
    This paper presents a novel kind of speech feature which is the modified Mellin transform of the log-spectrum of the speech signal (short for MMTLS). Because of the scale invariance property of the modified Mellin transform, the new feature is insensitive to the variation of the vocal tract length among individual speakers, and thus it is more appropriate for speaker-independent speech recognition than the popular used cepstrum. The preliminary experiments show that the performance of the MMTLS-based method is much better in comparison with those of the LPC- and MFC-based methods. Moreover, the error rate of this method is very consistent for different outlier speakers
  • Keywords
    acoustic signal processing; feature extraction; spectral analysis; speech recognition; transforms; LPC-based method; MFC-based method; MMTLS-based method; acoustic feature; experiments; log-spectrum; modified Mellin transform; outlier speakers; performance; robust feature; speaker-independent speech recognition; speech signal; vocal tract length; Acoustic noise; Fourier transforms; Frequency estimation; Laboratories; Loudspeakers; Pattern recognition; Robustness; Shape; Speech analysis; Speech recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing, 1998. Proceedings of the 1998 IEEE International Conference on
  • Conference_Location
    Seattle, WA
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-4428-6
  • Type

    conf

  • DOI
    10.1109/ICASSP.1998.675343
  • Filename
    675343