• DocumentCode
    2174149
  • Title

    Speaker characterization using spectral subband energy ratio based on Harmonic plus Noise Model

  • Author

    Long, Yanhua ; Yan, Zhi-Jie ; Soong, Frank K. ; Dai, Lirong ; Guo, Wu

  • Author_Institution
    iFly Speech Lab., Univ. of Sci. & Technol. of China (USTC), Hefei, China
  • fYear
    2011
  • fDate
    22-27 May 2011
  • Firstpage
    4520
  • Lastpage
    4523
  • Abstract
    This paper proposes a feature extraction for speaker characterization by exploring the relationship between the two distinct components of the speech signal, one is harmonics accounting for the periodicity of the signal and the other is modulated noise accounting for the turbulences of the glottal airflow. The harmonic and noise parts of the speech signal are decomposed based on the Harmonic plus Noise Model approach. We estimate the spectral subband energy ratios (SSERs) as the speaker characteristic features, which are expected to reflect the interaction property of the vocal tract and glottal airflow of individual speakers for speaker verification. The speaker verification experiments based on a GMM-UBM system have shown the efficiency of the SSER features, reducing the error equal rate by 27.2% by combining with the conventional MFCC features.
  • Keywords
    feature extraction; harmonic analysis; modulation; speaker recognition; GMM-UBM system; SSER estimation; conventional MFCC feature; conventional Mel- Frequency Cepstral Coefficients features; feature extraction; glottal airflow turbulence; harmonic plus noise model; harmonic plus noise model approach; harmonic speech analysis; harmonics accounting; modulated noise accounting; signal periodicity; speaker characterization; speaker verification; spectral subband energy ratio estimation; speech signal decomposition; vocal tract; Atmospheric modeling; Bandwidth; Feature extraction; Harmonic analysis; Mel frequency cepstral coefficient; Noise; Speech; Harmonic Plus Noise Model; Speaker verification; Spectral Subband Energy Ratio;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on
  • Conference_Location
    Prague
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4577-0538-0
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2011.5947359
  • Filename
    5947359