DocumentCode :
149482
Title :
Time-frequency reassigned cepstral coefficients for phone-level speech segmentation
Author :
Tryfou, Georgina ; Pellin, Marco ; Omologo, Maurizio
Author_Institution :
Fondazione Bruno Kessler-irst, Povo, Italy
fYear :
2014
fDate :
1-5 Sept. 2014
Firstpage :
2060
Lastpage :
2064
Abstract :
This paper studies feature extraction within the context of automatic speech segmentation at phonetic level. Current state-of-the-art solutions widely use cepstral features as a front-end for HMM based frameworks. Although the automatic segmentation results have reached the inter-annotator agreement, within a tolerance equal or higher than 20ms, the same is not true when a lower tolerance is considered. We propose a new set of cepstral features that derive from the time-frequency reassigned spectrogram and offer a sharper representation of the speech signal in the cepstral domain. The features are evaluated through a series of forced alignment experiments which demonstrate a better performance, compared to the traditional MFCC features, in aligning phone boundaries within a small distance from their true position.
Keywords :
cepstral analysis; feature extraction; hidden Markov models; mobile handsets; signal representation; speech processing; speech recognition; time-frequency analysis; tolerance analysis; HMM; cepstral domain; cepstral feature extraction; hidden Markov model; interannotator agreement; phone boundary alignment; phone level automatic speech segmentation; phonetic level; speech recognition; speech signal sharper representation; time-frequency reassigned cepstral coefficient; time-frequency reassigned spectrogram; Feature extraction; Hidden Markov models; Mel frequency cepstral coefficient; Speech; Time-frequency analysis; HMM; feature extraction; forced alignment; phonetic segmentation; reassigned spectrogram;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Signal Processing Conference (EUSIPCO), 2014 Proceedings of the 22nd European
Conference_Location :
Lisbon
Type :
conf
Filename :
6952752
Link To Document :
بازگشت