DocumentCode :
310562
Title :
Explicit, N-best formant features for vowel classification
Author :
Schmid, Philipp ; Barnard, Etienne
Author_Institution :
Center for Spoken Language Understanding, Oregon Graduate Inst. of Sci. & Technol., Portland, OR, USA
Volume :
2
fYear :
1997
fDate :
21-24 Apr 1997
Firstpage :
991
Abstract :
We demonstrate the use of explicit formant features for vowel and semi-vowel classification. The formant trajectories are approximated by either three line segments or Legendre polynomials. Together with formant amplitude, formant bandwidth, pitch, and segment duration, these formant features form a compact feature representation which performs as well (71.8%) as a cepstral-based feature representation (71.6%). The combination of the formant and cepstral feature improves the accuracy further to 73.4%. Additionally, we outline future experiments using our robust, N-best formant tracker
Keywords :
Legendre polynomials; approximation theory; cepstral analysis; feature extraction; signal representation; speech processing; speech recognition; tracking; Legendre polynomials; N-best formant features; N-best formant tracker; cepstral based feature representation; experiments; explicit formant features; formant amplitude; formant bandwidth; formant features; formant trajectories; line segments; pitch; segment duration; semivowel classification; speech recognition; vowel classification; Bandwidth; Cepstral analysis; Delay; Government; History; Mel frequency cepstral coefficient; Natural languages; Robustness; Speech; Trajectory;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1997. ICASSP-97., 1997 IEEE International Conference on
Conference_Location :
Munich
ISSN :
1520-6149
Print_ISBN :
0-8186-7919-0
Type :
conf
DOI :
10.1109/ICASSP.1997.596106
Filename :
596106
Link To Document :
بازگشت