Title :
Segmental phonetic features recognition by means of neural-fuzzy networks and integration in an N-best solutions post-processing
Author :
Moudenc, T. ; Sokol, R. ; Mercier, G.
Author_Institution :
France Telecom, CNET, Lannion, France
Abstract :
We present investigations on using segmental phonetic features in an N-best solutions post processing of an HMM based ASR system. These phonetic features are extracted by means of neural-fuzzy networks. Specialized neural-fuzzy networks are defined to recognize specific phonetic features (consonant/vowel, voiced/unvoiced, ...). Each of these neural networks furnishes a segmental coefficient (resulting from the output layers) which enables the computation of a segmental post-processing score for the N-best solutions of an HMM based ASR system. This post-processing is based on the computation of segmental score for each solution respectively under the hypotheses of a correct solution and an incorrect solution. Preliminary experiments were conducted on 3 speaker-independent telephone databases. An error rate reduction up to 20% was achieved on the digit corpus
Keywords :
database management systems; errors; feature extraction; fuzzy neural nets; hidden Markov models; speech recognition; telephony; HMM; N-best solutions post-processing; consonant; digit corpus; error rate reduction; experiments; feature extraction; fuzzy neural networks; hidden Markov model; segmental coefficient; segmental phonetic features recognition; speaker-independent telephone database; speech recognition; vowel; Automatic speech recognition; Computer networks; Feature extraction; Hidden Markov models; Image segmentation; Intelligent networks; Neural networks; Probability distribution; Speech recognition; Telephony;
Conference_Titel :
Spoken Language, 1996. ICSLP 96. Proceedings., Fourth International Conference on
Conference_Location :
Philadelphia, PA
Print_ISBN :
0-7803-3555-4
DOI :
10.1109/ICSLP.1996.607123