Title :
Probabilistic and Bottle-Neck Features for LVCSR of Meetings
Author :
Grezl, Frantisek ; Karafiat, Martin ; Kontar, S. ; Cernocky, Jan
Author_Institution :
Speech@FIT Group, Brno Univ. of Technol., Czech Republic
Abstract :
In recent years, probabilistic features became an integral part of state-of-the-are LVCSR systems. In this work, we are exploring the possibility of obtaining the features directly from neural net without the necessity of converting output probabilities to features suitable for subsequent GMM-HMM system. We experimented with 5-layer MLP with bottle-neck in the middle layer. After training such a neural net, we used outputs of the bottle-neck as features for GMM-HMM recognition system. The benefits are twofold: first, improvement was gained when these features are used instead of the probabilistic features, second, the size of the system was reduced, as only part of the neural net is used. The experiments were performed on meetings recognition task defined in MST RT´05 evaluation.
Keywords :
Gaussian processes; hidden Markov models; speech recognition; GMM; HMM; LVCSR; bottle-neck features; meetings recognition task; probabilistic features; recognition system; Discrete cosine transforms; Discrete transforms; Feature extraction; Hidden Markov models; Merging; NIST; Neural networks; Performance evaluation; Principal component analysis; Speech recognition; LVCSR; Probabilistic features; TRAP-based features; bottle-neck features; meeting recognition;
Conference_Titel :
Acoustics, Speech and Signal Processing, 2007. ICASSP 2007. IEEE International Conference on
Conference_Location :
Honolulu, HI
Print_ISBN :
1-4244-0727-3
DOI :
10.1109/ICASSP.2007.367023