Title :
A tonotopic artificial neural network architecture for phoneme probability estimation
Author_Institution :
Dept. of Speech, Music & Hearing, R. Inst. of Technol., Stockholm, Sweden
Abstract :
A novel sparse ANN connection scheme is proposed. It is inspired by the so called tonotopic organization of the auditory nerve, and allows a more detailed representation of the speech spectrum to be input to an ANN than is commonly used. A consequence of the new connection scheme is that more resources are allocated to analysis within narrow frequency sub bands-a concept that has recently been investigated by others with so called sub band ASR. ANNs with the proposed architecture have been evaluated on the TIMIT database for phoneme recognition, and are found to give better phoneme recognition performance than ANNs based on standard mel frequency cepstrum input. The lowest achieved phone error rate, 26.7%, is very close to the lowest published result for the core test set of the TIMIT database
Keywords :
cepstral analysis; neural nets; probability; resource allocation; speech processing; speech recognition; TIMIT database; auditory nerve; connection scheme; narrow frequency sub bands; novel sparse ANN connection scheme; phone error rate; phoneme probability estimation; phoneme recognition performance; resource allocation; speech spectrum; standard mel frequency cepstrum input; sub band ASR; tonotopic artificial neural network architecture; tonotopic organization; Artificial neural networks; Auditory system; Automatic speech recognition; Cepstrum; Databases; Frequency; Hidden Markov models; Radio spectrum management; Resource management; Testing;
Conference_Titel :
Automatic Speech Recognition and Understanding, 1997. Proceedings., 1997 IEEE Workshop on
Conference_Location :
Santa Barbara, CA
Print_ISBN :
0-7803-3698-4
DOI :
10.1109/ASRU.1997.659000