DocumentCode :
2979162
Title :
A tonotopic artificial neural network architecture for phoneme probability estimation
Author :
Ström, Nikko
Author_Institution :
Dept. of Speech, Music & Hearing, R. Inst. of Technol., Stockholm, Sweden
fYear :
1997
fDate :
14-17 Dec 1997
Firstpage :
156
Lastpage :
163
Abstract :
A novel sparse ANN connection scheme is proposed. It is inspired by the so called tonotopic organization of the auditory nerve, and allows a more detailed representation of the speech spectrum to be input to an ANN than is commonly used. A consequence of the new connection scheme is that more resources are allocated to analysis within narrow frequency sub bands-a concept that has recently been investigated by others with so called sub band ASR. ANNs with the proposed architecture have been evaluated on the TIMIT database for phoneme recognition, and are found to give better phoneme recognition performance than ANNs based on standard mel frequency cepstrum input. The lowest achieved phone error rate, 26.7%, is very close to the lowest published result for the core test set of the TIMIT database
Keywords :
cepstral analysis; neural nets; probability; resource allocation; speech processing; speech recognition; TIMIT database; auditory nerve; connection scheme; narrow frequency sub bands; novel sparse ANN connection scheme; phone error rate; phoneme probability estimation; phoneme recognition performance; resource allocation; speech spectrum; standard mel frequency cepstrum input; sub band ASR; tonotopic artificial neural network architecture; tonotopic organization; Artificial neural networks; Auditory system; Automatic speech recognition; Cepstrum; Databases; Frequency; Hidden Markov models; Radio spectrum management; Resource management; Testing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Automatic Speech Recognition and Understanding, 1997. Proceedings., 1997 IEEE Workshop on
Conference_Location :
Santa Barbara, CA
Print_ISBN :
0-7803-3698-4
Type :
conf
DOI :
10.1109/ASRU.1997.659000
Filename :
659000
Link To Document :
بازگشت