• DocumentCode
    2979162
  • Title

    A tonotopic artificial neural network architecture for phoneme probability estimation

  • Author

    Ström, Nikko

  • Author_Institution
    Dept. of Speech, Music & Hearing, R. Inst. of Technol., Stockholm, Sweden
  • fYear
    1997
  • fDate
    14-17 Dec 1997
  • Firstpage
    156
  • Lastpage
    163
  • Abstract
    A novel sparse ANN connection scheme is proposed. It is inspired by the so called tonotopic organization of the auditory nerve, and allows a more detailed representation of the speech spectrum to be input to an ANN than is commonly used. A consequence of the new connection scheme is that more resources are allocated to analysis within narrow frequency sub bands-a concept that has recently been investigated by others with so called sub band ASR. ANNs with the proposed architecture have been evaluated on the TIMIT database for phoneme recognition, and are found to give better phoneme recognition performance than ANNs based on standard mel frequency cepstrum input. The lowest achieved phone error rate, 26.7%, is very close to the lowest published result for the core test set of the TIMIT database
  • Keywords
    cepstral analysis; neural nets; probability; resource allocation; speech processing; speech recognition; TIMIT database; auditory nerve; connection scheme; narrow frequency sub bands; novel sparse ANN connection scheme; phone error rate; phoneme probability estimation; phoneme recognition performance; resource allocation; speech spectrum; standard mel frequency cepstrum input; sub band ASR; tonotopic artificial neural network architecture; tonotopic organization; Artificial neural networks; Auditory system; Automatic speech recognition; Cepstrum; Databases; Frequency; Hidden Markov models; Radio spectrum management; Resource management; Testing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Automatic Speech Recognition and Understanding, 1997. Proceedings., 1997 IEEE Workshop on
  • Conference_Location
    Santa Barbara, CA
  • Print_ISBN
    0-7803-3698-4
  • Type

    conf

  • DOI
    10.1109/ASRU.1997.659000
  • Filename
    659000