• DocumentCode
    2280596
  • Title

    High performance telephone bandwidth speaker independent continuous digit recognition

  • Author

    Cosi, Piero ; Hosom, John-Paul ; Valente, Alberto

  • Author_Institution
    Ist. di Fonetica a Dialettologia, CNR, Padova, Italy
  • fYear
    2001
  • fDate
    2001
  • Firstpage
    405
  • Lastpage
    408
  • Abstract
    The development of a high-performance telephone-bandwidth speaker independent connected digit recognizer for Italian is described. The CSLU Speech Toolkit was used to develop and implement the hybrid ANN/HMM system, which is trained on context-dependent categories to account for coarticulatory variation. Various front-end processing and system architectures were compared and, when the best features (MFCC with CMS + Δ) and network (4-layer fully connected feed-forward network) were considered, there was a 98.92% word recognition accuracy and a 92.62% sentence recognition accuracy on a test set of the FIELD continuous digits recognition task.
  • Keywords
    hidden Markov models; neural nets; speech recognition; 4-layer fully connected feed-forward network; Italian; coarticulatory variation; context dependent categories; front-end processing; high-performance telephone bandwidth speaker independent connected digit recognizer; hybrid ANN/HMM system; system architecture; Automatic speech recognition; Bandwidth; Collision mitigation; Feedforward systems; Hidden Markov models; Mel frequency cepstral coefficient; Natural languages; Speech recognition; System testing; Telephony;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Automatic Speech Recognition and Understanding, 2001. ASRU '01. IEEE Workshop on
  • Print_ISBN
    0-7803-7343-X
  • Type

    conf

  • DOI
    10.1109/ASRU.2001.1034670
  • Filename
    1034670