• DocumentCode
    26951
  • Title

    A Multi-Views Multi-Learners Approach Towards Dysarthric Speech Recognition Using Multi-Nets Artificial Neural Networks

  • Author

    Shahamiri, Seyed Reza ; Binti Salim, Siti Salwah

  • Author_Institution
    Dept. of Software Eng., Univ. of Malaya, Kuala Lumpur, Malaysia
  • Volume
    22
  • Issue
    5
  • fYear
    2014
  • fDate
    Sept. 2014
  • Firstpage
    1053
  • Lastpage
    1063
  • Abstract
    Automatic speech recognition (ASR) can be very helpful for speakers who suffer from dysarthria, a neurological disability that damages the control of motor speech articulators. Although a few attempts have been made to apply ASR technologies to sufferers of dysarthria, previous studies show that such ASR systems have not attained an adequate level of performance. In this study, a dysarthric multi-networks speech recognizer (DM-NSR) model is provided using a realization of multi-views multi-learners approach called multi-nets artificial neural networks, which tolerates variability of dysarthric speech. In particular, the DM-NSR model employs several ANNs (as learners) to approximate the likelihood of ASR vocabulary words and to deal with the complexity of dysarthric speech. The proposed DM-NSR approach was presented as both speaker-dependent and speaker-independent paradigms. In order to highlight the performance of the proposed model over legacy models, multi-views single-learner models of the DM-NSRs were also provided and their efficiencies were compared in detail. Moreover, a comparison among the prominent dysarthric ASR methods and the proposed one is provided. The results show that the DM-NSR recorded improved recognition rate by up to 24.67% and the error rate was reduced by up to 8.63% over the reference model.
  • Keywords
    medical signal processing; neural nets; neurophysiology; speech; speech processing; speech recognition; automatic speech recognition; dysarthria; dysarthric multinetworks speech recognizer model; motor speech articulators; multinets artificial neural networks; neurological disability; Accuracy; Artificial neural networks; Databases; Hidden Markov models; Speech; Speech recognition; Vocabulary; Dysarthria; dysarthric speech recognition; multi-nets artificial neural networks; multi-views multi-learners (MVML);
  • fLanguage
    English
  • Journal_Title
    Neural Systems and Rehabilitation Engineering, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1534-4320
  • Type

    jour

  • DOI
    10.1109/TNSRE.2014.2309336
  • Filename
    6762967