مرکز منطقه ای اطلاع رساني علوم و فناوري - Feature extraction using pre-trained convolutive bottleneck nets for dysarthric speech recognition

DocumentCode :

3716066

Title :

Feature extraction using pre-trained convolutive bottleneck nets for dysarthric speech recognition

Author :

Yuki Takashima;Toru Nakashika;Tetsuya Takiguchi;Yasuo Ariki

Author_Institution :

Graduate School of System Informatics, Kobe University, Japan

fYear :

2015

Firstpage :

1411

Lastpage :

1415

Abstract :

In this paper, we investigate the recognition of speech uttered by a person with an articulation disorder resulting from athetoid cerebral palsy based on a robust feature extraction method using pre-trained convolutive bottleneck networks (CBN). Generally speaking, the amount of speech data obtained from a person with an articulation disorder is limited because their burden is large due to strain on the speech muscles. Therefore, a trained CBN tends toward overfitting for a small corpus of training data. In our previous work, the experimental results showed speech recognition using features extracted from CBNs outperformed conventional features. However, the recognition accuracy strongly depends on the initial values of the convolution kernels. To prevent overfitting in the networks, we introduce in this paper a pre-training technique using a convolutional restricted Boltzmann machine (CRBM). Through word-recognition experiments, we confirmed its superiority in comparison to convolutional networks without pre-training.

Keywords :

"Feature extraction","Convolution","Speech","Speech recognition","Europe","Kernel"

Publisher :

ieee

Conference_Titel :

Signal Processing Conference (EUSIPCO), 2015 23rd European

Electronic_ISBN :

2076-1465

Type :

conf

DOI :

10.1109/EUSIPCO.2015.7362616

Filename :

7362616

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3716066