مرکز منطقه ای اطلاع رساني علوم و فناوري - Perceptual MVDR-based unsupervised built-in speaker normalization for Kazakh speech recognition

DocumentCode :

263849

Title :

Perceptual MVDR-based unsupervised built-in speaker normalization for Kazakh speech recognition

Author :

Yessenbayev, Zhandos ; Yapanel, Umit

Author_Institution :

Nazarbayev Univ. Res. & Innovation Syst., Astana, Kazakhstan

fYear :

2014

fDate :

15-17 Oct. 2014

Firstpage :

Lastpage :

Abstract :

In this work we present a novel approach to unsupervised speaker normalization on top of the Perceptual MVDR-based Built-in Speaker Normalization technique. We showed that the proposed method can be efficient for the task of phonetic recognition on TIMIT and then applied it to Kazakh speech recognition. From the experiments, we see that this method is able to improve the relative performance of ASR systems up to 20% The analysis of the optimal warp factor selection by the algorithm revealed a nice gender separation ability which may be used for gender/speaker classification tasks.

Keywords :

natural language processing; speech recognition; ASR systems; Kazakh speech recognition; TIMIT; gender classification tasks; gender separation ability; optimal warp factor selection; perceptual MVDR-based unsupervised built-in speaker normalization; phonetic recognition; speaker classification tasks; Acoustics; Algorithm design and analysis; Feature extraction; Hidden Markov models; Speech; Speech recognition; Training; Kazakh speech recognition; Unsupervised speaker normalization; phone recognition;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Application of Information and Communication Technologies (AICT), 2014 IEEE 8th International Conference on

Conference_Location :

Astana

Print_ISBN :

978-1-4799-4120-9

Type :

conf

DOI :

10.1109/ICAICT.2014.7035914

Filename :

7035914

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=263849