مرکز منطقه ای اطلاع رساني علوم و فناوري - Dynamic segmentation of vocal extract for Assamese Speech to Text Conversion using RNN

DocumentCode :

1923343

Title :

Dynamic segmentation of vocal extract for Assamese Speech to Text Conversion using RNN

Author :

Dutta, Krishna ; Sarma, Kandarpa Kumar

Author_Institution :

Dept. of Electron. & Commun. Technol., Gauhati Univ., Guwahati, India

fYear :

2012

fDate :

2-3 March 2012

Firstpage :

126

Lastpage :

131

Abstract :

The current work proposes a prototype Speech to Text Conversion System (STSC) in Assamese language using Linear Predictive Coding (LPC) and Recurrent Neural Network(RNN). The LPC features are extracted from utterances of isolated phonemes of Assamese language (a major language of North-East India). These are used to train a RNN by a proposed dynamic method. The proposed method segments an utterance with an optimal dynamic criterion to improve the success scores during testing of the STCS system. The proposed method dynamically adjusts the length of the windows required for recognizing different phonemes. The performance of the proposed method is compared with a conventional static RNN based STCS system which is trained using prior knowledge about length of windows required for recognizing different phonemes.

Keywords :

feature extraction; linear predictive coding; natural language processing; recurrent neural nets; speech recognition; Assamese language; Assamese speech to text conversion system; LPC; LPC feature extraction; RNN based STCS system; linear predictive coding; optimal dynamic criterion; phoneme recognition; recurrent neural network; vocal extract dynamic segmentation; Feature extraction; Finite impulse response filter; Speech; Speech processing; Speech recognition; Testing; Training; Dynamic Segmentation; LPC; Moving Average Filter; RNN; SCTS;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Computational Intelligence and Signal Processing (CISP), 2012 2nd National Conference on

Conference_Location :

Guwahati, Assam

Print_ISBN :

978-1-4577-0719-3

Type :

conf

DOI :

10.1109/NCCISP.2012.6189692

Filename :

6189692

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=1923343