مرکز منطقه ای اطلاع رساني علوم و فناوري - Multiple feature extraction for RNN-based Assamese speech recognition for speech to text conversion application

DocumentCode :

3108158

Title :

Multiple feature extraction for RNN-based Assamese speech recognition for speech to text conversion application

Author :

Dutta, K. ; Sarma, Kandarpa Kumar

Author_Institution :

Dept. of ECE, Gauhati Univ., Guwahati, India

fYear :

2012

fDate :

28-29 Dec. 2012

Firstpage :

600

Lastpage :

603

Abstract :

The current work proposes a prototype model for speech recognition in Assamese language using Linear Predictive Coding (LPC) and Mel frequency cepstral coefficient (MFCC). The speech recognition is a part of a speech to text conversion system. The LPC and MFCC features are extracted by two different Recurrent Neural Networks (RNN), which are used to recognize the vocal extract of Assamese language- a major language in the North Eastern part of India. In this work, decision block is designed by a combined framework of RNN block to extract the features. Using this combined architecture our system is able to generate 10% gain in the recognition rate than the case when individual architectures are used.

Keywords :

feature extraction; linear predictive coding; natural languages; recurrent neural nets; speech recognition; speech synthesis; Assamese language; India; LPC; MFCC; Mel frequency cepstral coefficient; RNN-based Assamese speech recognition; linear predictive coding; multiple feature extraction; recurrent neural networks; speech-to-text conversion application; Feature extraction; Mel frequency cepstral coefficient; Recurrent neural networks; Speech; Speech coding; Speech processing; Speech recognition; LPC; MFCC; Moving Average Filter; RNN;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Communications, Devices and Intelligent Systems (CODIS), 2012 International Conference on

Conference_Location :

Kolkata

Print_ISBN :

978-1-4673-4699-3

Type :

conf

DOI :

10.1109/CODIS.2012.6422274

Filename :

6422274

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3108158