مرکز منطقه ای اطلاع رساني علوم و فناوري - Modification of the speech feature extraction module for the improvement of the system for automatic lectures transcription

DocumentCode :

3648916

Title :

Modification of the speech feature extraction module for the improvement of the system for automatic lectures transcription

Author :

Josef Chaloupka;Petr Červa;Jan Silovský;Jindfích Žd´ánský;Jan Nouza

Author_Institution :

Laboratory of Computer Speech Processing, Institute of Information Technology and Electronics, Technical University of Liberec, Liberec, Czech Republic

fYear :

2012

Firstpage :

223

Lastpage :

226

Abstract :

This contribution is about experiments with different speech feature extraction methods and strategies where the goal has been to improve the result and the resulting recognition rate of the speech recognizer of an automatic audio speech signal transcription system. The extraction of speech features is based on MFCC (Mel Frequency Cepstral Coefficients) and PLP (Perceptual Linear Prediction), which are normally used in different transcription systems around the world. The speech recognizer with different speech features has been tested on our speech database where audio (or video) recordings from archives of university lectures are stored. The result from our experiments is that we get higher recognition rate if PLP based audio speech features are used.

Keywords :

"Speech recognition","Speech","Feature extraction","Hidden Markov models","Mel frequency cepstral coefficient","Databases","Vocabulary"

Publisher :

ieee

Conference_Titel :

ELMAR, 2012 Proceedings

ISSN :

1334-2630

Print_ISBN :

978-1-4673-1243-1

Type :

conf

Filename :

6338511

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3648916