مرکز منطقه ای اطلاع رساني علوم و فناوري - Adding Voicing Features into Speech Recognition Based on HMM in Slovak

DocumentCode :

2887880

Title :

Adding Voicing Features into Speech Recognition Based on HMM in Slovak

Author :

Kacur, Juraj ; Rozinaj, Gregor

Author_Institution :

Dept. of Telecommun., STU, Bratislava, Slovakia

fYear :

2009

fDate :

18-20 June 2009

Firstpage :

Lastpage :

Abstract :

This article discusses the impact of substituting some of the basic speech features with the voiced/ unvoiced information and possibly with the estimated pitch value. As a good measure of the signal´s voicing the average magnitude difference function was assumed, especially the ratio of its average value to its local minima found within the accepted ranges of the pitch. Furthermore, the pitch itself was used as an auxiliary feature to the base MFCC and PLP features. Experiments were performed on the professional database SPEECHDAT-SK for mobile applications working in harsh conditions, using various HMM models of context dependent and independent phonemes. All models were trained following the MASPER training scheme. In all cases the voicing feature brought improved results by more than 9% compared to the base systems. However the role of the pitch itself in the case of speaker independent ASR system evaluated over different tasks was not always so beneficial.

Keywords :

natural language processing; speech recognition; HMM; MASPER training scheme; MFCC; PLP features; SPEECHDAT-SK; Slovak; automatic speech recognition system; context dependent phonemes; context independent phonemes; mobile applications; pitch value estimation; professional database; voicing feature addition; Acceleration; Automatic speech recognition; Context modeling; Data mining; Frequency estimation; Hidden Markov models; Mel frequency cepstral coefficient; Robustness; Spatial databases; Speech recognition;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Systems, Signals and Image Processing, 2009. IWSSIP 2009. 16th International Conference on

Conference_Location :

Chalkida

Print_ISBN :

978-1-4244-4530-1

Electronic_ISBN :

978-1-4244-4530-1

Type :

conf

DOI :

10.1109/IWSSIP.2009.5367743

Filename :

5367743

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2887880