مرکز منطقه ای اطلاع رساني علوم و فناوري - Isolated voiced digit recognition using inductive inference

DocumentCode :

312523

Title :

Isolated voiced digit recognition using inductive inference

Author :

Samouelian, A.

Author_Institution :

Dept. of Electr. & Comput. Eng., Wollongong Univ., NSW

Volume :

fYear :

1996

fDate :

26-29 Nov 1996

Firstpage :

119

Abstract :

This paper proposes the use of inductive inference “decision trees” for isolated digit recognition. The aim of this research is to demonstrate that inductive learning can provide an alternative approach to existing automatic speech recognition techniques such as dynamic time warping (DTW), hidden Markov modelling (HMM) and neural networks (NN). The construction of the decision tree is based on C4.5 inductive system developed by Quinlan (1986, 1993), The decision tree is generated automatically from the training speech database. The database contains labelled examples in the form of a feature vector and its corresponding label, for each frame. The feature vector may consist of any number of different feature sets and the label may be at the word, phonetic class or phoneme level. The recognition is performed at the frame level, using an inference engine to execute the decision tree and classify the firing of the rules. The proposed system has two main advantages. Firstly, it uses the data-driven approach to isolated word classification, thus attempting to solve the problem of inter and intra speaker speech variability, by the use of a large speech database. Secondly, it has the ability to generate decision trees using any combination of features (parametric or acoustic-phonetic). This allows the integration of features from existing signal processing techniques, that are currently used in HMM stochastic modelling, and acoustic-phonetic features, which have been the cornerstone of traditional knowledge based techniques. Isolated digit recognition results for Texas Instruments (TI) digit database, for speaker dependent and independent recognition, are presented

Keywords :

decision theory; feature extraction; inference mechanisms; learning by example; speech recognition; C4.5 inductive system; HMM stochastic modelling; Texas Instruments digit database; acoustic-phonetic features; data-driven approach; decision trees; feature vector; frame level; inductive inference; inductive learning; inference engine; inter speaker speech variability; intra speaker speech variability; isolated digit recognition; isolated word classification; phoneme level; phonetic class level; speaker dependent recognition; speaker independent recognition; speech recognition; speech recognition techniques; training speech database; word level; Acoustic signal processing; Automatic speech recognition; Classification tree analysis; Decision trees; Engines; Hidden Markov models; Loudspeakers; Neural networks; Spatial databases; Speech recognition;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

TENCON '96. Proceedings., 1996 IEEE TENCON. Digital Signal Processing Applications

Conference_Location :

Perth, WA

Print_ISBN :

0-7803-3679-8

Type :

conf

DOI :

10.1109/TENCON.1996.608727

Filename :

608727

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=312523