مرکز منطقه ای اطلاع رساني علوم و فناوري - Automatic discovery of features underlying the perception of voicing

DocumentCode :

2940294

Title :

Automatic discovery of features underlying the perception of voicing

Author :

Damper, R.I.

Author_Institution :

Dept. of Electron. & Comput. Sci., Southampton Univ., UK

Volume :

fYear :

1995

fDate :

9-12 May 1995

Firstpage :

3459

Abstract :

Responses of both human and animal listeners to synthetic stop-consonant/vowel stimuli in which voice-onset time (VOT) is uniformly varied are known to be `categorical´ but an explanation of this phenomenon remains elusive. A `composite´ model consisting of a physiologically-realistic auditory model feeding its patterns of neural firing to an artificial neural network is described. When trained by (supervised) error back-propagation on the extreme, end-points of the VOT continuum, the composite model is capable of reproducing closely listeners´ behaviour in classical categorical-perception (CP) studies. However, whether the model also reproduces the so-called boundary-shift phenomenon-whereby the phoneme boundary moves with place of articulation-apparently depends upon the precise details of the auditory model and so, by implication, upon subtle aspects of peripheral auditory processing. A first attempt at unsupervised training has been unsuccessful: the likely reason for this is outlined. It is anticipated that future work comparing the model´s responses for unsupervised versus supervised training will help to elucidate the mechanisms of categorical perception

Keywords :

backpropagation; hearing; neural nets; physiological models; speech processing; unsupervised learning; animal listeners; artificial neural network; auditory model; boundary-shift phenomenon; categorical perception studies; composite model; error backpropagation; human listeners; neural firing patterns; peripheral auditory processing; phoneme boundary; physiologically-realistic auditory model; supervised training; synthetic stop-consonant/vowel stimuli; unsupervised training; voice-onset time; voicing; Animals; Artificial neural networks; Biological system modeling; Computer science; Heart; Humans; Intelligent systems; Intersymbol interference; Speech; Tail;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Acoustics, Speech, and Signal Processing, 1995. ICASSP-95., 1995 International Conference on

Conference_Location :

Detroit, MI

ISSN :

1520-6149

Print_ISBN :

0-7803-2431-5

Type :

conf

DOI :

10.1109/ICASSP.1995.479730

Filename :

479730

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2940294