Automatic extraction of acoustic prototypes for large vocabulary speech recognition by using speaker-independent features

Author

Colla, Anna Maria

Author_Institution

Elettronica San Giorgio, Genova, Italy

fYear

1989

fDate

23-26 May 1989

Firstpage

93

Abstract

A description is presented of AUTOSEGM, a novel method for automatic extraction of acoustic prototypes for large-vocabulary speech recognition systems based on diphone-like subword segments. AUTOSEGM compares favorably with previous methods for automatic extraction of diphone-like prototypes in that it does not require a set of training templates derived from another talker to bootstrap the new talker´s prototypes. AUTOSEGM exploits only general speaker-independent acoustic/phonetic knowledge, in the form of a general language model and standard acoustic parameters (cepstral derivative, smoothed energy contour). The core of AUTOSEGM is a self-segmentation procedure: the speech material is segmented into pseudosyllables, within which some occurrences of the diphone-like segments are located. A recognition performance of about 78% correct for the top candidate was achieved in a large-vocabulary isolated word recognition task

Keywords

speech recognition; AUTOSEGM; acoustic prototypes; automatic extraction; diphone-like subword segments; isolated word recognition; large vocabulary speech recognition; pseudosyllables; self-segmentation procedure; speaker-independent features; Acoustic materials; Acoustic signal detection; Automatic speech recognition; Decoding; Loudspeakers; Maintenance; Natural languages; Prototypes; Speech recognition; Vocabulary;

fLanguage

English

Publisher

ieee

Conference_Titel

Acoustics, Speech, and Signal Processing, 1989. ICASSP-89., 1989 International Conference on

Conference_Location

Glasgow

ISSN

1520-6149

Type

conf

DOI

10.1109/ICASSP.1989.266371

Filename

266371