• DocumentCode
    3521003
  • Title

    Automatic extraction of acoustic prototypes for large vocabulary speech recognition by using speaker-independent features

  • Author

    Colla, Anna Maria

  • Author_Institution
    Elettronica San Giorgio, Genova, Italy
  • fYear
    1989
  • fDate
    23-26 May 1989
  • Firstpage
    93
  • Abstract
    A description is presented of AUTOSEGM, a novel method for automatic extraction of acoustic prototypes for large-vocabulary speech recognition systems based on diphone-like subword segments. AUTOSEGM compares favorably with previous methods for automatic extraction of diphone-like prototypes in that it does not require a set of training templates derived from another talker to bootstrap the new talker´s prototypes. AUTOSEGM exploits only general speaker-independent acoustic/phonetic knowledge, in the form of a general language model and standard acoustic parameters (cepstral derivative, smoothed energy contour). The core of AUTOSEGM is a self-segmentation procedure: the speech material is segmented into pseudosyllables, within which some occurrences of the diphone-like segments are located. A recognition performance of about 78% correct for the top candidate was achieved in a large-vocabulary isolated word recognition task
  • Keywords
    speech recognition; AUTOSEGM; acoustic prototypes; automatic extraction; diphone-like subword segments; isolated word recognition; large vocabulary speech recognition; pseudosyllables; self-segmentation procedure; speaker-independent features; Acoustic materials; Acoustic signal detection; Automatic speech recognition; Decoding; Loudspeakers; Maintenance; Natural languages; Prototypes; Speech recognition; Vocabulary;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 1989. ICASSP-89., 1989 International Conference on
  • Conference_Location
    Glasgow
  • ISSN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.1989.266371
  • Filename
    266371