• DocumentCode
    2913209
  • Title

    Supervised selection of prototypes for classification [speech recognition]

  • Author

    Das, Subrata

  • Author_Institution
    IBM Thomas J. Watson Res. Center, Yorktown Heights, NY, USA
  • fYear
    1990
  • fDate
    3-6 Apr 1990
  • Firstpage
    697
  • Abstract
    Given sufficient samples of data tagged with their class identities, three techniques for constructing supervised prototypes to represent these classes are examined. The first method consists of averaging the tokens of each class separately to obtain the prototypes. In the second approach, several tokens, picked uniformly from each class, are designated as prototypes. The third technique involves a systematic search procedure to select effective prototypes and discard obsolete ones. Approximately two hours of continuous speech data from each of two speakers were used for experimentation. Each centisecond frame of speech was labeled with one of 200 phonetic subunit names utilizing hidden Markov model training and Viterbi alignment procedures. Prototypes were determined from the first part of the data, whereas the last part served to measure the classification performance. Average accuracies ranged from 24.2% with 200 prototypes in the first, to 31.5% with 32000 prototypes in the second, to 38.5% with 2258 prototypes in the third method
  • Keywords
    Markov processes; learning systems; speech recognition; Viterbi alignment; hidden Markov model; learning systems; speech recognition; supervised prototypes; systematic search; Databases; Displays; Frequency; Hidden Markov models; Prototypes; Spectrogram; Speech; Speech recognition; Testing; Training data; Viterbi algorithm;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 1990. ICASSP-90., 1990 International Conference on
  • Conference_Location
    Albuquerque, NM
  • ISSN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.1990.115858
  • Filename
    115858