• DocumentCode
    2176296
  • Title

    Phoneme recognition using Boosted Binary Features

  • Author

    Roy, Anindya ; Magimai-Doss, Mathew ; Marcel, Sébastien

  • Author_Institution
    Idiap Res. Inst., Martigny, Switzerland
  • fYear
    2011
  • fDate
    22-27 May 2011
  • Firstpage
    4868
  • Lastpage
    4871
  • Abstract
    In this paper, we propose a novel parts-based binary-valued feature for ASR. This feature is extracted using boosted ensembles of simple threshold-based classifiers. Each such classifier looks at a specific pair of time-frequency bins located on the spectro-temporal plane. These features termed as Boosted Binary Features (BBF) are integrated into standard HMM-based system by using multilayer perceptron (MLP) and single layer perceptron (SLP). Preliminary studies on TIMIT phoneme recognition task show that BBF yields similar or better performance compared to MFCC (67.8% accuracy for BBF vs. 66.3% accuracy for MFCC) using MLP, while it yields significantly better performance than MFCC (62.8% accuracy for BBF vs. 45.9% for MFCC) using SLP. This demonstrates the potential of the proposed feature for speech recognition.
  • Keywords
    feature extraction; hidden Markov models; multilayer perceptrons; pattern classification; speech recognition; time-frequency analysis; ASR; BBF; HMM-based system; MLP; SLP; TIMIT phoneme recognition task; boosted binary feature; multilayer perceptron; parts-based binary-valued feature; simple threshold-based classifier; single layer perceptron; spectrotemporal plane; speech recognition; time-frequency bin; Feature extraction; Hidden Markov models; Mel frequency cepstral coefficient; Speech; Speech recognition; Time frequency analysis; Training; Phoneme recognition; automatic speech recognition; binary features; boosting;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on
  • Conference_Location
    Prague
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4577-0538-0
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2011.5947446
  • Filename
    5947446