• DocumentCode
    3674647
  • Title

    An image based approach for speech perception

  • Author

    Nguyen Quang Trung;Bui The Duy;Ma Thi Chau

  • Author_Institution
    Human Machine Interaction Laboratory, University of Engineering &
  • fYear
    2015
  • Firstpage
    208
  • Lastpage
    213
  • Abstract
    Classification of speech signal is one of the most vital problems in speech perception and spoken word recognition. Although, there have been many studies on the classification of speech signals but the results are still limited. In this paper, we propose an image based approach for speech signal classification based on the combination of Local Naïve Bayes Nearest Neighbor (LNBNN) and Scale-invariant Feature Transform (SIFT) features. The proposed approach allows training feature vectors to have different sizes and no re-training is needed for additional training data after training phase. With this approach, achieved classification results are very satisfactory. They are 72.8%, 100% and 95.0% on the ISOLET, Digits and Places databases, respectively.
  • Keywords
    "Speech","Speech recognition","Feature extraction","Databases","Hidden Markov models","Mel frequency cepstral coefficient","Training"
  • Publisher
    ieee
  • Conference_Titel
    Information and Computer Science (NICS), 2015 2nd National Foundation for Science and Technology Development Conference on
  • Print_ISBN
    978-1-4673-6639-7
  • Type

    conf

  • DOI
    10.1109/NICS.2015.7302192
  • Filename
    7302192