• DocumentCode
    3692935
  • Title

    Voice control for a gripper using Mel-Frequency Cepstral Coefficients and Gaussian Mixture Models

  • Author

    Gustavo Velasco-Hernandez;Andrés Díaz-Toro

  • Author_Institution
    Universidad del Valle, Colombia
  • fYear
    2015
  • Firstpage
    1
  • Lastpage
    6
  • Abstract
    This work presents an implementation of a speaker-dependent speech recognition system used to control a gripper. The application was made using MATLAB and the gripper was assembled using the Lego Mindstorm NXT robotic kit. Four commands are implemented for controlling the gripper: Open, close, rotate left and rotate right. The development was divided into two stages. In training stage, we use Mel Frequency Cepstral Coefficients (MFCCs) and Gaussian Mixture Models (GMMs) to generate a representation of each defined command. Then, in testing stage, those models are used to identify the speaker´s utterance and send the command to the actuator. Finally, we present test results that show a performance of 95.09% for our system, and then we compare it with similar works.
  • Keywords
    "Mel frequency cepstral coefficient","Speech recognition","Testing","Training","Computational modeling","Grippers","Feature extraction"
  • Publisher
    ieee
  • Conference_Titel
    Signal Processing, Images and Computer Vision (STSIVA), 2015 20th Symposium on
  • Type

    conf

  • DOI
    10.1109/STSIVA.2015.7330391
  • Filename
    7330391