• DocumentCode
    1660713
  • Title

    Acoustic-to-articulatory inversion using Particle Swarm Optimization

  • Author

    Fairee, Suthida ; Sirinaovakul, Booncharoen ; Prom-on, Santitham

  • Author_Institution
    Dept. of Comput. Eng., King Mongkut´s Univ. of Technol. Thonburi, Bangkok, Thailand
  • fYear
    2015
  • Firstpage
    1
  • Lastpage
    6
  • Abstract
    This paper proposes an acoustic-to-articulatory inversion using the Particle Swarm Optimization (PSO). We present a schematic and a detailed design of the acoustic-to-articulatory inversion system. The system is implemented by using Praat script and Java, with VocalTractLab as a speech synthesizer. The target data of our synthetic utterance are 5 disyllabic utterances consisting of 9 Thai monophthongs. For each syllable, the synthetic utterances are synthesized from 15 articulatory parameters by which their values are estimated using PSO with inertia weight. The fitness values of the system are evaluated in the term of the sum of the squared errors (SSEs) of these articulatory parameter values. To assess the results, the original and the synthetic utterances are compared in the forms of spectrograms, and F1-F3 formant frequency contours. For our system results, the good agreement between the original and synthetic utterance was achieved for F1 and F2.
  • Keywords
    bioacoustics; biomechanics; biomedical measurement; error analysis; inverse problems; medical signal processing; particle swarm optimisation; speech; speech processing; speech synthesis; F1-F3 formant frequency contour; Java; PSO inertia weight; Praat script; SSE; Thai monophthongs; VocalTractLab; acoustic-to-articulatory inversion system; articulatory parameter value estimation; disyllabic utterance; fitness value; original utterance-synthetic utterance comparison; particle swarm optimization; spectrogram comparison; speech synthesizer; sum of the squared error; syllable; synthetic utterance synthesis; synthetic utterance target data; Acoustics; Heuristic algorithms; Optimization; Particle swarm optimization; Speech; Tongue; XML; acoustic-to-articulatory inversion; particle swarm optimization;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Electrical Engineering/Electronics, Computer, Telecommunications and Information Technology (ECTI-CON), 2015 12th International Conference on
  • Conference_Location
    Hua Hin
  • Type

    conf

  • DOI
    10.1109/ECTICon.2015.7206999
  • Filename
    7206999