• DocumentCode
    3431984
  • Title

    A study of applying subspace based pronunciation modeling in verifying pronunciation accuracy

  • Author

    Yin, Shou-Chun ; Rose, Richard ; Tang, Yun

  • Author_Institution
    Dept. of Electr. & Comput. Eng., McGill Univ., Montreal, QC, Canada
  • fYear
    2012
  • fDate
    2-5 July 2012
  • Firstpage
    59
  • Lastpage
    64
  • Abstract
    This paper investigates a new approach for detecting phoneme level mispronunciations from utterances obtained from impaired children with neuromuscular disorders. This new pronunciation verification (PV) approach is obtained from the subspace based Gaussian mixture model (SGMM) based pronunciation model, where a set of state level projection vectors is applied for representing phonetic variability. SGMM models are trained from disabled speakers´ utterances and PV scores are computed directly from distances between disabled and reference speaker projection vectors. An experimental study was performed to evaluate the performance of the SGMM based approach with respect to an approach based on the lattice posterior probabilities. A reduction in equal error rate (EER) of approximately 15% was obtained when the SGMM based scores were combined with lattice posterior probabilities.
  • Keywords
    Gaussian processes; handicapped aids; speech recognition; EER; PV approach; SGMM; equal error rate; impaired children; neuromuscular disorders; phoneme level mispronunciations; pronunciation accuracy; pronunciation verification; state level projection vectors; subspace based Gaussian mixture model; subspace based pronunciation modeling; utterance; Equations; Hidden Markov models; Mathematical model; Medical treatment; Speech; Statistics; Vectors; automated speech recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Information Science, Signal Processing and their Applications (ISSPA), 2012 11th International Conference on
  • Conference_Location
    Montreal, QC
  • Print_ISBN
    978-1-4673-0381-1
  • Electronic_ISBN
    978-1-4673-0380-4
  • Type

    conf

  • DOI
    10.1109/ISSPA.2012.6310622
  • Filename
    6310622