• DocumentCode
    2133599
  • Title

    A new i-vector approach and its application to irrelevant variability normalization based acoustic model training

  • Author

    Zhang, Yu ; Yan, Zhi-Jie ; Huo, Qiang

  • Author_Institution
    Dept. of Comput. Sci. & Eng., Shanghai Jiao Tong Univ., Shanghai, China
  • fYear
    2011
  • fDate
    18-21 Sept. 2011
  • Firstpage
    1
  • Lastpage
    6
  • Abstract
    This paper presents a new approach to extracting a low-dimensional i-vector from a speech segment to represent acoustic information irrelevant to phonetic classification. Compared with the traditional i-vector approach, a full factor analysis model with a residual term is used. New procedures for hyperparameter estimation and i-vector extraction are derived and presented. The proposed i-vector approach is applied to acoustic sniffing for irrelevant variability normalization based acoustic model training in large vocabulary continuous speech recognition. Its effectiveness is confirmed by experimental results on Switchboard-1 conversational telephone speech transcription task.
  • Keywords
    speech processing; speech recognition; acoustic model training; acoustic sniffing; factor analysis model; hyperparameter estimation; i-vector extraction; irrelevant variability normalization; low-dimensional i-vector; phonetic classification; residual term; speech segment; switchboard-1 conversational telephone speech transcription task; vocabulary continuous speech recognition; Acoustic measurements; Acoustics; Hidden Markov models; Speech; Training; Transforms; Vectors; LVCSR; acoustic model; i-vector; irrelevant variability normalization; unsupervised adaption;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Machine Learning for Signal Processing (MLSP), 2011 IEEE International Workshop on
  • Conference_Location
    Santander
  • ISSN
    1551-2541
  • Print_ISBN
    978-1-4577-1621-8
  • Electronic_ISBN
    1551-2541
  • Type

    conf

  • DOI
    10.1109/MLSP.2011.6064637
  • Filename
    6064637