• DocumentCode
    134181
  • Title

    A low complexity cluster model interpolation based on-line adaptation technique for spoken query systems

  • Author

    Shahnawazuddin, S. ; Sinha, Roopak

  • Author_Institution
    Dept. of Electron. & Electr. Eng., Indian Inst. of Technol. Guwahati, Guwahati, India
  • fYear
    2014
  • fDate
    12-14 Sept. 2014
  • Firstpage
    437
  • Lastpage
    441
  • Abstract
    The work presented in this paper describes the issues of on-line adaption in context of spoken query systems. In such systems, the available adaptation data is extremely small (≤ 3 seconds). Consequently, adapting such systems becomes extremely challenging. Moreover, since these systems are meant for real-time applications, the employed adaptation technique should not add much latency to the system response. To address these issues, a simple cluster model interpolation based approach for on-line adaptation is presented in this work. The proposed approach employs an OMP based search scheme to select a set of acoustically close models from a set of pre-trained cluster models. The selected cluster models are then linearly interpolated to derive the adapted model parameters. In this work, these interpolation weights are derived from the sparse coefficients in an approximate manner. Such an approximate approach helps in avoiding the iterative ML weight estimation usually employed in existing techniques. The proposed adaptation approach though not optimal, is found to be effective for on-line adaptation. The same has been verified in this work for an LVCSR task and also for an Assamese name recognition system which is a typical example of such query systems.
  • Keywords
    audio user interfaces; interpolation; pattern clustering; query processing; real-time systems; speech recognition; Assamese name recognition system; LVCSR task; OMP based search scheme; acoustic model interpolation; adaptation data; interpolation weights; iterative ML weight estimation; linear interpolation; low complexity cluster model interpolation; model parameters; online adaptation technique; real-time applications; sparse coefficients; spoken query systems; system response; Acoustics; Adaptation models; Complexity theory; Estimation; Hidden Markov models; Interpolation; Silicon; Spoken query system; acoustic model interpolation; fast adaptation; on-line adaptation; sparse representation;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Chinese Spoken Language Processing (ISCSLP), 2014 9th International Symposium on
  • Conference_Location
    Singapore
  • Type

    conf

  • DOI
    10.1109/ISCSLP.2014.6936573
  • Filename
    6936573