• DocumentCode
    627216
  • Title

    Articulatory feature-based gender factor minimization in automatic speech recognition

  • Author

    Rahman, B. K. M. Mizanur ; Ahamed, Bulbul ; Islam, Rashed ; Huda, Mohammad Nurul

  • Author_Institution
    United Int. Univ., Dhaka, Bangladesh
  • fYear
    2013
  • fDate
    17-18 May 2013
  • Firstpage
    1
  • Lastpage
    5
  • Abstract
    This paper presents articulatory feature-based automatic speech recognition for Japanese spoken language. Automatic speech recognition system suffers from some hidden factors, such as speaking style, gender effects, and noisy acoustic environment. These hidden factors degrade the performance of automatic speech recognizer. Therefore, the effect of these factors should be minimized for achieving better recognition performance. In this study, we have incorporated articulatory feature based gender effects normalization technique, where male- and female-dependent DPF extractors are firstly used to map LFs onto two DPF spaces corresponding to the gender type. Two DPF vectors extracted by each DPF extractor are called DPF-male and DPF-female, respectively. These DPF extractors are trained individually with a male speech and a female speech data set. In addition, a gender-independent (GI) DPF extractor is used to compensate errors of a DPF selector. GI-DPF extractor is trained with both the male and the female speech data set. After evaluating the Tohoku University and Matsushita Spoken Word Database it is observed that the proposed method improves word correct rate and word accuracies by a certain limit.
  • Keywords
    feature extraction; minimisation; natural language processing; speech recognition; DPF selector; DPF spaces; DPF vectors; DPF-female; DPF-male; GI-DPF extractor; Japanese spoken language; LF; Tohoku University and Matsushita Spoken Word Database; articulatory feature based gender effect normalization technique; articulatory feature-based automatic speech recognition; articulatory feature-based gender factor minimization; error compensation; female speech data set; female-dependent DPF extractors; gender-independent DPF extractor; male speech data set; male-dependent DPF extractors; word accuracy improvement; word correct rate improvement; Acoustics; Data mining; Feature extraction; Hidden Markov models; Speech; Speech recognition; Vectors; Articulatory Features; Distinctive Phonetic Features; Hidden Markov Model; Multilayer Neural Network;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Informatics, Electronics & Vision (ICIEV), 2013 International Conference on
  • Conference_Location
    Dhaka
  • Print_ISBN
    978-1-4799-0397-9
  • Type

    conf

  • DOI
    10.1109/ICIEV.2013.6572567
  • Filename
    6572567