• DocumentCode
    578167
  • Title

    An improved VQ based algorithm for recognizing speaker-independent isolated words

  • Author

    Ma, Ding-ding ; Zeng, Xiao-qin

  • Author_Institution
    Inst. of Intell. Sci. & Technol., Hohai Univ., Nanjing, China
  • Volume
    2
  • fYear
    2012
  • fDate
    15-17 July 2012
  • Firstpage
    792
  • Lastpage
    796
  • Abstract
    In this paper, an improved codebook generation algorithm called SLVQ (Speaker Level Vector quantization) is proposed, which can improve the recognition accuracy of speaker independent isolated words. Linde-Buzo-Gary (LBG) algorithm is the most commonly used codebook design method. The idea behind LBG is to find an optimal codebook that minimizes the distortion between the training words and the codebook. But this does not guarantee that the testing words also have minimum distortion as training words. To address the problem of producing poor codebook for testing words in speaker independent speech recognition, the proposed method makes use of the diversity of different speakers by randomly selecting some speakers and their pronounced words in the codebook design procedure to optimize codebooks. An evaluation experiment has been conducted to compare the speech recognition performance of the codebooks produced by the LBG, the LVQ (learning vector quantization), and the SLVQ. It is clearly shown that the SLVQ method performs better than the other two methods.
  • Keywords
    learning (artificial intelligence); speaker recognition; vector quantisation; LBG algorithm; Linde-Buzo-Gary algorithm; SLVQ; VQ based algorithm; codebook design method; codebook generation algorithm; learning vector quantization; speaker independent speech recognition; speaker level vector quantization; speaker-independent isolated word recognition; testing words; training words; Abstracts; Convergence; Codebook generation; Speaker Independent; Speech recognition; Vector quantization;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Machine Learning and Cybernetics (ICMLC), 2012 International Conference on
  • Conference_Location
    Xian
  • ISSN
    2160-133X
  • Print_ISBN
    978-1-4673-1484-8
  • Type

    conf

  • DOI
    10.1109/ICMLC.2012.6359026
  • Filename
    6359026