• DocumentCode
    2423765
  • Title

    Speaker normalization using dynamic frequency warping

  • Author

    Huang, Zhenhua ; Hou, Limin

  • Author_Institution
    Sch. of Commun. & Inf. Eng., Shanghai Univ., Shanghai
  • fYear
    2008
  • fDate
    7-9 July 2008
  • Firstpage
    1091
  • Lastpage
    1095
  • Abstract
    In an effort to reduce the degradation in a gender-independence isolated word recognition performance caused by variation character among different speaker, a dynamic frequency warping approach to speaker normalization is investigated. There are a lot of discrepancy in frequency domain which caused by vocal tract length difference among different speakers. Dynamic frequency warping (DFW) is an exact analog of dynamic time warping (DTW) which is used to reduce the discrepancy frequency scale of speech and normalize the frequency accurately. In this paper, the DFW method is to be introduced to normalize the frequency scale of speech and then applied it to a gender-independence isolated word recognition system. The results of experiments show a large improvement in average word error rate.
  • Keywords
    speaker recognition; speech synthesis; average word error rate; dynamic frequency warping; dynamic time warping; gender-independence isolated word recognition performance; speaker normalization; vocal tract length difference; Character recognition; Degradation; Error analysis; Frequency domain analysis; Frequency estimation; Loudspeakers; Piecewise linear techniques; Speech analysis; Speech recognition; Testing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Audio, Language and Image Processing, 2008. ICALIP 2008. International Conference on
  • Conference_Location
    Shanghai
  • Print_ISBN
    978-1-4244-1723-0
  • Electronic_ISBN
    978-1-4244-1724-7
  • Type

    conf

  • DOI
    10.1109/ICALIP.2008.4590058
  • Filename
    4590058