DocumentCode
700063
Title
Signal processing based segmentation and hmm based acoustic clustering of syllable segments for low bit rate segment vocoder at 1.4 Kbps
Author
Chevireddy, Sadhana ; Murthy, Hema A. ; Chandra Sekhar, C.
Author_Institution
Dept. of Comput. Sci. & Eng., Indian Inst. of Technol. Madras, Chennai, India
fYear
2008
fDate
25-29 Aug. 2008
Firstpage
1
Lastpage
5
Abstract
In this paper, we propose a novel approach for developing a segment-based vocoder at very low bit-rates. The segmental unit chosen for coding is a syllable. A signal processing technique called automatic group delay based segmentation is used to obtain syllable like segments. The segment codebook is prepared by acoustically clustering the syllable segments using a Hidden Markov Model (HMM) based unsupervised and incremental training algorithm. When the residual is modeled using MELP, a bit-rate of 1.4 Kbps is achieved. The synthesized speech quality is compared with that of the standard MELP codec at 2.4 Kbps using the objective evaluation measure, PESQ.
Keywords
acoustic signal processing; hidden Markov models; pattern clustering; speech coding; speech synthesis; unsupervised learning; vocoders; HMM based acoustic clustering; PESQ objective evaluation measure; automatic group delay based segmentation; bit rate 1.4 kbit/s; bit rate 2.4 kbit/s; hidden Markov model; incremental training algorithm; low bit rate segment vocoder; segment codebook; segmental unit; signal processing based segmentation; signal processing technique; standard MELP codec; syllable segments; synthesized speech quality; unsupervised training algorithm; Color; Delays; Hidden Markov models; Signal processing; Speech; Speech coding;
fLanguage
English
Publisher
ieee
Conference_Titel
Signal Processing Conference, 2008 16th European
Conference_Location
Lausanne
ISSN
2219-5491
Type
conf
Filename
7080595
Link To Document