Title : 
Sinusoidal speech coding at 2.4 kbps using an improved phase matching algorithm
         
        
            Author : 
Ahmadi, Sassan ; Spenias, A.S.
         
        
            Author_Institution : 
Nokia Mobile Phones Inc., San Diego, CA, USA
         
        
        
        
        
        
            Abstract : 
This paper addresses the design, development, evaluation, and implementation of efficient low bit rate speech coding algorithms based on the sinusoidal model. A series of algorithms have been developed for pitch frequency determination and voicing detection, simultaneous modeling of the sinusoidal amplitudes and phases, and mid-frame interpolation. An improved sinusoidal phase matching algorithm is presented, where short-time sinusoidal phases are approximated using an elaborate combination of linear prediction, spectral sampling, delay compensation, and phase correction techniques. A voicing-dependent perceptual split vector quantization scheme is used to encode the sinusoidal amplitudes. The perceptual properties of the human auditory system are effectively exploited in the developed algorithms. The algorithms have been successfully integrated into a 2.4 kbps sinusoidal coder. The performance of the 2.4 kbps coder has been evaluated in terms of subjective tests such as the mean opinion score and the diagnostic rhyme test, as well as some perceptually-motivated objective distortion measures. Performance analysis on a large speech database indicates that the use of the proposed algorithms resulted in considerable improvement in temporal and spectral signal matching, as well as improved subjective quality of the reproduced speech.
         
        
            Keywords : 
delays; hearing; interpolation; prediction theory; signal detection; signal sampling; spectral analysis; speech coding; vector quantisation; 2.4 kbit/s; delay compensation; diagnostic rhyme test; human auditory system; large speech database; linear prediction; low bit rate speech coding algorithms; mean opinion score; mid-frame interpolation; objective distortion measures; performance analysis; phase correction; pitch frequency determination; reproduced speech; short-time sinusoidal phases; sinusoidal amplitudes; sinusoidal coder; sinusoidal phase matching algorithm; sinusoidal speech coding; spectral sampling; spectral signal matching; subjective quality; subjective tests; temporal signal matching; vector quantization; voicing detection; voicing-dependent perceptual split VQ; Algorithm design and analysis; Bit rate; Distortion measurement; Frequency conversion; Interpolation; Phase detection; Phase frequency detector; Speech analysis; Speech coding; Testing;
         
        
        
        
            Conference_Titel : 
Signals, Systems & Computers, 1997. Conference Record of the Thirty-First Asilomar Conference on
         
        
            Conference_Location : 
Pacific Grove, CA, USA
         
        
        
            Print_ISBN : 
0-8186-8316-3
         
        
        
            DOI : 
10.1109/ACSSC.1997.679071