مرکز منطقه ای اطلاع رساني علوم و فناوري - Phonetically-based vector excitation coding of speech at 3.6 kbps

DocumentCode :

3520804

Title :

Phonetically-based vector excitation coding of speech at 3.6 kbps

Author :

Wang, Shihua ; Gersho, Allen

Author_Institution :

Dept. of Electr. & Comput. Eng., California Univ., Santa Barbara, CA, USA

fYear :

1989

fDate :

23-26 May 1989

Firstpage :

Abstract :

A phonetically based segmentation of speech is performed to classify segments into five classes: onset, unvoiced low-pass voiced, steady-state voiced, and transient voiced. The segment lengths are constrained to an integer multiple of a unit-frame. For each segment class, a distinctive coding scheme based on vector excitation coding (VXC) is used. The maximum bit-rate is 3.6 kb/s, and a moderate coding delay of 45 ms is incurred. Performance is roughly comparable to conventional VXC/CELP (code-excited linear prediction) coding at 4.8 kb/s

Keywords :

encoding; filtering and prediction theory; speech analysis and processing; 3.6 kbit/s; VXC/CELP; low-pass voiced; onset; phonetically based segmentation; speech coding; steady-state voiced; transient voiced; unvoiced; vector excitation coding; Degradation; Delay; Distortion measurement; Filters; Linear predictive coding; Speech coding; Speech synthesis; Steady-state; Vectors; Vocoders;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Acoustics, Speech, and Signal Processing, 1989. ICASSP-89., 1989 International Conference on

Conference_Location :

Glasgow

ISSN :

1520-6149

Type :

conf

DOI :

10.1109/ICASSP.1989.266360

Filename :

266360

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3520804