Title :
Enhancement of alaryngeal speech utilizing spectral subtraction and minimum statistics
Author :
Kabir, Raonaak ; Greenblatt, Aaron ; Panetta, Karen ; Agaian, Sos
Author_Institution :
Dept. of Electr. & Comput. Eng., Tufts Univ., Medford, MA
Abstract :
This paper proposes improvements to the electrolarynx device, which allows a patient to speak after the larynx is removed. Speech through existing electrolarynx devices is corrupted by high levels of noise and sounds unnatural. The proposed algorithm is based upon spectral subtraction techniques and modifies the magnitude of the speech signal in the frequency domain. Here, with the introduction of Discrete Cosine Transform (DCT) domain analysis using minimum statistics, the proposed algorithm effectively reduces high levels of noise generated by the electrolarynx. Unlike existing methods, the proposed algorithm does not require the use of a voice activity detector and the Discrete Cosine Transform domain is more proficient at isolating speech signal energy. The new algorithm presented in this paper is readily adaptable to hardware implementation and has the potential to be included in a handheld electrolarynx device in the future.
Keywords :
discrete cosine transforms; spectral analysis; speech enhancement; statistics; alaryngeal speech enhancement; discrete cosine transform domain analysis; electrolarynx device; larynx; minimum statistics; spectral subtraction; speech signal energy; voice activity detector; Acoustic noise; Algorithm design and analysis; Discrete cosine transforms; Frequency domain analysis; Larynx; Noise level; Speech enhancement; Statistical analysis; Statistics; Subtraction techniques; Spectral subtraction; alaryngeal speech; cosine transforms; dct; electrolarynx; minimum statistics; speech enhancement;
Conference_Titel :
Machine Learning and Cybernetics, 2008 International Conference on
Conference_Location :
Kunming
Print_ISBN :
978-1-4244-2095-7
Electronic_ISBN :
978-1-4244-2096-4
DOI :
10.1109/ICMLC.2008.4621049