Sparsification of voice data using Discrete Rajan Transform and its applications in speaker recognition

Author

Prashanthi, G. ; Singh, Sushil ; Rajan, E.G. ; Krishnan, Prasad

Author_Institution

Pentagram Res. Centre Pvt Ltd., Hyderabad, India

fYear

2014

fDate

5-8 Oct. 2014

Firstpage

429

Lastpage

434

Abstract

This paper proposes a novel technique of sparsing speech data and compressing it in spectral domain. Discrete Rajan Transform is applied to voice data and the spectrum is sparsed by retaining the first component CPI (Cumulative Point Index) of the spectrum and forcing the other spectral components to zero. Thus the spectrum could be compressed to a maximum of 12.5% of the original data. As and when required the compressed spectrum could be synthesized using Inverse Discrete Rajan Transform and the reconstructed speech data analyzed for speaker recognition. Speaker recognition accuracy to a maximum of 93.5% has been obtained in this case.

Keywords

Hadamard transforms; data compression; discrete transforms; encoding; inverse transforms; speaker recognition; spectral analysis; CPI; compressed spectrum synthesis; cumulative point index; inverse discrete Rajan transform; reconstructed speech data analysis; speaker recognition accuracy; spectral components; spectral domain; spectrum sparsing; speech data compression; speech data sparsing technique; voice data sparsification; Accuracy; Databases; Mel frequency cepstral coefficient; Speaker recognition; Speech; Speech recognition; Transforms; Discrete Rajan Transform; MFCC and Fuzzy Vector Quantization; Speech Processing and Speaker Recognition;

fLanguage

English

Publisher

ieee

Conference_Titel

Systems, Man and Cybernetics (SMC), 2014 IEEE International Conference on

Conference_Location

San Diego, CA

Type

conf

DOI

10.1109/SMC.2014.6973945

Filename

6973945