DocumentCode
173182
Title
Sparsification of voice data using Discrete Rajan Transform and its applications in speaker recognition
Author
Prashanthi, G. ; Singh, Sushil ; Rajan, E.G. ; Krishnan, Prasad
Author_Institution
Pentagram Res. Centre Pvt Ltd., Hyderabad, India
fYear
2014
fDate
5-8 Oct. 2014
Firstpage
429
Lastpage
434
Abstract
This paper proposes a novel technique of sparsing speech data and compressing it in spectral domain. Discrete Rajan Transform is applied to voice data and the spectrum is sparsed by retaining the first component CPI (Cumulative Point Index) of the spectrum and forcing the other spectral components to zero. Thus the spectrum could be compressed to a maximum of 12.5% of the original data. As and when required the compressed spectrum could be synthesized using Inverse Discrete Rajan Transform and the reconstructed speech data analyzed for speaker recognition. Speaker recognition accuracy to a maximum of 93.5% has been obtained in this case.
Keywords
Hadamard transforms; data compression; discrete transforms; encoding; inverse transforms; speaker recognition; spectral analysis; CPI; compressed spectrum synthesis; cumulative point index; inverse discrete Rajan transform; reconstructed speech data analysis; speaker recognition accuracy; spectral components; spectral domain; spectrum sparsing; speech data compression; speech data sparsing technique; voice data sparsification; Accuracy; Databases; Mel frequency cepstral coefficient; Speaker recognition; Speech; Speech recognition; Transforms; Discrete Rajan Transform; MFCC and Fuzzy Vector Quantization; Speech Processing and Speaker Recognition;
fLanguage
English
Publisher
ieee
Conference_Titel
Systems, Man and Cybernetics (SMC), 2014 IEEE International Conference on
Conference_Location
San Diego, CA
Type
conf
DOI
10.1109/SMC.2014.6973945
Filename
6973945
Link To Document