• DocumentCode
    173182
  • Title

    Sparsification of voice data using Discrete Rajan Transform and its applications in speaker recognition

  • Author

    Prashanthi, G. ; Singh, Sushil ; Rajan, E.G. ; Krishnan, Prasad

  • Author_Institution
    Pentagram Res. Centre Pvt Ltd., Hyderabad, India
  • fYear
    2014
  • fDate
    5-8 Oct. 2014
  • Firstpage
    429
  • Lastpage
    434
  • Abstract
    This paper proposes a novel technique of sparsing speech data and compressing it in spectral domain. Discrete Rajan Transform is applied to voice data and the spectrum is sparsed by retaining the first component CPI (Cumulative Point Index) of the spectrum and forcing the other spectral components to zero. Thus the spectrum could be compressed to a maximum of 12.5% of the original data. As and when required the compressed spectrum could be synthesized using Inverse Discrete Rajan Transform and the reconstructed speech data analyzed for speaker recognition. Speaker recognition accuracy to a maximum of 93.5% has been obtained in this case.
  • Keywords
    Hadamard transforms; data compression; discrete transforms; encoding; inverse transforms; speaker recognition; spectral analysis; CPI; compressed spectrum synthesis; cumulative point index; inverse discrete Rajan transform; reconstructed speech data analysis; speaker recognition accuracy; spectral components; spectral domain; spectrum sparsing; speech data compression; speech data sparsing technique; voice data sparsification; Accuracy; Databases; Mel frequency cepstral coefficient; Speaker recognition; Speech; Speech recognition; Transforms; Discrete Rajan Transform; MFCC and Fuzzy Vector Quantization; Speech Processing and Speaker Recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Systems, Man and Cybernetics (SMC), 2014 IEEE International Conference on
  • Conference_Location
    San Diego, CA
  • Type

    conf

  • DOI
    10.1109/SMC.2014.6973945
  • Filename
    6973945