DocumentCode :
672820
Title :
Estimation of instants of significant excitation using accumulated energy function of DCT
Author :
Sripriya, N. ; Nagarajan, T.
Author_Institution :
Dept. of Inf. Technol., SSN Coll. of Eng., Chennai, India
fYear :
2013
fDate :
25-27 Nov. 2013
Firstpage :
1
Lastpage :
6
Abstract :
This correspondence proposes an effective algorithm for estimation of instants of significant excitation that are exclusively present in the voiced speech. The proposed method is based on the idea that the fundamental frequency signal can be generated by the reconstruction of the signal using the first few DCT coefficients. The critical window including the required number of DCT coefficients is identified using the accumulated energy function (AEF) of DCT. This AEF function is evaluated by computing the energies of the signals constructed by expanding the window each time to include the next DCT coefficient. Many work reported in the literature yield better estimates of these instants of excitation present in the voiced speech. However, their estimations are not restricted to voiced regions alone causing spurious instants in non-voiced regions. The major advantage of this method is its inbuilt extended ability to differentiate voiced/non-voiced speech preventing spurious instants in non-voiced regions. The well-known method for instants estimation, DYPSA, is reviewed, evaluated and, the new algorithm is compared with it using the CMU Arctic database. The results clearly show that the generation of 85% of the spurious instants in non-voiced region are avoided without any serious compromise in the identification rate(IDR). Moreover, this technique is time-effective and has a tuning parameter to control the miss rate(MR) and the false alarm rate(FAR).
Keywords :
discrete cosine transforms; signal reconstruction; speech processing; CMU Arctic database; DCT; DYPSA; accumulated energy function; discrete cosine transforms; false alarm rate; fundamental frequency signal; identification rate; instants estimation; miss rate; signal reconstruction; significant excitation; voiced regions; voiced speech; Algorithm design and analysis; Discrete cosine transforms; Estimation; Frequency estimation; Harmonic analysis; Speech; Tuning; DCT; instants of excitation; voiced/non-voiced discrimination;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Oriental COCOSDA held jointly with 2013 Conference on Asian Spoken Language Research and Evaluation (O-COCOSDA/CASLRE), 2013 International Conference
Conference_Location :
Gurgaon
Type :
conf
DOI :
10.1109/ICSDA.2013.6709845
Filename :
6709845
Link To Document :
بازگشت