DocumentCode :
2407153
Title :
Unsupervised phone segmentation method using delta spectral function
Author :
Hoang, Dac-Thang ; Wang, Hsiao-Chuan
Author_Institution :
Dept. of Electr. Eng., Nat. Tsing Hua Univ., Hsinchu, Taiwan
fYear :
2011
fDate :
26-28 Oct. 2011
Firstpage :
152
Lastpage :
156
Abstract :
Unsupervised phone segmentation means that the phone boundaries in an utterance can be detected without a prior knowledge about the text contents. Usually, a spectral change in the speech signal implies the existence of a phone boundary. In this paper, the Delta Spectral Function (DSF) is defined for each frame to represent the variation of band energy for a specific band. Then a number of bands that give highest DSF values in a frame are chosen to define a measure of spectral change. The chosen bands are not fixed. They are dynamically chosen frame by frame. The peaks of the spectral change curve can be recognized as possible boundaries. A fine tune procedure is then applied to choose the peaks that will be the detected boundaries. Our proposed method results in an F-value of 75.3% under the condition of near zero over segmentation. In this situation the recall rate is 75.3%. This experimental result is better than many previous reports. Besides, the computation is simple and the proposed method is easy to be implemented.
Keywords :
spectral analysis; speech processing; band energy; delta spectral function; phone boundary; spectral change curve; speech signal; tune procedure; unsupervised phone segmentation method; Accuracy; Cepstral analysis; Energy measurement; Filter banks; Hidden Markov models; Speech; Training; delta spectral function; phone boundary; spectral change; unsupervised phone segmentation;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Speech Database and Assessments (Oriental COCOSDA), 2011 International Conference on
Conference_Location :
Hsinchu
Print_ISBN :
978-1-4577-0930-2
Type :
conf
DOI :
10.1109/ICSDA.2011.6085998
Filename :
6085998
Link To Document :
بازگشت