DocumentCode :
3124878
Title :
A phone segmentation method and its evaluation on Mandarin speech corpus
Author :
Dac-Thang Hoang ; Hsiao-Chuan Wang
Author_Institution :
Dept. of Electr. Eng., Nat. Tsing Hua Univ., Hsinchu, Taiwan
fYear :
2012
fDate :
5-8 Dec. 2012
Firstpage :
373
Lastpage :
377
Abstract :
This paper presents a phone segmentation method without a prior knowledge about the text contents. The proposed method is an unsupervised phone boundary detection based on band-energy tracing technique. It demonstrates a better performance than those previous works when the method was applied to TIMIT corpus. But the performance degrades when the method is applied to a Mandarin Chinese speech database, TCC300 corpus. The evaluation on this Mandarin speech corpus reveals some interesting facts that may cause the difficulty in detecting phone boundaries. We have proposed some ideas that may be helpful in future study for improving the phone segmentation method.
Keywords :
natural language processing; speech recognition; speech synthesis; text analysis; Mandarin Chinese speech database; Mandarin speech corpus; TCC300 corpus; TIMIT corpus; band-energy tracing technique; phone segmentation method; text contents; unsupervised phone boundary detection; Accuracy; Cepstral analysis; Educational institutions; Indexes; Manuals; Speech; band-energy tracing; phone segmentation; unsupervised phone boundary detection;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Chinese Spoken Language Processing (ISCSLP), 2012 8th International Symposium on
Conference_Location :
Kowloon
Print_ISBN :
978-1-4673-2506-6
Electronic_ISBN :
978-1-4673-2505-9
Type :
conf
DOI :
10.1109/ISCSLP.2012.6423515
Filename :
6423515
Link To Document :
بازگشت