Title :
A phone segmentation method and its evaluation on Mandarin speech corpus
Author :
Dac-Thang Hoang ; Hsiao-Chuan Wang
Author_Institution :
Dept. of Electr. Eng., Nat. Tsing Hua Univ., Hsinchu, Taiwan
Abstract :
This paper presents a phone segmentation method without a prior knowledge about the text contents. The proposed method is an unsupervised phone boundary detection based on band-energy tracing technique. It demonstrates a better performance than those previous works when the method was applied to TIMIT corpus. But the performance degrades when the method is applied to a Mandarin Chinese speech database, TCC300 corpus. The evaluation on this Mandarin speech corpus reveals some interesting facts that may cause the difficulty in detecting phone boundaries. We have proposed some ideas that may be helpful in future study for improving the phone segmentation method.
Keywords :
natural language processing; speech recognition; speech synthesis; text analysis; Mandarin Chinese speech database; Mandarin speech corpus; TCC300 corpus; TIMIT corpus; band-energy tracing technique; phone segmentation method; text contents; unsupervised phone boundary detection; Accuracy; Cepstral analysis; Educational institutions; Indexes; Manuals; Speech; band-energy tracing; phone segmentation; unsupervised phone boundary detection;
Conference_Titel :
Chinese Spoken Language Processing (ISCSLP), 2012 8th International Symposium on
Conference_Location :
Kowloon
Print_ISBN :
978-1-4673-2506-6
Electronic_ISBN :
978-1-4673-2505-9
DOI :
10.1109/ISCSLP.2012.6423515