DocumentCode
148597
Title
Voice segmentation system based on energy estimation
Author
Rocha, Raissa B. ; Freire, Virginio V. ; Alencar, Marcelo S.
Author_Institution
Inst. of Adv. Studies in Commun. (Iecom), Fed. Univ. of Campina Grande (UFCG), Campina Grande, Brazil
fYear
2014
fDate
1-5 Sept. 2014
Firstpage
860
Lastpage
864
Abstract
Voice segmentation is used in speech recognition and system synthesis, as well as in phonetic voice encoders. This paper describes an implicit speech segmentation system, which aims to estimate the boundaries between phonemes in a locution. To find the segmentation marks, the proposed method initially locates reference borders between silent periods and phonemes, and vice versa measuring energy in short duration periods. The phonetic boundaries are found by means of energy encoding in the region delimited by the reference marks, which were initially detected. To evaluate the performance of the proposed system, an objective evaluation using 50 locutions was performed. The system detected 72.41% of the segmentation marks, in which, 77.6% were detected with an error less or equal to 10 ms and 22.4% of the boundaries were found with an error between 10 and 20 ms.
Keywords
speech recognition; speech synthesis; energy estimation; implicit speech segmentation system; phonemes; phonetic voice encoders; silent periods; speech recognition; speech system synthesis; voice segmentation system; Acoustics; Databases; Hidden Markov models; Manuals; Speech; Speech processing; Speech recognition; Voice segmentation; energy detection; objective evaluation;
fLanguage
English
Publisher
ieee
Conference_Titel
Signal Processing Conference (EUSIPCO), 2014 Proceedings of the 22nd European
Conference_Location
Lisbon
Type
conf
Filename
6952271
Link To Document