• DocumentCode
    148597
  • Title

    Voice segmentation system based on energy estimation

  • Author

    Rocha, Raissa B. ; Freire, Virginio V. ; Alencar, Marcelo S.

  • Author_Institution
    Inst. of Adv. Studies in Commun. (Iecom), Fed. Univ. of Campina Grande (UFCG), Campina Grande, Brazil
  • fYear
    2014
  • fDate
    1-5 Sept. 2014
  • Firstpage
    860
  • Lastpage
    864
  • Abstract
    Voice segmentation is used in speech recognition and system synthesis, as well as in phonetic voice encoders. This paper describes an implicit speech segmentation system, which aims to estimate the boundaries between phonemes in a locution. To find the segmentation marks, the proposed method initially locates reference borders between silent periods and phonemes, and vice versa measuring energy in short duration periods. The phonetic boundaries are found by means of energy encoding in the region delimited by the reference marks, which were initially detected. To evaluate the performance of the proposed system, an objective evaluation using 50 locutions was performed. The system detected 72.41% of the segmentation marks, in which, 77.6% were detected with an error less or equal to 10 ms and 22.4% of the boundaries were found with an error between 10 and 20 ms.
  • Keywords
    speech recognition; speech synthesis; energy estimation; implicit speech segmentation system; phonemes; phonetic voice encoders; silent periods; speech recognition; speech system synthesis; voice segmentation system; Acoustics; Databases; Hidden Markov models; Manuals; Speech; Speech processing; Speech recognition; Voice segmentation; energy detection; objective evaluation;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Signal Processing Conference (EUSIPCO), 2014 Proceedings of the 22nd European
  • Conference_Location
    Lisbon
  • Type

    conf

  • Filename
    6952271