• DocumentCode
    575629
  • Title

    Statistical phonetic analysis of the Romanian language for speech recognition and synthesis tasks

  • Author

    Stanescu, Miruna ; Buzo, Andi ; Cucu, H. ; Burileanu, C.

  • Author_Institution
    Univ. Politeh. of Bucharest, Bucharest, Romania
  • fYear
    2012
  • fDate
    12-14 Sept. 2012
  • Firstpage
    219
  • Lastpage
    222
  • Abstract
    This article provides a statistical phonetic analysis based on the largest Romanian text corpus collected so far for research purposes. Several types of phonetic events are analyzed: phones, diphones, triphones, and phone clusters based on the general classification of phones in the Romanian language. Some interesting conclusions are drawn, such as the fact that less than half the diphones cover 99% of the whole text. The article also discusses some usages of these phonetic statistics for spoken language technology tasks.
  • Keywords
    speech processing; statistical analysis; Romanian language; Romanian text corpus; diphones; general classification; phone clusters; phonetic events; phonetic statistics; speech recognition; speech synthesis tasks; spoken language technology tasks; statistical phonetic analysis; triphones; Automatic speech recognition; Buildings; Databases; Speech; Speech processing; Training; Automatic speech recognition; Phonetic event; Spoken language technology; Text-to-speech;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    ELMAR, 2012 Proceedings
  • Conference_Location
    Zadar
  • ISSN
    1334-2630
  • Print_ISBN
    978-1-4673-1243-1
  • Type

    conf

  • Filename
    6338510