Title :
Data-driven phrasing for speech synthesis in low-resource languages
Author :
Parlikar, Alok ; Black, Alan W.
Author_Institution :
Language Technol. Inst., Carnegie Mellon Univ., Pittsburgh, PA, USA
Abstract :
We present an approach to build phrase break prediction models when synthesizing text in low resource languages. This method allows building models without depending on the availability of part of speech taggers, or corpus with hand annotated breaks. We use the same speech data used for building a synthetic voice, to deduce acoustic phrase breaks. We perform unsupervised part of speech induction over a small text corpus in the language at hand. We use these tags and train a grammar based phrasing model. In this paper, we show results for the languages: English, Portuguese and Marathi, which suggest that we can quickly build very reasonable phrasing models for new languages using very little data.
Keywords :
speech synthesis; acoustic phrase break deduction; data-driven phrasing; grammar based phrasing model; hand annotated breaks; low-resource languages; phrase break prediction models; speech data; speech induction; speech synthesis; speech taggers; synthetic voice; text corpus; text synthesis; Data models; Educational institutions; Grammar; Histograms; Numerical models; Predictive models; Speech; Low Resource Languages; Phrase Break Prediction; Speech Synthesis;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on
Conference_Location :
Kyoto
Print_ISBN :
978-1-4673-0045-2
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2012.6288798