DocumentCode :
3161500
Title :
Data-driven phrasing for speech synthesis in low-resource languages
Author :
Parlikar, Alok ; Black, Alan W.
Author_Institution :
Language Technol. Inst., Carnegie Mellon Univ., Pittsburgh, PA, USA
fYear :
2012
fDate :
25-30 March 2012
Firstpage :
4013
Lastpage :
4016
Abstract :
We present an approach to build phrase break prediction models when synthesizing text in low resource languages. This method allows building models without depending on the availability of part of speech taggers, or corpus with hand annotated breaks. We use the same speech data used for building a synthetic voice, to deduce acoustic phrase breaks. We perform unsupervised part of speech induction over a small text corpus in the language at hand. We use these tags and train a grammar based phrasing model. In this paper, we show results for the languages: English, Portuguese and Marathi, which suggest that we can quickly build very reasonable phrasing models for new languages using very little data.
Keywords :
speech synthesis; acoustic phrase break deduction; data-driven phrasing; grammar based phrasing model; hand annotated breaks; low-resource languages; phrase break prediction models; speech data; speech induction; speech synthesis; speech taggers; synthetic voice; text corpus; text synthesis; Data models; Educational institutions; Grammar; Histograms; Numerical models; Predictive models; Speech; Low Resource Languages; Phrase Break Prediction; Speech Synthesis;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on
Conference_Location :
Kyoto
ISSN :
1520-6149
Print_ISBN :
978-1-4673-0045-2
Electronic_ISBN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2012.6288798
Filename :
6288798
Link To Document :
بازگشت