مرکز منطقه ای اطلاع رساني علوم و فناوري - Data-driven phrasing for speech synthesis in low-resource languages

DocumentCode :

3161500

Title :

Data-driven phrasing for speech synthesis in low-resource languages

Author :

Parlikar, Alok ; Black, Alan W.

Author_Institution :

Language Technol. Inst., Carnegie Mellon Univ., Pittsburgh, PA, USA

fYear :

2012

fDate :

25-30 March 2012

Firstpage :

4013

Lastpage :

4016

Abstract :

We present an approach to build phrase break prediction models when synthesizing text in low resource languages. This method allows building models without depending on the availability of part of speech taggers, or corpus with hand annotated breaks. We use the same speech data used for building a synthetic voice, to deduce acoustic phrase breaks. We perform unsupervised part of speech induction over a small text corpus in the language at hand. We use these tags and train a grammar based phrasing model. In this paper, we show results for the languages: English, Portuguese and Marathi, which suggest that we can quickly build very reasonable phrasing models for new languages using very little data.

Keywords :

speech synthesis; acoustic phrase break deduction; data-driven phrasing; grammar based phrasing model; hand annotated breaks; low-resource languages; phrase break prediction models; speech data; speech induction; speech synthesis; speech taggers; synthetic voice; text corpus; text synthesis; Data models; Educational institutions; Grammar; Histograms; Numerical models; Predictive models; Speech; Low Resource Languages; Phrase Break Prediction; Speech Synthesis;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on

Conference_Location :

Kyoto

ISSN :

1520-6149

Print_ISBN :

978-1-4673-0045-2

Electronic_ISBN :

1520-6149

Type :

conf

DOI :

10.1109/ICASSP.2012.6288798

Filename :

6288798

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3161500