DocumentCode :
2179471
Title :
Improved pos tagging for text-to-speech synthesis
Author :
Sun, Ming ; Bellegarda, Jerome R.
Author_Institution :
Center for Language & Speech Process., Johns Hopkins Univ., Baltimore, MD, USA
fYear :
2011
fDate :
22-27 May 2011
Firstpage :
5384
Lastpage :
5387
Abstract :
One of the fundamental building blocks of text processing for text to-speech (TTS) synthesis is the assignment of a part-of-speech (POS) tag to each input word. POS tags are heavily relied upon for downstream natural language analysis and prosody rendering. Conventional TTS POS tagging tends to resort to detailed hand crafted rules that can accommodate TTS specificities such as pertinent prosodic features, while mainstream tagging increasingly relies on data-driven statistical models trained on large but fairly generic corpora. This paper proposes a new strategy, hybrid POS tagging, which integrates these two approaches in order to achieve higher tagging accuracy. The resulting framework combines the TTS-specific advantage of rule-based tagging with the inherent robustness of broadly-trained statistical tagging. Empirical evidence underscores the viability of this framework for improving TTS quality, e.g., in regard to phrase boundary placement and homograph selection.
Keywords :
speech synthesis; text analysis; word processing; POS tagging; TTS; broadly-trained statistical tagging; downstream natural language analysis; homograph selection; phrase boundary placement; text processing; text-to-speech synthesis; Accuracy; Context; Error analysis; Hidden Markov models; Natural languages; Tagging; Training; Speech synthesis; part-of-speech disambiguation; statistical/rule-based/hybrid tagging; syntactic analysis; text processing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on
Conference_Location :
Prague
ISSN :
1520-6149
Print_ISBN :
978-1-4577-0538-0
Electronic_ISBN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2011.5947575
Filename :
5947575
Link To Document :
بازگشت