مرکز منطقه ای اطلاع رساني علوم و فناوري - Considerations to Spoken Language Recognition for Text-to-Speech Applications

DocumentCode :

2998797

Title :

Considerations to Spoken Language Recognition for Text-to-Speech Applications

Author :

Raf, M. Saadeq ; Jafari, Somayeh ; Ahmadi, Hesamoddin Shahriari ; Jafari, Masumeh

fYear :

2011

fDate :

March 30 2011-April 1 2011

Firstpage :

304

Lastpage :

309

Abstract :

There have been a great deal of discussions throughout the past few years over whether or not Text-to-Speech (TTS) machines can synthesize natural synthetic speech from texts. Aiming to address this need, in this paper we have considered some significant viewpoints of Spoken Language Processing (SLP) on the phonetic transcription of each word for the preprocessing of text-to-speech synthesis. On the other hand, lack of researches on automatic language detection for text transcription in different languages by considering the phonology of that language, motivated us to make a text language identifier system for commonly-used contexts. It therefore encouraged us to conduct a novel research into semi-English texts that people from different nations often use in their conversational transcripts such as email or chat via the Internet or cell phone text messaging services. In this research, we have investigated the language of text sequences by employing phonotactics rules, chiefly on Finglish (a portmanteau term combining Farsi and English) which is an alternative writing format for the Farsi Language, which has its own orthographic system, by means of English letters. As this is the first paper regarding Finglish texts, it can be tremendously exciting to enhance the text-to-speech synthesis systems created by advanced Digital Signal Processing (DSP) algorithms to specifying the language of each sentence in the first place. Finally, we have also proposed highly recommended writing rules for Finglish for it to be easily understood, translated to English and converted to natural speech.

Keywords :

speech recognition; speech synthesis; text analysis; English letters; Farsi Language; Finglish; automatic language detection; digital signal processing; natural synthetic speech; orthographic system; phonetic transcription; phonotactics rules; portmanteau term; semi-English texts; spoken language processing; spoken language recognition; text language identifier system; text transcription; text-to-speech application; text-to-speech machines; text-to-speech synthesis systems; Dictionaries; Google; Spectrogram; Speech; Speech processing; Speech recognition; Stress; Digital Signal Processing (DSP); Finglish; Phonology; Semi-English Language Recognition for Text Sequences (SELRTS); Spoken Language Processing (SLP); Text-to-Speech (TTS);

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Computer Modelling and Simulation (UKSim), 2011 UkSim 13th International Conference on

Conference_Location :

Cambridge

Print_ISBN :

978-1-61284-705-4

Electronic_ISBN :

978-0-7695-4376-5

Type :

conf

DOI :

10.1109/UKSIM.2011.64

Filename :

5754231

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2998797