Title :
Considerations to Spoken Language Recognition for Text-to-Speech Applications
Author :
Raf, M. Saadeq ; Jafari, Somayeh ; Ahmadi, Hesamoddin Shahriari ; Jafari, Masumeh
fDate :
March 30 2011-April 1 2011
Abstract :
There have been a great deal of discussions throughout the past few years over whether or not Text-to-Speech (TTS) machines can synthesize natural synthetic speech from texts. Aiming to address this need, in this paper we have considered some significant viewpoints of Spoken Language Processing (SLP) on the phonetic transcription of each word for the preprocessing of text-to-speech synthesis. On the other hand, lack of researches on automatic language detection for text transcription in different languages by considering the phonology of that language, motivated us to make a text language identifier system for commonly-used contexts. It therefore encouraged us to conduct a novel research into semi-English texts that people from different nations often use in their conversational transcripts such as email or chat via the Internet or cell phone text messaging services. In this research, we have investigated the language of text sequences by employing phonotactics rules, chiefly on Finglish (a portmanteau term combining Farsi and English) which is an alternative writing format for the Farsi Language, which has its own orthographic system, by means of English letters. As this is the first paper regarding Finglish texts, it can be tremendously exciting to enhance the text-to-speech synthesis systems created by advanced Digital Signal Processing (DSP) algorithms to specifying the language of each sentence in the first place. Finally, we have also proposed highly recommended writing rules for Finglish for it to be easily understood, translated to English and converted to natural speech.
Keywords :
speech recognition; speech synthesis; text analysis; English letters; Farsi Language; Finglish; automatic language detection; digital signal processing; natural synthetic speech; orthographic system; phonetic transcription; phonotactics rules; portmanteau term; semi-English texts; spoken language processing; spoken language recognition; text language identifier system; text transcription; text-to-speech application; text-to-speech machines; text-to-speech synthesis systems; Dictionaries; Google; Spectrogram; Speech; Speech processing; Speech recognition; Stress; Digital Signal Processing (DSP); Finglish; Phonology; Semi-English Language Recognition for Text Sequences (SELRTS); Spoken Language Processing (SLP); Text-to-Speech (TTS);
Conference_Titel :
Computer Modelling and Simulation (UKSim), 2011 UkSim 13th International Conference on
Conference_Location :
Cambridge
Print_ISBN :
978-1-61284-705-4
Electronic_ISBN :
978-0-7695-4376-5
DOI :
10.1109/UKSIM.2011.64