DocumentCode
2009221
Title
Human speech model based on information separation and its application to speech processing
Author
Minematsu, Nobuaki
Author_Institution
Grad. Sch. of Inf. Sci. & Technol., Univ. of Tokyo, Tokyo, Japan
fYear
2010
fDate
Nov. 29 2010-Dec. 3 2010
Firstpage
477
Lastpage
482
Abstract
This paper points out that no existing technically-implemented speech model is adequate enough to describe one of the most fundamental and unique capacities of human speech processing. Language acquisition of infants is based on vocal imitation but they don´t impersonate their parents and imitate only the linguistic and para-linguistic aspects of the parents´ utterances. The vocal imitation is found only in a few species of animals: birds, dolphins, and whales, but their imitation is basically acoustic imitation. How to represent exclusively what in the utterances human infants imitate? An adequate speech model should be independent of the extra-linguistic features and represents only the linguistic and para-linguistc aspects. We already proposed a new speech model, called speech structure, which is proved mathematically to be invariant with any kind of transformation. Its extremely high independence of speaker differences was shown experimentally. In this paper, by reviewing studies of evolutionary anthropology and language disorders, we discuss the theoretical validity of the new model to describe the human-unique capacity of speech processing.
Keywords
linguistics; speech processing; speech recognition; speech synthesis; human infant; human speech processing; information separation; language acquisition; linguistic; vocal imitation; Acoustics; Animals; Hidden Markov models; Humans; Mathematical model; Speech; Speech recognition;
fLanguage
English
Publisher
ieee
Conference_Titel
Chinese Spoken Language Processing (ISCSLP), 2010 7th International Symposium on
Conference_Location
Tainan
Print_ISBN
978-1-4244-6244-5
Type
conf
DOI
10.1109/ISCSLP.2010.5684477
Filename
5684477
Link To Document