Human speech model based on information separation and its application to speech processing

Author

Minematsu, Nobuaki

Author_Institution

Grad. Sch. of Inf. Sci. & Technol., Univ. of Tokyo, Tokyo, Japan

fYear

2010

fDate

Nov. 29 2010-Dec. 3 2010

Firstpage

477

Lastpage

482

Abstract

This paper points out that no existing technically-implemented speech model is adequate enough to describe one of the most fundamental and unique capacities of human speech processing. Language acquisition of infants is based on vocal imitation but they don´t impersonate their parents and imitate only the linguistic and para-linguistic aspects of the parents´ utterances. The vocal imitation is found only in a few species of animals: birds, dolphins, and whales, but their imitation is basically acoustic imitation. How to represent exclusively what in the utterances human infants imitate? An adequate speech model should be independent of the extra-linguistic features and represents only the linguistic and para-linguistc aspects. We already proposed a new speech model, called speech structure, which is proved mathematically to be invariant with any kind of transformation. Its extremely high independence of speaker differences was shown experimentally. In this paper, by reviewing studies of evolutionary anthropology and language disorders, we discuss the theoretical validity of the new model to describe the human-unique capacity of speech processing.

Keywords

linguistics; speech processing; speech recognition; speech synthesis; human infant; human speech processing; information separation; language acquisition; linguistic; vocal imitation; Acoustics; Animals; Hidden Markov models; Humans; Mathematical model; Speech; Speech recognition;

fLanguage

English

Publisher

ieee

Conference_Titel

Chinese Spoken Language Processing (ISCSLP), 2010 7th International Symposium on

Conference_Location

Tainan

Print_ISBN

978-1-4244-6244-5

Type

conf

DOI

10.1109/ISCSLP.2010.5684477

Filename

5684477