مرکز منطقه ای اطلاع رساني علوم و فناوري - Learning auditory models of machine voices

DocumentCode :

2521025

Title :

Learning auditory models of machine voices

Author :

Dobson, Kelly ; Whitman, Brian ; Ellis, Daniel P W

Author_Institution :

MIT Media Lab, MIT, Cambridge, MA, USA

fYear :

2005

fDate :

16-19 Oct. 2005

Firstpage :

339

Lastpage :

342

Abstract :

Vocal imitation is often found useful in machine therapy sessions as it creates an emphatic relational bridge between human and machine. The feedback of the machine directly responding to the person´s imitation can strengthen the trust of this connection. However, vocal imitation of machines often bear little resemblance to the target due to physiological limitations. In practice, we need a way to detect human vocalization of machine sounds that can generalize to new machines. In this study we learn the relationship between vocal imitation of machine sounds and the target sounds to create a predictive model of vocalization of otherwise humanly impossible sounds. After training on a small set of machines and their imitations, we predict the correct target of a new set of imitations with high accuracy. The model outperforms distance metrics between human and machine sounds on the same task and takes into account auditory perception and constraints in vocal expression.

Keywords :

acoustic signal processing; audio signal processing; auditory perception; human vocalization; learning auditory models; machine sounds; machine therapy sessions; machine voices; physiological limitations; predictive model; vocal imitation; vocalization; Acoustic signal detection; Databases; Human voice; Instruments; Machine learning; Medical treatment; Multiple signal classification; Music; Predictive models; Psychology;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Applications of Signal Processing to Audio and Acoustics, 2005. IEEE Workshop on

Print_ISBN :

0-7803-9154-3

Type :

conf

DOI :

10.1109/ASPAA.2005.1540238

Filename :

1540238

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2521025