DocumentCode :
590847
Title :
Composite-DBN for recognition of environmental contexts
Author :
Chu, S. ; Narayanan, Shrikanth ; Kuo, C.-C Jay
Author_Institution :
Oregon State Univ., Corvallis, OR, USA
fYear :
2012
fDate :
3-6 Dec. 2012
Firstpage :
1
Lastpage :
4
Abstract :
People´s behaviors are usually dictated by their surroundings. The surrounding environment affects the character and disposition of the people within it. The goal of our work is to automatically recognize the type of environments one is in. In this paper, we introduce a hierarchical structure to recognize environments using the surrounding audio. We can use this structure to discover high-level representations for different acoustic environments in a data-driven fashion. Being able to perform such function would allow us to better understand how we could utilize such information to assist in predicting a person´s emotion or behavior. To accurately make an informative decision about behaviors or emotions, it is important to have the ability to differentiate between different types of environments. Environmental sound contains large variances even within a single environment and is constantly changing. These changes and events are dynamic and inconsistent. The goal is to come up with models that is robust enough to generalize to different situations. Learning a hierarchy of sound types would improve and clarify problems caused by the confusion between multiple acoustic environments with similar characteristics. We propose a framework for a composite of deep belief networks (composite-DBNs) as a way to represent various levels of representations and to recognize twelve different types of common everyday environments. Experimental results demonstrate promising performance in improving the state of art recognition for acoustic environments.
Keywords :
acoustic applications; belief networks; emotion recognition; acoustic environments; composite-DBN; deep belief networks; environmental contexts; environmental sound; hierarchical structure; high-level representations; informative decision; people behaviors; sound types; surrounding audio; surrounding environment; Acoustics; Hidden Markov models; Neural networks; Noise measurement; Speech; Speech recognition; Training;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Signal & Information Processing Association Annual Summit and Conference (APSIPA ASC), 2012 Asia-Pacific
Conference_Location :
Hollywood, CA
Print_ISBN :
978-1-4673-4863-8
Type :
conf
Filename :
6411994
Link To Document :
بازگشت