Title :
Multi-modal service operation estimation using DNN-based acoustic bag-of-features
Author :
Satoshi Tamura;Takuya Uno;Masanori Takehara;Satoru Hayamizu;Takeshi Kurata
Author_Institution :
Department of Information Science Gifu University, Japan
Abstract :
In service engineering it is important to estimate when and what a worker did, because they include crucial evidences to improve service quality and working environments. For Service Operation Estimation (SOE), acoustic information is one of useful and key modalities; particularly environmental or background sounds include effective cues. This paper focuses on two aspects: (1) extracting powerful and robust acoustic features by using stacked-denoising-autoencoder and bag-of-feature techniques, and (2) investigating a multi-modal SOE scheme by combining the audio features and the other sensor data as well as non-sensor information. We conducted evaluation experiments using multi-modal data recorded in a restaurant. We improved SOE performance in comparison to conventional acoustic features, and effectiveness of our multimodal SOE scheme is also clarified.
Keywords :
"Feature extraction","Speech","Mel frequency cepstral coefficient","Signal processing","Estimation","Support vector machines"
Conference_Titel :
Signal Processing Conference (EUSIPCO), 2015 23rd European
Electronic_ISBN :
2076-1465
DOI :
10.1109/EUSIPCO.2015.7362793