مرکز منطقه ای اطلاع رساني علوم و فناوري - On the integration of time-frequency masking speech separation and recognition in underdetermined environments

DocumentCode :

1804217

Title :

On the integration of time-frequency masking speech separation and recognition in underdetermined environments

Author :

Jafari, Ingrid ; Haque, Showera ; Togneri, Roberto ; Nordholm, Sven Erik

Author_Institution :

Univ. of Western Australia, Crawley, WA, Australia

fYear :

2012

fDate :

4-7 Nov. 2012

Firstpage :

1613

Lastpage :

1617

Abstract :

The successful application of automatic speech recognition systems in the real world is conditional on its ability to handle realistic environments with unfavorable conditions such as reverberation and multiple sources of inteference. Previous research has identified time-frequency masking based approaches to blind source separation as a viable approach for multisource reverberant source separation. It is proposed the use of such separation techniques as a front-end to speech recognition will encourage greater recognition accuracy. Experimental evaluations confirmed the hypothesis with an improvement in recognition accuracy of over 20% at a reverberation time of RT₆₀ = 300ms; this is indicative of the potential for future research in this field.

Keywords :

blind source separation; speech recognition; automatic speech recognition systems; blind source separation; time frequency masking speech recognition; time frequency masking speech separation; underdetermined environments;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Signals, Systems and Computers (ASILOMAR), 2012 Conference Record of the Forty Sixth Asilomar Conference on

Conference_Location :

Pacific Grove, CA

ISSN :

1058-6393

Print_ISBN :

978-1-4673-5050-1

Type :

conf

DOI :

10.1109/ACSSC.2012.6489303

Filename :

6489303

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=1804217