مرکز منطقه ای اطلاع رساني علوم و فناوري - Analysis of effect of single-channel speech-music separation using NMF to automatic speech recognition

DocumentCode :

152909

Title :

Analysis of effect of single-channel speech-music separation using NMF to automatic speech recognition

Author :

Demir, Cemil ; Cemgil, A.T. ; Saraclar, Murat

Author_Institution :

BILGEM, TUBITAK, Kocaeli, Turkey

fYear :

2014

fDate :

23-25 April 2014

Firstpage :

1818

Lastpage :

1821

Abstract :

In this study, single-channel speech source separation is carried out to separate the speech from the background music, which degrades the speech recognition performance especially in broadcast news transcription systems. Since the separation is done using single observation of the source signals, the sources have to be previously modeled using training data. Non-negative Matrix Factorization (NMF) methods are used to model the sources. In order to model the source signals, different training data sets, which contain different music and speech data, are created and the effect of the training data sets are analyzed in this study. The performances of the methods are measured not only using separation performance measure but also with speech recognition performance measures.

Keywords :

broadcast channels; matrix decomposition; music; source separation; speech recognition; NMF method; automatic speech recognition; broadcast news transcription system; music data; nonnegative matrix factorization method; single channel speech source separation; source signal modelling; speech data; training data set; Conferences; Kuiper belt; Source separation; Sparse matrices; Speech; Speech recognition;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Signal Processing and Communications Applications Conference (SIU), 2014 22nd

Conference_Location :

Trabzon

Type :

conf

DOI :

10.1109/SIU.2014.6830605

Filename :

6830605

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=152909