مرکز منطقه ای اطلاع رساني علوم و فناوري - Maximum likelihood Linear Dimension Reduction of heteroscedastic feature for robust Speaker Recognition

DocumentCode :

3674396

Title :

Maximum likelihood Linear Dimension Reduction of heteroscedastic feature for robust Speaker Recognition

Author :

Suwon Shon;Seongkyu Mun;David K. Han;Hanseok Ko

Author_Institution :

School of Electrical Engineering, Korea University, Seoul, Korea

fYear :

2015

Firstpage :

Lastpage :

Abstract :

This paper analyzes heteroscedasticity in i-vector for robust forensics and surveillance speaker recognition system. Linear Discriminant Analysis (LDA), a widely-used linear dimension reduction technique, assumes that classes are homoscedastic within a same covariance. In this paper it is assumed that general speech utterances contain both homoscedastic and heteroscedastic elements. We show the validity of this assumption by employing several analyses and also demonstrate that dimension reduction using principal components is feasible. To effectively handle the presence of heteroscedastic and homoscedastic elements, we propose a fusion approach of applying both LDA and Heteroscedastic-LDA (HLDA). The experiments are conducted to show its effectiveness and compare to other methods using the telephone database of National Institute of Standards and Technology (NIST) Speaker Recognition Evaluation (SRE) 2010 extended.

Keywords :

"Speech","Switches","Analytical models","Transforms","Computational modeling","Speech processing","Algorithm design and analysis"

Publisher :

ieee

Conference_Titel :

Advanced Video and Signal Based Surveillance (AVSS), 2015 12th IEEE International Conference on

Type :

conf

DOI :

10.1109/AVSS.2015.7301784

Filename :

7301784

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3674396