مرکز منطقه ای اطلاع رساني علوم و فناوري - Segmentation-by-classification system based on factor analysis

DocumentCode :

1653768

Title :

Segmentation-by-classification system based on factor analysis

Author :

Castan, Diego ; Ortega, Antonio ; Villalba, Jesus ; Miguel, A. ; Lleida, Eduardo

Author_Institution :

ViVoLab I3A, Univ. of Zaragoza, Zaragoza, Spain

fYear :

2013

Firstpage :

783

Lastpage :

787

Abstract :

This paper proposes a novel audio segmentation-by-classification system based on Factor Analysis (FA) with a channel compensation matrix for each class and scoring the fixed-length segments as the log-likelihood ratio between class/no-class. The scores are smoothed and the most probable sequence is computed with a Viterbi algorithm. The system described here is designed to segment and classify the audio files coming from broadcast programs into five different classes: speech (SP), speech with noise (SN), speech with music (SM), music (MU) or others (OT). This task was proposed in the Albayzin 2010 evaluation campaign. The system is compared with the winning system of the evaluation achieving lower error rate in SP and SN. These classes represent 3/4 of the total amount of the data. Therefore, the FA segmentation system gets a reduction in the average segmentation error rate.

Keywords :

audio signal processing; matrix algebra; signal classification; speech processing; Albayzin 2010 evaluation campaign; FA segmentation system; Viterbi algorithm; audio file classification; audio file segmentation; audio segmentation-by-classification system; average segmentation error rate; broadcast programs; channel compensation matrix; factor analysis; fixed-length segments; log-likelihood ratio; speech class; speech with music class; speech with noise class; Acoustics; Databases; Hidden Markov models; Measurement; Speech; Tin; Vectors; Albayzin-2010 Evaluation; Audio Segmentation; Broadcast News (BN); Channel Compensation; Factor Analysis;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on

Conference_Location :

Vancouver, BC

ISSN :

1520-6149

Type :

conf

DOI :

10.1109/ICASSP.2013.6637755

Filename :

6637755

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=1653768