Title :
Segmentation-by-classification system based on factor analysis
Author :
Castan, Diego ; Ortega, Antonio ; Villalba, Jesus ; Miguel, A. ; Lleida, Eduardo
Author_Institution :
ViVoLab I3A, Univ. of Zaragoza, Zaragoza, Spain
Abstract :
This paper proposes a novel audio segmentation-by-classification system based on Factor Analysis (FA) with a channel compensation matrix for each class and scoring the fixed-length segments as the log-likelihood ratio between class/no-class. The scores are smoothed and the most probable sequence is computed with a Viterbi algorithm. The system described here is designed to segment and classify the audio files coming from broadcast programs into five different classes: speech (SP), speech with noise (SN), speech with music (SM), music (MU) or others (OT). This task was proposed in the Albayzin 2010 evaluation campaign. The system is compared with the winning system of the evaluation achieving lower error rate in SP and SN. These classes represent 3/4 of the total amount of the data. Therefore, the FA segmentation system gets a reduction in the average segmentation error rate.
Keywords :
audio signal processing; matrix algebra; signal classification; speech processing; Albayzin 2010 evaluation campaign; FA segmentation system; Viterbi algorithm; audio file classification; audio file segmentation; audio segmentation-by-classification system; average segmentation error rate; broadcast programs; channel compensation matrix; factor analysis; fixed-length segments; log-likelihood ratio; speech class; speech with music class; speech with noise class; Acoustics; Databases; Hidden Markov models; Measurement; Speech; Tin; Vectors; Albayzin-2010 Evaluation; Audio Segmentation; Broadcast News (BN); Channel Compensation; Factor Analysis;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on
Conference_Location :
Vancouver, BC
DOI :
10.1109/ICASSP.2013.6637755