DocumentCode
1797161
Title
Automatic cell phone recognition from speech recordings
Author
Ling Zou ; Jichen Yang ; Tangsen Huang
Author_Institution
Sch. of Electron. & Inf. Eng., South China Univ. of Technol., Guangzhou, China
fYear
2014
fDate
9-13 July 2014
Firstpage
621
Lastpage
625
Abstract
Recording device recognition is an important research field of digital audio forensic. In this paper, we utilize Gaussian mixture model-universal background model (GMM-UBM) as the classifier to form a recording device recognition system. We examine the performance of Mel-frequency cepstral coefficients (MFCCs) and Power-normalized cepstral coefficients (PNCCs) to this problem. Experiments conducted on recordings come from 14 cell phones show that MFCCs are more effective than PNCCs in cell phone recognition. We find that the identification performance can be improved by stacking MFCCs and energy feature. We also investigate the effect of speaker mismatch and de-noising processing for acoustic feature to this problem. The highest identification accuracy achieved here is 97.71%.
Keywords
Gaussian processes; audio recording; mobile handsets; speech recognition; GMM-UBM; Gaussian mixture model-universal background model; MFCC; Mel-frequency cepstral coefficients; PNCCs; acoustic feature; automatic cell phone recognition; denoising processing; digital audio forensic; power normalized cepstral coefficients; recording device recognition; speaker mismatch; speech recordings; Accuracy; Cellular phones; Forensics; Object recognition; Speech; Speech recognition; Training; Cell phone identification; Gaussian mixture model-universal background model (GMM-UBM); Mel-frequency cepstral coefficients (MFCCs); Power-normalized cepstral coefficients (PNCCs);
fLanguage
English
Publisher
ieee
Conference_Titel
Signal and Information Processing (ChinaSIP), 2014 IEEE China Summit & International Conference on
Conference_Location
Xi´an
Print_ISBN
978-1-4799-5401-8
Type
conf
DOI
10.1109/ChinaSIP.2014.6889318
Filename
6889318
Link To Document