Title :
Acoustic model selection using limited data for accent robust speech recognition
Author :
Najafian, Maryam ; Safavi, Saeid ; Hanani, Abualsoud ; Russell, Matthew
Author_Institution :
Sch. of EECE, Univ. of Birmingham, Birmingham, UK
Abstract :
This paper investigates techniques to compensate for the effects of regional accents of British English on automatic speech recognition (ASR) performance. Given a small amount of speech from a new speaker, is it better to apply speaker adaptation, or to use accent identification (AID) to identify the speaker´s accent followed by accent-dependent ASR? Three approaches to accent-dependent modelling are investigated: using the `correct´ accent model, choosing a model using supervised (ACCDIST-based) accent identification (AID), and building a model using data from neighbouring speakers in `AID space´. All of the methods outperform the accent-independent model, with relative reductions in ASR error rate of up to 44%. Using on average 43s of speech to identify an appropriate accent-dependent model outperforms using it for supervised speaker-adaptation, by 7%.
Keywords :
acoustic signal processing; learning (artificial intelligence); speaker recognition; AID space; ASR performance; British English; accent robust speech recognition; accent-dependent ASR; accent-dependent modelling; acoustic model selection; automatic speech recognition performance; correct accent model; limited data; neighbouring speakers; regional accent effect; supervised ACCDIST-based accent identification model; supervised speaker-adaptation; Acoustics; Adaptation models; Data models; Error analysis; Hidden Markov models; Speech; Speech recognition; accent identification; acoustic data selection; speech recognition;
Conference_Titel :
Signal Processing Conference (EUSIPCO), 2014 Proceedings of the 22nd European
Conference_Location :
Lisbon