DocumentCode :
672831
Title :
The voice bank corpus: Design, collection and data analysis of a large regional accent speech database
Author :
Veaux, Christophe ; Yamagishi, Junichi ; King, Simon
Author_Institution :
Centre for Speech Technol. Res. (CSTR), Univ. of Edinburgh, Edinburgh, UK
fYear :
2013
fDate :
25-27 Nov. 2013
Firstpage :
1
Lastpage :
4
Abstract :
The University of Edinburgh has started the development of a new speech database, the Voice Bank corpus, specifically designed for the creation of personalised synthetic voices for individuals with speech disorders. This corpus already constitutes the largest corpora of British English currently in existence, with more than 300 hours of recordings from approximately 500 healthy speakers. New recordings are continuously being made in order to get the best coverage of the different combinations of regional accents, social classes, age and gender across Britain. This paper describes the motivation and the processes involved in the design and recording of this corpus as well as some analysis of its content. The paper concludes with our future plans to further extend this corpus and to overcome its current limitations.
Keywords :
audio databases; data analysis; handicapped aids; medical disorders; natural language processing; speech synthesis; text analysis; British English; University of Edinburgh; Voice Bank collection; Voice Bank corpus design; age issue; data analysis; gender; large regional accent speech database; social classes; speech disorders; Databases; Educational institutions; Hidden Markov models; Optimization; Recruitment; Speech; Speech synthesis; Corpus Design; Speech Synthesis; Text Selection; Voice Banking;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Oriental COCOSDA held jointly with 2013 Conference on Asian Spoken Language Research and Evaluation (O-COCOSDA/CASLRE), 2013 International Conference
Conference_Location :
Gurgaon
Type :
conf
DOI :
10.1109/ICSDA.2013.6709856
Filename :
6709856
Link To Document :
بازگشت