Title :
AusTalk — The Australian speech database: Design framework, recording experience and localisation
Author :
Alghowinem, Sharifa ; Wagner, Michael ; Goecke, Roland
Author_Institution :
Australian Nat. Univ., Canberra, ACT, Australia
Abstract :
Aiming to create a comprehensive Australian speech database, the “AusTalk” project was carefully designed by 30 speech scientists contributing their disciplinary expertise. Standardised three one-hour audio-visual sessions for each of 1000 speakers around Australia were recorded having diverse components suitable for different research areas. The design of this database provides a good framework for any speech data corpus collection. In this paper, we present the AusTalk design and recording protocol, as well as problems faced and lessons learned. Localisation of this protocol and the potential customisation based on other countries´ specifications are discussed. Collecting such speech databases including accent groups is encouraged to boost speech research in areas such as linguistics, speech and speaker recognition, forensic voice comparison, auditory-visual speech processing and many more.
Keywords :
audio databases; audio recording; protocols; speech processing; AusTalk design; AusTalk project; AusTalk recording protocol; Australian speech database; auditory-visual speech processing; country specifications; design framework; forensic voice comparison; one-hour audio-visual sessions; protocol localisation; recording experience; speaker recognition; speech data corpus collection; speech recognition; speech scientists; Australia; Databases; Protocols; Servers; Software; Speech; Speech recognition; Australian English; Speech corpus; audio-visual data; generalisation;
Conference_Titel :
Information Technology in Asia (CITA), 2013 8th International Conference on
Conference_Location :
Kota Samarahan
Print_ISBN :
978-1-4799-1091-5
DOI :
10.1109/CITA.2013.6637567