DocumentCode
1649139
Title
AusTalk — The Australian speech database: Design framework, recording experience and localisation
Author
Alghowinem, Sharifa ; Wagner, Michael ; Goecke, Roland
Author_Institution
Australian Nat. Univ., Canberra, ACT, Australia
fYear
2013
Firstpage
1
Lastpage
7
Abstract
Aiming to create a comprehensive Australian speech database, the “AusTalk” project was carefully designed by 30 speech scientists contributing their disciplinary expertise. Standardised three one-hour audio-visual sessions for each of 1000 speakers around Australia were recorded having diverse components suitable for different research areas. The design of this database provides a good framework for any speech data corpus collection. In this paper, we present the AusTalk design and recording protocol, as well as problems faced and lessons learned. Localisation of this protocol and the potential customisation based on other countries´ specifications are discussed. Collecting such speech databases including accent groups is encouraged to boost speech research in areas such as linguistics, speech and speaker recognition, forensic voice comparison, auditory-visual speech processing and many more.
Keywords
audio databases; audio recording; protocols; speech processing; AusTalk design; AusTalk project; AusTalk recording protocol; Australian speech database; auditory-visual speech processing; country specifications; design framework; forensic voice comparison; one-hour audio-visual sessions; protocol localisation; recording experience; speaker recognition; speech data corpus collection; speech recognition; speech scientists; Australia; Databases; Protocols; Servers; Software; Speech; Speech recognition; Australian English; Speech corpus; audio-visual data; generalisation;
fLanguage
English
Publisher
ieee
Conference_Titel
Information Technology in Asia (CITA), 2013 8th International Conference on
Conference_Location
Kota Samarahan
Print_ISBN
978-1-4799-1091-5
Type
conf
DOI
10.1109/CITA.2013.6637567
Filename
6637567
Link To Document