• DocumentCode
    1649139
  • Title

    AusTalk — The Australian speech database: Design framework, recording experience and localisation

  • Author

    Alghowinem, Sharifa ; Wagner, Michael ; Goecke, Roland

  • Author_Institution
    Australian Nat. Univ., Canberra, ACT, Australia
  • fYear
    2013
  • Firstpage
    1
  • Lastpage
    7
  • Abstract
    Aiming to create a comprehensive Australian speech database, the “AusTalk” project was carefully designed by 30 speech scientists contributing their disciplinary expertise. Standardised three one-hour audio-visual sessions for each of 1000 speakers around Australia were recorded having diverse components suitable for different research areas. The design of this database provides a good framework for any speech data corpus collection. In this paper, we present the AusTalk design and recording protocol, as well as problems faced and lessons learned. Localisation of this protocol and the potential customisation based on other countries´ specifications are discussed. Collecting such speech databases including accent groups is encouraged to boost speech research in areas such as linguistics, speech and speaker recognition, forensic voice comparison, auditory-visual speech processing and many more.
  • Keywords
    audio databases; audio recording; protocols; speech processing; AusTalk design; AusTalk project; AusTalk recording protocol; Australian speech database; auditory-visual speech processing; country specifications; design framework; forensic voice comparison; one-hour audio-visual sessions; protocol localisation; recording experience; speaker recognition; speech data corpus collection; speech recognition; speech scientists; Australia; Databases; Protocols; Servers; Software; Speech; Speech recognition; Australian English; Speech corpus; audio-visual data; generalisation;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Information Technology in Asia (CITA), 2013 8th International Conference on
  • Conference_Location
    Kota Samarahan
  • Print_ISBN
    978-1-4799-1091-5
  • Type

    conf

  • DOI
    10.1109/CITA.2013.6637567
  • Filename
    6637567