• DocumentCode
    1652197
  • Title

    An undergraduate Mandarin speech database for speaker recognition research

  • Author

    Hong Wang ; Jin´gui, P.

  • Author_Institution
    State Key Lab. for Novel Software Technol., Nanjing Univ., Nanjing, China
  • fYear
    2009
  • Firstpage
    94
  • Lastpage
    99
  • Abstract
    This paper describes the development of a new speech database for speaker recognition research, UMSD (undergraduate Mandarin speech database). In UMSD, there are total 12 sessions of utterances for each of the selected 24 undergraduate students, while all recordings are conducted in different session intervals. The phonetically balanced corpus content include isolated digits (0~9), digit strings (5 phone numbers and 2 postal codes), words and phrases with different length from 1 to 10 characters (10 for each given length), the Chinese Phonetic Alphabet Table (21 Initials and 35 Finals), 2 ancient poems and a 200 words paragraph extracted from a well-known essay. Additionally, in order to effectively extract and process the interesting speech segments from UMSD, a speech database management system has been proposed on the base of MATLAB and MS-ACCESS. Results of preliminary evaluation show that the performance attained with UMSD is good, it not only meets the needs of our own recent effort in text-dependent and text-independent speaker recognition, but also allows the further research of the long term intra-speaker variability thanks to its multi-session records with different session intervals.
  • Keywords
    database management systems; speech recognition; Chinese phonetic alphabet table; MATLAB; MS-ACCESS; multisession records; phonetically balanced corpus content; speaker recognition research; speech database management system; undergraduate Mandarin speech database; Application software; Appropriate technology; Database systems; Laboratories; MATLAB; Natural languages; Speaker recognition; Speech analysis; Speech processing; System testing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Speech Database and Assessments, 2009 Oriental COCOSDA International Conference on
  • Conference_Location
    Urumqi
  • Print_ISBN
    978-1-4244-4400-7
  • Electronic_ISBN
    978-1-4244-4400-7
  • Type

    conf

  • DOI
    10.1109/ICSDA.2009.5278370
  • Filename
    5278370