• DocumentCode
    2197919
  • Title

    Automatic Segmentation and Labeling for Spontaneous Standard Malay Speech Recognition

  • Author

    Seman, Noraini ; Jusoff, Kamaruzaman

  • Author_Institution
    Fac. of Inf. Technol. & Quantitative Sci., MARA Univ. of Technol., Shah Alam, Malaysia
  • fYear
    2008
  • fDate
    20-22 Dec. 2008
  • Firstpage
    59
  • Lastpage
    63
  • Abstract
    In this paper, we proposed an automatically segmenting and transcribing spontaneous speech signal without the use of manually annotated speech database. The spontaneous speech signal is first segmented into syllable-like units by considering short-term energy as a magnitude spectrum of some arbitrary signal. Similar syllable segments are then grouped together using an unsupervised incremental clustering technique. Separate models are generated for each cluster of syllable segments. At this stage, labels are assigned for each group of syllable segments manually. The syllable models of these clusters are then used to transcribe or recognize the spontaneous speech signal of closed-set speakers´ data as well open-set speaker data. As a syllable recognizer, our initial results on Standard Malay television (TV3) news bulletins of the native and non-native speakers shows that the performance is 42.53% and 30.8% respectively.
  • Keywords
    natural languages; pattern clustering; speaker recognition; speech processing; unsupervised learning; automatic speech labeling; automatic speech segmentation; speaker recognition; spontaneous standard Malay speech recognition; syllable segment; unsupervised incremental clustering technique; Automatic speech recognition; Convergence; Data engineering; Databases; Delay; Humans; Information technology; Labeling; Speech recognition; Training data; Segmentation; incremental clustering; labeling; spontaneous speech.; syllable-like units;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Advanced Computer Theory and Engineering, 2008. ICACTE '08. International Conference on
  • Conference_Location
    Phuket
  • Print_ISBN
    978-0-7695-3489-3
  • Type

    conf

  • DOI
    10.1109/ICACTE.2008.150
  • Filename
    4736922