• DocumentCode
    2818536
  • Title

    Automatic transcription and speech recognition of Romanian corpus RO-GRID

  • Author

    Giurgiu, Mircea ; Kabir, Ahsanul

  • Author_Institution
    Telecommun. Dept., Tech. Univ. of Cluj-Napoca, Cluj-Napoca, Romania
  • fYear
    2012
  • fDate
    3-4 July 2012
  • Firstpage
    465
  • Lastpage
    468
  • Abstract
    The results reported in this paper assess the ability of Hidden Markov Model (HMM) based method to generate accurate and reliable automatic phone-level transcriptions for a small vocabulary speech corpus such as RO-GRID. The system requires only orthographic transcription of the target corpus, and can be bootstrapped from models trained just on few amount of data in the transcribed corpus. For this purpose, an automatic time-aligned phone transcription toolbox has been developed and tested on the Romanian corpus and also validated on an English corpus. The quality of transcriptions is judged by evaluating the statistical parameters of the error between the automatic and manual transcription. The transcriptions generated from the most reliable system deviate from the average manual transcription by an average of 20 ms. The system is also able to convert the generated transcription from HTK format into PRAAT format for further manipulation of the speech signal.
  • Keywords
    hidden Markov models; natural language processing; speech processing; speech recognition; English corpus; HMM based method; HTK format; PRAAT format; RO-GRID; Romanian corpus; automatic phone-level transcriptions; automatic time-aligned phone transcription toolbox; bootstrapping; hidden Markov model; manual transcription; orthographic transcription; speech recognition; speech signal manipulation; statistical error parameter evaluation; transcription quality; vocabulary speech corpus; Adaptation models; Hidden Markov models; Manuals; Speech; Speech recognition; Standards; Training; Automatic speech transcription; Hidden Markov Models;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Telecommunications and Signal Processing (TSP), 2012 35th International Conference on
  • Conference_Location
    Prague
  • Print_ISBN
    978-1-4673-1117-5
  • Type

    conf

  • DOI
    10.1109/TSP.2012.6256337
  • Filename
    6256337