• DocumentCode
    1891418
  • Title

    Analysis of the Effect of Training and Test Data on the Performance of Speech Recognition Systems

  • Author

    Wang, Xiangdong ; Xie, Feng ; Lin, Shouxun ; Qian, Yueliang ; Liu, Qun

  • Author_Institution
    Chinese Acad. of Sci., Beijing
  • fYear
    2007
  • fDate
    26-27 July 2007
  • Firstpage
    106
  • Lastpage
    110
  • Abstract
    In the field of speech recognition, performance varies much when the system is trained or tested with different data. In this paper, we explore the effect of training and test data on the performance of automatic speech recognition systems. Unlike other researchers who analyze the effect of training and testing as pattern learning and recognition of vectors, the effect of data is investigated as effect of data properties, such as SNR and kind of environmental noise. For a data property, a statistical model based on ANOVA was proposed to decompose the effect on system performance into three parts - effect of training data, test data and their interaction, and each part is considered dependent on the level of data properties. Experiments were conducted on a LVCSR system for the data properties of kind of noise and SNR, and results and analysis are presented to explain how they influence the performance by training and test.
  • Keywords
    speech recognition; statistical analysis; ANOVA model; LVCSR system; automatic speech recognition system; data property; large vocabulary continuous speech recognition system; statistical model; system performance; training effect; Analysis of variance; Automatic speech recognition; Automatic testing; Pattern analysis; Pattern recognition; Performance analysis; Signal to noise ratio; Speech analysis; Speech recognition; System testing; ANOVA; LVCSR; speech recognition; test data; training data;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Pervasive Computing and Applications, 2007. ICPCA 2007. 2nd International Conference on
  • Conference_Location
    Birmingham
  • Print_ISBN
    978-1-4244-0971-6
  • Electronic_ISBN
    978-1-4244-0971-6
  • Type

    conf

  • DOI
    10.1109/ICPCA.2007.4365421
  • Filename
    4365421