Author/Authors :
Qingwei، ZHAO نويسنده , , Xi، XIAO نويسنده , , Zuoying، WANG نويسنده , , Dajin، LU نويسنده ,
Abstract :
DDBHMM (Duration Distribution Based Hidden Markov Model) solved the defects of traditional HMM (Hidden Markov Model). Based on DDBHMM, the problem of how to effectively utilize the duration information is studied in detail. The approach on estimating the duration distribution is introduced first, then the data file is classified according to the speak rate. The recognition experiment shows that, the duration information behaves best on the data of low speak rate, behaves normal on the data of medfum speak rate and has little effect on the data of fast speak rate. Therefore, the nost importance of duration is that by it the more accurate state segmentation point could be obtained and then the recognition rate can be improved. At the same time, the robustness of the system to speaking rate is improved with the employment of the duration information. Furthermore, the method of classified duration and normalized duration is also put forward and studied in detail, it shows that both of the two method can improve the performance. In order to study the dependency between the duration, the method of using the Bigrain of the duration is proposed and analyzed. At last, the approach of post processing duration is studied, it shows that only based on DDBHMM, and utilizing the duration information synchronously in the recognition process, then the performance can be improved greatly.