Title :
Stochastic perceptual speech models with durational dependence
Author :
Bilmes, Jeff ; Morgan, Nelson ; Wu, Su-Lin ; Bourlard, Hervé
Author_Institution :
Int. Comput. Sci. Inst., Berkeley, CA, USA
Abstract :
In (Morgan et al., 1994), we developed a statistical model of speech recognition where emphasis was placed on the perceptually-relevant and information-rich portion of the speech signal. In that model, speech is viewed as a sequence of elementary decisions or auditory events (avents) that are made in response to loci of significant spectral change. These decision points are interleaved with periods during which insufficient information has been accumulated to make the next decision. We have called this a stochastic perceptual avent model, or SPAM. In the work reported, we have extended our initial experimental implementation to include other probabilistic dependencies specified in the original theory, particularly the dependence on the time from the current frame back to the previous hypothesized avent
Keywords :
hidden Markov models; probability; spectral analysis; speech recognition; statistical analysis; stochastic processes; SPAM recognition model; auditory events; durational dependence; hidden Markov model; hypothesized avent; information-rich; perceptually-relevant; probabilistic dependencies; spectral change; speech recognition; speech signal; statistical model; stochastic perceptual avent model; stochastic perceptual speech models; time; Acoustics; Computer science; Databases; Hidden Markov models; Power system modeling; Random processes; Speech recognition; Stochastic processes; Stochastic systems; Unsolicited electronic mail;
Conference_Titel :
Spoken Language, 1996. ICSLP 96. Proceedings., Fourth International Conference on
Conference_Location :
Philadelphia, PA
Print_ISBN :
0-7803-3555-4
DOI :
10.1109/ICSLP.1996.607851