Title :
Fast browsing of speech material for digital library and distance learning
Author :
Wong, Peter H W ; Au, Oscar C.
Author_Institution :
Dept. of Electr. & Electron. Eng., Hong Kong Univ. of Sci. & Technol., Kowloon, Hong Kong
Abstract :
In digital library and distance learning applications, one usually needs to search through lots of speech material. While content-based retrieval techniques can help to identify possible matching items, the person would usually need to browse through the items quickly before making decisions on whether the items are useful or not. As a result, fast speech browsing techniques are highly desirable. In this paper we discuss problems of fast playback of speech materials and overview some existing time scale modification (TSM) techniques. We propose some novel modifications of TSM to make it much more effective in fast browsing of speech materials, especially those with irregular speech tempo. The proposed algorithm includes silent period removal, gain equalization and locally adaptive TSM. Simulation results show that the proposed algorithm can increase the intelligibility of the fast playback speech materials significantly
Keywords :
Internet; computer aided instruction; speech intelligibility; speech processing; digital library; distance learning; gain equalization; intelligibility; irregular speech tempo; locally adaptive TSM; silent period removal; speech material browsing; time scale modification techniques; Computer aided instruction; Content based retrieval; Frequency; Lapping; Recursive estimation; Signal analysis; Signal synthesis; Software libraries; Speech; Video compression;
Conference_Titel :
Circuits and Systems, 1998. ISCAS '98. Proceedings of the 1998 IEEE International Symposium on
Conference_Location :
Monterey, CA
Print_ISBN :
0-7803-4455-3
DOI :
10.1109/ISCAS.1998.704087