DocumentCode :
478283
Title :
Using Discriminative Training Techniques in Practical Intelligent Music Retrieval System
Author :
Xu, Ran ; Pan, Jielin ; Yan, Yonghong
Author_Institution :
Inst. of Acoust., Chinese Acad. of Sci., Beijing
Volume :
4
fYear :
2008
fDate :
18-20 Oct. 2008
Firstpage :
286
Lastpage :
290
Abstract :
The development of speech recognition technology has made it possible for some intelligent query systems to use a voice interface. In this paper, we developed a pop-song music retrieval system for telecom carriers to facilitate the interactions between the end users and the music database. When trying to improve the system performance, however, it was found that some typical recognizing optimization techniques for large vocabulary continuous speech recognition (LVCSR) is not practicable for such a real-time application, in which accuracy and speed are both highly stressed. Thus, model optimization techniques are considered. Feature discriminative analysis and minimum phone error discriminative training techniques proposed in recent years have obtained great success in LVCSR, however, there are few reports about their practical applications on online grammar-constrained recognition tasks. In this paper, these techniques are employed and evaluated on such a real-time recognition task. The experimental result shows that these techniques can be effectively implemented in our practical application system with a remarkable error rate reduction of 13.3%.
Keywords :
information retrieval systems; music; optimisation; speech recognition; voice communication; discriminative training techniques; feature discriminative analysis; intelligent query systems; large vocabulary continuous speech recognition; online grammar-constrained recognition; optimization; practical intelligent music retrieval system; speech recognition technology; telecom carriers; voice interface; Error analysis; Hidden Markov models; Intelligent systems; Lattices; Music information retrieval; Real time systems; Speech recognition; System performance; Telecommunications; Telephony; discriminative training; speech recognition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Natural Computation, 2008. ICNC '08. Fourth International Conference on
Conference_Location :
Jinan
Print_ISBN :
978-0-7695-3304-9
Type :
conf
DOI :
10.1109/ICNC.2008.985
Filename :
4667291
Link To Document :
بازگشت