Title :
Open domain continuous filipino speech recognition with code-switching
Author :
Ang, Federico ; Miyanaga, Yoshikazu ; Guevara, Rowena Cristina ; Cajote, Rhandley ; Bayona, Michael Gringo Angelo
Author_Institution :
Grad. Sch. of Inf. Sci. & Technol., Hokkaido Univ., Sapporo, Japan
Abstract :
It is widely known that database quality has a huge impact on speech recognition system performance, most especially when the expected domain is well represented. In this paper, we use this idea as leverage for a data-driven solution to the problem of code-switching in Filipino. Practical Filipino conversations often contain English and other loan words in varying frequencies, demanding better training of parameters and models for its speech recognition system. We alleviate the underrepresentation of loan words through the development of a new speech database for training, and applied appropriate data analysis to make reliable evaluation results. The best system was searched via lattice rescoring from a cross-validation set containing almost three hours of unknown speech data. The description and results of our experiments serve as a new and competent baseline model for succeeding future developments.
Keywords :
audio databases; natural language processing; speech recognition; code switching; competent baseline model; data analysis; data driven solution; database quality; open domain continuous Filipino speech recognition; practical Filipino conversations; speech database; speech recognition system performance; unknown speech data; Acoustics; Databases; Decoding; Hidden Markov models; Speech; Speech recognition; Training; Filipino speech; HMM; automatic speech recognition; code-switching;
Conference_Titel :
Circuits and Systems (ISCAS), 2014 IEEE International Symposium on
Conference_Location :
Melbourne VIC
Print_ISBN :
978-1-4799-3431-7
DOI :
10.1109/ISCAS.2014.6865631