DocumentCode :
2652542
Title :
Isolated word, speaker dependent recognition under the presence noise, based on an audio retrieval algorithm
Author :
Vasiloglou, Nikolaos ; Schafer, Ronald W. ; Hans, Mat C.
Author_Institution :
Center for Signal & Image Process., Georgia Inst. of Technol., Atlanta, GA, USA
Volume :
2
fYear :
2004
fDate :
7-10 Nov. 2004
Firstpage :
1809
Abstract :
With rapidly increasing storage and computational capacity, a common PC can store and index hundreds of hours of speech. This suggests that new approaches based on database techniques might be useful in speech recognition and speech indexing. This paper presents a first step in such a direction. The algorithm developed relies on an indexed single-speaker database. The database consists of spoken utterances transcribed into text. The waveforms of these utterances are converted off-line to binary symbols called fingerprints through a nonlinear frequency-domain transform. The fingerprints are associated with the transcribed text. Given the fingerprint of a new waveform, the best word match from the database can be retrieved. A 3255 word database is used as a test bed. All the words from this database are mixed with white noise and time-scale modified to provide test data. The database is queried with the fingerprint of the test words and the best match is retrieved. The results of the experiments conducted are promising, showing a 99.5% recognition rate for a 20 dB signal to noise ratio (SNR).
Keywords :
audio databases; noise; speaker recognition; speech processing; 20 dB; audio retrieval algorithm; computational capacity; database techniques; isolated word; nonlinear frequency-domain transform; presence noise; signal to noise ratio; speaker dependent recognition; speech indexing; speech recognition; Databases; Fingerprint recognition; Frequency; Hidden Markov models; Image storage; Indexing; Information retrieval; Signal to noise ratio; Speech recognition; Testing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Signals, Systems and Computers, 2004. Conference Record of the Thirty-Eighth Asilomar Conference on
Print_ISBN :
0-7803-8622-1
Type :
conf
DOI :
10.1109/ACSSC.2004.1399475
Filename :
1399475
Link To Document :
بازگشت