DocumentCode :
3023370
Title :
Fast Vocabulary-Independent Audio Search Based on Syllable Confusion Network Indexing in Mandarin Spontaneous Speech
Author :
Jian Shao ; Pengyuan Zhang ; Zhaojie Liu ; Qingwei Zhao ; Yonghong Yan
Author_Institution :
Chinese Acad. of Sci., Beijing
fYear :
2007
fDate :
1-5 July 2007
Firstpage :
8
Lastpage :
8
Abstract :
This paper presents a fast vocabulary-independent audio search method in Mandarin spontaneous speech which is based on syllable confusion network (SCN) indexing. Confusion network is linear and naturally suitable for indexing. The feasibility of using syllable confusion network as lattice representation is firstly investigated. Since direct syllabic decoding may not have a very high accuracy, long- span decoding units such as syllable collocation and word are also explored respectively. To handle spontaneous speech with high error rate and irregular prosody, syllable confusion matrix (SCM) is applied here to calculate the relevance score. Experiments carried out on conversational corpora for the keyword spotting task in the Chinese 2005 863 Evaluation show that this method can not only yield highly compact SCN lattices with syllable graph density (SGD) of 3.83, but also achieve an equal error rate (EER) of 32.45%, which is about 33% relatively reduction when comparing with the baseline top-1-based search method with an EER of 48.72%.
Keywords :
indexing; natural languages; search problems; speech recognition; Mandarin spontaneous speech; direct syllabic decoding; fast vocabulary-independent audio search; lattice representation; syllable collocation; syllable confusion matrix; syllable confusion network indexing; syllable graph density; Acoustics; Automatic speech recognition; Decoding; Error analysis; Frequency; Lattices; Machine assisted indexing; Natural languages; Search methods; Speech recognition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Digital Telecommunications, 2007. ICDT '07. Second International Conference on
Conference_Location :
San Jose, CA
Print_ISBN :
0-7695-2910-0
Electronic_ISBN :
0-7695-2910-0
Type :
conf
DOI :
10.1109/ICDT.2007.17
Filename :
4270574
Link To Document :
بازگشت