Title :
Reduced sets of subword units for continuous speech recognition of Portuguese
Author :
Cerqueira Bisp dos Santos, S. ; Alcaim, A.
Author_Institution :
CETUC-PUC, Rio de Janeiro, Brazil
fDate :
3/16/2000 12:00:00 AM
Abstract :
An investigation is presented concerning two sets of subword units for continuous speech recognition, which are based on the characteristics of the Portuguese language. In the first set, with 149 units, it is considered that syllables which contain an epenthetic vowel can be formed by two CV (consonant-vowel) units. In the second set, with 254 units, these syllables are regarded as CCV (consonant-consonant-vowel) units. The recognition performance obtained with the two unit sets are comparable if a bigram model is used at the unit level. In this case, the first set is definitely preferable because it enables the complexity to be significantly reduced
Keywords :
computational complexity; speech recognition; Portuguese language; bigram model; complexity reduction; consonant-consonant-vowel units; consonant-vowel units; continuous speech recognition; epenthetic vowel; recognition performance; subword units; syllables;
Journal_Title :
Electronics Letters
DOI :
10.1049/el:20000446