مرکز منطقه ای اطلاع رساني علوم و فناوري - Vowel duration measurement using deep neural networks

DocumentCode :

3688610

Title :

Vowel duration measurement using deep neural networks

Author :

Yossi Adi;Joseph Keshet;Matthew Goldrick

Author_Institution :

Dept. of Computer Science, Bar-Ilan University, Ramat-Gan, Israel

fYear :

2015

Firstpage :

Lastpage :

Abstract :

Vowel durations are most often utilized in studies addressing specific issues in phonetics. Thus far this has been hampered by a reliance on subjective, labor-intensive manual annotation. Our goal is to build an algorithm for automatic accurate measurement of vowel duration, where the input to the algorithm is a speech segment contains one vowel preceded and followed by consonants (CVC). Our algorithm is based on a deep neural network trained at the frame level on manually annotated data from a phonetic study. Specifically, we try two deep-network architectures: convolutional neural network (CNN), and deep belief network (DBN), and compare their accuracy to an HMM-based forced aligner. Results suggest that CNN is better than DBN, and both CNN and HMM-based forced aligner are comparable in their results, but neither of them yielded the same predictions as models fit to manually annotated data.

Keywords :

"Hidden Markov models","Speech","Manuals","Context","Data models","Predictive models","Neural networks"

Publisher :

ieee

Conference_Titel :

Machine Learning for Signal Processing (MLSP), 2015 IEEE 25th International Workshop on

Type :

conf

DOI :

10.1109/MLSP.2015.7324331

Filename :

7324331

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3688610