Title :
Persian speech sentence segmentation without speech recognition
Author :
Jafari, Hoda Sadat ; Homayounpour, Mohammad Mehdi
Author_Institution :
Comput. Eng. & Inf. Technol. Dept., Amirkabir Univ. of Technol., Tehran, Iran
Abstract :
In this paper, we propose a method for detection of Persian speech sentence boundaries using a set of prosodic features and spectral centroid. No speech recognizer is used in our proposed method. Silent regions are first detected using four features including spectral centroid, zero crossing rate, energy and pitch. Then, twelve prosodic features are extracted from each silent region. Silent regions may correspond to a sentence boundary or other regions inside a sentence. Features of Silence regions of speech data from some speakers are extracted and labeled as silence in the boundary or inside the sentences. These feature vectors and a nonlinear support vector machine (SVM) classifier, is trained and then evaluated for detection of Persian speech sentence boundaries. The proposed algorithm was evaluated on six speakers from Large FARSDAT data set. A performance of 82.4% F-measure was achieved on test set from all speakers in training data and 73.02% F-measure on speakers outside the training data.
Keywords :
feature extraction; natural language processing; pattern classification; spectral analysis; support vector machines; F-measure; Persian speech sentence boundary detection; Persian speech sentence segmentation; SVM classifier training; energy; feature vectors; large FARSDAT data set; nonlinear support vector machine classifier; pitch; prosodic feature extraction; silent region detection; spectral centroid; speech data; training data; zero-crossing rate; Computers; Feature extraction; Speech; Speech recognition; Support vector machines; Training; Training data; Persian; SVM classifier; prosodic features; sentence detection;
Conference_Titel :
Intelligent Systems (ICIS), 2014 Iranian Conference on
Conference_Location :
Bam
Print_ISBN :
978-1-4799-3350-1
DOI :
10.1109/IranianCIS.2014.6802564