Title :
Sentence segmentation for speech processing
Author :
Anu, J.P. ; Karjigi, Veena
Author_Institution :
Electron. & Commun, Siddaganga Inst. of Technol., Tumkur, India
Abstract :
Automatic sentence segmentation of speech is a process of identifying the end of a sentence. It is used for improving the output after speech recognition and helps in making the recognition output more readable. It is generally a two-class problem which involves the identification of a boundary characterizing the sentence part and the non-sentence part. An Automatic Speech Recognition (ASR) system receives a stream of speech signal and produces a un-annotated stream of words. The regions of silences in the inter word boundaries of the output of ASR are detected. The prosodic features of the input speech signal that are given to the ASR are extracted for a particular duration of time which includes pause, rhyme, slope, minimum, maximum and mean features. Once the features are extracted, SVM classifier uses all the features and discriminates each word boundary as sentence or non-sentence boundary.
Keywords :
speech processing; speech recognition; support vector machines; ASR system; SVM classifier; automatic sentence segmentation; automatic speech recognition system; input speech signal; inter word boundary; nonsentence part characterization; prosodic features; sentence part characterization; speech processing; time duration; word unannotated stream; Feature extraction; Hidden Markov models; Speech; Speech processing; Speech recognition; Support vector machines; Training; Automatic Speech Recognition (ASR); Hidden Markov model tool kit (HTK); Prosody;
Conference_Titel :
Communication, Signal Processing and Networking (NCCSN), 2014 National Conference on
Conference_Location :
Palakkad
DOI :
10.1109/NCCSN.2014.7001148