• DocumentCode
    2833935
  • Title

    Extraction of fixed dimension patterns from varying duration segments of consonant-vowel utterances

  • Author

    Gangashetty, Suryakanth V. ; Sekhar, Chandra C. ; Yegnanarayana, B.

  • Author_Institution
    Dept. of Comput. Sci. & Eng., Indian Inst. of Technol., Chennai, India
  • fYear
    2004
  • fDate
    2004
  • Firstpage
    159
  • Lastpage
    164
  • Abstract
    Classification models based on multilayer perceptron (MLP) or support vector machine (SVM) have been commonly used for complex pattern classification tasks. These models are suitable for classification of fixed dimension patterns. However, durations of consonant-vowel (CV) utterances vary not only for different classes, but also for a particular CV class. It is necessary to develop a method for representing the CV utterances by patterns of fixed dimension. For CV utterances, vowel onset point (VOP) is the instant at which the consonant part ends and the vowel part begins. Important information necessary for classification of CV utterances is present in the region around the VOP. A segment of fixed duration around the VOP can be processed to extract a pattern of fixed dimension to represent a CV utterance. Accurate detection of vowel onset points is important for recognition of CV utterances of speech. In this paper, we propose an approach for detection of VOP, based on dynamic time alignment between a reference pattern of a CV class and the pattern of an utterance of that class. The results of studies show that the hypothesised VOPs using the proposed approach have less deviation from their actual locations.
  • Keywords
    feature extraction; multilayer perceptrons; pattern classification; speech recognition; support vector machines; consonant vowel utterance; dynamic time alignment; fixed dimension patterns classification; fixed dimension patterns extraction; multilayer perceptron; support vector machine; vowel onset point; Computer science; Laboratories; Multilayer perceptrons; Neural networks; Pattern classification; Production; Speech recognition; Support vector machine classification; Support vector machines;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Intelligent Sensing and Information Processing, 2004. Proceedings of International Conference on
  • Print_ISBN
    0-7803-8243-9
  • Type

    conf

  • DOI
    10.1109/ICISIP.2004.1287644
  • Filename
    1287644