• DocumentCode
    2790264
  • Title

    Analysis of instantaneous F0 contours from two speakers mixed signal using zero frequency filtering

  • Author

    Yegnanarayana, B. ; Prasanna, S. R Mahadeva

  • Author_Institution
    Int. Inst. of Inf. Technol. Hyderabad, Hyderabad, India
  • fYear
    2010
  • fDate
    14-19 March 2010
  • Firstpage
    5074
  • Lastpage
    5077
  • Abstract
    Instantaneous fundamental frequency (F0) in voiced speech can be obtained from the sequence of epochs corresponding to the instants of significant excitation. The epoch sequence can be derived using the recently proposed epoch extraction method based on zero frequency filtering. The epoch extraction method is robust against additive noise degradation. But in a multispeaker mixed signal, the degradation is caused due to overlapping impulse-like excitations of two or more speakers. The feasibility of extracting the instantaneous F0 contours from the two speaker mixed signal using zero frequency filtering is studied in this paper. The present study is based on deriving speaker-specific Hilbert Envelope (HE) signal which emphasizes peaks due to impulse-like excitation of one speaker and suppresses peaks due to other speaker. The epochs from this speaker-specific signal are obtained using the approach based on zero frequency filtering. The results of the proposed method is demonstrated for three different cases of mixed signals of two speakers data.
  • Keywords
    acoustic filters; acoustic noise; acoustic signal processing; speech; speech processing; additive noise degradation; epoch extraction method; impulse-like excitation; instantaneous contours; instantaneous fundamental frequency; multispeaker mixed signal; speaker-specific Hilbert envelope signal; voiced speech; zero frequency filtering; Additive noise; Autocorrelation; Data mining; Degradation; Filtering; Frequency; Helium; Microphones; Noise robustness; Speech analysis; Two speakers; epochs; instantaneous F0; zero frequency filtering;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics Speech and Signal Processing (ICASSP), 2010 IEEE International Conference on
  • Conference_Location
    Dallas, TX
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4244-4295-9
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2010.5495049
  • Filename
    5495049