• DocumentCode
    395219
  • Title

    Estimating number of speakers by the modulation characteristics of speech

  • Author

    Arai, Takuyuki

  • Author_Institution
    Dept. of Electr. & Electron. Eng., Sophia Univ., Tokyo, Japan
  • Volume
    2
  • fYear
    2003
  • fDate
    6-10 April 2003
  • Abstract
    A method for estimating number of speakers of mixed speech signals was proposed. The algorithm was based on the modulation characteristics of speech, specifically that a single speech utterance typically has a distinct modulation pattern with a peak around 4-5 Hz. Having observed that the modulation peak decreases as number of speakers increases, our estimation algorithm used the region of the modulation frequency between 2 and 8 Hz. We obtained a novel parameter we called "equivalent number of speakers" to estimate the number of simultaneous speakers when speech signals contain multiple speakers.
  • Keywords
    parameter estimation; spectral analysis; speech processing; 2 to 8 Hz; equivalent number of speakers; mixed speech signals; modulation characteristics; modulation pattern; simultaneous speakers; speaker number estimation; speech utterance; Ambient intelligence; Amplitude modulation; Auditory system; Frequency estimation; Frequency modulation; Humans; Loudspeakers; Signal analysis; Speech analysis; Speech enhancement;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03). 2003 IEEE International Conference on
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-7663-3
  • Type

    conf

  • DOI
    10.1109/ICASSP.2003.1202328
  • Filename
    1202328