• DocumentCode
    3594331
  • Title

    The relation between speech segment selectivity and source localization accuracy

  • Author

    Aarabi, Parham ; Mahdavi, Alborz

  • Author_Institution
    The Edward S. Rogers Sr. Department of Electrical and Computer Engineering, University of Toronto, 10 Kings College Road, Ontario, Canada, M5S 3G4
  • Volume
    1
  • fYear
    2002
  • Abstract
    An experimental analysis of the relation between speech signal segment power and the source direction-of-arrival-estimation accuracy is conducted. A total of 10 different speakers, including both male and female speakers, totaling to approximately 2 hours of speech are used to analyze the performance of the Phase Transform, the Maximum Likelihood, and the Unfiltered Cross Correlation time-delay estimation techniques. For female speakers, it is determined that the Phase Transform technique has a lower percentage of anomalies and a lower direction-of-arrival root mean-square error (DOA RMSE). Conversely, for male speakers, it is determined that the Unfiltered Cross Correlation has a lower percentage of anomalies although the Phase Transform has a lower DOA RMSE. The spatial distribution of the errors as well as the speech segment power relation to the errors are also presented.
  • Keywords
    Artificial neural networks; Signal to noise ratio;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing (ICASSP), 2002 IEEE International Conference on
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-7402-9
  • Type

    conf

  • DOI
    10.1109/ICASSP.2002.5743707
  • Filename
    5743707