• DocumentCode
    2862514
  • Title

    The Meta-Pi network: connectionist rapid adaptation for high-performance multi-speaker phoneme recognition

  • Author

    Hampshire, John B., II ; Waibel, Alex H.

  • Author_Institution
    Sch. of Comput. Sci., Carnegie Mellon Univ., Pittsburgh, PA, USA
  • fYear
    1990
  • fDate
    3-6 Apr 1990
  • Firstpage
    165
  • Abstract
    A multinetwork time-delay-neural-network (TDNN)-based connectionist architecture that allows multispeaker phoneme discrimination (/b,d,g/) to be performed at the speaker-dependent recognition rate of 98.4% is presented. The overall network gates the phonemic decisions of modules trained on individual speakers to form its overall classification decision. By dynamically adapting to the input speech and focusing on a combination of speaker-specific modules, the network outperforms a single TDNN trained on the speech of all six speakers (95.9%). To train this network a form of multiplicative connection called the Meta-Pi connection is developed. It is illustrated how the Mega-Pi paradigm implements a dynamically adaptive Bayesian MAP classifier. It learns-without supervision-to recognize the speech of one particular speaker (99.8%) using a dynamic combination of internal models of other speakers exclusively. The Meta-Pi model is a viable basis for a connectionist speech recognition system that can rapidly adapt to new speakers and varying speaker dialects
  • Keywords
    Bayes methods; adaptive systems; computer architecture; computerised signal processing; learning systems; neural nets; speech recognition; Meta-Pi network; classification decision; connectionist rapid adaptation; dynamically adaptive Bayesian MAP classifier; multi-speaker phoneme recognition; multispeaker phoneme discrimination; speaker-dependent recognition rate; speaker-specific modules; time-delay-neural-network; Adaptive systems; Bayesian methods; Character recognition; Computer architecture; Computer science; Databases; Neural networks; Robustness; Speech recognition; Testing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 1990. ICASSP-90., 1990 International Conference on
  • Conference_Location
    Albuquerque, NM
  • ISSN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.1990.115564
  • Filename
    115564