• DocumentCode
    3103572
  • Title

    Thinking about the present and future of the complex speech recognition

  • Author

    Vicsi, Klara

  • Author_Institution
    Laboratory of Speech Acoustics of Department of Telecommunication and Mediainformatics of Budapest University of Technology and Economics, Hungary
  • fYear
    2012
  • fDate
    2-5 Dec. 2012
  • Firstpage
    371
  • Lastpage
    376
  • Abstract
    A critical point of the most cognitive info-communication systems is the state of the development of speech recognition technology. The paper gives a short introduction of the principles of this speech recognition technology today. It highlights the fact that these systems in the market are only speech-to-text transformers giving only a word chain at the output, where the speech prosody, speech emotion, speech style and more other information are not involved. Many uncertainties exist in this operational system. Some up to date research tendencies, mostly the parallel processing are introduced aiming to increase the efficiencies of the recognition. At the end, research agenda of META NET are shortly introduced for Multilingual Europe in 2020.
  • Keywords
    multi-modal speech processing; multi-stream modelling; speech recognition; speech to text transformation system;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Cognitive Infocommunications (CogInfoCom), 2012 IEEE 3rd International Conference on
  • Conference_Location
    Kosice, Slovakia
  • Print_ISBN
    978-1-4673-5187-4
  • Electronic_ISBN
    978-1-4673-5186-7
  • Type

    conf

  • DOI
    10.1109/CogInfoCom.2012.6422008
  • Filename
    6422008