• DocumentCode
    3428665
  • Title

    Embedded speech recognition applications in mobile phones: Status, trends, and challenges

  • Author

    Cohen, Jordan

  • Author_Institution
    SRI Int., Menlo Park, CA
  • fYear
    2008
  • fDate
    March 31 2008-April 4 2008
  • Firstpage
    5352
  • Lastpage
    5355
  • Abstract
    Voice centric interfaces are widely available in modern mobile phones, including low-cost versions. The applications have evolved from speaker-dependent name dialing, which require user enrollment of frequently dialed names, to speaker-independent capabilities including continuous digit dialing, command and control of phone functions, and name dialing directly from the phone´s contacts directory. Recently available advances include capabilities like voice-enabled SMS, e-mail, and even mobile search with voice. This evolution has been enabled by advances in speech recognition robustness, network capabilities, and increased computational power in small devices. Systems may now be used in hands-busy/eyes-busy conditions including speakerphone and bluetooth scenarios. In this paper, we will provide an overview of embedded speech recognition centric applications in mobile phones, specifically focusing on current status, industry trends, and challenges in customer acceptance. Although voice interfaces are natural and attractive in theory, a majority of users do not use the voice-enabled features available in their mobile phones. We will discuss some of the reasons for this user behavior and recommend actions to be taken.
  • Keywords
    mobile handsets; speaker recognition; user interfaces; mobile phones; speaker-dependent name dialing; speech recognition; voice centric interfaces; Application software; Cellular phones; Hidden Markov models; Laboratories; Linear predictive coding; Mobile handsets; Signal processing algorithms; Space technology; Speech coding; Speech recognition; Applications; Mobile; Speech Recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing, 2008. ICASSP 2008. IEEE International Conference on
  • Conference_Location
    Las Vegas, NV
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4244-1483-3
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2008.4518869
  • Filename
    4518869