• DocumentCode
    2971660
  • Title

    Delay computation for real-time synchronization of speech and its converted text

  • Author

    Ali, Hamida Qunber ; Ahmed, Jameel ; Siyal, Mohammed Yakoob

  • Author_Institution
    Iqra Univ., Karachi
  • fYear
    2007
  • fDate
    10-13 Dec. 2007
  • Firstpage
    1
  • Lastpage
    5
  • Abstract
    Transmission of real-time text data integrated with other multimedia applications such as audio and video has raised the issues of compatibility and synchronization among these applications since a stringent quality of service (QoS) guarantee is especially critical for real-time traffic. In order to meet the real-time properties, text must be produced efficiently to integrate its transmission with other multimedia applications. Literature survey shows that the text for the real-time transmission can be produced by different input sources such as from handwriting recognition, voice recognition or it can be entered by human users from a keyboard or any other input method [1]. This paper presents an efficient way of producing text which to the best of our knowledge has not been previously explored. We propose to generate text from the recognition of the real-time voice in the source machine. We calculate the delay in the speech-recognition or speech-to-text conversion. Based on these statistics we suggest a buffer size to store the voice data until its respective text is generated. This enables us to transmit both voice and its converted text synchronously. We find, and show it graphically, that this delay is almost negligible and there is almost no queue formation in the buffer. Hence both of the applications can be transmitted instantly that is as they are available. This research is a reasonable advancement in the subject area.
  • Keywords
    speech processing; speech recognition; delay computation; quality of service; real-time synchronization; speech-recognition; speech-to-text conversion; Buffer storage; Delay; Handwriting recognition; Humans; Keyboards; Quality of service; Speech recognition; Statistics; Text recognition; Traffic control; component; formatting; insert; style; styling;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Information, Communications & Signal Processing, 2007 6th International Conference on
  • Conference_Location
    Singapore
  • Print_ISBN
    978-1-4244-0982-2
  • Electronic_ISBN
    978-1-4244-0983-9
  • Type

    conf

  • DOI
    10.1109/ICICS.2007.4449582
  • Filename
    4449582