• DocumentCode
    1468870
  • Title

    Intelligent voice smoother for silence-suppressed voice over Internet

  • Author

    Tien, Po L. ; Yuang, Maria C.

  • Author_Institution
    Inst. of Comput. Sci. & Inf. Eng., Nat. Chiao Tung Univ., Hsinchu, Taiwan
  • Volume
    17
  • Issue
    1
  • fYear
    1999
  • fDate
    1/1/1999 12:00:00 AM
  • Firstpage
    29
  • Lastpage
    41
  • Abstract
    When transporting voice data with silence suppression over the Internet, the problem of jitter introduced from the network often renders the speech unintelligible. It is thus indispensable to offer intramedia synchronization to remove jitter while retaining minimal playout delay (PD). We propose a neural network (NN)-based intravoice synchronization mechanism, called the intelligent voice smoother (IVoS). The IVoS is composed of three components: (1) the smoother buffer; (2) the NN traffic predictor; and (3) the constant bit rate (CBR) enforcer. Newly arriving frames, assumed to follow a generic Markov modulated Bernoulli process (MMBP), are queued in the smoother buffer. The NN traffic predictor employs an online-trained back propagation NN (BPNN) to predict three traffic characteristics of every newly encountered talkspurt period. Based on the predicted characteristics, the CBR enforcer derives an adaptive buffering delay (ABD) by means of a near-optimal simple closed-form formula. It then imposes the delay on the playout of the first frame in the talkspurt period. The CBR enforcer in turn regulates CBR-based departures for the remaining frames of the talkspurt, aiming at assuring minimal mean and variance of distortion of talkspurts (DOT) and mean PD. Simulation results reveal that, compared to three other playout approaches, the IVoS achieves superior playout, yielding negligible DOT and PD, irrespective of traffic variation
  • Keywords
    Internet; Markov processes; backpropagation; jitter; modulation; neural nets; smoothing methods; speech processing; synchronisation; telecommunication computing; telecommunication traffic; voice communication; Internet; MMBP; Markov modulated Bernoulli process; adaptive buffering delay; constant bit rate enforcer; distortion of talkspurts; intelligent voice smoother; intramedia synchronization; jitter; mean; near-optimal closed-form formula; network-based intravoice synchronization; online-trained back propagation neural network; playout delay; silence-suppressed voice; simulation results; smoother buffer; speech intelligibility; talkspurt period; traffic characteristics prediction; traffic predictor; variance; voice data; Bit rate; Delay; IP networks; Intelligent networks; Jitter; Neural networks; Speech; Telecommunication traffic; Traffic control; US Department of Transportation;
  • fLanguage
    English
  • Journal_Title
    Selected Areas in Communications, IEEE Journal on
  • Publisher
    ieee
  • ISSN
    0733-8716
  • Type

    jour

  • DOI
    10.1109/49.743694
  • Filename
    743694