مرکز منطقه ای اطلاع رساني علوم و فناوري - A system for automatic alignment of broadcast media captions using weighted finite-state transducers

DocumentCode :

3744911

Title :

A system for automatic alignment of broadcast media captions using weighted finite-state transducers

Author :

Peter Bell;Steve Renals

Author_Institution :

Centre for Speech Technology Research, University of Edinburgh, Edinburgh EH8 9AB, UK

fYear :

2015

Firstpage :

675

Lastpage :

680

Abstract :

We describe our system for alignment of broadcast media captions in the 2015 MGB Challenge. A precise time alignment of previously-generated subtitles to media data is important in the process of caption generation by broadcasters. However, this task is challenging due to the highly diverse, often noisy content of the audio, and because the subtitles are frequently not a verbatim representation of the actual words spoken. Our system employs a two-pass approach with appropriately constrained weighted finite state transducers (WFSTs) to enable good alignment even when the audio quality would be challenging for conventional ASR. The system achieves an f-score of 0.8965 on the MGB Challenge development set.

Keywords :

"Transducers","Decoding","Acoustics","Training","Speech","Timing","Hidden Markov models"

Publisher :

ieee

Conference_Titel :

Automatic Speech Recognition and Understanding (ASRU), 2015 IEEE Workshop on

Type :

conf

DOI :

10.1109/ASRU.2015.7404861

Filename :

7404861

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3744911