DocumentCode
2085763
Title
Aligning ASL for Statistical Translation Using a Discriminative Word Model
Author
Farhadi, Ali ; Forsyth, David
Author_Institution
University of Illinois at Urbana-Champai
Volume
2
fYear
2006
fDate
2006
Firstpage
1471
Lastpage
1476
Abstract
We describe a method to align ASL video subtitles with a closed-caption transcript. Our alignments are partial, based on spotting words within the video sequence, which consists of joined (rather than isolated) signs with unknown word boundaries. We start with windows known to contain an example of a word, but not limited to it. We estimate the start and end of the word in these examples using a voting method. This provides a small number of training examples (typically three per word). Since there is no shared structure, we use a discriminative rather than a generative word model. While our word spotters are not perfect, they are sufficient to establish an alignment. We demonstrate that quite small numbers of good word spotters results in an alignment good enough to produce simple English-ASL translations, both by phrase matching and using word substitution.
Keywords
Action Analysis and Recognition.; Applications of Vision; Image and video retrieval; Object recognition; Computer science; Handicapped aids; Hidden Markov models; Image analysis; Image retrieval; Natural languages; Object recognition; Video sequences; Vocabulary; Voting; Action Analysis and Recognition.; Applications of Vision; Image and video retrieval; Object recognition;
fLanguage
English
Publisher
ieee
Conference_Titel
Computer Vision and Pattern Recognition, 2006 IEEE Computer Society Conference on
ISSN
1063-6919
Print_ISBN
0-7695-2597-0
Type
conf
DOI
10.1109/CVPR.2006.51
Filename
1640930
Link To Document