Aligning ASL for statistical translation using a discriminative word model

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

We describe a method to align ASL video subtitles with a closed-caption transcript. Our alignments are partial, based on spotting words within the video sequence, which consists of joined (rather than isolated) signs with unknown word boundaries. We start with windows known to contain an example of a word, but not limited to it. We estimate the start and end of the word in these examples using a voting method. This provides a small number of training examples (typically three per word). Since there is no shared structure, we use a discriminative rather than a generative word model. While our word spotters are not perfect, they are sufficient to establish an alignment. We demonstrate that quite small numbers of good word spotters results in an alignment good enough to produce simple English-ASL translations, both by phrase matching and using word substitution.

Original languageEnglish (US)
Title of host publicationProceedings - 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, CVPR 2006
Pages1471-1476
Number of pages6
DOIs
StatePublished - Dec 22 2006
Event2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, CVPR 2006 - New York, NY, United States
Duration: Jun 17 2006Jun 22 2006

Publication series

NameProceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition
Volume2
ISSN (Print)1063-6919

Other

Other2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, CVPR 2006
CountryUnited States
CityNew York, NY
Period6/17/066/22/06

Keywords

  • Action analysis and recognition
  • Applications of vision
  • Image and video retrieval
  • Object recognition

ASJC Scopus subject areas

  • Software
  • Computer Vision and Pattern Recognition

Fingerprint Dive into the research topics of 'Aligning ASL for statistical translation using a discriminative word model'. Together they form a unique fingerprint.

  • Cite this

    Farhadi, A., & Forsyth, D. A. (2006). Aligning ASL for statistical translation using a discriminative word model. In Proceedings - 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, CVPR 2006 (pp. 1471-1476). [1640930] (Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition; Vol. 2). https://doi.org/10.1109/CVPR.2006.51