Joshua: An Open Source Toolkit for Parsing-based Machine Translation

Zhifei Li, Chris Callison-Burch, Chris Dyer, Juri Ganitkevitch, Sanjeev Khudanpur, Lane Schwartz, Wren N.G. Thornton, Jonathan Weese, Omar F. Zaidan

Research output: Contribution to conferencePaperpeer-review

Abstract

We describe Joshua, an open source toolkit for statistical machine translation. Joshua implements all of the algorithms required for synchronous context free grammars (SCFGs): chart-parsing, ngram language model integration, beamand cube-pruning, and k-best extraction. The toolkit also implements suffix-array grammar extraction and minimum error rate training. It uses parallel and distributed computing techniques for scalability. We demonstrate that the toolkit achieves state of the art translation performance on the WMT09 French-English translation task.

Original languageEnglish (US)
Pages135-139
Number of pages5
StatePublished - 2009
Event4th Workshop on Statistical Machine Translation, WMT 2009, immediately preceding the 12th Conference of the European Chapter of the Association for Computational Linguistics, EACL 2009 - Athens, Greece
Duration: Mar 30 2009Mar 31 2009

Conference

Conference4th Workshop on Statistical Machine Translation, WMT 2009, immediately preceding the 12th Conference of the European Chapter of the Association for Computational Linguistics, EACL 2009
Country/TerritoryGreece
CityAthens
Period3/30/093/31/09

ASJC Scopus subject areas

  • Software
  • Language and Linguistics
  • Human-Computer Interaction

Fingerprint

Dive into the research topics of 'Joshua: An Open Source Toolkit for Parsing-based Machine Translation'. Together they form a unique fingerprint.

Cite this