Abstract
We describe Joshua, an open source toolkit for statistical machine translation. Joshua implements all of the algorithms required for synchronous context free grammars (SCFGs): chart-parsing, ngram language model integration, beamand cube-pruning, and k-best extraction. The toolkit also implements suffix-array grammar extraction and minimum error rate training. It uses parallel and distributed computing techniques for scalability. We demonstrate that the toolkit achieves state of the art translation performance on the WMT09 French-English translation task.
Original language | English (US) |
---|---|
Pages | 135-139 |
Number of pages | 5 |
State | Published - 2009 |
Event | 4th Workshop on Statistical Machine Translation, WMT 2009, immediately preceding the 12th Conference of the European Chapter of the Association for Computational Linguistics, EACL 2009 - Athens, Greece Duration: Mar 30 2009 → Mar 31 2009 |
Conference
Conference | 4th Workshop on Statistical Machine Translation, WMT 2009, immediately preceding the 12th Conference of the European Chapter of the Association for Computational Linguistics, EACL 2009 |
---|---|
Country/Territory | Greece |
City | Athens |
Period | 3/30/09 → 3/31/09 |
ASJC Scopus subject areas
- Software
- Language and Linguistics
- Human-Computer Interaction