A Machine Learning Approach to Evaluating Translation Quality

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

We explored supervised machine learning (ML) techniques to understand and predict the adequacy and fluency of English-Spanish machine translation. Five experiments were conducted using three classifiers in Weka, an open-source ML tool. We found that the highest performance was achieved by applying a dimensionality reduction approach to the classification task, which included collapsing a numeric scale of quality to two categories: high quality and low quality. Our results showed that the Support Vector Machine classifier performed the best at predicting the adequacy (65.65%) and fluency (65.77%) of the translations. More research is needed to explore the methodologies of applying ML to translation evaluation.

Original languageEnglish (US)
Title of host publication2017 ACM/IEEE Joint Conference on Digital Libraries, JCDL 2017
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)9781538638613
DOIs
StatePublished - Jul 25 2017
Externally publishedYes
Event17th ACM/IEEE Joint Conference on Digital Libraries, JCDL 2017 - Toronto, Canada
Duration: Jun 19 2017Jun 23 2017

Publication series

NameProceedings of the ACM/IEEE Joint Conference on Digital Libraries
ISSN (Print)1552-5996

Other

Other17th ACM/IEEE Joint Conference on Digital Libraries, JCDL 2017
Country/TerritoryCanada
CityToronto
Period6/19/176/23/17

Keywords

  • machine learning
  • Machine translation evaluation
  • Weka

ASJC Scopus subject areas

  • General Engineering

Fingerprint

Dive into the research topics of 'A Machine Learning Approach to Evaluating Translation Quality'. Together they form a unique fingerprint.

Cite this