TY - GEN
T1 - Cross-lingual models of word embeddings
T2 - 54th Annual Meeting of the Association for Computational Linguistics, ACL 2016
AU - Upadhyay, Shyam
AU - Faruqui, Manaal
AU - Dyer, Chris
AU - Roth, Dan
N1 - Funding Information:
This material is based on research sponsored by DARPA under agreement number FA8750-13-2-0008 and Contract HR0011-15-2-0025. Approved for Public Release, Distribution Unlimited. The views expressed are those of the authors and do not reflect the official policy or position of the Department of Defense or the U.S. Government.
PY - 2016
Y1 - 2016
N2 - Despite interest in using cross-lingual knowledge to learn word embeddings for various tasks, a systematic comparison of the possible approaches is lacking in the literature. We perform an extensive evaluation of four popular approaches of inducing cross-lingual embeddings, each requiring a different form of supervision, on four typologically different language pairs. Our evaluation setup spans four different tasks, including intrinsic evaluation on mono-lingual and cross-lingual similarity, and extrinsic evaluation on downstream semantic and syntactic applications. We show that models which require expensive cross-lingual knowledge almost always perform better, but cheaply supervised models often prove competitive on certain tasks.
AB - Despite interest in using cross-lingual knowledge to learn word embeddings for various tasks, a systematic comparison of the possible approaches is lacking in the literature. We perform an extensive evaluation of four popular approaches of inducing cross-lingual embeddings, each requiring a different form of supervision, on four typologically different language pairs. Our evaluation setup spans four different tasks, including intrinsic evaluation on mono-lingual and cross-lingual similarity, and extrinsic evaluation on downstream semantic and syntactic applications. We show that models which require expensive cross-lingual knowledge almost always perform better, but cheaply supervised models often prove competitive on certain tasks.
UR - http://www.scopus.com/inward/record.url?scp=85011798855&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85011798855&partnerID=8YFLogxK
U2 - 10.18653/v1/p16-1157
DO - 10.18653/v1/p16-1157
M3 - Conference contribution
AN - SCOPUS:85011798855
T3 - 54th Annual Meeting of the Association for Computational Linguistics, ACL 2016 - Long Papers
SP - 1661
EP - 1670
BT - 54th Annual Meeting of the Association for Computational Linguistics, ACL 2016 - Long Papers
PB - Association for Computational Linguistics (ACL)
Y2 - 7 August 2016 through 12 August 2016
ER -