TY - GEN
T1 - What is wrong with you?
T2 - 60th Annual Meeting of the Association for Computational Linguistics, ACL 2022
AU - Ghazarian, Sarik
AU - Hedayatnia, Behnam
AU - Papangelis, Alexandros
AU - Liu, Yang
AU - Hakkani-Tur, Dilek
N1 - Publisher Copyright:
© 2022 Association for Computational Linguistics.
PY - 2022
Y1 - 2022
N2 - Accurate automatic evaluation metrics for open-domain dialogs are in high demand. Existing model-based metrics for system response evaluation are trained on human annotated data, which is cumbersome to collect. In this work, we propose to use information that can be automatically extracted from the next user utterance, such as its sentiment or whether the user explicitly ends the conversation, as a proxy to measure the quality of the previous system response. This allows us to train on a massive set of dialogs with weak supervision, without requiring manual system turn quality annotations. Experiments show that our model is comparable to models trained on human annotated data. Furthermore, our model generalizes across both spoken and written open-domain dialog corpora collected from real and paid users.
AB - Accurate automatic evaluation metrics for open-domain dialogs are in high demand. Existing model-based metrics for system response evaluation are trained on human annotated data, which is cumbersome to collect. In this work, we propose to use information that can be automatically extracted from the next user utterance, such as its sentiment or whether the user explicitly ends the conversation, as a proxy to measure the quality of the previous system response. This allows us to train on a massive set of dialogs with weak supervision, without requiring manual system turn quality annotations. Experiments show that our model is comparable to models trained on human annotated data. Furthermore, our model generalizes across both spoken and written open-domain dialog corpora collected from real and paid users.
UR - http://www.scopus.com/inward/record.url?scp=85148765981&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85148765981&partnerID=8YFLogxK
U2 - 10.18653/v1/2022.findings-acl.331
DO - 10.18653/v1/2022.findings-acl.331
M3 - Conference contribution
AN - SCOPUS:85148765981
T3 - Proceedings of the Annual Meeting of the Association for Computational Linguistics
SP - 4194
EP - 4204
BT - ACL 2022 - 60th Annual Meeting of the Association for Computational Linguistics, Findings of ACL 2022
A2 - Muresan, Smaranda
A2 - Nakov, Preslav
A2 - Villavicencio, Aline
PB - Association for Computational Linguistics (ACL)
Y2 - 22 May 2022 through 27 May 2022
ER -