TY - GEN
T1 - Dialog Acts for Task-Driven Embodied Agents
AU - Gella, Spandana
AU - Padmakumar, Aishwarya
AU - Lange, Patrick
AU - Hakkani-Tur, Dilek
N1 - Publisher Copyright:
© 2022 Association for Computational Linguistics.
PY - 2022
Y1 - 2022
N2 - Embodied agents need to be able to interact in natural language - understanding task descriptions and asking appropriate follow up questions to obtain necessary information to be effective at successfully accomplishing tasks for a wide range of users. In this work, we propose a set of dialog acts for modelling such dialogs and annotate the TEACh dataset that includes over 3,000 situated, task oriented conversations (consisting of 39.5k utterances in total) with dialog acts. TEACh-DA is one of the first large scale dataset of dialog act annotations for embodied task completion. Furthermore, we demonstrate the use of this annotated dataset in training models for tagging the dialog acts of a given utterance, predicting the dialog act of the next response given a dialog history, and use the dialog acts to guide agent's non-dialog behaviour. In particular, our experiments on the TEACh Execution from Dialog History task where the model predicts the sequence of low level actions to be executed in the environment for embodied task completion, demonstrate that dialog acts can improve end task success rate by up to 2 points compared to the system without dialog acts.
AB - Embodied agents need to be able to interact in natural language - understanding task descriptions and asking appropriate follow up questions to obtain necessary information to be effective at successfully accomplishing tasks for a wide range of users. In this work, we propose a set of dialog acts for modelling such dialogs and annotate the TEACh dataset that includes over 3,000 situated, task oriented conversations (consisting of 39.5k utterances in total) with dialog acts. TEACh-DA is one of the first large scale dataset of dialog act annotations for embodied task completion. Furthermore, we demonstrate the use of this annotated dataset in training models for tagging the dialog acts of a given utterance, predicting the dialog act of the next response given a dialog history, and use the dialog acts to guide agent's non-dialog behaviour. In particular, our experiments on the TEACh Execution from Dialog History task where the model predicts the sequence of low level actions to be executed in the environment for embodied task completion, demonstrate that dialog acts can improve end task success rate by up to 2 points compared to the system without dialog acts.
UR - http://www.scopus.com/inward/record.url?scp=85150604615&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85150604615&partnerID=8YFLogxK
U2 - 10.18653/v1/2022.sigdial-1.13
DO - 10.18653/v1/2022.sigdial-1.13
M3 - Conference contribution
AN - SCOPUS:85150604615
T3 - SIGDIAL 2022 - 23rd Annual Meeting of the Special Interest Group on Discourse and Dialogue, Proceedings of the Conference
SP - 111
EP - 123
BT - SIGDIAL 2022 - 23rd Annual Meeting of the Special Interest Group on Discourse and Dialogue, Proceedings of the Conference
PB - Association for Computational Linguistics (ACL)
T2 - 23rd Annual Meeting of the Special Interest Group on Discourse and Dialogue, SIGDIAL 2022
Y2 - 7 September 2022 through 9 September 2022
ER -