TY - JOUR
T1 - Overview of the Tenth Dialog System Technology Challenge
T2 - DSTC10
AU - Yoshino, Koichiro
AU - Chen, Yun Nung
AU - Crook, Paul
AU - Kottur, Satwik
AU - Li, Jinchao
AU - Hedayatnia, Behnam
AU - Moon, Seungwhan
AU - Fei, Zhengcong
AU - Li, Zekang
AU - Zhang, Jinchao
AU - Feng, Yang
AU - Zhou, Jie
AU - Kim, Seokhwan
AU - Liu, Yang
AU - Jin, Di
AU - Papangelis, Alexandros
AU - Gopalakrishnan, Karthik
AU - Hakkani-Tur, Dilek
AU - Damavandi, Babak
AU - Geramifard, Alborz
AU - Hori, Chiori
AU - Shah, Ankit
AU - Zhang, Chen
AU - Li, Haizhou
AU - Sedoc, Joao
AU - D'haro, Luis F.
AU - Banchs, Rafael
AU - Rudnicky, Alexander
N1 - Publisher Copyright:
© 2023 The Authors.
PY - 2024
Y1 - 2024
N2 - This article introduces the Tenth Dialog System Technology Challenge (DSTC-10). This edition of the DSTC focuses on applying end-to-end dialog technologies for five distinct tasks in dialog systems, namely 1. Incorporation of Meme images into open domain dialogs, 2. Knowledge-grounded Task-oriented Dialogue Modeling on Spoken Conversations, 3. Situated Interactive Multimodal dialogs, 4. Reasoning for Audio Visual Scene-Aware Dialog, and 5. Automatic Evaluation and Moderation of Open-domainDialogue Systems. This article describes the task definition, provided datasets, baselines, and evaluation setup for each track. We also summarize the results of the submitted systems to highlight the general trends of the state-of-the-art technologies for the tasks.
AB - This article introduces the Tenth Dialog System Technology Challenge (DSTC-10). This edition of the DSTC focuses on applying end-to-end dialog technologies for five distinct tasks in dialog systems, namely 1. Incorporation of Meme images into open domain dialogs, 2. Knowledge-grounded Task-oriented Dialogue Modeling on Spoken Conversations, 3. Situated Interactive Multimodal dialogs, 4. Reasoning for Audio Visual Scene-Aware Dialog, and 5. Automatic Evaluation and Moderation of Open-domainDialogue Systems. This article describes the task definition, provided datasets, baselines, and evaluation setup for each track. We also summarize the results of the submitted systems to highlight the general trends of the state-of-the-art technologies for the tasks.
KW - Dialog systems
KW - multimodal sensors
KW - natural language processing
KW - speech processing
UR - http://www.scopus.com/inward/record.url?scp=85164398846&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85164398846&partnerID=8YFLogxK
U2 - 10.1109/TASLP.2023.3293030
DO - 10.1109/TASLP.2023.3293030
M3 - Article
AN - SCOPUS:85164398846
SN - 2329-9290
VL - 32
SP - 765
EP - 778
JO - IEEE/ACM Transactions on Audio Speech and Language Processing
JF - IEEE/ACM Transactions on Audio Speech and Language Processing
ER -