Overview of the Tenth Dialog System Technology Challenge: DSTC10

Koichiro Yoshino, Yun Nung Chen, Paul Crook, Satwik Kottur, Jinchao Li, Behnam Hedayatnia, Seungwhan Moon, Zhengcong Fei, Zekang Li, Jinchao Zhang, Yang Feng, Jie Zhou, Seokhwan Kim, Yang Liu, Di Jin, Alexandros Papangelis, Karthik Gopalakrishnan, Dilek Hakkani-Tur, Babak Damavandi, Alborz GeramifardChiori Hori, Ankit Shah, Chen Zhang, Haizhou Li, Joao Sedoc, Luis F. D'haro, Rafael Banchs, Alexander Rudnicky

Research output: Contribution to journalArticlepeer-review

Abstract

This article introduces the Tenth Dialog System Technology Challenge (DSTC-10). This edition of the DSTC focuses on applying end-to-end dialog technologies for five distinct tasks in dialog systems, namely 1. Incorporation of Meme images into open domain dialogs, 2. Knowledge-grounded Task-oriented Dialogue Modeling on Spoken Conversations, 3. Situated Interactive Multimodal dialogs, 4. Reasoning for Audio Visual Scene-Aware Dialog, and 5. Automatic Evaluation and Moderation of Open-domainDialogue Systems. This article describes the task definition, provided datasets, baselines, and evaluation setup for each track. We also summarize the results of the submitted systems to highlight the general trends of the state-of-the-art technologies for the tasks.

Original languageEnglish (US)
Pages (from-to)765-778
Number of pages14
JournalIEEE/ACM Transactions on Audio Speech and Language Processing
Volume32
DOIs
StatePublished - 2024
Externally publishedYes

Keywords

  • Dialog systems
  • multimodal sensors
  • natural language processing
  • speech processing

ASJC Scopus subject areas

  • Computer Science (miscellaneous)
  • Acoustics and Ultrasonics
  • Computational Mathematics
  • Electrical and Electronic Engineering

Fingerprint

Dive into the research topics of 'Overview of the Tenth Dialog System Technology Challenge: DSTC10'. Together they form a unique fingerprint.

Cite this