TY - GEN
T1 - Let's DISCOH
T2 - 2006 IEEE ACL Spoken Language Technology Workshop, SLT 2006
AU - Andreani, G.
AU - Di Fabbrizio, G.
AU - Gilbert, M.
AU - Gillick, D.
AU - Hakkani-Tür, D.
AU - Lemon, O.
PY - 2006
Y1 - 2006
N2 - We motivate and explain the DISCOH project1, which uses a publicly deployed spoken dialogue system for conference services to collect a richly annotated corpus of mixed-initiative human- machine spoken dialogues. System users are able to call a phone number and learn about a conference, including paper submission, program, venue, accommodation options and costs, etc. The collected corpus is (1) usable for training, evaluating and comparing statistical models, (2) naturally spoken and task oriented, (3) extendible / generalizable, (4) collected using state-of-the-art research and commercial technology, (5) freely available to researchers. We explain the principles behind the dialogue context representations and reward signals collected by the system, as well as the overall system design, Call Types, and Call Flow. We also present results regarding the initial ASR models and spoken language understanding models. We expect the resulting corpora to be used in advanced dialogue research over the coming years.
AB - We motivate and explain the DISCOH project1, which uses a publicly deployed spoken dialogue system for conference services to collect a richly annotated corpus of mixed-initiative human- machine spoken dialogues. System users are able to call a phone number and learn about a conference, including paper submission, program, venue, accommodation options and costs, etc. The collected corpus is (1) usable for training, evaluating and comparing statistical models, (2) naturally spoken and task oriented, (3) extendible / generalizable, (4) collected using state-of-the-art research and commercial technology, (5) freely available to researchers. We explain the principles behind the dialogue context representations and reward signals collected by the system, as well as the overall system design, Call Types, and Call Flow. We also present results regarding the initial ASR models and spoken language understanding models. We expect the resulting corpora to be used in advanced dialogue research over the coming years.
KW - Learning systems
KW - Natural language interfaces
KW - Speech communication
KW - User interfaces
UR - http://www.scopus.com/inward/record.url?scp=44049104329&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=44049104329&partnerID=8YFLogxK
U2 - 10.1109/SLT.2006.326794
DO - 10.1109/SLT.2006.326794
M3 - Conference contribution
AN - SCOPUS:44049104329
SN - 1424408733
SN - 9781424408733
T3 - 2006 IEEE ACL Spoken Language Technology Workshop, SLT 2006, Proceedings
SP - 218
EP - 221
BT - 2006 IEEE ACL Spoken Language Technology Workshop, SLT 2006, Proceedings
Y2 - 10 December 2006 through 13 December 2006
ER -