TY - GEN
T1 - Statistical semantic interpretation modeling for spoken language understanding with enriched semantic features
AU - Celikyilmaz, Asli
AU - Hakkani-Tur, Dilek
AU - Tur, Gokhan
PY - 2012
Y1 - 2012
N2 - In natural language human-machine statistical dialog systems, semantic interpretation is a key task typically performed following semantic parsing, and aims to extract canonical meaning representations of semantic components. In the literature, usually manually built rules are used for this task, even for implicitly mentioned non-named semantic components (like genre of a movie or price range of a restaurant). In this study, we present statistical methods for modeling interpretation, which can also benefit from semantic features extracted from large in-domain knowledge sources. We extract features from user utterances using a semantic parser and additional semantic features from textual sources (online reviews, synopses, etc.) using a novel tree clustering approach, to represent unstructured information that correspond to implicit semantic components related to targeted slots in the user's utterances. We evaluate our models on a virtual personal assistance system and demonstrate that our interpreter is effective in that it does not only improve the utterance interpretation in spoken dialog systems (reducing the interpretation error rate by 36% relative compared to a language model baseline), but also unveils hidden semantic units that are otherwise nearly impossible to extract from purely manual lexical features that are typically used in utterance interpretation.
AB - In natural language human-machine statistical dialog systems, semantic interpretation is a key task typically performed following semantic parsing, and aims to extract canonical meaning representations of semantic components. In the literature, usually manually built rules are used for this task, even for implicitly mentioned non-named semantic components (like genre of a movie or price range of a restaurant). In this study, we present statistical methods for modeling interpretation, which can also benefit from semantic features extracted from large in-domain knowledge sources. We extract features from user utterances using a semantic parser and additional semantic features from textual sources (online reviews, synopses, etc.) using a novel tree clustering approach, to represent unstructured information that correspond to implicit semantic components related to targeted slots in the user's utterances. We evaluate our models on a virtual personal assistance system and demonstrate that our interpreter is effective in that it does not only improve the utterance interpretation in spoken dialog systems (reducing the interpretation error rate by 36% relative compared to a language model baseline), but also unveils hidden semantic units that are otherwise nearly impossible to extract from purely manual lexical features that are typically used in utterance interpretation.
KW - graphical models
KW - semantic interpretation
KW - semi-supervised clustering
KW - spoken language understanding
UR - http://www.scopus.com/inward/record.url?scp=84874255045&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84874255045&partnerID=8YFLogxK
U2 - 10.1109/SLT.2012.6424225
DO - 10.1109/SLT.2012.6424225
M3 - Conference contribution
AN - SCOPUS:84874255045
SN - 9781467351263
T3 - 2012 IEEE Workshop on Spoken Language Technology, SLT 2012 - Proceedings
SP - 216
EP - 221
BT - 2012 IEEE Workshop on Spoken Language Technology, SLT 2012 - Proceedings
T2 - 2012 IEEE Workshop on Spoken Language Technology, SLT 2012
Y2 - 2 December 2012 through 5 December 2012
ER -