TY - GEN
T1 - Neural NLG for Methodius
T2 - 13th International Conference on Natural Language Generation, INLG 2020
AU - Stevens-Guille, Symon Jory
AU - Maskharashvili, Aleksandre
AU - Isard, Amy
AU - Li, Xintong
AU - White, Michael
N1 - This research was supported by a collaborative open science research agreement between Face-book and The Ohio State University.
PY - 2020
Y1 - 2020
N2 - While classic NLG systems typically made use of hierarchically structured content plans that included discourse relations as central components, more recent neural approaches have mostly mapped simple, flat inputs to texts without representing discourse relations explicitly. In this paper, we investigate whether it is beneficial to include discourse relations in the input to neural data-to-text generators for texts where discourse relations play an important role. To do so, we reimplement the sentence planning and realization components of a classic NLG system, Methodius, using LSTM sequence-to-sequence (seq2seq) models. We find that although seq2seq models can learn to generate fluent and grammatical texts remarkably well with sufficiently representative Methodius training data, they cannot learn to correctly express Methodius's SIMILARITY and CONTRAST comparisons unless the corresponding RST relations are included in the inputs. Additionally, we experiment with using self-training and reverse model reranking to better handle train/test data mismatches, and find that while these methods help reduce content errors, it remains essential to include discourse relations in the input to obtain optimal performance.
AB - While classic NLG systems typically made use of hierarchically structured content plans that included discourse relations as central components, more recent neural approaches have mostly mapped simple, flat inputs to texts without representing discourse relations explicitly. In this paper, we investigate whether it is beneficial to include discourse relations in the input to neural data-to-text generators for texts where discourse relations play an important role. To do so, we reimplement the sentence planning and realization components of a classic NLG system, Methodius, using LSTM sequence-to-sequence (seq2seq) models. We find that although seq2seq models can learn to generate fluent and grammatical texts remarkably well with sufficiently representative Methodius training data, they cannot learn to correctly express Methodius's SIMILARITY and CONTRAST comparisons unless the corresponding RST relations are included in the inputs. Additionally, we experiment with using self-training and reverse model reranking to better handle train/test data mismatches, and find that while these methods help reduce content errors, it remains essential to include discourse relations in the input to obtain optimal performance.
UR - https://www.scopus.com/pages/publications/85113829115
UR - https://www.scopus.com/pages/publications/85113829115#tab=citedBy
U2 - 10.18653/v1/2020.inlg-1.37
DO - 10.18653/v1/2020.inlg-1.37
M3 - Conference contribution
AN - SCOPUS:85113829115
T3 - INLG 2020 - 13th International Conference on Natural Language Generation, Proceedings
SP - 306
EP - 315
BT - INLG 2020 - 13th International Conference on Natural Language Generation, Proceedings
A2 - Davis, Brian
A2 - Graham, Yvette
A2 - Kelleher, John
A2 - Sripada, Yaji
PB - Association for Computational Linguistics (ACL)
Y2 - 15 December 2020 through 18 December 2020
ER -