Abstract
The transformation that natural language parsing has undergone since the nineties would have been impossible without the availability of syntactically annotated corpora such as the Penn Treebank and similar resources for other languages. By now, it has become increasingly difficult to increase parsing accuracy on our standard data sets. But as we move to other domains of text, or aim to recover richer representations that are required for natural language understanding, it is also clear that parsing is far from being a solved task. In this panel, I would like to initiate a discussion about the kind of language resources needed to advance natural language parsing. I will also reflect on what the translation of existing resources into other grammatical representations has taught us about treebank design.
Original language | English (US) |
---|---|
Title of host publication | PACLIC 24 - Proceedings of the 24th Pacific Asia Conference on Language, Information and Computation |
Pages | 13 |
Number of pages | 1 |
State | Published - 2010 |
Event | 24th Pacific Asia Conference on Language, Information and Computation, PACLIC 24 - Sendai, Japan Duration: Nov 4 2010 → Nov 7 2010 |
Other
Other | 24th Pacific Asia Conference on Language, Information and Computation, PACLIC 24 |
---|---|
Country/Territory | Japan |
City | Sendai |
Period | 11/4/10 → 11/7/10 |
ASJC Scopus subject areas
- Language and Linguistics
- Computer Science (miscellaneous)