Modeling the Complexity and Descriptive Adequacy of Construction Grammars

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

This paper uses the Minimum Description Length paradigm to model the complexity of CxGs (operationalized as the encoding size of a grammar) alongside their descriptive adequacy (operationalized as the encoding size of a corpus given a grammar). These two quantities are combined to measure the quality of potential CxGs against unannotated corpora, supporting discovery-device CxGs for English, Spanish, French, German, and Italian. The results show (i) that these grammars provide significant generalizations as measured using compression and (ii) that more com- plex CxGs with access to multiple levels of representation provide greater generalizations than single-representation CxGs.
Original languageEnglish (US)
Title of host publicationProceedings of the Society for Computation in Linguistics (SCiL) 2018
EditorsGaja Jarosz, Brendan O'Connor, Joe Pater
PublisherSociety for Computation in Linguistics
Pages81-90
DOIs
StatePublished - 2018
Externally publishedYes

Publication series

NameProceedings of the Society for Computation in Linguistics
ISSN (Print)2834-1007

Keywords

  • construction grammar
  • discovery-device grammar
  • minimum description length
  • grammar induction
  • cxg

Fingerprint

Dive into the research topics of 'Modeling the Complexity and Descriptive Adequacy of Construction Grammars'. Together they form a unique fingerprint.

Cite this