Creating a CCGbank and a wide-coverage CCG lexicon for German

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

We present an algorithm which creates a German CCGbank by translating the syntax graphs in the German Tiger corpus into CCG derivation trees. The resulting corpus contains 46,628 derivations, covering 95% of all complete sentences in Tiger. Lexicons extracted from this corpus contain correct lexical entries for 94% of all known tokens in unseen text.

Original languageEnglish (US)
Title of host publicationCOLING/ACL 2006 - 21st International Conference on Computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics, Proceedings of the Conference
PublisherAssociation for Computational Linguistics (ACL)
Pages505-512
Number of pages8
ISBN (Print)1932432655, 9781932432657
DOIs
StatePublished - 2006
Externally publishedYes
Event21st International Conference on Computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics, COLING/ACL 2006 - Sydney, NSW, Australia
Duration: Jul 17 2006Jul 21 2006

Publication series

NameCOLING/ACL 2006 - 21st International Conference on Computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics, Proceedings of the Conference
Volume1

Other

Other21st International Conference on Computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics, COLING/ACL 2006
Country/TerritoryAustralia
CitySydney, NSW
Period7/17/067/21/06

ASJC Scopus subject areas

  • Language and Linguistics
  • Linguistics and Language

Fingerprint

Dive into the research topics of 'Creating a CCGbank and a wide-coverage CCG lexicon for German'. Together they form a unique fingerprint.

Cite this