Terminology extraction for global content management

Arendse Bernth, Michael McCord, Kara Warburton

Research output: Contribution to journalArticlepeer-review


The role of terminology in content management has often been underrated. Term extraction has been identified by the information industry as an area requiring focus. Term extraction benefits both the content authoring and the translation process. Supplying key product terms to translation services several weeks before the actual translation begins reduces translation time, improves translation quality, and saves effort (and thus money) by reducing duplication of work. Getting the key terms ready in a timely manner can be difficult without some automation. This paper describes the process of proposing, designing, developing, and deploying a terminology extraction tool. The tool extracts nouns and noun groups, excludes non-translatable terms and known product terms, and displays a context for each extracted item. This is done based on full parsing of the text with a broad-coverage parser. The tool is made available to users on a Web server.

Original languageEnglish (US)
Pages (from-to)51-69
Number of pages19
Issue number1
StatePublished - 2003
Externally publishedYes


  • Computational-linguistic tools
  • Localization
  • Multiword terms
  • Term recognition
  • Terminology extraction
  • Terminology management

ASJC Scopus subject areas

  • Language and Linguistics
  • Communication
  • Library and Information Sciences


Dive into the research topics of 'Terminology extraction for global content management'. Together they form a unique fingerprint.

Cite this