A decision-tree-based symbolic rule induction system for text categorization

David E. Johnson, Frank J. Oles, Tong Zhang, Thilo Goetz

Research output: Contribution to journalArticlepeer-review

Abstract

We present a decision-tree-based symbolic rule induction system for categorizing text documents automatically. Our method for rule induction involves the novel combination of (1) a fast decision tree induction algorithm especially suited to text data and (2) a new method for converting a decision tree to a rule set that is simplified, but still logically equivalent to, the original tree. We report experimental results on the use of this system on some practical problems.

Original languageEnglish (US)
Pages (from-to)428-437
Number of pages10
JournalIBM Systems Journal
Volume41
Issue number3
DOIs
StatePublished - 2002
Externally publishedYes

ASJC Scopus subject areas

  • Software
  • Theoretical Computer Science
  • General Computer Science
  • Information Systems
  • Computer Graphics and Computer-Aided Design
  • Computational Theory and Mathematics

Fingerprint

Dive into the research topics of 'A decision-tree-based symbolic rule induction system for text categorization'. Together they form a unique fingerprint.

Cite this