Co-selection of features and instances for unsupervised rare category analysis

Jingrui He, Jaime Carbonell

Research output: Contribution to conferencePaper

Abstract

Rare category analysis is of key importance both in theory and in practice. Previous research work focuses on supervised rare category analysis, such as rare category detection and rare category classification. In this paper, for the first time, we address the challenge of unsupervised rare category analysis, including feature selection and rare category selection. We propose to jointly deal with the two correlated tasks so that they can benefit from each other. To this end, we design an optimization framework which is able to co-select the relevant features and the examples from the rare category (a.k.a. the minority class). It is well justified theoretically. Furthermore, we develop the Partial Augmented Lagrangian Method (PALM) to solve the optimization problem. Experimental results on both synthetic and real data sets show the effectiveness of the proposed method.

Original languageEnglish (US)
Pages525-536
Number of pages12
DOIs
StatePublished - Jan 1 2010
Event10th SIAM International Conference on Data Mining, SDM 2010 - Columbus, OH, United States
Duration: Apr 29 2010May 1 2010

Conference

Conference10th SIAM International Conference on Data Mining, SDM 2010
CountryUnited States
CityColumbus, OH
Period4/29/105/1/10

ASJC Scopus subject areas

  • Software

Fingerprint Dive into the research topics of 'Co-selection of features and instances for unsupervised rare category analysis'. Together they form a unique fingerprint.

  • Cite this

    He, J., & Carbonell, J. (2010). Co-selection of features and instances for unsupervised rare category analysis. 525-536. Paper presented at 10th SIAM International Conference on Data Mining, SDM 2010, Columbus, OH, United States. https://doi.org/10.1137/1.9781611972801.46