Abstract
Rare category analysis is of key importance both in theory and in practice. Previous research work focuses on supervised rare category analysis, such as rare category detection and rare category classification. In this paper, for the first time, we address the challenge of unsupervised rare category analysis, including feature selection and rare category selection. We propose to jointly deal with the two correlated tasks so that they can benefit from each other. To this end, we design an optimization framework which is able to co-select the relevant features and the examples from the rare category (a.k.a. the minority class). It is well justified theoretically. Furthermore, we develop the Partial Augmented Lagrangian Method (PALM) to solve the optimization problem. Experimental results on both synthetic and real data sets show the effectiveness of the proposed method.
Original language | English (US) |
---|---|
Pages | 525-536 |
Number of pages | 12 |
DOIs | |
State | Published - 2010 |
Externally published | Yes |
Event | 10th SIAM International Conference on Data Mining, SDM 2010 - Columbus, OH, United States Duration: Apr 29 2010 → May 1 2010 |
Conference
Conference | 10th SIAM International Conference on Data Mining, SDM 2010 |
---|---|
Country/Territory | United States |
City | Columbus, OH |
Period | 4/29/10 → 5/1/10 |
ASJC Scopus subject areas
- Software