TY - GEN
T1 - Discovering compact and highly discriminative features or combinations of drug activities using support vector machines
AU - Yu, H.
AU - Yang, J.
AU - Wang, W.
AU - Han, J.
N1 - Publisher Copyright:
© 2003 IEEE.
PY - 2003
Y1 - 2003
N2 - Nowadays, high throughput experimental techniques make it feasible to examine and collect massive data at the molecular level. These data, typically mapped to a very high dimensional feature space, carry rich information about functionalities of certain chemical or biological entities and can be used to infer valuable knowledge for the purposes of classification and prediction. Typically, a small number of features or feature combinations may play determinant roles in functional discrimination. The identification of such features or feature combinations is of great importance. In this paper, we study the problem of discovering compact and highly discriminative features or feature combinations from a rich feature collection. We employ the support vector machine as the classification means and aim at finding compact feature combinations. Comparing to previous methods on feature selection, which identify features solely based on their individual roles in the classification, our method is able to identify minimal feature combinations that ultimately have determinant roles in a systematic fashion. Experimental study on drug activity data shows that our method can discover descriptors that are not necessarily significant individually but are most significant collectively.
AB - Nowadays, high throughput experimental techniques make it feasible to examine and collect massive data at the molecular level. These data, typically mapped to a very high dimensional feature space, carry rich information about functionalities of certain chemical or biological entities and can be used to infer valuable knowledge for the purposes of classification and prediction. Typically, a small number of features or feature combinations may play determinant roles in functional discrimination. The identification of such features or feature combinations is of great importance. In this paper, we study the problem of discovering compact and highly discriminative features or feature combinations from a rich feature collection. We employ the support vector machine as the classification means and aim at finding compact feature combinations. Comparing to previous methods on feature selection, which identify features solely based on their individual roles in the classification, our method is able to identify minimal feature combinations that ultimately have determinant roles in a systematic fashion. Experimental study on drug activity data shows that our method can discover descriptors that are not necessarily significant individually but are most significant collectively.
KW - Drug Activity
KW - Feature Selection
KW - Support Vector Machine
UR - https://www.scopus.com/pages/publications/44749083335
UR - https://www.scopus.com/pages/publications/44749083335#tab=citedBy
U2 - 10.1109/CSB.2003.1227321
DO - 10.1109/CSB.2003.1227321
M3 - Conference contribution
C2 - 16452796
AN - SCOPUS:44749083335
T3 - Proceedings of the 2003 IEEE Bioinformatics Conference, CSB 2003
SP - 220
EP - 228
BT - Proceedings of the 2003 IEEE Bioinformatics Conference, CSB 2003
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2nd International IEEE Computer Society Computational Systems Bioinformatics Conference, CSB 2003
Y2 - 11 August 2003 through 14 August 2003
ER -