TY - JOUR
T1 - A genetic-algorithm-based selective principal component analysis (GA-SPCA) method for high-dimensional data feature extraction
AU - Yao, Haibo
AU - Tian, Lei
N1 - Funding Information:
Manuscript received May 17, 2002; revised February 8, 2003. This research has been supported by the Illinois Council of Food and Agricultural Research (C-FAR) Project IDACF 01-DS-3-1 AE and by the University of Illinois. The authors are with the Department of Agricultural Engineering, University of Illinois at Urbana–Champaign, Urbana, IL 61801 USA (e-mail: haiboyao@uiuc.edu; lei-tian@uiuc.edu). Digital Object Identifier 10.1109/TGRS.2003.811691
PY - 2003/6
Y1 - 2003/6
N2 - In this paper, a genetic-algorithm-based selective principal component analysis (GA-SPCA) method is proposed and tested using hyperspectral remote sensing data and ground reference data collected within an agricultural field. The proposed method uses a global optimizer, the genetic algorithms, to select a subset of the original image bands, which first reduces the data dimension. A principal component transformation is subsequently applied to the selected bands. By extracting features from the resulting eigenimage, the remote sensing data, originally high in dimension, will be further reduced to a feature space with one to several principal component bands. Subsequent image processing on the reduced feature space can thus be performed with improved accuracy. Experiments were conducted using three sets of ground reference data: corn chlorophyll content, corn plant population, and various corn hybrids. The results showed that with GA-SPCA, the number of original bands used for principal component analysis (PCA) could be reduced to 17, 26, and 25 from a 60-band hyperspectral image, respectively. In all cases, the correlation coefficients between image and ground reference data were greater when using GA-SPCA than that for PCA results with all original bands. This indicates that bands with no contribution to a specific application were removed prior to PCA. The variance related to a specific application within the image was transformed with more emphasis by using bands sensitive to that application. The selected bands can also provide useful information for future imaging sensor development.
AB - In this paper, a genetic-algorithm-based selective principal component analysis (GA-SPCA) method is proposed and tested using hyperspectral remote sensing data and ground reference data collected within an agricultural field. The proposed method uses a global optimizer, the genetic algorithms, to select a subset of the original image bands, which first reduces the data dimension. A principal component transformation is subsequently applied to the selected bands. By extracting features from the resulting eigenimage, the remote sensing data, originally high in dimension, will be further reduced to a feature space with one to several principal component bands. Subsequent image processing on the reduced feature space can thus be performed with improved accuracy. Experiments were conducted using three sets of ground reference data: corn chlorophyll content, corn plant population, and various corn hybrids. The results showed that with GA-SPCA, the number of original bands used for principal component analysis (PCA) could be reduced to 17, 26, and 25 from a 60-band hyperspectral image, respectively. In all cases, the correlation coefficients between image and ground reference data were greater when using GA-SPCA than that for PCA results with all original bands. This indicates that bands with no contribution to a specific application were removed prior to PCA. The variance related to a specific application within the image was transformed with more emphasis by using bands sensitive to that application. The selected bands can also provide useful information for future imaging sensor development.
KW - Feature extraction
KW - Genetic algorithm
KW - Hyper-spectral image
KW - Selective principal component analysis
KW - Supervised dimension reduction
UR - http://www.scopus.com/inward/record.url?scp=0041380888&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=0041380888&partnerID=8YFLogxK
U2 - 10.1109/TGRS.2003.811691
DO - 10.1109/TGRS.2003.811691
M3 - Article
AN - SCOPUS:0041380888
SN - 0196-2892
VL - 41
SP - 1469
EP - 1478
JO - IEEE Transactions on Geoscience and Remote Sensing
JF - IEEE Transactions on Geoscience and Remote Sensing
IS - 6 PART I
ER -