TY - JOUR
T1 - Relating chromatographic data to measurements of wheat quality
T2 - Case studies in dimension reduction
AU - Simpson, D. G.
AU - Guo, S.
AU - Sacks, J.
AU - Bietz, J. A.
AU - Huebner, F.
AU - Nelsen, T.
N1 - Funding Information:
The authors thank J. Alho, H.D. Petersen and the referees for helpful comments and additional references.T his research was supported by United States Department of Agriculture Contract USDA-58-5114-8-1026, National Security Agency Grant NSA-MDA904-89-H-2011, and Air Force Office of Scientific Research Grant AFOSR-87-0041.
PY - 1991/2
Y1 - 1991/2
N2 - Fractionating wheat proteins by reversed-phase high-performance liquid chromatography yields extremely complex chromatograms. The data they contain may relate to many characteristics of milled wheat such as the volume of a loaf of bread or the texture of the dough produced, but such relationships are not readily apparent from the raw data. We report our experiences with two dimension-reduction techniques that are widely cited in the chemometrics literature: principal component analysis and partial least squares (PLS). Each of these methods replaces the original observation vectors by weighted averages of their components, where the weights are selected according to a data-dependent criterion. The analysis proceeds by operating on these weighted averages rather than the original, high-dimensional data. In order to elucidate properties of significance tests and other inferences, we focus on the special case where only one factor is selected. We show how to use simulation to compute the appropriate significance level of the regression on the PLS scores. The common technique of using the F distribution to compute significance levels for PLS regression can be an extremely liberal procedure. The interpretation of PLS weights requires considerable care.
AB - Fractionating wheat proteins by reversed-phase high-performance liquid chromatography yields extremely complex chromatograms. The data they contain may relate to many characteristics of milled wheat such as the volume of a loaf of bread or the texture of the dough produced, but such relationships are not readily apparent from the raw data. We report our experiences with two dimension-reduction techniques that are widely cited in the chemometrics literature: principal component analysis and partial least squares (PLS). Each of these methods replaces the original observation vectors by weighted averages of their components, where the weights are selected according to a data-dependent criterion. The analysis proceeds by operating on these weighted averages rather than the original, high-dimensional data. In order to elucidate properties of significance tests and other inferences, we focus on the special case where only one factor is selected. We show how to use simulation to compute the appropriate significance level of the regression on the PLS scores. The common technique of using the F distribution to compute significance levels for PLS regression can be an extremely liberal procedure. The interpretation of PLS weights requires considerable care.
UR - http://www.scopus.com/inward/record.url?scp=0026032120&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=0026032120&partnerID=8YFLogxK
U2 - 10.1016/0169-7439(91)80045-R
DO - 10.1016/0169-7439(91)80045-R
M3 - Article
AN - SCOPUS:0026032120
SN - 0169-7439
VL - 10
SP - 155
EP - 167
JO - Chemometrics and Intelligent Laboratory Systems
JF - Chemometrics and Intelligent Laboratory Systems
IS - 1-2
ER -