Robust multivariate mixture regression models with incomplete data

Hwa Kyung Lim, Naveen N. Narisetty, Sooyoung Cheon

Research output: Contribution to journalArticlepeer-review

Abstract

Multivariate mixture regression models can be used to investigate the relationships between two or more response variables and a set of predictor variables by taking into consideration unobserved population heterogeneity. It is common to take multivariate normal distributions as mixing components, but this mixing model is sensitive to heavy-tailed errors and outliers. Although normal mixture models can approximate any distribution in principle, the number of components needed to account for heavy-tailed distributions can be very large. Mixture regression models based on the multivariate t distributions can be considered as a robust alternative approach. Missing data are inevitable in many situations and parameter estimates could be biased if the missing values are not handled properly. In this paper, we propose a multivariate t mixture regression model with missing information to model heterogeneity in regression function in the presence of outliers and missing values. Along with the robust parameter estimation, our proposed method can be used for (i) visualization of the partial correlation between response variables across latent classes and heterogeneous regressions, and (ii) outlier detection and robust clustering even under the presence of missing values. We also propose a multivariate t mixture regression model using MM-estimation with missing information that is robust to high-leverage outliers. The proposed methodologies are illustrated through simulation studies and real data analysis.

Original languageEnglish (US)
Pages (from-to)328-347
Number of pages20
JournalJournal of Statistical Computation and Simulation
Volume87
Issue number2
DOIs
StatePublished - Jan 22 2017

Keywords

  • Incomplete data
  • multivariate mixture regression models
  • multivariate t-distribution
  • outlier
  • robust clustering

ASJC Scopus subject areas

  • Statistics and Probability
  • Modeling and Simulation
  • Statistics, Probability and Uncertainty
  • Applied Mathematics

Fingerprint

Dive into the research topics of 'Robust multivariate mixture regression models with incomplete data'. Together they form a unique fingerprint.

Cite this