Automated machine learning in insurance

Panyi Dong, Zhiyu Quan

Research output: Contribution to journalArticlepeer-review

Abstract

Machine Learning (ML) has gained popularity in actuarial research and insurance industrial applications. However, the performance of most ML tasks heavily depends on data preprocessing, model selection, and hyperparameter optimization, which are considered to be intensive in terms of domain knowledge, experience, and manual labor. Automated Machine Learning (AutoML) aims to automatically complete the full life-cycle of ML tasks and provides state-of-the-art ML models without human intervention or supervision. This paper introduces an AutoML workflow that allows users without domain knowledge or prior experience to achieve robust and effortless ML deployment by writing only a few lines of code. This proposed AutoML is specifically tailored for the insurance application, with features like the balancing step in data preprocessing, ensemble pipelines, and customized loss functions. These features are designed to address the unique challenges of the insurance domain, including the imbalanced nature of common insurance datasets. The full code and documentation are available on the GitHub repository.1

Original languageEnglish (US)
Pages (from-to)17-41
Number of pages25
JournalInsurance: Mathematics and Economics
Volume120
DOIs
StatePublished - Jan 2025

Keywords

  • AI education
  • AutoML
  • Imbalance learning
  • Insurance data analytics

ASJC Scopus subject areas

  • Statistics and Probability
  • Economics and Econometrics
  • Statistics, Probability and Uncertainty

Fingerprint

Dive into the research topics of 'Automated machine learning in insurance'. Together they form a unique fingerprint.

Cite this