TY - GEN
T1 - Learning Curves for Analysis of Deep Networks
AU - Hoiem, Derek
AU - Gupta, Tanmay
AU - Li, Zhizhong
AU - Shlapentokh-Rothman, Michal M.
N1 - Publisher Copyright:
Copyright © 2021 by the author(s)
PY - 2021
Y1 - 2021
N2 - Learning curves model a classifier's test error as a function of the number of training samples. Prior works show that learning curves can be used to select model parameters and extrapolate performance. We investigate how to use learning curves to evaluate design choices, such as pretraining, architecture, and data augmentation. We propose a method to robustly estimate learning curves, abstract their parameters into error and data-reliance, and evaluate the effectiveness of different parameterizations. Our experiments exemplify use of learning curves for analysis and yield several interesting observations.
AB - Learning curves model a classifier's test error as a function of the number of training samples. Prior works show that learning curves can be used to select model parameters and extrapolate performance. We investigate how to use learning curves to evaluate design choices, such as pretraining, architecture, and data augmentation. We propose a method to robustly estimate learning curves, abstract their parameters into error and data-reliance, and evaluate the effectiveness of different parameterizations. Our experiments exemplify use of learning curves for analysis and yield several interesting observations.
UR - http://www.scopus.com/inward/record.url?scp=85161333480&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85161333480&partnerID=8YFLogxK
M3 - Conference contribution
AN - SCOPUS:85161333480
T3 - Proceedings of Machine Learning Research
SP - 4287
EP - 4296
BT - Proceedings of the 38th International Conference on Machine Learning, ICML 2021
PB - ML Research Press
T2 - 38th International Conference on Machine Learning, ICML 2021
Y2 - 18 July 2021 through 24 July 2021
ER -