Abstract
Learning accurate classifiers for novel categories from very few examples, known as few-shot image classification, is a challenging task in statistical machine learning and computer vision. The performance in few-shot classification suffers from the bias in the estimation of classifier parameters; however, an effective underlying bias reduction technique that could alleviate this issue in training few-shot classifiers has been overlooked. In this work, we demonstrate the effectiveness of Firth bias reduction in few-shot classification. Theoretically, Firth bias reduction removes the O(N −1) first order term from the small-sample bias of the Maximum Likelihood Estimator. Here we show that the general Firth bias reduction technique simplifies to encouraging uniform class assignment probabilities for multinomial logistic classification, and almost has the same effect in cosine classifiers. We derive an easy-to-implement optimization objective for Firth penalized multinomial logistic and cosine classifiers, which is equivalent to penalizing the cross-entropy loss with a KL-divergence between the uniform label distribution and the predictions. Then, we empirically evaluate that it is consistently effective across the board for few-shot image classification, regardless of (1) the feature representations from different backbones, (2) the number of samples per class, and (3) the number of classes. Furthermore, we demonstrate the effectiveness of Firth bias reduction on cross-domain and imbalanced data settings. Our implementation is available at https://github.com/ehsansaleh/firth_bias_reduction.
Original language | English (US) |
---|---|
Title of host publication | International Conference on Learning Representations |
State | Published - 2022 |
Fingerprint
Dive into the research topics of 'On the Importance of Firth Bias Reduction in Few-Shot Classification'. Together they form a unique fingerprint.Datasets
-
Dataset for On the Importance of Firth Bias Reduction in Few-Shot Classification
Saleh, E. (Creator), Ghaffari, S. (Creator), Forsyth, D. A. (Creator) & Wang, Y. (Creator), University of Illinois Urbana-Champaign, Apr 19 2022
DOI: 10.13012/B2IDB-1016367_V1
Dataset