TY - GEN
T1 - Optimal classification with multivariate losses
AU - Natarajan, Nagarajan
AU - Koyejo, Oluwasanmi
AU - Ravikumar, Pradeep
AU - Dhillon, Inderjit S.
PY - 2016
Y1 - 2016
N2 - Multivariate loss functions are extensively employed in several prediction tasks arising in Information Retrieval. Often, the goal in the tasks is to minimize expected loss when retrieving relevant items from a presented set of items, where the expectation is with respect to the joint distribution over item sets. Our key result is that for most multivariate losses, the expected loss is provably optimized by sorting the items by the conditional probability of label being positive and then selecting top k items. Such a result was previously known only for the F-measure. Leveraging on the optimality characterization, we give an algorithm for estimating optimal predictions in prac-tice with runtime quadratic in size of item sets for many losses. We provide empirical results on benchmark datasets, comparing the proposed algorithm to state-of-the-art methods for optimiz-ing multivariate losses.
AB - Multivariate loss functions are extensively employed in several prediction tasks arising in Information Retrieval. Often, the goal in the tasks is to minimize expected loss when retrieving relevant items from a presented set of items, where the expectation is with respect to the joint distribution over item sets. Our key result is that for most multivariate losses, the expected loss is provably optimized by sorting the items by the conditional probability of label being positive and then selecting top k items. Such a result was previously known only for the F-measure. Leveraging on the optimality characterization, we give an algorithm for estimating optimal predictions in prac-tice with runtime quadratic in size of item sets for many losses. We provide empirical results on benchmark datasets, comparing the proposed algorithm to state-of-the-art methods for optimiz-ing multivariate losses.
UR - http://www.scopus.com/inward/record.url?scp=84998887141&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84998887141&partnerID=8YFLogxK
M3 - Conference contribution
AN - SCOPUS:84998887141
T3 - 33rd International Conference on Machine Learning, ICML 2016
SP - 2283
EP - 2295
BT - 33rd International Conference on Machine Learning, ICML 2016
A2 - Weinberger, Kilian Q.
A2 - Balcan, Maria Florina
PB - International Machine Learning Society (IMLS)
T2 - 33rd International Conference on Machine Learning, ICML 2016
Y2 - 19 June 2016 through 24 June 2016
ER -