TY - GEN
T1 - Selective Labeling in Learning with Expert Advice
AU - Truong, Anh
AU - Etesami, S. Rasoul
AU - Kiyavash, Negar
N1 - Funding Information:
Anh Truong is with Capital One. This work was done at Department of Industrial and Systems Engineering, University of Illinois at Urbana-Champaign, Illinois 61820, USA (ducanhtt@gmail.com). S. Rasoul Etesami is with Department of Industrial and Systems Engineering, University of Illinois at Urbana-Champaign, Illinois, 61820, USA (etesami1@illinois.edu). Negar Kiyavash with École Polytechnique Fédérale de Lausanne, College of Management of Technology, Switzerland (negar.kiyavash@epfl.ch). This work is supported by the NSF under Grant No. EPCN-1944403.
Publisher Copyright:
© 2021 American Automatic Control Council.
PY - 2021/5/25
Y1 - 2021/5/25
N2 - An online active learning mechanism using the expert advice framework is considered where the goal is to learn the correct labels of a sequence of revealed items. The learning scheme's efficiency is measured in terms of the regret bound and reduced data labeling queries based on experts' predictions. Two efficient randomized algorithms EPSL and EPAL are proposed in which the opinion ranges of experts are examined in order to decide whether to acquire a label from users for a given instance. It is shown that both algorithms obtain nearly optimal regret bounds and up to a constant factor depending on the characteristics of experts' predictions. While EPSL yields a better regret bound than EPAL, it requires extra prior knowledge of experts' predictions. Relaxing this assumption, EPAL provides a more practical scheme by implying an adaptive time-varying learning rate whose regret is at worst \sqrt{2} times of that for EPSL. Experimental results justify the outperformance of the proposed algorithms compared to the existing ones in this setting.
AB - An online active learning mechanism using the expert advice framework is considered where the goal is to learn the correct labels of a sequence of revealed items. The learning scheme's efficiency is measured in terms of the regret bound and reduced data labeling queries based on experts' predictions. Two efficient randomized algorithms EPSL and EPAL are proposed in which the opinion ranges of experts are examined in order to decide whether to acquire a label from users for a given instance. It is shown that both algorithms obtain nearly optimal regret bounds and up to a constant factor depending on the characteristics of experts' predictions. While EPSL yields a better regret bound than EPAL, it requires extra prior knowledge of experts' predictions. Relaxing this assumption, EPAL provides a more practical scheme by implying an adaptive time-varying learning rate whose regret is at worst \sqrt{2} times of that for EPSL. Experimental results justify the outperformance of the proposed algorithms compared to the existing ones in this setting.
UR - http://www.scopus.com/inward/record.url?scp=85111901717&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85111901717&partnerID=8YFLogxK
U2 - 10.23919/ACC50511.2021.9482705
DO - 10.23919/ACC50511.2021.9482705
M3 - Conference contribution
AN - SCOPUS:85111901717
T3 - Proceedings of the American Control Conference
SP - 2537
EP - 2542
BT - 2021 American Control Conference, ACC 2021
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2021 American Control Conference, ACC 2021
Y2 - 25 May 2021 through 28 May 2021
ER -