TY - GEN
T1 - Approximate dynamic programming using fluid and diffusion approximations with applications to power management
AU - Chen, Wei
AU - Huang, Dayu
AU - Kulkarni, Ankur A.
AU - Unnikrishnan, Jayakrishnan
AU - Zhu, Quanyan
AU - Mehta, Prashant
AU - Meyn, Sean
AU - Wierman, Adam
PY - 2009
Y1 - 2009
N2 - TD learning and its refinements are powerful tools for approximating the solution to dynamic programming problems. However, the techniques provide the approximate solution only within a prescribed finite-dimensional function class. Thus, the question that always arises is how should the function class be chosen? The goal of this paper is to propose an approach for TD learning based on choosing the function class using the solutions to associated fluid and diffusion approximations. In order to illustrate this new approach, the paper focuses on an application to dynamic speed scaling for power management.
AB - TD learning and its refinements are powerful tools for approximating the solution to dynamic programming problems. However, the techniques provide the approximate solution only within a prescribed finite-dimensional function class. Thus, the question that always arises is how should the function class be chosen? The goal of this paper is to propose an approach for TD learning based on choosing the function class using the solutions to associated fluid and diffusion approximations. In order to illustrate this new approach, the paper focuses on an application to dynamic speed scaling for power management.
UR - http://www.scopus.com/inward/record.url?scp=77950814003&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=77950814003&partnerID=8YFLogxK
U2 - 10.1109/CDC.2009.5399685
DO - 10.1109/CDC.2009.5399685
M3 - Conference contribution
AN - SCOPUS:77950814003
SN - 9781424438716
T3 - Proceedings of the IEEE Conference on Decision and Control
SP - 3575
EP - 3580
BT - Proceedings of the 48th IEEE Conference on Decision and Control held jointly with 2009 28th Chinese Control Conference, CDC/CCC 2009
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 48th IEEE Conference on Decision and Control held jointly with 2009 28th Chinese Control Conference, CDC/CCC 2009
Y2 - 15 December 2009 through 18 December 2009
ER -