### Abstract

In this paper, the problem of the smart grid energy management under stochastic dynamics is investigated. In the considered model, at the demand side, it is assumed that customers can act as prosumers who own renewable energy sources and can both produce and consume energy. Due to the coupling between the prosumers' decisions and the stochastic nature of renewable energy, the interaction among prosumers is formulated as a stochastic game, in which each prosumer seeks to maximize its payoff, in terms of revenues, by controlling its energy consumption and demand. In particular, the subjective behavior of prosumers is explicitly reflected into their payoff functions using the prospect theory, a powerful framework that allows modeling real-life human choices, rather than objective, user-agnostic decisions, as normative models do. For this prospect-based stochastic game, it is shown that there always exists a stationary Nash equilibrium where the prosumers' trading policies in the equilibrium are independent of the time and their histories of the play. Moreover, to obtain one of such equilibrium policies, a novel distributed algorithm with no information sharing among prosumers is proposed and shown to converge to an \epsilon-Nash equilibrium in which each prosumer is able to achieve its optimal payoff in an equilibrium up to a small additive error \epsilon. On the other hand, at the supply side, the interaction between the utility company and the prosumers is formulated as an online optimization problem in which the utility company's goal is to learn its optimal energy allocation rules. For this case, it is shown that such an optimization problem admits a no-regret algorithm meaning that regardless of the actual outcome of the game among the prosumers, the utility company can follow a strategy that mitigates its allocation costs as if it knew the entire demand market a priori. Simulation results justify the convergence of the proposed algorithms and present new insights toward more efficient energy management in the smart grids.

Original language | English (US) |
---|---|

Pages (from-to) | 2327-2342 |

Number of pages | 16 |

Journal | IEEE Transactions on Automatic Control |

Volume | 63 |

Issue number | 8 |

DOIs | |

State | Published - Aug 2018 |

### Fingerprint

### Keywords

- Distributed learning
- energy management
- prospect theory
- smart grid
- stationary Nash equilibrium
- stochastic game

### ASJC Scopus subject areas

- Control and Systems Engineering
- Computer Science Applications
- Electrical and Electronic Engineering

### Cite this

*IEEE Transactions on Automatic Control*,

*63*(8), 2327-2342. https://doi.org/10.1109/TAC.2018.2797217

**Stochastic games for the smart grid energy management with prospect prosumers.** / Etesami, Seyed Rasoul; Saad, Walid; Mandayam, Narayan B.; Poor, H. Vincent.

Research output: Contribution to journal › Article

*IEEE Transactions on Automatic Control*, vol. 63, no. 8, pp. 2327-2342. https://doi.org/10.1109/TAC.2018.2797217

}

TY - JOUR

T1 - Stochastic games for the smart grid energy management with prospect prosumers

AU - Etesami, Seyed Rasoul

AU - Saad, Walid

AU - Mandayam, Narayan B.

AU - Poor, H. Vincent

PY - 2018/8

Y1 - 2018/8

N2 - In this paper, the problem of the smart grid energy management under stochastic dynamics is investigated. In the considered model, at the demand side, it is assumed that customers can act as prosumers who own renewable energy sources and can both produce and consume energy. Due to the coupling between the prosumers' decisions and the stochastic nature of renewable energy, the interaction among prosumers is formulated as a stochastic game, in which each prosumer seeks to maximize its payoff, in terms of revenues, by controlling its energy consumption and demand. In particular, the subjective behavior of prosumers is explicitly reflected into their payoff functions using the prospect theory, a powerful framework that allows modeling real-life human choices, rather than objective, user-agnostic decisions, as normative models do. For this prospect-based stochastic game, it is shown that there always exists a stationary Nash equilibrium where the prosumers' trading policies in the equilibrium are independent of the time and their histories of the play. Moreover, to obtain one of such equilibrium policies, a novel distributed algorithm with no information sharing among prosumers is proposed and shown to converge to an \epsilon-Nash equilibrium in which each prosumer is able to achieve its optimal payoff in an equilibrium up to a small additive error \epsilon. On the other hand, at the supply side, the interaction between the utility company and the prosumers is formulated as an online optimization problem in which the utility company's goal is to learn its optimal energy allocation rules. For this case, it is shown that such an optimization problem admits a no-regret algorithm meaning that regardless of the actual outcome of the game among the prosumers, the utility company can follow a strategy that mitigates its allocation costs as if it knew the entire demand market a priori. Simulation results justify the convergence of the proposed algorithms and present new insights toward more efficient energy management in the smart grids.

AB - In this paper, the problem of the smart grid energy management under stochastic dynamics is investigated. In the considered model, at the demand side, it is assumed that customers can act as prosumers who own renewable energy sources and can both produce and consume energy. Due to the coupling between the prosumers' decisions and the stochastic nature of renewable energy, the interaction among prosumers is formulated as a stochastic game, in which each prosumer seeks to maximize its payoff, in terms of revenues, by controlling its energy consumption and demand. In particular, the subjective behavior of prosumers is explicitly reflected into their payoff functions using the prospect theory, a powerful framework that allows modeling real-life human choices, rather than objective, user-agnostic decisions, as normative models do. For this prospect-based stochastic game, it is shown that there always exists a stationary Nash equilibrium where the prosumers' trading policies in the equilibrium are independent of the time and their histories of the play. Moreover, to obtain one of such equilibrium policies, a novel distributed algorithm with no information sharing among prosumers is proposed and shown to converge to an \epsilon-Nash equilibrium in which each prosumer is able to achieve its optimal payoff in an equilibrium up to a small additive error \epsilon. On the other hand, at the supply side, the interaction between the utility company and the prosumers is formulated as an online optimization problem in which the utility company's goal is to learn its optimal energy allocation rules. For this case, it is shown that such an optimization problem admits a no-regret algorithm meaning that regardless of the actual outcome of the game among the prosumers, the utility company can follow a strategy that mitigates its allocation costs as if it knew the entire demand market a priori. Simulation results justify the convergence of the proposed algorithms and present new insights toward more efficient energy management in the smart grids.

KW - Distributed learning

KW - energy management

KW - prospect theory

KW - smart grid

KW - stationary Nash equilibrium

KW - stochastic game

UR - http://www.scopus.com/inward/record.url?scp=85040985844&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85040985844&partnerID=8YFLogxK

U2 - 10.1109/TAC.2018.2797217

DO - 10.1109/TAC.2018.2797217

M3 - Article

AN - SCOPUS:85040985844

VL - 63

SP - 2327

EP - 2342

JO - IEEE Transactions on Automatic Control

JF - IEEE Transactions on Automatic Control

SN - 0018-9286

IS - 8

ER -