Journal of Modern Power Systems and Clean Energy

ISSN 2196-5625 CN 32-1884/TK

Approximating Nash Equilibrium in Day-ahead Electricity Market Bidding with Multi-agent Deep Reinforcement Learning
Affiliation:

1.University of Tennessee, Knoxville, USA;2.Oak Ridge National Laboratory, Oak Ridge, USA

Fund Project:

This work was supported in part by the US Department of Energy (DOE), Office of Electricity and Office of Energy Efficiency and Renewable Energy under contract DE-AC05-00OR22725, in part by CURENT, an Engineering Research Center funded by US National Science Foundation (NSF) and DOE under NSF award EEC-1041877, and in part by NSF award ECCS-1809458.

  • Article
  • |
  • Figures
  • |
  • Metrics
  • |
  • Reference
  • |
  • Related
  • |
  • Cited by
  • |
    Abstract:

    In this paper, a day-ahead electricity market bidding problem with multiple strategic generation company (GENCO) bidders is studied. The problem is formulated as a Markov game model, where GENCO bidders interact with each other to develop their optimal day-ahead bidding strategies. Considering unobservable information in the problem, a model-free and data-driven approach, known as multi-agent deep deterministic policy gradient (MADDPG), is applied for approximating the Nash equilibrium (NE) in the above Markov game. The MADDPG algorithm has the advantage of generalization due to the automatic feature extraction ability of the deep neural networks. The algorithm is tested on an IEEE 30-bus system with three competitive GENCO bidders in both an uncongested case and a congested case. Comparisons with a truthful bidding strategy and state-of-the-art deep reinforcement learning methods including deep Q network and deep deterministic policy gradient (DDPG) demonstrate that the applied MADDPG algorithm can find a superior bidding strategy for all the market participants with increased profit gains. In addition, the comparison with a conventional-model-based method shows that the MADDPG algorithm has higher computational efficiency, which is feasible for real-world applications.

    表 6 Table 6
    表 5 Table 5
    图1 多通道烘缸结构示意图Fig.1 Multi-channel cylinder dryer structure
    图2 实验装置示意图Fig.2 Experimental setup
    图3 测试段结构示意图Fig.3 Test section
    图5 凝结换热系数随G的变化Fig.5 Variation of condensation heat transfer coefficient with steam mass flux (G)
    图6 温差与热流密度随G的变化Fig.6 Variation of temperature difference and heat flux with steam mass flux (G)
    图7 凝结换热系数随Rec的变化Fig.7 Variation trend of condensation heat transfer coefficient with Rec of cooling water
    图8 温差与热流密度随Rec的变化Fig.8 Variation of temperature difference and heat flux with Rec of cooling water
    图1 MADDPG algorithm for solving Markov game in day-ahead electricity electricity market bidding.Fig.1
    图2 Topology of IEEE 30-bus system.Fig.2
    图3 Load profiles of training days and test days in June and July of year 2019.Fig.3
    图4 Convergence of MADDPG in uncongested case. (a) GENCO 1. (b) GENCO 2. (c) GENCO 3.Fig.4
    图5 Bidding strategy of three GENCOs with MADDPG in uncongested case. (a) GENCO 1. (b) GENCO 2. (c) GENCO 3.Fig.5
    图6 Convergence of MADDPG in congested case. (a) GENCO 1. (b) GENCO 2. (c) GENCO 3.Fig.6
    图7 Bidding strategy of three GENCOs with MADDPG in congested case. (a) GENCO 1. (b) GENCO 2. (c) GENCO 3.Fig.7
    图8 Hourly market clearing price with MADDPG bidding strategy. (a) GENCO 1. (b) GENCO 2. (c) GENCO 3.Fig.8
    图9 Comparison of learning performance of different RL methods. (a) GENCO 1. (b) GENCO 2. (c) GENCO 3.Fig.9
    表 4 Table 4
    表 2 Table 2
    表 1 Table 1
    Reference
    Related
    Cited by
Get Citation
Share
Article Metrics
  • Abstract:
  • PDF:
  • HTML:
  • Cited by:
History
  • Received:July 21,2020
  • Online: May 19,2021