Optimal Control of Microgrids with Multi-stage Mixed-integer Nonlinear Programming Guided Q-learning Algorithm

doi:10.35833/MPCE.2020.000506

Home > Archive>Volume 8, Issue 6, 2020 >1151-1159. DOI:10.35833/MPCE.2020.000506

PDF HTML Export

Optimal Control of Microgrids with Multi-stage Mixed-integer Nonlinear Programming Guided Q-learning Algorithm

DOI:

10.35833/MPCE.2020.000506

Author:

Yeliz Yoldas ^¹
Yeliz Yoldas
Department of Electrical and Electronics Engineering, Abdullah Gul University, Kayseri, 38080, Turkey
Find this author on All Journals
Find this author on BaiDu
Search for this author on this site
Selcuk Goren ^²
Selcuk Goren
Department of Industrial Engineering, Abdullah Gul University, Kayseri, 38080, Turkey
Find this author on All Journals
Find this author on BaiDu
Search for this author on this site
Ahmet Onen ^¹
Ahmet Onen
Department of Electrical and Electronics Engineering, Abdullah Gul University, Kayseri, 38080, Turkey
Find this author on All Journals
Find this author on BaiDu
Search for this author on this site

Affiliation:

1.Department of Electrical and Electronics Engineering, Abdullah Gul University, Kayseri, 38080, Turkey;2.Department of Industrial Engineering, Abdullah Gul University, Kayseri, 38080, Turkey

Fund Project:

This work was supported by the Scientific and Technological Research Council of Turkey (TUBITAK) (No. 215E373), Malta Council for Science and Technology (MCST) (No. ENM-2016-002a), Jordan The Higher Council for Science and Technology (HCST), Cyprus Research Promotion Foundation (RPF), Greece General Secretariat for Research and Technology (GRST), Spain Ministerio de Economía, Industria y Competitividad (MINECO), Germany and Algeria through the ERANETMED Initiative of Member States, Associated Countries and Mediterranean Partner Countries (3DMgrid Project ID eranetmed_energy-11-286).

Article

Figures

Metrics

Reference

Cited by

Materials

Abstract:

This paper proposes an energy management system (EMS) for the real-time operation of a pilot stochastic and dynamic microgrid on a university campus in Malta consisting of a diesel generator, photovoltaic panels, and batteries. The objective is to minimize the total daily operation costs, which include the degradation cost of batteries, the cost of energy bought from the main grid, the fuel cost of the diesel generator, and the emission cost. The optimization problem is modeled as a finite Markov decision process (MDP) by combining network and technical constraints, and Q-learning algorithm is adopted to solve the sequential decision subproblems. The proposed algorithm decomposes a multi-stage mixed-integer nonlinear programming (MINLP) problem into a series of single-stage problems so that each subproblem can be solved by using Bellman’s equation. To prove the effectiveness of the proposed algorithm, three case studies are taken into consideration: ① minimizing the daily energy cost; ② minimizing the emission cost; ③ minimizing the daily energy cost and emission cost simultaneously. Moreover, each case is operated under different battery operation conditions to investigate the battery lifetime. Finally, performance comparisons are carried out with a conventional Q-learning algorithm.

Key words:Cost minimization ; energy management system ; microgrid ; real-time optimization ; reinforcement learning

Get Citation

Copy

Article Metrics

Abstract:
PDF:
HTML:
Cited by:

History

Received:July 22,2020
Revised:
Adopted:
Online: December 03,2020
Published:

Home

Introduction

Editorial Board

For Author

Call For Papers

APC

Sponsor & Publisher

Get Citation

Share

Article Metrics

History