Abstract
This paper presents a data-driven variable reduction approach to accelerate the computation of large-scale transmission-constrained unit commitment (TCUC). Lagrangian relaxation (LR) and mixed-integer linear programming (MILP) are popular approaches to solving TCUC. However, with many binary unit commitment variables, LR suffers from slow convergence and MILP presents heavy computation burden. The proposed data-driven variable reduction approach consists of offline and online calculations to accelerate computational performance of the MILP-based large-scale TCUC problems. A database including multiple nodal net load intervals and the corresponding TCUC solutions is first built offline via the data-driven and all-scenario-feasible (ASF) approaches, which is then leveraged to efficiently solve new TCUC instances online. On/off statuses of considerable units can be fixed in the online calculation according to the database, which would reduce the computation burden while guaranteeing good solution quality for new TCUC instances. A feasibility proposition is proposed to promptly check the feasibility of the new TCUC instances with fixed binary variables, which can be used to dynamically tune parameters of binary variable fixing strategies and guarantee the existence of feasible UC solutions even when system structure changes. Numerical tests illustrate the efficiency of the proposed approach.
UNIT commitment (UC), as a key building block in power system planning and operation and electricity market clearing problems, has long-standing challenges in computational performance and solution quality [
Lagrangian relaxation (LR) [
With the recent advance of commercial MILP solvers such as CPLEX and Gurobi, the MILP-based approach becomes the mainstream approach in the power industry. In this direction, advanced modeling and decomposition techniques have been explored to resolve the computational concerns of MILP-based approaches for large-scale UC problems. Tight formulations and dynamic global cutting plane approaches are presented in [
In observing the recent advance in data-driven approaches and the volume of available data, data-driven approaches are explored to solve the UC problem [
From the above literature review, we note that the LR-based approaches [
In comparison, the data-driven approaches show potentials to fundamentally mitigate computational complexity of the UC problem. However, in the existing data-driven studies, [
This paper discusses a new data-driven variable reduction approach to efficiently reduce the number of binary variables for large-scale TCUC problems and render the fast calculation. The main idea is depicted in

Fig. 1 Main idea of proposed data-driven variable reduction approach for fast TCUC.
The key features are summarized as follows.
1) The proposed data-driven variable reduction approach for fast TCUC includes two parts of offline and online calculations. Specifically, a database including multiple nodal net load, i.e., the load minus renewable energy output of each node, intervals and corresponding TCUC solutions is established offline. This knowledge is then leveraged to assist in fixing certain on/off binary variables in new TCUC instances, reducing the online computational burden of TCUC.
2) Different from the existing data-driven approaches such as [
3) In the online calculation, several customized schemes are proposed to achieve the adaptability to the differences between online and offline TCUC instances such as initial on/off statuses and system parameter changes. Specifically, a feasibility check proposition is proposed to strategically adjust the number of on/off statuses for generators to be fixed, guaranteeing the feasibility against the new nodal net load instances and system parameter changes.
4) Numerical tests on the IEEE 6-bus, IEEE 118-bus, and Polish 2383-bus systems show that, with the comprehensive offline database established according to sufficient and high-quality historical and simulated nodal net load data, considerable on/off statuses of generators in the new TCUC instances can be strategically fixed via the customized schemes. Therefore, the computational efficiency with acceptable solution quality is improved.
The rest of this paper is organized as follows. Section II presents the framework of the proposed data-driven variable reduction approach for fast TCUC. The offline database is established in Section III, and the online fast TCUC calculation is introduced in Section IV. Numerical results are analyzed in Section V and Section VI concludes this paper.
This paper aims at accelerating the computational performance of TCUC for large-scale power systems. The main idea is to first use the data-driven approach for exploring the relationship between nodal net load profiles and feasibility/optimality of UC solutions, which is then leveraged to guide binary variable fixing strategies and render fast online calculation of new TCUC instances. The flowchart of proposed data-driven variable reduction approach for fast TCUC is shown in

Fig. 2 Flowchart of proposed data-driven variable reduction approach for fast TCUC.
The first step is an offline procedure, which leverages historical and simulated nodal net load data, K-means clustering approach, and ASF UC approach to establish a database including a suite of classified nodal net load intervals and the corresponding unique TCUC solutions. Each record in the database describes the nodal net load intervals, and the corresponding unique TCUC solution supports the system operations when individual nodal net load varies within the intervals. The second step is an online procedure, which compares nodal net load profiles of new TCUC instances against individual classified nodal net load intervals to strategically fix on/off statuses of certain units. It will reduce the number of binary variables and promptly derive TCUC solutions. The establishment of offline database and the calculation of fast online TCUC will be detailed in Sections III and IV.
Two aspects of the proposed data-driven variable reduction approach for fast TCUC are discussed as follows.
1) The offline database is established based on the nodal net loads. In the actual operation of complex power systems, the TCUC solutions would depend on many features such as loads, generation bids, and system structure. However, if all features are considered to build the offline database, the database would be considerable large, impacting its establishment and application. Therefore, since nodal net loads contain the information on loads, renewable energy outputs, and partial system structure, i.e., locations of loads and renewables, and have a great influence on scheduling results, the offline database is established by leveraging nodal net loads, which is essentially a tradeoff between data scale and calculation efficacy.
2) The proposed data-driven variable reduction approach for fast TCUC could handle the differences in parameters of offline and online TCUC instances such as initial on/off statuses, generation bids, and system structures. In the online calculation of the proposed approach, several customized schemes are proposed to achieve the adaptability for such differences. Specifically, the online calculation includes a feasibility check pass, which can dynamically tune the number of on/off statuses for the generators to be fixed, guaranteeing the feasibility against the new nodal net load instances and changes of system parameters. In the extreme situation, when the online system parameters significantly differ from the offline database, only a limited number of on/off statuses will be fixed to guarantee the solution feasibility.
We first leverage multiple historical and simulated nodal net load data, with respect to the fixed grid structure of a given power system, to establish the offline database. To guarantee the solution quality of online TCUC calculation, the established offline database shall follow two principles.
1) Feasibility: each classified nodal net load interval corresponds to a unique UC solution. This solution remains feasible for any load realization contained in this interval.
2) Optimality: for any nodal net load profile contained in a classified nodal net load interval, the objective value with respect to the corresponding unique UC solution is within a prespecified threshold of the true optimal TCUC solution of the nodal net load profile.
As the database covering a wider range of nodal net load intervals would provide more information for the online TCUC calculation, besides historical nodal net load data, we also generate additional nodal net load data to establish the database offline. The simulated data intend to compensate the historical data for covering representative nodal net load situations. In this paper, the simulated nodal net load data are generated by the proper forecasting approaches [
The proposed classification algorithm of nodal net load includes three steps. ① Classify net load data samples into K clusters, i.e., K nodal net load intervals. ② Check TCUC feasibility and optimality of each classified nodal net load interval. If both feasibility and optimality of a nodal net load interval are achieved, this classified nodal net load interval is completed. Otherwise, all nodal net load data in this interval will be reclassified. ③ Update K and go back to ① until all nodal net load data have been classified. Finally, multiple nodal net load intervals and their corresponding TCUC solutions that can simultaneously guarantee feasibility and optimality are obtained. The detailed nodal net load classification algorithm is described in
Algorithm 1 : nodal net load classification |
---|
Input and initialization: Step 1: Collect historical and simulated nodal net load data into set , set threshold , and initialize the solution set as null Step 2: Compute the optimal objective value of UC problem (1)-(3) for each nodal net load data in set Step 3: Analyze nodal net load data and their UC solutions to set a proper initial classification number K Classify data: Step 4: Classify nodal net load data in set into K categories via the K-means clustering approach Step 5: Calculate nodal net load interval of each category Assess feasibility and optimality: Step 6: Set Step 7: for Step 8: Calculate the unique UC solution corresponding to the nodal net load interval of category k by ASF UC approach (8)-(13) Step 9: If a feasible UC solution is obtained, go to Step 10; otherwise, set and go to Step 7 Step 10: Compute objective values of the ED problem (4), (5) for all nodal net load data in this category Step 11: Calculate the difference between ED objective from Step 10 and UC objective from Step 2 for all nodal net load data in this category Step 12: If the differences of all nodal net load data are smaller than , save the UC solution and nodal net load interval of this category to the solution set , and remove all nodal net load data in this category from set ; otherwise, set Step 13: End for Step 14: If , set and go to Step 4; otherwise, output |
The convergence of the proposed classification algorithm is naturally guaranteed, as K will be updated only when the feasibility/optimality check of TCUC in ② fails. That is, in the worst case, each category includes a single sample of historical/simulated nodal net load data. Note that the final classified number of nodal net load categories is determined by continuously updating iterations of
As shown in
The UC model (1)-(3) and ED model (4), (5) are used to calculate the optimal reference of classification results in the proposed classification algorithm of nodal net load.
(1) |
s.t.
(2) |
(3) |
(4) |
s.t.
(5) |
where z and p are the UC and dispatch decision variables, respectively; and are the start-up/shut-down cost and fuel cost, respectively; matrices A, B, E and vector F are constant coefficients; d is the nodal net load vector, i.e., demands minus renewable energy outputs of individual nodes; is the set of ; and is a known UC solution derived from (1)-(3).
The objective (1) is to minimize the total operation cost. Prevailing constraints coupling z and p are described as in (2) such as the load balance, generation capacity, ramping, and network constraints. In (3), the set X describes constraints related to z such as the minimum on/off time limits and on/off time requirements.
K-means clustering approach is applied to classify nodal net load data into multiple categories according to their similarities. Each classified nodal net load category will be used later to identify a unique UC solution. The objective of the K-means clustering approach is to minimize the sum of squared errors over all K clusters, as shown in (6).
(6) |
where k is the index of clusters, ; is the mean vector of the
We emphasize several implementation issues of using the K-means to conduct the classification of nodal net load data, while additional details are in [
1) Initial number of clusters K: as shown in
2) Algorithm stability: although the K-means clustering approach is usually unstable, the stability of
3) Comparison with other clustering algorithms: K-means clustering approach, as the most widely applied approach in the industry, is applied in this paper because of its simple principle and fast convergence speed. It is also noted that, the proposed classification algorithm of nodal net load can be implemented via other clustering approaches such as hierarchical clustering [
For each classified nodal net load interval, the ASF UC approach [
Considering a clustered nodal net load category containing M samples of nodal net load profiles, the maximum and minimum hourly nodal net load levels are used to build the nodal net load intervals and for this load category, i.e., and (, ), where t and i are the indices of time periods and nodal net loads, respectively. Constraint (7) describes the box set for the ASF UC approach, ensuring that the unique UC solution is feasible for any realization of nodal net load containing the nodal net load interval.
(7) |
In the ASF UC approach, the key idea is to build a scenario set , composed of selective vertex scenarios, and adopt nonanticipative constraints to guarantee the feasibility of the UC solution against the box set (7). As for the selective vertex scenarios, if only one node is associated with the variability, i.e., containing loads and/or renewable energy, the scenarios with the minimum and maximum nodal net load levels constitute the selective vertex scenarios. While for the multi-node uncertainty situation, the selective vertex scenarios shall include the combinations of these minimum and maximum nodal net load levels. The detailed procedure to design vertex scenarios and the proof for the UC solution feasibility are given in [
The ASF UC formulation is established as in (8)-(12) and (3). The objective (8) is to minimize the weighted total costs of all scenarios in set . Constraint (9) includes power balance limits, transmission capacity limits, generation capacity limits, and so on. Nonanticipative constraints (10)-(12) guarantee the feasibility of UC solutions.
(8) |
s.t.
(9) |
(10) |
(11) |
(12) |
where the superscript denotes the scenario ; the superscripts max and min denote the maximum and minimum of allowable outputs of thermal units in the actual operation; the subscripts and denote the time periods; is the weighting factor of scenario s; and are the auxiliary variables; and and are the ramp-up and ramp-down capabilities, respectively.
In order to accelerate the calculation of large-scale ASF UC models, a time-decoupled decomposition (TDD) algorithm [

Fig. 3 Framework of TDD algorithm.
The ASF UC approach presents two major advantages: ① ASF UC can guarantee the feasibility of UC solutions against the box set (7), and meet the actual operation needs on the robustness and nonanticipativity of UC solutions against nodal net load variabilities. Indeed, it is clearly pointed out in [
(13) |
s.t.
(14) |
(15) |
(16) |
where ; is the violation gap between MP and SP; and
Some classified nodal net load intervals out of
The unique UC solution of a certain classified nodal net load interval could potentially handle a larger interval than the one identified as in (7). To this end, an interval expansion model is established to calculate the largest possible nodal net load interval, as shown in (17)-(20).
(17) |
s.t.
(18) |
(10)-(12) with respect to known
(19) |
(20) |
where and are the decision variables describing the newly expanded nodal net load interval; and are the original nodal net load intervals calculated from
The objective (17) is to maximize the difference between the newly expanded nodal net load intervals and the original ones, i.e., identifying the largest nodal net load interval that can handle. Constraint (18) guarantees the feasibility of against the expanded nodal net load intervals. The scenario set S is the same as that described in the ASF UC model. Constraint (19) describes the relationship between the original and expanded nodal net load intervals. Constraint (20) restricts the economic efficiency of UC solution against the expanded nodal net load intervals, which is evaluated via set . As too many scenarios would lead to higher computational burden of the interval expansion problem (17)-(20), could only include typical scenarios such as those with the maximum and minimum nodal net loads.
After expanding the nodal net load interval via (17)-(20), certain nodal net load intervals may essentially belong to the same category and thus can be merged. For instance, when one interval is fully contained in another, it does not need to be kept in the database. That is, for two nodal net load intervals and derived from (17)-(20), if and , only and its UC solution remain in the database.
Essentially, the offline algorithm of nodal net load classification establishes a comprehensive database that describes the relationship of nodal net load intervals and corresponding TCUC solutions. Thus, the database can be used to guide binary variable fixing strategies, i.e., certain on/off statuses of thermal units, of new TCUC instances, reducing the number of binary variables and improving the efficiency of online calculations.
The online fast TCUC calculation includes three main steps as follows.
Step 1: identify a trail UC solution for the new TCUC instance based on the offline database. The deviation degree defined in (22) is used to identify the nodal net load interval that could potentially best cover the nodal net load profile of the new TCUC instance. It quantifies the distance between and the median value of classified nodal net load intervals. The UC solution corresponding to the nodal net load interval with the smallest deviation degree is taken as the trail solution of this new TCUC instance.
(21) |
Step 2: strategically fix certain on/off statuses of the units based on the trail UC solution. Because parameters of the new TCUC instance, e.g., initial on/off statuses of units and transmission capacity limits, could differ from those used in building the database, the full trail UC solution may be infeasible to the new TCUC instance. To this end, we only fix certain on/off decisions of units according to the trail UC solution, while others remain as decision variables to be optimized. The strategy for selecting on/off status variables to be fixed is detailed in Section IV-B, in order to reduce computational complexity while guaranteeing solution quality of the new TCUC instance.
Step 3: check the feasibility of the TCUC problem with the fixed on/off statuses. If it is feasible, solve this TCUC problem with the remaining binary variables. Otherwise, go back to Step 2 and update the fixed on/off statuses of units. The checking approach is detailed in Section IV-C.
Units are split into two categories based on their on/off statuses in the trail UC solution: ① the first one satisfies the initial minimum on/off time constraints; and ② the second one violates these constraints.
The units in the first category will be arranged in a descending order of their minimum on/off time limits. Then, we select the top units according to the pre-determined radio (PDR) and fix their on/off statuses based on the trail UC solution. PDR, as calculated in (22), determines the ratio of units whose statuses are to be fixed. PDR is a preset parameter that can be adjusted according to the needs of power system operators. Specifically, and are the pre-specified lower and upper bounds of PDR, respectively. is the correlation coefficient, which could be set according to the requirements of system operators. Larger and mean more binary variables are fixed. The deviation degree represents the similarity between new UC instances and the database. PDR is negatively correlated with deviation degree. Furthermore, PDR could be further dynamically tuned as discussed in Section IV-C, ensuring that the binary variable fixing strategy will not compromise the feasibility of new TCUC instances.
(22) |
The units in the second category are also arranged in a descending order of their minimum on/off time limits, and we select the top units according to PDR, i.e., the same as in (22), but with different values of and . The effects of and of this category are the same as those of the first category, and are set to be 0.1 and 0.025 in the numerical tests of this paper, respectively. Enough time periods are reserved for these units according to their initial on/off statuses, allowing them to flexibly adjust on/off statuses for satisfying the initial minimum on/off time requirements. The fixing strategy is depicted in

Fig. 4 Fixing strategy for on/off statuses.
The on/off statuses during the period [1, G/O] will be determined according to the initial minimum on/off time requirements; the on/off statuses during the period or will be optimized to offer flexible adjustability of UC solutions; and on/off statuses of the remaining periods will be fixed based on the trail UC solution.
With the above fixing strategy, the TCUC model with a reduced number of binary variables can be solved to accelerate the calculation of the new TCUC instance. Particularly, when the initial minimum on/off time constraints are fully satisfied and the nodal net load profile of a new UC instance is completely contained in a certain classified nodal net load interval, i.e., , the offline UC solution for this nodal net load interval is a good feasible solution for the new TCUC instance, i.e., it is within difference from the true optimal solution. That is, a good-enough UC solution of the new UC instance could be directly obtained without solving the new MILP problem.
Considering potential differences of system parameters between the offline database and online calculations, the settings of PDR may cause infeasibility to the new TCUC instances, if certain on/off status variables are fixed improperly. One solution is to run multiple instances in parallel with different PDR values, and adopt the solution with the best optimality gap.
In addition, a feasibility proposition is proposed to promptly check the feasibility of the new TCUC instances with fixed binary variables against the changes of system parameters. This fast feasibility-check ability allows us to efficiently tune PDR (or lower/upper bounds of PDR) online and dynamically adjust the on/off status variables to be fixed, guaranteeing the existence of a feasible UC solution, even under system structure changes.
We first define the whole on/off statuses of units as , where denotes the on/off statuses of units that are fixed according to the strategies discussed in Section IV-B; and denotes the on/off statuses of units that remain unfixed. We relax the original TCUC problem and establish a constraint satisfaction problem (23)-(25).
(23) |
(24) |
(25) |
where , , , , , and are the new TCUC instances; and is the upper bound of thermal output.
Proposition 1: the necessary condition for the existence of a feasible solution to the original TCUC problem (1)-(3) with new system parameters and nodal net loads is that the constraint satisfaction problem (23)-(25) with is feasible.
Proof: with , the constraint satisfaction problem (23)-(25) is obtained by relaxing the original inequality to constraint (24), where is the lower bound of thermal output. Therefore, it is obvious that the existence of a solution to the relaxation problem is a necessary condition for the existence of a solution to the original problem.
In the online calculation of a new TCUC instance with given system parameters such as network constraints, one can tune PDR iteratively to adjust to be fixed, guaranteeing that a feasible UC solution exists. Specifically, because fixing more on/off status variables would impact the feasibility of the new TCUC problem more significantly, we first set an initial PDR and check the feasibility according to Proposition 1. If the constraint satisfaction problem (23)-(25) is infeasible, we reduce PDR according to (26) and check the feasibility again until a feasible PDR is found.
(26) |
where is the reduction ratio.
Specially, if the system parameter difference between the daily new UC instances is not significant, the PDR value from the previous day could be used. This tuning process with Proposition 1 can be efficiently conducted, as the constraint satisfaction problem (23)-(25) does not involve binary variables and thus can be solved quickly. Proposition 1 can also be used in the offline part to set a proper initial PDR value, reducing the updating iterations in the online part.
After solving the new TCUC instance online, the new nodal net load profile and its TCUC solution can also be used to dynamically update the database, providing more accurate information for future online TCUC calculations. Two updating strategies can be adopted as follows: ① when a new nodal net load profile is covered by the upper bound of one nodal net load interval and the lower bound of another, nodal net load samples in these two intervals and the new nodal net load data are reclassified to obtain new nodal net load intervals and UC solutions; and ② when a new nodal net load profile is not contained in any classified nodal net load interval of the database, net load data samples contained in low probability intervals and the new nodal net load profile are reclassified. Low probability intervals refer to the nodal net load intervals which contain significantly fewer net load data samples than others, i.e., smaller than a prespecified threshold of the overall number of net load data samples.
The proposed data-driven variable reduction approach for fast TCUC is tested on the IEEE 6-bus, IEEE 118-bus, and Polish 2383-bus systems. Based on the data of ERCOT, 1825 (i.e., 365×5) historical and simulated samples of nodal net loads are used to establish the database offline, and the nodal net load data of one year are used to test the efficiency of the proposed approach for online UC calculation. All tests are implemented via MATLAB R2014a and Gurobi 7.5.2 on a desktop with Intel i7-7700 3.60 GHz CPU and 16 GB RAM.
The characteristic analysis of the proposed approach is conducted on the IEEE 6-bus system, with 3 thermal units and 3 nodal net loads [
K is initially set to be 10 in the K-means clustering approach, and the threshold of low probability intervals is set as . The final number of nodal net load categories is 37, in which 8 categories are low probability intervals, i.e., the number of net load data samples in each of these 8 categories is less than 10 (i.e., 365×5×0.5%).

Fig. 5 Results of four classified load category examples after the first K-means clustering calculation. (a) Results of classification No. 1. (b) Results of classification No. 2. (c) Results of classification No. 3. (d) Results of classification No. 4.

Fig. 6 Final classification results via Algorithm 1. (a) Results of classification No. 1. (b) Results of classification No. 2. (c) Results of classification No. 3. (d) Results of classification No. 4.

Fig. 7 Four expanded nodal net load intervals. (a) Interval of classification No. 1. (b) Interval of classification No. 2. (c) Interval of classification No. 3. (d) Interval of classification No. 4.
Figures
The numbers of net load data samples in
The interval expansion is to find the maximum nodal net load intervals that the corresponding UC solutions could handle without the feasibility and optimality of comprising solution. Four expanded nodal net load intervals are shown in
The expanded nodal net load intervals in
The IEEE 118-bus and Polish 2383-bus systems with 1825 net load data samples [
The computational time for IEEE 118-bus and Polish 2383-bus systems is analyzed. Testing results are shown in Figs.

Fig. 8 Computational time of TCUC for IEEE 118-bus system.

Fig. 9 Computational time of TCUC for Polish 2383-bus system.
Approach | Average time (s) | STD of average time (s) | Reduction ratio (%) |
---|---|---|---|
Original MILP-based approach | 1.59 | 0.32 | |
Proposed approach | 0.79 | 0.16 | 50.31 |
Approach | Average time (s) | STD of average time (s) | Reduction ratio (%) |
---|---|---|---|
Original MILP-based approach | 28.44 | 3.59 | |
Proposed approach | 7.48 | 0.41 | 73.70 |
Figures
Besides, the proposed approach performs more stably, i.e., standard deviation (STD) of computational time among different TCUC instances, than directly solving the original TCUC problem. Tables
In addition, the comparison between the IEEE 118-bus system and the Polish 2383-bus system indicates that computational benefits of the proposed approach are more significant on larger systems. The reason is that the locations of units with larger minimum on/off time limits could influence the computational performance, when certain on/off decisions of these units are fixed during the online calculation. Units with larger minimum on/off time limits are more concentrated on the IEEE 118-bus system. Thus, when on/off statuses of these units are first fixed, power flow constraints of certain transmission lines are very likely to be binding, which affects the computational efficiency. For instance, the computational time of the proposed approach for several new TCUC instances is even longer than the original UC problem, as shown in
System | The maximum relative error(%) | Average relative error (%) | Average reduction number | Reduction ratio (%) |
---|---|---|---|---|
118-bus | 1.32 | 0.47 | 743 | 57.33 |
2383-bus | 2.40 | 0.67 | 2527 | 58.82 |

Fig. 10 UC solutions of IEEE 118-bus system. (a) Proposed approach. (b) Original MILP-based approach.
Bidding price change is among the most significant factors that influence economic performance. Different levels of bidding price changes in the new TCUC instance against the values used in the database are compared to show the impacts on the proposed approach. The testing results are described in
Price change (%) | Average time (s) | The maximum time (s) | Average error (%) | The maximum error (%) |
---|---|---|---|---|
10 | 0.80 | 1.52 | 0.50 | 1.68 |
15 | 0.80 | 1.64 | 0.57 | 2.02 |
20 | 0.79 | 1.45 | 0.67 | 2.05 |
25 | 0.79 | 1.46 | 0.88 | 2.09 |
30 | 0.80 | 1.33 | 1.13 | 3.55 |
The data scale refers to the amount of historical and simulated data used to build the database offline, which could influence classification results and in turn computational performance of the online calculation. Five data scales, i.e., 1 year to 5 years, are compared to analyze the impact. The tests are implemented on the IEEE 118-bus system and results are reported in
Data scale (year) | Average time (s) | Average error (%) |
---|---|---|
1 | 0.79 | 0.55 |
2 | 0.76 | 0.50 |
3 | 0.78 | 0.49 |
4 | 0.79 | 0.50 |
5 | 0.79 | 0.47 |
Since different data-driven approaches would bring varying computational performance, we further compare the proposed approach in this paper with the data-driven approaches in [
Approach | Relative error (%) | Relative reduction time (%) | Infeasible rate (%) | Reduction variable (%) | System scale |
---|---|---|---|---|---|
Approach in [ | 0.02 | 95.4 | 0.06 | 2000-bus | |
Approach in [ | 0.10 | 56.4 | 0.00 | 68.7 | 2848-bus |
Proposed approach | 0.67 | 73.7 | 0.00 | 58.8 | 2383-bus |
The settings of PDR influence the fixed on/off statuses of thermal units, and then influence the computational efficiency and economic performance for new TCUC instances. To this end, the influence of PDR on computational performance is investigated in
PDR setting (%) | Relative error (%) | Relative reduction time (%) | Relative reduction number of binary variable (%) |
---|---|---|---|
30 | 0.32 | 59.26 | 30.00 |
40 | 0.49 | 68.26 | 40.00 |
50 | 0.67 | 73.30 | 50.00 |
60 | 0.67 | 72.76 | 60.00 |
70 | 0.89 | 75.10 | 69.94 |
80 | 1.04 | 76.21 | 79.63 |
90 | 1.13 | 76.43 | 88.69 |
In
This paper presents a new data-driven variable reduction approach to accelerate the calculation of TCUC problems for large-scale systems. It includes an offline data-driven approach to build a database that explores the relationship of nodal net load intervals and corresponding UC solutions, and an online approach to promptly calculate UC instances by strategically fixing on/off statuses of considerable units via the information of the database. By making full utilization of historical and simulated nodal net load information, it can essentially reduce the number of binary variables and improve the computational efficiency of TCUC problems.
Numerical tests illustrate the effectiveness of the proposed data-driven variable reduction approach, which could greatly reduce the computational time and present high computational stability. Future works could extend the fast solution approach to solve stochastic TCUC problems with heterogeneous uncertainty factors such as renewable energy and demand response assets. The fast solution approach may also be extended to include other key parameters relevant to the optimal TCUC solutions such as generation bids, for further improving the computational performance and solution quality.
References
M. Shahidehpour, H. Yamin, and Z. Li, Market Operations in Electric Power Systems. New York: Wiley, 2002. [Baidu Scholar]
Y. Chen, A. Casto, F. Wang et al., “Improving large scale day-ahead security constrained unit commitment performance,” IEEE Transactions on Power Systems, vol. 31, no. 6, pp. 4732-4743, Nov. 2016. [Baidu Scholar]
National Academics of Sciences, Engineering, and Medicine, Analytic research Foundations for the Next-generation Electric Grid. Washington D. C.: National Academies Press, 2016. [Baidu Scholar]
H. Wu, X. Guan, Q. Zhai et al., “A systematic method for constructing feasible solution to SCUC problem with analytical feasibility, conditions,” IEEE Transactions on Power Systems, vol. 27, no. 1, pp. 526-534, Feb. 2012. [Baidu Scholar]
Z. Ma, H. Zhong, Q. Xia et al., “A unit commitment algorithm with relaxation-based neighborhood search and improved relaxation inducement,” IEEE Transactions on Power Systems, vol. 35, no. 5, pp. 3800-3809, Sept. 2020. [Baidu Scholar]
X. Sun, P. Luh, M. Bragin et al., “A novel decomposition and coordination approach for large day-ahead unit commitment with combined cycle units,” IEEE Transactions on Power Systems, vol. 33, no. 5, pp. 5297-5308, Sept. 2018. [Baidu Scholar]
X. Li, Q. Zhai, J. Zhou et al., “A variable reduction method for large-scale unit commitment,” IEEE Transactions on Power Systems, vol. 35, no. 1, pp. 261-272, Jan. 2020. [Baidu Scholar]
L. Wu, “Accelerating NCUC via binary variable-based locally ideal formulation and dynamic global cuts,” IEEE Transactions on Power Systems, vol. 31, no. 5, pp. 4097-4107, Sept. 2016. [Baidu Scholar]
L. Yang, J. Jian, Z. Dong et al., “Multi-cuts outer approximation method for unit commitment,” IEEE Transactions on Power Systems, vol. 32, no. 2, pp. 1587-1588, Mar. 2017. [Baidu Scholar]
Y. Chen, F. Liu, B. Liu et al., “An efficient MILP approximation for the hydro-thermal unit commitment,” IEEE Transactions on Power Systems, vol. 31, no. 4, pp. 3318-3319, Jul. 2016. [Baidu Scholar]
G. Morales-España, J. M. Latorre, and A. Ramos, “Tight and compact MILP formulation of start-up and shut-down ramping in unit commitment,” IEEE Transactions on Power Systems, vol. 28, no. 2, pp. 1288-1296, May 2013. [Baidu Scholar]
K. Pan, Y. Guan, J. Watson et al., “Strengthened MILP formulation for certain gas turbine unit commitment problems,” IEEE Transactions on Power Systems, vol. 31, no. 2, pp. 1440-1448, Mar. 2016. [Baidu Scholar]
J. Meus, K. Poncelet, and E. Delarue, “Applicability of a clustered unit commitment model in power system modeling,” IEEE Transactions on Power Systems, vol. 33, no. 2, pp. 2195-2204, Mar. 2018. [Baidu Scholar]
G. Morales-España and D. A. Tejada-Arango, “Modeling the hidden flexibility of clustered unit commitment,” IEEE Transactions on Power Systems, vol. 34, no. 4, pp. 3294-3296, Jul. 2019. [Baidu Scholar]
W. Zheng, Z. Li, and H. Zhou, “Efficient robust look-ahead dispatch incorporating critical region preparation in gap time,” IEEE Transactions on Power Systems, vol. 36, no. 5, pp. 4840-4843, Sept. 2021. [Baidu Scholar]
Y. Yang, X. Lu, and L Wu. “Integrated data-driven framework for fast SCUC calculation,” IET Generation, Transmission & Distribution, vol. 14, no. 24, pp. 5728-5738, Dec. 2020. [Baidu Scholar]
S. Pineda, J. M. Morales, and A. Jiménez-Cordero, “Data-driven screening of network constraints for unit commitment,” IEEE Transactions on Power Systems, vol. 35, no. 5, pp. 3695-3705, Sept. 2020. [Baidu Scholar]
Á. S. Xavier, F. Qiu, and S. Ahmed, “Learning to solve large-scale security-constrained unit commitment problems,” Informs Journal on Computing, vol. 33, no. 2, pp. 739-756, Feb. 2020. [Baidu Scholar]
G. Dalal, E. Gilboa, S. Mannor et al., “Unit commitment using nearest neighbor as a short-term proxy,” in Proceedings of 2018 Power Systems Computation Conference, Dublin, Ireland, Jun. 2018, pp. 1-7. [Baidu Scholar]
M. Li, W. Wei, Y. Chen et al., “Learning the optimal strategy of power system operation with varying renewable generations,” IEEE Transactions on Sustainable Energy. doi: 10.1109/TSTE.2021.3088951 [Baidu Scholar]
Q. Tu, S. Miao, F. Yao et al., “Forecasting scenario generation for multiple wind farms considering time-series characteristics and spatial-temporal correlation,” Journal of Modern Power Systems and Clean Energy, vol. 9, no. 4, pp. 837-848, Jul. 2021. [Baidu Scholar]
J. Wang, X. Chen, F. Zhang et al., “Building load forecasting using deep neural network with efficient feature fusion,” Journal of Modern Power Systems and Clean Energy, vol. 9, no. 1, pp. 160-169, Jan. 2021. [Baidu Scholar]
Y. Wang, N. Zhang, Q. Chen et al., “Data-driven probabilistic net load forecasting with high penetration of behind-the-meter PV,” IEEE Transactions on Power Systems, vol. 33, no. 3, pp. 3255-3264, May 2018. [Baidu Scholar]
A. K. Jain, “Data clustering: 50 years beyond K-means,” Pattern Recognition Letters, vol. 31, no. 8, pp. 651-666, Aug. 2010. [Baidu Scholar]
C. Si, S. Xu, C. Wan et al., “Electric load clustering in smart grid: methodologies, applications, and future trends,” Journal of Modern Power Systems and Clean Energy, vol. 9, no. 2, pp. 237-252, Mar. 2021. [Baidu Scholar]
G. Karypis, E. Han, and V. Kumar, “Chameleon: hierarchical clustering using dynamic modeling,” Computer, vol. 32, no. 8, pp. 68-75, Aug. 2002. [Baidu Scholar]
M. Ester, “A density-based algorithm for discovering clusters in large spatial databases with noise,” in Proceedings of 1996 Process International Conference Knowledge on Discovery & Data Mining, Portland, United States, Aug. 1996, pp. 226-231. [Baidu Scholar]
Y. Zhou, Q. Zhai, and L. Wu, “Optimal operation of regional microgrids with renewable and energy storage: solution robustness and nonanticipativity against uncertainties,” IEEE Transactions on Smart Grid, vol. 13, no. 6, pp. 4218-4230, Nov. 2022. [Baidu Scholar]
Y. Zhou, Q. Zhai, M. Zhou et al., “Generation scheduling of self-generation power plant in enterprise microgrid with wind power and gateway power bound limits,” IEEE Transactions on Sustainable Energy, vol. 11, no. 2, pp. 758-770, Apr. 2020. [Baidu Scholar]
Y. Zhou, Q. Zhai, and L. Wu, “Multistage transmission-constrained unit commitment with renewable energy and energy storage: implicit and explicit decision methods,” IEEE Transactions on Sustainable Energy, vol. 12, no. 2, pp. 1032-1043, Apr. 2021. [Baidu Scholar]
X. Li and Q. Zhai, “Multi-stage robust transmission constrained unit commitment: a decomposition framework with implicit decision rules,” International Journal of Electrical Power & Energy, vol. 108, pp. 372-381, Jan. 2019. [Baidu Scholar]
A. Lorca and X. Sun, “Multistage robust unit commitment with dynamic uncertainty sets and energy storage,” IEEE Transactions on Power Systems, vol. 32, no. 3, pp. 1678-1688, Mar. 2016. [Baidu Scholar]
Y. Zhou. (2021, Mar.). Historical and simulated data. [Online]. Available: https://www.researchgate.net/publication/350373720_historical_and_simulated_data [Baidu Scholar]
AI-Roomi International. (2021, Mar.). Electric power system analysis & nature-inspired optimization algorithms, test system. [Online]. Available: https://al-roomi.org/component/tags/tag/test-system. [Baidu Scholar]
R. Zimmerman, C. Murillo-Sánchez, and R. Thomas, “MATPOWER: steady-state operations, planning, and analysis tools for power systems research and education,” IEEE Transactions on Power Systems, vol. 26, no. 1, pp. 12-19, Jan. 2011. [Baidu Scholar]