Abstract
Recently, we demonstrated the success of a time-synchronized state estimator using deep neural networks (DNNs) for real-time unobservable distribution systems. In this paper, we provide analytical bounds on the performance of the state estimator as a function of perturbations in the input measurements. It has already been shown that evaluating performance based only on the test dataset might not effectively indicate the ability of a trained DNN to handle input perturbations. As such, we analytically verify the robustness and trustworthiness of DNNs to input perturbations by treating them as mixed-integer linear programming (MILP) problems. The ability of batch normalization in addressing the scalability limitations of the MILP formulation is also highlighted. The framework is validated by performing time-synchronized distribution system state estimation for a modified IEEE 34-node system and a real-world large distribution system, both of which are incompletely observed by micro-phasor measurement units.
DISTRIBUTION system state estimation (DSSE) utilizing micro-phasor measurement units (µPMUs) and deep neural networks (DNNs) is currently a topic of active research interest in the power system community. This is because their combination can provide high-speed situational awareness in real-time unobservable distribution systems, as already demonstrated in [
Providing performance guarantees to machine learning (ML) algorithms (DNN being a type of ML algorithm) is particularly important for power system problems as the electric power grid is a mission-critical system. In line with this realization, [
In this paper, we exploit the piecewise linear nature of the rectified linear unit (ReLU) activation function, which is one of the most commonly used activation functions, to analytically examine robustness and trustworthiness of a trained DNN for performing time-synchronized DSSE in µPMU-unobservable distribution systems. We first express the ReLU operation integrated with batch normalization (BN) as a mixed-integer linear programming (MILP) problem. We then introduce two sets of verification formulations. For robustness verification, we show that for a prespecified range of perturbation in the input, the deviation in the output from its reference value is guaranteed to lie within a bounded region. For trustworthiness verification, we find the minimum perturbation required in the input to generate a given error in the output. Extensive simulations carried out using a modified IEEE 34-node distribution system demonstrate that the proposed formulations ensure robustness and trustworthiness of DNNs for time-synchronized DSSE. We have also tested our verification formulations on a real-world large distribution system to demonstrate their scalability and widespread applicability.
The salient contributions of this letter are as follows.
1) Providing bounds on the estimation error of a DNN- based time-synchronized distribution system state estimator given a bounded perturbation in the input measurements (robustness verification).
2) Quantifying the minimum perturbation in the input measurements required to create a given amount of error in the state estimates (trustworthiness verification).
3) Integrating BN with the verification formulations to improve scalability.
The rest of the paper is structured as follows. Section II presents the time-synchronized DSSE using DNNs. The proposed formulations are developed in Section III. The results and discussion are presented in Section IV, while the conclusion is presented in Section V.
In [
(1) |
where is the optimal answer to the optimization problem; and denotes the total number of states to be estimated. The conditional expectation of (1) can be expressed in terms of the joint probability density of and , , as shown in (2).
(2) |
For µPMU-unobservable distribution systems, the probability density function (PDF) between µPMU data and all the voltage phasors (states) is unknown or impossible to specify, making direct computation of intractable. Even if the underlying joint PDF is known, finding a closed-form solution to (2) can be hard. The role of DNN is to approximate the MMSE estimator as it has excellent approximation capabilities [
A well-trained DNN that gives satisfactory response for unseen test data cannot necessarily ensure similar performance for all possible combinations of its inputs. This brings into question the rationale of using DNNs for decision-making in mission-critical systems. The goals of this paper are to address this concern through verification-based methodologies and build credibility of DNNs for time-synchronized DSSE.
The DNN used in this analysis is a fully-connected feed-forward neural network with hidden layers integrated with BN, each having neurons, as shown in

Fig. 1 DNN architecture with hidden layers.
The hidden layers, denoted by , are equipped with ReLU activation function. The input to the ReLU activation function is a linear transformation of the output of the previous layer denoted by . The output of each neuron in every hidden layer is sent to a BN operator. Hence, for each layer, we have:
(3) |
(4) |
(5) |
where is the weight matrix; is the bias vector; is a small configurable constant; and are the moving average and variance of the batches observed during the training process, respectively; and and are the scaling and offset factors, respectively. The values of these hyperparameters are obtained during the training process. Note that in (3), . and are non-trainable variables that are updated each time the layer is called during the training process based on the given batch.
(6) |
(7) |
where is a configurable constant called momentum. In accordance with [
(8) |
where indicates whether the corresponding ReLU neuron is active (equals 1) or inactive (equals 0); and and are the upper and lower bounds of the ReLU activation function, respectively. The two bounds are calculated using the following equations:
(9a) |
where and are vectors that contain the maximum and minimum input values for all the neurons in layer k, respectively.
For example, for the first hidden layer, and will correspond to the normalized inputs, implying that and . This means that the bounds on the first hidden layer will be and , respectively. The bounds for the remaining layers can be obtained by applying (9a) and (9b) sequentially. Lastly, is calculated based on the linear transformation of the output of the BN operator in the last hidden layer, .
In the proposed formulation, one integer variable is assigned to each neuron for the linearization of the ReLU activation function. Hence, the number of integer variables quickly increases as deeper and wider DNNs are used. BN enables us to significantly reduce the number of integer variables while maintaining the high accuracy of the DNN. This is due to three main reasons [
In this subsection, we examine the robustness of DNNs for time-synchronized DSSE. Given a prespecified bounded perturbation in the input that deviates it from the actual value (reference), a DNN will be deemed robust if the output deviation is guaranteed to be within an acceptable threshold. This is pictorially depicted in

Fig. 2 Robustness analysis for an operating condition described by .
The distortion-free input measurement, , can be perturbed in either or both dimensions, with the maximum perturbation limit indicated by the black rectangle. For the time-synchronized DSSE application, the limit is specified in terms of permissible error in µPMU measurements (denoted by ). Now, for any randomly selected perturbed sample obtained during training/validation (purple dot), there can be a perturbed adversarial sample (black dot) encountered during testing that causes the maximum error in the output (red oval). The goal of the robustness analysis is to quantify this maximum output error given the prespecified input perturbation limit, . The following formulations are proposed to this end as:
(10a) |
s.t.
(10b) |
(10c) |
where is a known output (e.g., true value of the state); and is an appropriate norm. Equations (
To verify the robustness of the trained DNN for all possible input combinations, we choose to be the infinity norm as it ensures that the DNN error is bounded throughout. For infinity norm maximization, we rewrite the objective function of (10a) as: . Next, we convert the overall maximization problem to one maximization problem and one minimization problem for each state. Finally, we pick the maximum absolute value between the two as:
(11a) |
s.t.
(11b) |
(11c) |
In this subsection, we present a formulation for analyzing trustworthiness of a DNN trained for regression problems. If a perturbation in the input vector is denoted by , the objective of trustworthiness analysis is to determine the smallest input perturbation (i.e., ) that can create erroneous results exceeding a threshold, , in the output. Afterwards, we compare the resulting with the actual level of perturbation allowed in the given application. For the time-synchronized DSSE problem, this will be the permissible error in µPMU measurements, specified by . If consistently surpasses , we can have trust in the ability of our trained DNN to provide erroneous estimates that are always within .
The above-mentioned logic is pictorially depicted in

Fig. 3 Trustworthiness analysis for an operating condition described by .
In
(12a) |
(12b) |
s.t.
(12c) |
(12d) |
The DNN described in Section II is trained on historical smart meter data in the offline learning stage and tested using µPMU measurements in the online execution stage [
To implement the proposed formulation, the following steps were performed.
Step 1: cleaned smart meter data were used to create a big dataset by solving many power flows. The voltage phasors obtained from the power flow solution were saved as . The voltage and current phasors corresponding to µPMU locations were saved as . was obtained from by using an appropriate perturbation limit, .
Step 2: the big dataset was split into training and testing datasets, and the former was used to train a DNN that finds a mapping function that relates and . Cleaned µPMU data were then fed into the trained DNN during the test to determine , and calculate the maximum testing dataset error.
Step 3: the robustness verification formulation given by (11) was solved. The solution gave the maximum error that the trained DNN would have for an input perturbation that was bounded by . If this solution is greater than the testing error found in Step 2, it means that the robustness formulation has found an input perturbation for which the trained DNN performs worse than what the testing accuracy indicates.
Step 4: the trustworthiness verification formulation given by (12) was solved for every node to determine that was needed to create an error greater than in any of the state estimates. If no perturbation is found for a given node or the smallest perturbation found is greater than used in Step 1, the DNN is deemed trustworthy in the sense that it will not give an error greater than for any input perturbation that is bounded by .
The DNNs created based on the logic proposed in [
A. System S1
1) Robustness Results
System S1 has three distributed generation units with ratings of 135 kW, 60 kW, and 60 kW placed on nodes 822, 848, and 860, respectively. To train a DNN for DSSE, we created a database comprising input, , and output, . The database was then split into training and testing datasets. Note that according to the sensor placement algorithm of [
In the first set of simulations, we compared the output of (11) obtained from the trained DNN using the test dataset with the estimation errors produced by the same DNN for the same (test) dataset. Due to space limitation, we only present the results for phase A voltage magnitude estimation. However, similar observations are made when analysis of magnitudes of other phases as well as angles are conducted. The maximum absolute error of all the test samples is shown in

Fig. 4 Comparison of DNN-based voltage magnitude estimation error and DNN robustness analysis for phase A of System S1.
Next, we tested the sensitivity of the proposed formulation to different sample sizes. The results presented in

Fig. 5 Robustness analysis for System S1 with different data sizes.
To identify the minimum perturbation in µPMU measurements capable of inducing a prespecified error in the state estimates, (12) was employed. We assumed a maximum allowable error of 1% in voltage magnitude estimation, i.e., . Then, we determined specific to each operating condition that resulted in . To ensure trust in our trained DNN for DSSE, we must verify whether consistently exceeds (%). The results obtained are shown in
Node No. | (%) | Node No. | (%) | Node No. | (%) |
---|---|---|---|---|---|
800 | 822 | 4.6729 | 864 | 0.9582 | |
802 | 824 | 834 | 0.9667 | ||
806 | 828 | 842 | 0.9668 | ||
808 | 10.5745 | 830 | 844 | 0.9671 | |
812 | 6.2330 | 854 | 846 | 0.9665 | |
814 | 4.7759 | 852 | 6.8931 | 848 | 0.9664 |
850 | 832 | 0.9553 | 860 | 0.9684 | |
816 | 888 | 0.9445 | 836 | 0.9691 | |
818 | 890 | 0.8169 | 840 | 0.9692 | |
820 | 3.5278 | 858 | 0.9591 | 862 | 0.9692 |
The findings presented in
B. System S2
1) Robustness Results
In this test system, µPMU measurements were only possible at the feeder-head (see

Fig. 6 System S2 with one µPMU available at feeder-head.
Note that having real-time measurements only at the feeder-head is common for most distribution systems. Therefore, it is of interest to evaluate the DNN performance of time-synchronized DSSE in situations where additional µPMU placement cannot be done due to budget constraints. There are 642, 665, and 637 nodes in phase A, phase B, and phase C of this system, respectively, whose voltages must be estimated under different operating conditions. Additionally, this feeder has 766 house-hold/commercial roof-top solar photovoltaic units, implying that it has a high penetration of renewable energy resources. Thus, this was an ideal test system for investigating the scalability as well as the handling capability of the renewable-rich system.
Due to the sheer number of nodes in this system, we show the difference between robustness analysis and DNN testing dataset for phase A voltage magnitude estimation as a histogram, as shown in

Fig. 7 Difference in maximum absolute errors obtained using and for all 642 nodes.
The trustworthiness result analysis for System S2 is presented in the form of a histogram in

Fig. 8 Trustworthiness results for phase A voltage magnitude estimation for 115 nodes of System S2.
C. Discussion
1) Strategies to Address Computational Burden
The proposed formulation is built on an MILP-based formulation whose worst-case run-time complexity is exponential. For example, the computational burden of the verification formulations developed in Section III is of the order of , where denotes the total number of samples. Since optimization formulations with exponential time complexity face scalability issues when applied to the problems involving large numbers of variables, we employed three strategies to lower the severity of this issue for the proposed formulation, namely, time-synchronized DSSE in µPMU-unobservable distribution systems.
1) Strategy 1: incorporation of BN layer. It was observed that by incorporating BN layers within the DNN, a smaller-sized DNN could give similar validation accuracy as a larger-sized DNN that did not have BN layers. For example, in the absence of BN, a DNN with eight hidden layers and 500 neurons/layer was needed for System S1 for achieving similar accuracy as the DNN with BN described in Section IV-A. This considerable reduction in the size of the DNN also reduced the number of integer variables in the proposed formulation by a significant amount, resulting in faster convergence of the optimization process.
2) Strategy 2: identifying always-dead and always-active neurons. During the training process, the output of each neuron was monitored. It was observed that some neurons were always-active, while others became always-dead, outputting zero. For these neurons, the corresponding binary integer variable, , was fixed to 1 and 0, respectively, reducing the number of integer variables required for post-training robustness and trustworthiness analyses. This strategy improves the efficacy of the proposed formulation even for large DNNs.
3) Strategy 3: effective parallelization. This was done in two ways. First, the verification formulations were implemented parallelly for the three phases (since DSSE is performed on a per-phase basis). Second, the verification formulations were specific to a given power system node and could be performed independently of any other node. Therefore, different nodes of the test system were grouped together into clusters (e.g., 100 nodes for System S2), and these clusters were solved in parallel. Since both of these ways are agnostic of the DNN size, they can be easily applied to large DNNs for DSSE.
Trained ML models are prone to poor performance due to the presence of adversarial examples that can be present in the input space domain but not seen during training and testing stages. In the context of the DSSE using DNN application, this can be due to the presence of non-Gaussian noise in the µPMU measurements. Most studies have modeled the noise as a zero-mean Gaussian distribution, but in reality, the noise model could be non-Gaussian [
This is precisely the scenario where the proposed formulations become crucial. Consider the robustness verification formulation that calculates the maximum error in the state estimation caused by a perturbation bounded by in the input measurements, once this maximum error is found, one can say with certainty that as long as the input is corrupted by a perturbation bounded by (irrespective of the distribution that the perturbation may have), the DNN-based DSSE error will be less than or equal to this calculated maximum error value. This is a powerful result that clearly indicates the practical significance of the proposed formulation for mission-critical systems such as the electric power grid.
The black box nature of a DNN often makes power system operators question the validity of the obtained results. This is because although a well-trained DNN can make accurate predictions, it might lack requisite robustness to (adversarial) input perturbations. Therefore, providing formal guarantees of DNN performance is necessary for ensuring their acceptability in the power system. To this end, we formulated two verification problems, namely, robustness and trustworthiness, for DNN-based time-synchronized DSSE using MILP. The robustness formulation finds the maximum error in the output for a given bounded perturbation in the input, while the trustworthiness formulation finds the minimum perturbation in the input that is required to produce a given error in the output. The proposed formulations are also applicable to DNN-based regression problems in other domains.
The analytical verification of DNN-based time-synchronized DSSE was first performed on a modified IEEE 34-node system. It was confirmed that the robustness analysis conducted using the testing data on a DNN resulted in a higher error than what was observed when the same data were fed as an input into that DNN. This implied that relying on the outputs of the testing data alone (i.e., without a robustness analysis) might result in a sense of false security, which is dangerous for mission-critical systems such as power systems. Through trustworthiness analysis, it was observed that we could verify the adherence of the estimation error to a prespecified threshold that was based on the characteristics of the inputs (e.g., permissible error in µPMU measurements). Lastly, the applicability of the proposed formulation to a real-world, large-scale, and renewable-rich distribution system was demonstrated, confirming its practical utility. A future scope of this work will be to address the exponential run-time complexity of the proposed formulations by creating verification problems that do not involve MILP.
Disclaimer
This paper was prepared as an account of work sponsored by an agency of the United States Government. Neither the United States Government nor any agency thereof, nor any of their employees, makes any warranty, express or implied, or assumes any legal liability or responsibility for the accuracy, completeness, or usefulness of any information, apparatus, product, or process disclosed, or represents that its use would not infringe privately owned rights. Reference herein to any specific commercial product, process, or service by trade name, trademark, manufacturer, or otherwise does not necessarily constitute or imply its endorsement, recommendation, or favoring by the United States Government or any agency thereof. The views and opinions of authors expressed herein do not necessarily state or reflect those of the United States Government or any agency thereof.
References
B. Azimian, R. S. Biswas, S. Moshtagh et al., “State and topology estimation for unobservable distribution systems using deep neural networks,” IEEE Transactions on Instrumentation and Measurement, vol. 71, p. 9003514, Jan. 2022. [Baidu Scholar]
B. Zargar, A. Angioni, F. Ponci et al., “Multiarea parallel data-driven three-phase distribution system state estimation using synchrophasor measurements,” IEEE Transactions on Instrumentation and Measurement, vol. 69, no. 9, pp. 6186-6202, Sept. 2020. [Baidu Scholar]
H. Xu, Y. Ma, H. Liu et al., “Adversarial attacks and defenses in images, graphs and text: a review,” International Journal of Automation and Computing, vol. 17, no. 2, pp. 151-178, Feb. 2020. [Baidu Scholar]
Y. Chen, Y. Tan, and D. Deka, “Is machine learning in power systems vulnerable?” in Proceedings of 2018 IEEE International Conference on Communications, Control, and Computing Technologies for Smart Grids (SmartGridComm), Aalborg, Denmark, Oct. 2018, pp. 1-6. [Baidu Scholar]
Z. Zhang, M. Sun, R. Deng et al., “Physics-constrained robustness evaluation of intelligent security assessment for power systems,” IEEE Transactions on Power Systems, vol. 38, no. 1, pp. 872-884, Jan. 2023. [Baidu Scholar]
Z. Zhang, K. Zuo, R. Deng et al., “Cybersecurity analysis of data-driven power system stability assessment,” IEEE Internet of Things Journal, vol. 10, no. 17, pp. 15723-15735, Nov. 2023. [Baidu Scholar]
Y. Liu, B. Xu, A. Botterud et al., “Bounding regression errors in data-driven power grid steady-state models,” IEEE Transactions on Power Systems, vol. 36, no. 2, pp. 1023-1033, Mar. 2021. [Baidu Scholar]
V. Tjeng, K. Xiao, and R. Tedrake, “Evaluating robustness of neural networks with mixed integer programming,” in Proceedings of 7th International Conference on Learning Representations (ICLR), New Orleans, USA, May 2019, pp. 1-21. [Baidu Scholar]
R. Anderson, J. Huchette, W. Ma et al., “Strong mixed-integer programming formulations for trained neural networks,” Mathematical Programming, vol. 183, no. 1, pp. 3-39, Jan. 2020. [Baidu Scholar]
M. I. Khedher, H. I. Khedher, and M. Hadji, “Dynamic and scalable deep neural network verification algorithm,” in Proceedings of the 13th International Conference on Agents & Artificial Intelligence, Vienna, Austria, Feb. 2021, pp. 1122-1130. [Baidu Scholar]
E. Wong, T. Schneider, J. Schmitt et al., “Neural network virtual sensors for fuel injection quantities with provable performance specifications,” in Proceedings of 2020 IEEE Intelligent Vehicles Symposium, Las Vegas, USA, Oct. 2020, pp. 1753-1758. [Baidu Scholar]
A. Venzke and S. Chatzivasileiadis, “Verification of neural network behaviour: formal guarantees for power system applications,” IEEE Transactions on Smart Grid, vol. 12, no. 1, pp. 383-397, Jan. 2021. [Baidu Scholar]
A. Venzke, G. Qu, S. Low et al., “Learning optimal power flow: worst-case guarantees for neural networks,” in Proceedings of 2020 IEEE International Conference on Communications, Control, and Computing Technologies for Smart Grids (SmartGridComm), Tempe, USA, Nov. 2020, pp. 1-7. [Baidu Scholar]
S. Sonoda and N. Murata, “Neural network with unbounded activation functions is universal approximator,” Applied and Computational Harmonic Analysis, vol. 43, no. 2, pp. 233-268, Sept. 2017. [Baidu Scholar]
S. Ioffe and C. Szegedy. (2015, Feb.). Batch normalization: accelerating deep network training by reducing internal covariate shift. [Online]. Available: http://arxiv.org/abs/1502.03167 [Baidu Scholar]
J. Bjorck, C. Gomes, B. Selman et al., “Understanding batch normalization,” in Proceedings of 32nd Conference Neural Information Processing Systems (NeurIPS 2018), Montréal, Canada, Nov. 2018, pp. 1-12. [Baidu Scholar]
B. Azimian, R. S. Biswas, and A. Pal, “Application of AI and machine learning algorithms in power system state estimation,” in Cyber-physical Power Systems: Challenges and Solutions by AI/ML, Big Data, Blockchain, IoT, and Information Theory Paradigms, New Jersey: Wiley-IEEE Press. [Baidu Scholar]
K. R. Mestav, J. Luengo-Rozas, and L. Tong, “State estimation for unobservable distribution systems via deep neural networks,” in Proceedings of 2018 IEEE PES General Meeting (PESGM), Portland, USA, Aug. 2018, pp. 1-5. [Baidu Scholar]
K. Montano-Martinez, S. Thakar, S. Ma et al., “Detailed primary and secondary distribution system model enhancement using AMI data,” IEEE Open Access Journal of Power and Energy, vol. 9, pp. 2-15, Jun. 1883. [Baidu Scholar]
G. Cavraro and R. Arghandeh, “Power distribution network topology detection with time-series signature verification method,” IEEE Transactions on Power Systems, vol. 33, no. 4, pp. 3500-3509, Jul. 2018. [Baidu Scholar]
A. C. Varghese, A. Pal, and G. Dasarathy, “Transmission line parameter estimation under non-gaussian measurement noise,” IEEE Transactions on Power Systems, vol. 38, no. 4, pp. 3147-3162, Jul. 2023. [Baidu Scholar]