Abstract
This paper develops a physics-guided graph network to enhance the robustness of distribution system state estimation (DSSE) against anomalous real-time measurements, as well as a deep auto-encoder (DAE)-based detector and a Gaussian process-aided residual learning (GARL) to deal with challenges arising from topology changes. A global-scanning jumping knowledge network (GSJKN) is first designed to establish the regression rule between the measurement data and state variables. The structural information of distribution system (DS) and a global-scanning module are incorporated to guide the propagation of scarce measurements in the graph topology, contributing to valid estimation precision in sparsely measured DSs. To monitor the topology changes of the network, a DAE network is employed to learn an efficient representation of the measurements of the system under a certain topology, which can achieve online monitoring of the network structure by observing the variation tendency of the reconstruction error. When the topology change occurs, a Gaussian process with a composite kernel is applied to the modeling of the pre-trained GSJKN residual to adapt to the new topology. The embedding of the physical structural knowledge enables the proposed GSJKN method to restore the missing/noisy values utilizing the adjacent measurements, which enhances the robustness to typical data acquisition errors. The adopted DAE network and special GARL-based transfer method further allow the DSSE method to rapidly detect and adapt to the topology change, as well as achieve effective quantification of the estimation uncertainties. Comparative tests on balanced and unbalanced systems demonstrate the accuracy, robustness, and adaptability of the proposed DSSE method.
THE widespread integration of distributed generation (DG) [
DSSE methods are typically classified into two categories: optimization-based [
Advancements in distribution automation (DA) systems and machine learning (ML) have introduced learning-based state estimation methods [
While these methods incorporate DS topology to improve model performance and generalization, they have mostly been tested under conditions of abundant real-time measurements. In practice, DSs often lack sufficient measurements, leaving many nodes without critical data and resulting in ineffective graph aggregation due to insufficient neighbor node features. Although increasing graph aggregation layers [
1) The proposed global-scanning jumping knowledge network (GSJKN) method expands the application of physics-guided methods under scarce real-time measurements. It is realized by adaptively selecting the range of graph aggregation and designing a global-scanning module based on recurrent neural network (RNN) structures to obtain the feasible node representation, which allows it to achieve satisfying estimation precision under scarce measurements, as well as restore the missing/noisy measurement data utilizing the adjacent information during the propagation of nodal features.
2) The proposed DSSE method can achieve online detection of the topology change events according to real-time and pseudo-measurement data. The DAE takes the real and pseudo-measurements as inputs and reconstructs them via multiple layers of transformation. The encoding and decoding process enables the DAE network to learn the intrinsic structure of the measurements under a certain topology in an unsupervised manner. This allows us to identify the topology change by observing the trend of reconstruction error in an online manner.
3) The GARL-based transfer method allows the proposed model to realize fast adaption to a new topology utilizing sparse online measurement data. Instead of modeling the DSSE under a new topology as a new task, the proposed method employs the GP with a composite kernel to model the residual of the pre-trained GSJKN under the original topology. The adopted composite kernel allows the proposed method to realize inductive reasoning about the differences between the DSSE tasks under different topologies and rapidly adapt to the new topology using a limited amount of data. This differentiates from the traditional parametric transfer methods that still require a certain amount of historical data to adapt to new topology conditions.
4) The Bayesian characteristic of the GARL enables the proposed method to effectively quantify the uncertainties of the DSSE results. This is beneficial for the operators when making uncertainty-aware decisions.
The rest of this paper is organized as follows. Section II presents the problem statement. Section III describes the proposed DSSE and fast transfer framework. Section IV presents the case study. Finally, Section V concludes this paper.
Consider the system state variables as and the system measurement variables as , where n and m are the numbers of nodes and system measurements, respectively. The DSSE model is a measurement equation based on the DS structure, line parameters, state variables , and measurement variables .
(1) |
where v is the measurement error. State estimation solves the estimated state variable so that the measured is most likely to be observed, which can be illustrated as:
(2) |
where is the probability distribution density function. The essence of the WLS-based method is to solve the following mathematical problem:
(3) |
where is the measurements weight matrix, and , and is the weight factor of the
When learning-based methods are utilized to deal with DSSE tasks, the task is generally transformed into a supervised regression learning process from the historical data. Consider a large amount of historical data collected by DA as . The DSSE task can be illustrated as , where is the mapping function constructed by NN or ML models. However, due to the neglect of structure information in the typical learning-based methods, the abnormal data will significantly impact their estimation results. Researchers have proposed physics-guided NN to incorporate structure information, which is represented as:
(4) |
where T is the prior topology information; and is the graph aggregation with n layers to obtain the information from
The proposed GSJKN-based estimator consists of two main components: ① graph jump connections through multi-layer graph aggregation to obtain the neighbor information in a certain range; and ② a subsequent adaptive global-scanning module based on RNN cells. Firstly, the graph jump connections are constructed by aggregating the node embeddings from multiple graph layers. Consider the collected historical data as and the prior topology information as , where N is the node number in DS. The graph jump connections can be represented as:
(5) |
(6) |
where is the node representation after the graph layer; is the node representation after graph jump connections; is the splicing operation; and is the parameterized graph aggregation layer. Specifically, is calculated by splicing M graph embedding heads:
(7) |
where is the node embedding from the
(8) |
(9) |
(10) |
(11) |
where is the feature of the node in ; and are the learnable parameters at the graph embedding head in the graph layer to obtain the node embeddings ; is the normalized attention value between nodes i and j; is the structure connection between nodes i and j; is the splicing operation; is the feature at the edge between nodes i and j; is the leaky rectified linear unit function; and is the selected activation function. The embeddings from nodes i and j are spliced and transformed to the edge feature between them under . Then, these edge features around node i are normalized to obtain the edge attention value to indicate the importance of neighbor nodes. Subsequently, the node embeddings and attention value are weighted to obtain the new representation for node i through .
Secondly, a global-scanning module based on the bidirectional RNN is designed to adaptively select the range of graph aggregation from the jump connections and propagate critical features in the whole topology. The new node representation is calculated by:
(12) |
where and are the forward and reversed RNN calculations, respectively; and is the aggregated node embeddings from the jump connections. The forward RNN calculation in the global-scanning module is represented as:
(13) |
(14) |
(15) |
(16) |
where is the feature of the
When new topologies emerge, the original DSSE models may encounter significant estimation errors. Topology change is critical information in the operation of DS. Researchers have investigated various methods to tackle bus-branch and node-breaker topology issues [

Fig. 1 Structure of DAE-based detector.
During the training process, given the measurement data for the known topology, we construct the encoder mapping to transmit into a code , which is formulated by:
(17) |
The code retains key information from measurements due to the unique bottleneck structure of DAE. We use a decoder to reconstruct from , which is formulated by:
(18) |
We want to get close to the original measurement data . The training process aims to derive the parameters that minimize the reconstruction error, which is formulated by:
(19) |
where and are the learnable parameters in DAE; and is the reconstruction error for the sample. By minimizing the reconstruction error, the encoder and decoder extract and preserve crucial feature information, particularly related to topology structure, from the input measurements . However, when a topology change occurs, the power flow equation of DS undergoes modifications, resulting in measurements of new topology that deviates from the previous measurement . Consequently, the decoder struggles to accurately reconstruct the new measurements close to , leading to a significant increase in the reconstruction error . To detect the topology change, we can monitor the fluctuation of the reconstruction error over time. A noticeable increase in after a certain moment indicates the occurrence of topology change events for new measurements . This DAE-based detector enables us to detect the topology changes in the DS, followed by the proposed GARL-based transfer method constructing the DSSE model for the new topologies.
Typically, obtaining historical data for a new topology is challenging, resulting in a scarcity of available data. To address this limitation and build a DSSE model for a new topology, a specially designed compound Gaussian kernel, denoted as , is proposed. This kernel fully incorporates information from the known topology to overcome the lack of historical data under the new topology conditions. For the limited data of the new topology , the DSSE model is expressed as:
(20) |
where is the completed DSSE model for . The limited data are used to train , which may consist of a few dozen samples. The GARL achieves transfer learning by finding a mapping , to model the residuals between and . This is accomplished by employing a GP with a composite kernel. here is the pseudo-estimation result from the original model , and is the actual state of the new topology. The GARL-based transfer method consists of the training phase and the deployment phase [
(21) |
Let denote the vector of all residuals and denote the vector of all pseudo estimation results . A GP with a composite kernel is trained assuming , where is the identity matrix, and denotes an covariance matrix at all pairs of training points based on a composite kernel as:
(22) |
where ; is the kernel to process ; and is the kernel to process . Suppose a linear kernel is used for both and . Then, the composite kernel can be expressed as:
(23) |
The training process of GP learns the hyperparameters , , by maximizing the marginal likelihood . In the deployment phase, a test point is input to the DSSE model to get an output . The well-trained GP can establish the distribution of the residual as , where , and indicates the variance. Here, is the vector of kernel-based covariances. The predicted residuals will modify the output of so that it can be applied to the new topology. In addition, the final estimation results with uncertainty information is given as:
(24) |
(25) |
GARL enables output reconstruction of the original DSSE model to adapt to new topologies while providing uncertainty estimates for state variables under these conditions. Without altering the architecture or retraining the DSSE model, GARL leverages only the model output, enabling efficient and low-cost deployment. Unlike conventional methods, GARL does not rely on explicit new topology information. Instead, it constructs the DSSE model for the new topology through residual learning, streamlining the transfer process without requiring detailed topology knowledge.
The Algorithm SA1 of Supplementary Material A illustrates the training and deployment process of the proposed GSJKN method, DAE-based detector, and GARL-based transfer method. The inputs for training are the historical dataset , the known topology , and the limited dataset at the new topology. The outputs are for , and , and a transferred DSSE model for the new topology. After the training process, the parameters in and are fixed and ready for the real-time DSSE on known and new topologies. The proposed DSSE framework is shown in Algorithm SA1.
1) Data preparation. The IEEE 33-bus test system with photovoltaic (PV) location, modeled for DSSE tasks, is illustrated in

Fig. 2 IEEE 33-bus test system with PV location.
2) The mean absolute error (MAE) is used for the performance evaluation of deterministic DSSE results.
(26) |
where and are the actual state and estimated state of the sample, respectively; and V is the total number of state variables in the test dataset. For interval estimation, key factors are reliability, sharpness, and calibration [
3) The GSJKN-based estimator for the known topology is constructed using the graph attention with 2 attention heads. The hyper-parameters of the DSSE model are presented in
Method | Voltage magnitude error ( p.u.) | |||||
---|---|---|---|---|---|---|
Case 1 | Case 2 | Case 3 | Case 4 | Case 5 | Case 6 | |
WLS-R(MAE) | 3.73 | 8.48 | 12.00 | 8.75 | 8.72 | 9.65 |
WLS-R(MAX) | 63.00 | 113.00 | 146.00 | 97.40 | 119.00 | 101.60 |
WLS-L(MAE) | 3.73 | 8.48 | 12.00 | 39.00 | 75.50 | 49.20 |
WLS-L(MAX) | 63.00 | 113.00 | 146.00 | 407.00 | 316.00 | 362.00 |
BPN [ | 4.29 | 11.70 | 25.30 | 42.30 | 190.60 | 77.90 |
BPN [ | 52.00 | 93.20 | 248.00 | 331.00 | 1021.00 | 463.00 |
CNN [ | 9.29 | 15.60 | 20.90 | 30.30 | 33.60 | 38.20 |
CNN [ | 95.40 | 129.00 | 142.00 | 178.00 | 222.00 | 289.00 |
GP [ | 2.88 | 8.93 | 14.60 | 31.30 | 60.20 | 37.20 |
GP [ | 67.40 | 178.00 | 187.00 | 183.00 | 258.00 | 232.00 |
PAWNN [ | 20.00 | 22.00 | 24.20 | 24.90 | 51.00 | 34.50 |
PAWNN [ | 413.00 | 407.00 | 365.00 | 407.00 | 407.00 | 453.00 |
PAMLP [ | 4.00 | 6.61 | 10.60 | 14.60 | 67.90 | 27.20 |
PAMLP [ | 57.40 | 139.00 | 221.00 | 172.00 | 222.00 | 184.00 |
GCNII [ | 18.50 | 19.90 | 22.30 | 25.20 | 33.50 | 34.50 |
GCNII [ | 167.00 | 197.00 | 269.00 | 455.00 | 252.00 | 453.00 |
GSJKN (MAE) | 3.35 | 6.25 | 9.50 | 7.72 | 7.84 | 8.84 |
GSJKN (MAX) | 70.20 | 82.90 | 98.50 | 96.20 | 93.40 | 88.60 |
Layer type | Layer task | Layer parameter |
---|---|---|
Input layer | Accepting measurement | |
Fully connected layer (FCL) | Node feature embedding | |
FCL | Attention calculation | |
Graph | Node information aggregation | |
RNN (forward) | Global scanning | |
RNN (reversed) | Node feature embedding | |
MLP | Node feature embedding | |
Output layer | Estimation result |
To evaluate the performance of the proposed DSSE model under missing or noisy data conditions, six cases simulating typical data acquisition errors are conducted.
1) Case 1: available real-time measurements are collected correctly (normal condition).
2) Case 2: randomly selecting 2 real-time measurements and adding 30% uniform noise.
3) Case 3: randomly selecting 5 real-time measurements and adding 30% uniform noise.
4) Case 4: missing the real-time measurements at branch 1-2 while randomly selecting 2 real-time measurements and adding 30% uniform noise.
5) Case 5: missing the real-time measurements at branches 2-3 and 7-8 while randomly selecting 2 real-time measurements and adding 30% uniform noise.
6) Case 6: missing the real-time measurements at branches 1-2, 7-8, and 10-11 while randomly selecting 2 real-time measurements and adding 30% uniform noise.
For comparison, we assess three standard learning-based methods: ① back-propagation network (BPN) methods [
Tables
Method | Voltage angle error | |||||
---|---|---|---|---|---|---|
Case 1 | Case 2 | Case 3 | Case 4 | Case 5 | Case 6 | |
WLS-R(MAE) | 18.40 | 37.70 | 55.60 | 37.20 | 39.90 | 41.30 |
WLS-R(MAX) | 343.00 | 463.00 | 690.00 | 648.00 | 603.00 | 469.00 |
WLS-L(MAE) | 18.40 | 37.70 | 55.60 | 70.80 | 96.30 | 88.60 |
WLS-L(MAX) | 343.00 | 463.00 | 690.00 | 951.00 | 773.00 | 930.00 |
BPN [ | 9.49 | 58.50 | 66.00 | 62.50 | 121.00 | 87.60 |
BPN [ | 190.00 | 469.00 | 391.00 | 950.00 | 1437.00 | 1135.00 |
CNN [ | 11.20 | 35.00 | 54.40 | 42.10 | 49.50 | 58.10 |
CNN [ | 149.00 | 274.00 | 454.00 | 365.00 | 495.00 | 490.00 |
GP [ | 11.80 | 20.90 | 43.40 | 67.00 | 105.00 | 82.20 |
GP [ | 235.00 | 529.00 | 1026.00 | 467.00 | 561.00 | 483.00 |
PAWNN [ | 39.30 | 45.30 | 58.30 | 47.20 | 74.30 | 67.50 |
PAWNN [ | 863.00 | 1017.00 | 1102.00 | 1017.00 | 1017.00 | 1017.00 |
PAMLP [ | 9.10 | 21.70 | 43.90 | 36.00 | 66.20 | 45.20 |
PAMLP [ | 183.00 | 538.00 | 1042.00 | 633.00 | 503.00 | 612.00 |
GCNII [ | 18.00 | 26.40 | 40.30 | 32.60 | 39.80 | 50.70 |
GCNII [ | 322.00 | 649.00 | 698.00 | 659.00 | 889.00 | 806.00 |
GSJKN(MAE) | 9.43 | 19.10 | 37.70 | 22.10 | 22.30 | 24.40 |
GSJKN(MAX) | 209.00 | 407.00 | 676.00 | 419.00 | 475.00 | 429.00 |
In Cases 2-6, where measurements are anomalous, BPN and CNN show notable performance degradation, underscoring the limitations of traditional learning-based DSSE methods in addressing missing real-time data due to a lack of physical insights. Although PAMLP achieves feasible accuracy with normal noise, it exhibits sharper performance declines compared with other physics-guided methods, revealing challenges in directly integrating global information. In contrast, the proposed GSJKN method leverages structural insights and a global scanning module to effectively fill missing values using adjacent data, sustaining accuracy even with three missing nodes in Case 6. These results demonstrate the robustness of the proposed GSJKN method in managing anomalous data.
For the WLS-based method, undetected missing measurements cause significant accuracy degradation in WLS-L. Although WLS-R, with missing data detection, achieves similar accuracy to the proposed WLS-based method for voltage magnitudes, its voltage angle results are notably worse in Case 6. Additionally, WLS-based method relies on precise line parameters, which are often difficult to obtain. The proposed GSJKN method alleviates this dependency by directly learning the regression relationship between measurements and state variables.
For more tests of the proposed GSJKN method under various noise conditions and fewer real-time measurements, please refer to Supplementary Material C Tables SCI-SCIII, respectively. For ablation studies of the proposed GSJKN method, please refer to Table SCIV of Supplementary Material C.
Additional tests are conducted to assess the topology change detection capability of the proposed DAE-based detector. Four scenarios are considered, where the network topology transitions from the original condition T1 to new configurations NT1, NT2, NT3, and NT4 as follows.
1) NT1: opening switches of branches 7-8, 28-29, and 14-15 while closing switches of branches 21-8, 25-29, and 9-15.
2) NT2: opening switches of branches 7-8 and 11-12 while closing switches of branches 21-8 and 22-12.
3) NT3: opening switches of branches 7-8, 28-29, and 11-12 while closing switches of branches 21-8, 25-29, and 12-22.
4) NT4: opening switches of branches 11-12 and 28-29 while closing switches of branches 22-12 and 25-29.
Unplanned switch actions causing these topology changes occur at the hour.

Fig. 3 Reconstruction error of DAE before and after topology changes for NT1-NT4.
To further evaluate the detection capability of the proposed DAE-based detector, four substation configurations, are tested as follows.
1) SC1: moving PV3 and PV2 to bus 31.
2) SC2: adding static var compensators at bus 29 with the capacity of 5 kvar.
3) SC3: cutting off the load at buses 28 and 29.
4) SC4: removing PV3 and cutting off the load at bus 29.

Fig. 4 Variations in reconstruction error with original condition T1 for SC1-SC4.
To evaluate the effectiveness of the proposed GARL-based transfer method in handling topology changes, four new topologies are considered. The base model is the DSSE model of the original topology, and the GARL-based transfer method is trained on 48 sample sets. Additionally, for the typical NN-based method, we explore BPN [
For the WLS-based methods, we investigate the WLS (right) method with topology identification, using correct new topology structures; and the WLS (error) method without topology identification, using incorrect structures with one switch error. The BPN-Finetuned is trained for 200 epochs with a learning rate of .
Method | Voltage magnitude error ( p.u.) | Voltage angle error ( degree) | ||||||
---|---|---|---|---|---|---|---|---|
NT1 | NT2 | NT3 | NT4 | NT1 | NT2 | NT3 | NT4 | |
BPN [ | 37.10 | 35.90 | 37.40 | 32.00 | 33.60 | 48.30 | 55.80 | 54.00 |
CNN [ | 41.10 | 31.00 | 30.30 | 31.50 | 27.10 | 33.70 | 26.80 | 32.20 |
GP [ | 20.20 | 21.10 | 20.30 | 19.10 | 61.00 | 66.00 | 62.80 | 57.70 |
BPN-CT | 24.30 | 22.60 | 27.90 | 24.30 | 30.00 | 37.00 | 36.30 | 37.10 |
CNN-CT | 27.80 | 23.80 | 25.20 | 23.50 | 25.20 | 29.40 | 31.20 | 39.30 |
GP-CT | 14.80 | 13.70 | 15.70 | 14.00 | 43.00 | 41.50 | 48.10 | 43.70 |
WLS (right) | 4.96 | 6.66 | 6.92 | 6.05 | 24.70 | 34.60 | 35.40 | 31.00 |
WLS (error) | 34.00 | 11.00 | 40.00 | 34.00 | 110.00 | 55.30 | 93.60 | 81.40 |
EELM [ | 27.80 | 33.50 | 27.10 | 26.40 | 32.70 | 37.40 | 36.80 | 41.90 |
BPN-Finetuned [ | 20.90 | 17.70 | 20.70 | 18.90 | 28.10 | 30.90 | 34.30 | 32.10 |
CDAR [ | 24.90 | 20.40 | 25.50 | 19.90 | 21.80 | 26.80 | 31.20 | 31.50 |
DNN+ [ | 13.70 | 17.60 | 14.90 | 13.10 | 18.00 | 34.90 | 29.40 | 29.50 |
BAR [ | 11.30 | 9.95 | 11.90 | 8.99 | 16.80 | 33.70 | 30.50 | 22.00 |
Proposed | 4.22 | 4.31 | 4.84 | 4.23 | 9.94 | 13.80 | 12.70 | 12.10 |
The proposed GARL-based transfer method converts deterministic DSSE results into probabilistic estimates via the GP-based transfer process. Using NT3 as a test case, we assess estimation intervals with the proposed GARL-based transfer method trained on 48 samples. Additionally, for the typical non-parametric method, we explore GP [
For the NN-based method, we investigate the quantile regression neural network (QRNN)-CT method, using the same data as GP-CT. For the transfer learning-based methods, we explore BAR [
Method | Pinball loss | Winkler loss | |||
---|---|---|---|---|---|
Proposed | 1.89 | 32.7 | 25.6 | 21.6 | 18.8 |
BAR [ | 4.94 | 116.0 | 76.7 | 59.9 | 49.8 |
GP [ | 8.22 | 154.0 | 117.0 | 98.0 | 84.4 |
GP-CT | 6.65 | 162.0 | 105.0 | 81.5 | 67.4 |
QRNN-CT | 7.83 | 157.0 | 125.0 | 101.0 | 85.5 |
QRNN-Finetuned | 6.70 | 196.0 | 121.0 | 100.0 | 81.0 |
Method | Pinball loss | Winkler loss | |||
---|---|---|---|---|---|
Proposed | 5.16 | 96.7 | 74.0 | 61.7 | 53.0 |
BAR [ | 13.30 | 379.0 | 225.0 | 166.0 | 134.0 |
GP [ | 26.20 | 499.0 | 380.0 | 316.0 | 271.0 |
GP-CT | 20.30 | 498.0 | 323.0 | 250.0 | 206.0 |
QRNN-CT | 21.40 | 413.0 | 314.0 | 258.0 | 220.0 |
QRNN-Finetuned | 16.40 | 446.0 | 2463.0 | 194.0 | 164.0 |
Tables
Method | PICP (%) | MPIW ( p.u.) | ||
---|---|---|---|---|
Proposed | 0.9045 | 0.8448 | 22.73 | 17.72 |
BAR [ | 0.7034 | 0.6386 | 29.03 | 22.61 |
GP [ | 0.8712 | 0.8171 | 85.66 | 66.74 |
GP-CT | 0.7066 | 0.6384 | 36.87 | 28.73 |
QRNN-CT | 0.7827 | 0.6428 | 79.71 | 60.24 |
QRNN-Finetuned | 0.6324 | 0.5221 | 52.55 | 34.27 |
Method | PICP (%) | MPIW ( degree) | ||
---|---|---|---|---|
Proposed | 0.8757 | 0.8168 | 58.67 | 45.71 |
BAR [ | 0.5974 | 0.5324 | 46.83 | 36.49 |
GP [ | 0.8703 | 0.8106 | 280.20 | 218.30 |
GP-CT | 0.7395 | 0.9375 | 120.30 | 93.75 |
QRNN-CT | 0.7248 | 0.6134 | 140.00 | 103.20 |
QRNN-Finetuned | 0.6028 | 0.4890 | 77.47 | 59.15 |
Although the GP achieves better coverage with limited samples, its intervals for magnitude and angle estimates remain wide due to insufficient training data. For example, MPIW of GP is 276.9% larger than that of the proposed GARL-based transfer method for the voltage magnitude task when. Adding training data and training tasks simultaneously reduces MPIW for GP-CT, but naive aggregation across topologies hinders the performance, as can be observed in PICP values of GP-CT. While the BAR achieves narrower intervals, its PICP does not satisfy DSSE requirements. In contrast, the proposed GARL-based transfer method adapts to new topologies with limited data, while its Bayesian characteristics enable uncertainty quantification, achieving superior coverage with narrower intervals. For more displays about the probabilistic results of various methods, please refer to Fig. SC2 of Supplementary Material C.
To evaluate the adaptability of the proposed GARL-based transfer metthod, more tests are carried out on the IEEE 119-bus test system [
1) HT1: original topology.
2) HNT1: opening switches of branches 34-35, 72-73, and 107-108 while closing switches of branches 25-35, 91-73, and 83-108.
3) HNT2: opening switches of branches 23-24, 34-35, 72-73, and 107-108 while closing switches of branches 8-24, 25-35, 91-73, and 83-108.
Six cases are considered to simulate data acquisition errors as follows.
1) Case 1: normal condition.
2) Case 2: randomly selecting 20% real-time measurements and adding 30% uniform noise.
3) Case 3: randomly selecting 50% real-time measurements and adding 30% uniform noise.
4) Case 4: missing the real-time measurements at branches 18-19, 29-30, 64-65, and 102-103 while randomly selecting 20% real-time measurements and adding 30% uniform noise.
5) Case 5: missing the real-time measurements at branches 18-19, 22-23, 29-30, 33-34, 64-65, 68-69, 102-103, and 106-107 while randomly selecting 20% real-time measurements and adding 30% uniform noise.
6) Case 6: missing the real-time measurements at branches 18-19, 21-22, 23-24, 29-30, 32-33, 34-35, 64-65, 67-68, 69-70, 102-103, 105-106, and 107-108 while randomly selecting 20% real-time measurements and adding 30% uniform noise.
Method | Voltage magnitude error ( p.u.) | Voltage angle error ( degrees) | ||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|
Case 1 | Case 2 | Case 3 | Case 4 | Case 5 | Case 6 | Case 1 | Case 2 | Case 3 | Case 4 | Case 5 | Case 6 | |
WLS-L | 3.10 | 4.79 | 6.10 | 26.90 | 30.50 | 34.0 | 8.10 | 15.6 | 21.2 | 40.4 | 43.8 | 51.3 |
BPN [ | 5.60 | 7.13 | 9.22 | 29.60 | 34.90 | 38.3 | 5.23 | 12.6 | 23.2 | 53.3 | 60.2 | 61.3 |
GCNII [ | 22.20 | 24.70 | 25.20 | 25.70 | 26.60 | 27.5 | 19.50 | 21.1 | 22.4 | 30.3 | 33.4 | 35.2 |
PAWNN [ | 15.80 | 16.70 | 17.80 | 25.60 | 27.20 | 30.0 | 12.10 | 15.9 | 21.2 | 39.9 | 42.7 | 51.0 |
GSJKN | 3.80 | 4.52 | 5.97 | 8.13 | 9.02 | 10.1 | 7.51 | 11.3 | 16.9 | 15.5 | 16.4 | 17.9 |
Transfer tests in HNT1 and HNT2 are conducted to evaluate the effectiveness of the proposed GARL-based transfer method.
Method | Voltage magnitude error (1 | Voltage angle error (1 | ||
---|---|---|---|---|
HNT1 | HNT2 | HNT1 | HNT2 | |
BPN [ | 73.40 | 92.10 | 34.00 | 29.70 |
BPN-CT | 40.90 | 56.00 | 26.10 | 24.70 |
BPN-Finetuned [ | 39.60 | 34.70 | 27.40 | 24.20 |
BAR [ | 22.30 | 19.90 | 29.40 | 28.20 |
WLS (right) | 3.76 | 3.08 | 14.60 | 12.20 |
WLS (error) | 28.60 | 17.00 | 83.70 | 51.00 |
Proposed | 9.13 | 8.52 | 23.50 | 21.40 |
For tests of the proposed GARL-based transfer method at the post-transfer stage, please refer to Table SCVI of Supplementary Material C.
To evaluate the adaptability of the proposed GSJKN method and GARL-based transfer method, tests are conducted on the IEEE 342-node test system, which represenets low-voltage, three-phase unbalanced networks widely used in North America [
1) GT1: original topology.
2) GNT1: opening switches of branches P134-135 and S148-S27.
3) GNT2: opening switches of branches S148-S27.
Six cases are simulated to replicate typical data acquisition errors as follows.
1) Case 1: normal condition.
2) Case 2: randomly selecting 20% real-time measurements and adding 30% uniform noise.
3) Case 3: randomly selecting 50% real-time measurements and adding 30% uniform noise.
4) Case 4: missing the real-time measurements at branches P82 and P83.
5) Case 5: missing the real-time measurements at branches P122, P123, S52, and S53.
6) Case 6: missing the real-time measurements at branches P122, P123, S82, S83, S202, and S203.
Method | Voltage magnitude error ( p.u.) | Voltage angle error ( degrees) | ||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|
Case 1 | Case 2 | Case 3 | Case 4 | Case 5 | Case 6 | Case 1 | Case 2 | Case 3 | Case 4 | Case 5 | Case 6 | |
BPN [ | 37.70 | 37.90 | 41.00 | 39.00 | 36.80 | 40.50 | 96.10 | 100.00 | 104.00 | 110.00 | 99.90 | 172.00 |
CNN [ | 20.80 | 23.70 | 27.20 | 223.00 | 180.00 | 193.00 | 48.50 | 76.50 | 99.90 | 84.70 | 67.20 | 154.00 |
GCNII [ | 34.10 | 34.60 | 35.10 | 36.30 | 39.20 | 44.80 | 139.00 | 141.00 | 142.00 | 144.00 | 151.00 | 159.00 |
PAWNN [ | 30.60 | 31.20 | 32.00 | 38.40 | 39.30 | 41.60 | 151.00 | 154.00 | 162.00 | 182.00 | 231.00 | 236.00 |
GSJKN | 8.61 | 9.63 | 10.80 | 12.40 | 12.70 | 16.60 | 48.10 | 51.40 | 57.50 | 66.80 | 58.50 | 78.70 |
In
Method | Voltage magnitude error ( p.u.) | Voltage angle error ( degrees) | ||
---|---|---|---|---|
GNT1 | GNT2 | GNT1 | GNT2 | |
BPN [ | 1871.0 | 1664.0 | 155.0 | 128.0 |
CNN [ | 1363.0 | 821.0 | 110.0 | 120.0 |
BPN-CT | 45.1 | 64.2 | 116.0 | 124.0 |
CNN-CT | 27.1 | 32.8 | 73.6 | 72.6 |
BPN-Finetuned [ | 34.2 | 29.8 | 115.0 | 104.0 |
BAR [ | 16.9 | 16.2 | 133.0 | 199.0 |
Proposed | 14.4 | 13.4 | 65.9 | 61.1 |
For probabilistic results of various methods at the IEEE 342-node test system, please refer to Table SCVII and Table SCVIII of Supplementary Material C.
We introduce a robust DSSE method based on a physics-guided GSJKN method, a DAE-based detector and a GARL-based transfer method, aiming at tackling anomalous real-time measurements and potential topology changes, respectively. Specifically, the proposed GSJKN method establishes a complex mapping between measurement data and system state variables, followed by a DAE-based detector to detect topology changes and a GARL-based transfer method to capture residuals after topology changes occur. Comparative tests with benchmark methods show that: embedding physical structural information within the GSJKN improves robustness against missing/noisy measurements; the DAE-based detector monitors topology changes online by tracking reconstruction errors; the GARL-based transfer method enables rapid adaptation to new topologies with minimal online data and effectively quantifies estimation uncertainty, producing probabilistic DSSE results with higher reliability, sharpness, and resolution than other methods.
As system scale and measurement diversity increase, fusing multi-rate multi-sensor data becomes critical for enhancing state estimation precision and efficiency. Future research will focus on developing a multi-source information fusion module to effectively integrate diverse measurement data into the estimation process. Additionally, more complex node-breaker substation models will be incorporated into the topology change detector to enhance the monitoring of substation configurations. Advanced semi-supervised and meta-learning frameworks will also be explored to reduce model dependency on extensive training data, broadening the applicability of learning-based DSSE methods.
References
Z. Liu, P. Li, C. Wang et al., “Robust state estimation of active distribution networks with multi-source measurements,” Journal of Modern Power Systems and Clean Energy, vol. 11, no. 5, pp. 1540-1552, Sept. 2023. [Baidu Scholar]
V. Gundu, S. P. Simon, V. Kasi et al., “Priority-based residential demand response for alleviating crowding in distribution systems,” Journal of Modern Power Systems and Clean Energy, vol. 11, no. 2, pp. 502-510, Mar. 2023. [Baidu Scholar]
W. Wang and N. Yu, “Estimate three-phase distribution line parameters with physics-informed graphical learning method,” IEEE Transactions on Power Systems, vol. 37, no. 5, pp. 3577-3591, Sept. 2022. [Baidu Scholar]
J. Hu, W. Hu, D. Cao et al., “Robust multiarea distribution system state estimation based on structure-informed graphic network and multitask Gaussian process,” IEEE Transactions on Industrial Informatics, vol. 20, no. 8, pp. 10599-10612, Aug. 2024. [Baidu Scholar]
D. Cao, J. Zhao, J. Hu et al., “Physics-informed graphical representation-enabled deep reinforcement learning for robust distribution system voltage control,” IEEE Transactions on Smart Grid, vol. 15, no. 1, pp. 233-246, Jan. 2024. [Baidu Scholar]
A. Primadianto and C. Lu, “A review on distribution system state estimation,” IEEE Transactions on Power Systems, vol. 32, no. 5, pp. 3875-3883, Sept. 2017. [Baidu Scholar]
X. Zhou, Z. Liu, Y. Guo et al., “Gradient-based multi-area distribution system state estimation,” IEEE Transactions on Smart Grid, vol. 11, no. 6, pp. 5325-5338, Nov. 2020. [Baidu Scholar]
G. Cheng, Y. Lin, Y. Chen et al., “Adaptive state estimation for power systems measured by PMUs with unknown and time-varying error statistics,” IEEE Transactions on Power Systems, vol. 36, no. 5, pp. 4482-4491, Sept. 2021. [Baidu Scholar]
B. Rout, S. Dahale, and B. Natarajan, “Dynamic matrix completion based state estimation in distribution grids,” IEEE Transactions on Industrial Informatics, vol. 18, no. 11, pp. 7504-7511, Nov. 2022. [Baidu Scholar]
M. Mao, J. Xu, Z. Wu et al., “A multiarea state estimation for distribution networks under mixed measurement environment,” IEEE Transactions on Industrial Informatics, vol. 18, no. 6, pp. 3620-3629, Jun. 2022. [Baidu Scholar]
R. Madbhavi, B. Natarajan, and B. Srinivasan, “Enhanced tensor completion based approaches for state estimation in distribution systems,” IEEE Transactions on Industrial Informatics, vol. 17, no. 9, pp. 5938-5947, Sept. 2021. [Baidu Scholar]
P. A. Pegoraro and S. Sulis, “Robustness-oriented meter placement for distribution system state estimation in presence of network parameter uncertainty,” IEEE Transactions on Instrumentation and Measurement, vol. 62, no. 5, pp. 954-962, May 2013. [Baidu Scholar]
T. Chen, L. Sun, K. Ling et al., “Robust power system state estimation using t-distribution noise model,” IEEE Systems Journal, vol. 14, no. 1, pp. 771-781, Mar. 2020. [Baidu Scholar]
T. Wu, W. Xue, H. Wang et al., “Extreme learning machine-based state reconstruction for automatic attack filtering in cyber physical power system,” IEEE Transactions on Industrial Informatics, vol. 17, no. 3, pp. 1892-1904, Mar. 2021. [Baidu Scholar]
M. Netto and L. Mili, “A robust data-driven Koopman Kalman filter for power systems dynamic state estimation,” IEEE Transactions on Power Systems, vol. 33, no. 6, pp. 7228-7237, Nov. 2018. [Baidu Scholar]
B. Zargar, A. Angioni, F. Ponci et al., “Multiarea parallel data-driven three-phase distribution system state estimation using synchrophasor measurements,” IEEE Transactions on Instrumentation and Measurement, vol. 69, no. 9, pp. 6186-6202, Sept. 2020. [Baidu Scholar]
D. Cao, J. Zhao, W. Hu et al., “Topology change aware data-driven probabilistic distribution state estimation based on Gaussian process,” IEEE Transactions on Smart Grid, vol. 14, no. 2, pp. 1317-1320, Mar. 2023. [Baidu Scholar]
G. Tian, Y. Gu, D. Shi et al., “Neural-network-based power system state estimation with extended observability,” Journal of Modern Power Systems and Clean Energy, vol. 9, no. 5, pp. 1043-1053, Sept. 2021. [Baidu Scholar]
Y. Chen, H. Chen, Y. Jiao et al., “Data-driven robust state estimation through off-line learning and on-line matching,” Journal of Modern Power Systems and Clean Energy, vol. 9, no. 4, pp. 897-909, Jul. 2021. [Baidu Scholar]
A. S. Zamzam and N. D. Sidiropoulos, “Physics-aware neural networks for distribution system state estimation,” IEEE Transactions on Power Systems, vol. 35, no. 6, pp. 4347-4356, Nov. 2020. [Baidu Scholar]
T. Su, J. Zhao, Y. Pei et al., “Probabilistic physics-informed graph convolutional network for active distribution system voltage prediction,” IEEE Transactions on Power Systems, vol. 38, no. 6, pp. 5969-5972, Nov. 2023. [Baidu Scholar]
J. Hu, W. Hu, D. Cao et al., “Feature graph-enabled graphical learning for robust DSSE with inaccurate topology information,” IEEE Transactions on Power Systems, vol. 39, no. 4, pp. 6091-6094, Jul. 2024. [Baidu Scholar]
H. Wu, Z. Xu, and M. Wang, “Unrolled spatiotemporal graph convolutional network for distribution system state estimation and forecasting,” IEEE Transactions on Sustainable Energy, vol. 14, no. 1, pp. 297-308, Jan. 2023. [Baidu Scholar]
B. Huang and J. Wang, “Applications of physics-informed neural networks in power systems-a review,” IEEE Transactions on Power Systems, vol. 38, no. 1, pp. 572-588, Jan. 2023. [Baidu Scholar]
L. Wang, Q. Zhou, and S. Jin, “Physics-guided deep learning for power system state estimation,” Journal of Modern Power Systems and Clean Energy, vol. 8, no. 4, pp. 607-615, Jul. 2020. [Baidu Scholar]
J. Hu, W. Hu, J. Chen et al., “Fault location and classification for distribution systems based on deep graph learning methods,” Journal of Modern Power Systems and Clean Energy, vol. 11, no. 1, pp. 35-51, Jan. 2023. [Baidu Scholar]
D. Cao, J. Zhao, W. Hu et al., “Physics-informed graphical learning and Bayesian averaging for robust distribution state estimation,” IEEE Transactions on Power Systems, vol. 39, no. 2, pp. 2879-2892, Mar. 2024. [Baidu Scholar]
A. Abur, H. Kim, and M. K. Celik, “Identifying the unknown circuit breaker statuses in power networks,” IEEE Transactions on Power Systems, vol. 10, no. 4, pp. 2029-2037, Nov. 1995. [Baidu Scholar]
A. de la V. Jaen and A. Gomez-Exposito, “Implicitly constrained substation model for state estimation,” IEEE Transactions on Power Systems, vol. 17, no. 3, pp. 850-856, Aug. 2002. [Baidu Scholar]
X. Qiu, E. Meyerson, and R. Miikkulainen, “Quantifying point-prediction uncertainty in neural networks via residual estimation with an I/O kernel,” in Proceeding of The International Conference on Learning Representations (ICLR), online, Jun. 2020, pp. 1-35. [Baidu Scholar]
K. Dehghanpour and Z. Wang. (2019, Jan.). A Real 240-Bus Distribution System with One-Year Smart Meter Data. [Online]. Available: http://wzy.ece.iastate.edu/Testsystem.html. [Baidu Scholar]
J. Hu, W. Hu, D. Cao et al., “Probabilistic net load forecasting based on transformer network and Gaussian process-enabled residual modeling learning method,” Renewable Energy, vol. 225, pp. 120253, Apr. 2024. [Baidu Scholar]
J. Hu, W. Hu, D. Cao et al., “Bayesian averaging-enabled transfer learning method for probabilistic wind power forecasting of newly built wind farms,” Applied Energy, vol. 355, p. 122185, Aug. 2024. [Baidu Scholar]
H. Zang, Y. Guo, M. Huang et al., “State estimation for power systems with time-varying topology based on deep transfer learning,” Automation of Electric Power Systems, vol. 45, no. 24, pp. 49-56, Jan. 2021. [Baidu Scholar]
Y. Xu, Z. Dong, J. Zhao et al., “A reliable intelligent system for real-time dynamic security assessment of power systems,” IEEE Transactions on Power Systems, vol. 27, no. 3, pp. 1253-1263, Aug. 2012. [Baidu Scholar]
Q. Hu, R. Zhang, and Y. Zhou, “Transfer learning for short-term wind speed prediction with deep neural networks,” Renewable Energy, vol. 85, pp. 83-95, Jan. 2016. [Baidu Scholar]
X. Liu, Y. Li, Qi. Meng et al., “Deep transfer learning for conditional shift in regression,” Knowledge-Based Systems, vol. 227, p. 107216, Jan. 2021. [Baidu Scholar]
H. Wang, W. Zhang, and Y. Liu, “A robust measurement placement method for active distribution system state estimation considering network reconfiguration,” IEEE Transactions on Smart Grid, vol. 9, no. 3, pp. 2108-2117, May 2018. [Baidu Scholar]
J. A. D. Massignan, J. B. A. London, M. Bessani et al., “Bayesian inference approach for information fusion in distribution system state estimation,” IEEE Transactions on Smart Grid, vol. 13, no. 1, pp. 526-540, Jan. 2022. [Baidu Scholar]