Abstract
To overcome the shortcomings of model-driven state estimation methods, this paper proposes a data-driven robust state estimation (DDSE) method through off-line learning and on-line matching. At the off-line learning stage, a linear regression equation is presented by clustering historical data from supervisory control and data acquisition (SCADA), which provides a guarantee for solving the over-learning problem of the existing DDSE methods; then a novel robust state estimation method that can be transformed into quadratic programming (QP) models is proposed to obtain the mapping relationship between the measurements and the state variables (MRBMS). The proposed QP models can well solve the problem of collinearity in historical data. Furthermore, the off-line learning stage is greatly accelerated from three aspects including reducing historical categories, constructing tree retrieval structure for known topologies, and using sensitivity analysis when solving QP models. At the on-line matching stage, by quickly matching the current snapshot with the historical ones, the corresponding MRBMS can be obtained, and then the estimation values of the state variables can be obtained. Simulations demonstrate that the proposed DDSE method has obvious advantages in terms of suppressing over-learning problems, dealing with collinearity problems, robustness, and computation efficiency.
SMART grids make the huge amounts of system status data available; however, at the same time, the requirements for the data reliability are higher than ever before [
The performance of MDSE methods has been greatly improved during the past fifty years. However, in the emerging smart grid environments, MDSE methods have the following shortcomings [
1) MDSE methods usually use only the current measurement snapshot while ignoring the massive historical data collected in smart grids. When the gross errors and the parameter errors exist simultaneously, the identification ability of MDSE methods drops [
2) The grid parameters themselves have a certain degree of uncertainty [
3) In smart grids, the power generation and loads often become intermittent and much more uncertain [
4) Using large amounts of historical data, the malicious attackers are more likely to accurately know the state of power grids and launch a malicious data injection attack that cannot be identified by MDSE [
Recently, data-driven SE (DDSE) methods have been proposed to address the above problems. In [
In summary, compared with MDSE methods, the general DDSE methods include the following characteristics.
1) A large amount of historical data is stored in the historical database of SE, and these data include measurement vectors and the corresponding estimation values of state vectors given by historical MDSE methods. DDSE methods try to use the massive historical data to overcome the shortcomings of MDSE methods.
2) DDSE methods do not need to know the measurement equations of the current snapshot exactly like MDSE methods, nor do they need to build an optimal estimation model of the state vector of the current snapshot based on the measurement equations like MDSE methods.
3) DDSE methods generally need to use a large amount of historical data or simulation data as the sample data, and construct a learning model (such as a regression model, etc.) to find the internal mapping relationship between the measurements and the state variables (MRBMS); and then the corresponding MRBMS is used to calculate the state vector of the current snapshot.
Although the development of DDSE methods has been in steady progress, the over-learning problem and the low-learning efficiency are the main shortcomings or even big obstacles [
In addition to the above general characteristics, the proposed DDSE method also has the following unique characteristics.
1) It includes off-line learning stage and on-line matching stage. The former performs off-line analysis and processing of historical data, while the latter performs on-line calculation based on the current measurement snapshot.
2) The off-line learning stage only needs to run once, but it can be used multiple times; whereas the on-line matching stage requires online real-time periodic operation.
3) At the off-line learning stage, an LRE suitable for DDSE is formed by clustering the historical data; then based on this LRE, a novel robust estimation method is proposed to filter the historical data to obtain the MRBMS.
4) At the on-line matching stage, by quickly matching the current snapshot with the historical data (QMCH), the corresponding MRBMS can be obtained quickly, and then the estimated state variables of the current snapshot can be obtained quickly based on this MRBMS.
The contributions of this paper mainly include four aspects.
1) By clustering historical data of SE, an LRE is presented, which provides a guarantee for solving the over-learning problem of the existing DDSE methods.
2) A novel robust estimation method that can be transformed into QP models is proposed to obtain the MRBMS at the off-line learning stage, and the proposed QP-based method can solve the collinearity problem in historical data.
3) The computational efficiency of the off-line learning stage is sped up by the reduction of historical categories (RHC), the establishment of a tree search structure for known historical topologies, and the usage of sensitivity algorithm for QP models.
4) A method of QMCH is proposed, thereby greatly improving the computation efficiency of the on-line matching stage, which is beneficial to the on-line application of the proposed DDSE method.
The remainder of this paper is organized as follows. The LRE for the DDSE method is presented in Section II. To obtain the MRBMS, Section III proposes a robust estimation method that can be transformed into QP models based on the proposed LRE. In Section IV, the off-line learning stage is sped up from three aspects. In Section V, the on-line matching stage is presented. The performance of the proposed DDSE method is tested in Section VI. Conclusions are presented in Section VII.
Reference [
(1) |
where is the auxiliary measurement vector, and j are the node numbers, is the square of the voltage amplitude measurement, Pi and Qi are the measurements of active and reactive injection power at node i, respectively, Pij and Qij are the measurements of active and reactive power flow from bus i to bus j, is the square of the line current magnitude measurement from bus i to bus j, is the total number of measurements; is the auxiliary state vector, , N and b are the numbers of nodes and branches, respectively, , , li and lj are the terminal buses of the branch l, and are the voltage magnitudes of buses li and lj, respectively, and are the voltage angles of buses li and lj, respectively, is the angle difference between buses li and lj; is the m-dimensional vector of measurement error with variance (an diagonal matrix); and is a constant matrix, in which all elements are determined by the network topology and network parameters. As for the details of (1), please refer to [
The general modeling method of DDSE is shown in

Fig. 1 General modeling method of DDSE.
A very important observation that can be made from (1) is that the constant Jacobian matrix of the ELMEs changes only with respect to the changes in topological structure. Therefore, the clustering of the historical snapshots can be based on their topologies, and those snapshots with the same topology should be in the same category. Since the above process only needs to be performed off-line, we call it off-line historical data clustering (OHDC).
Note that when the DDSE learning model is built, the available historical data generally include the historical measurement vectors and the corresponding estimation values of historical state vectors given by historical MDSE. The corresponding historical topology may be known or unknown, and correspondingly, different OHDC methods need to be constructed, which are presented in the following subsections.
In general, the vast majority (almost all) of different operation modes of the studied power grid are stored in the historical database of SE, i.e., the historical data include almost all the possible topologies with a high probability. As a result, when the historical topologies are known, it is theoretically possible to directly cluster the historical data according to the topologies. However, the drawbacks of direct clustering according to the original topologies are as follows.
1) Since each topology leads to a category, the direct clustering method will result in too many categories caused by too many different topologies for the studied power grid, which will affect the computation efficiency.
2) A large number of historical snapshots are needed to form the datasets for the corresponding topologies, which affects the practicability of the algorithm.
3) In extreme cases, it may be difficult to find multiple historical snapshots with the same topology as the current snapshot in the historical database of SE. To this end, a spanning tree method is proposed to solve the above problems through the following seven steps.
Step 1: for the studied distribution network, assuming that all the branches are put into operation, the network has nodes and edges (here, multiple edges connected in parallel between two nodes are treated as one edge). The number of all spanning trees of this network is assumed to be , the value of which can be determined by Kirchhoff’s matrix tree theorem (KMTT) and will be reduced in Section IV.
Step 2: for each spanning tree, create a corresponding structure including three fields. The first field stores all the links corresponding to this spanning tree (in ascending order according to link numbers), which is the flag of the corresponding spanning tree. The number of elements in each is . The second field stores the multiple historical snapshots (the measurement vectors and the corresponding estimation values of state vectors) with the same spanning tree. And the third field is used to store the corresponding two MRBMSs, which will be introduced in detail below. When each spanning tree is formed, the corresponding links are determined, so the first field in the corresponding structure is easy to be determined. The formation of the second field requires processing large numbers of historical snapshots, and the processing method is given in Steps 3-7. The formation of the third field corresponding to each spanning tree will be given in Section III.
Step 3: assuming that there are historical snapshots available, take out the
Step 4: for the
Step 5: let . If , go to Step 4; else, go to Step 6.
Step 6: if all the second fields in Step 2 store at least historical snapshots (the value of will be analyzed and given in Section III), then go to Step 7; otherwise, take new historical data and return to Step 3.
Step 7: for each spanning tree, there are historical snapshots in , which have the same unknown constant Jacobian matrix according to (1).
For each snapshot , mark the historical auxiliary measurement vectors in as and the historical auxiliary state vectors associated with the spanning tree as , (for the spanning tree, b is equal to ). Further, the auxiliary measurement vectors and the auxiliary state vectors associated with the spanning tree of all the historical snapshots are aggregated into the following matrix forms.
(2) |
(3) |
In most cases, multiple historical snapshots with the same topology as the current snapshot are available in the historical database of SE. These historical snapshots and the current snapshot have the same MRBMS. At this time, and can be used as the sample input and the sample output to construct the DDSE learning model, respectively. In this circumstance, all the measurements in each historical measurement snapshot are used in the DDSE learning model.
In a few extreme cases, historical snapshots with the same topology as the current snapshot are not available in the historical database of SE. Therefore, there are no historical snapshots with the same MRBMS as the current snapshot. However, we can always find multiple historical snapshots with the same spanning tree as the current snapshot; these historical/current auxiliary measurement vectors associated with the spanning tree have the same MRBMS. At this time, the sample output is still , but the sample input of the DDSE learning model should be historical auxiliary measurement vectors associated with the spanning tree, and the matrix form is formulated as:
(4) |
where () is the historical auxiliary measurement vector associated with the spanning tree for the
At the off-line learning stage, two MRBMSs should be formed for each spanning tree. For the first MRBMS, the input and output of the corresponding DDSE learning model are and , respectively, which are suitable for most cases where the same topologies of historical snapshots as the current snapshot are available. For the second MRBMS, the input and output of the corresponding DDSE learning model are and , respectively, which are suitable for a few extreme cases where the same topologies of historical snapshots as the current snapshot are not available. For the convenience of expression, the following analysis only takes the formation of the first MRBMS as an example. By using the same method, it is easy to get the second MRBMS.
Note that these historical auxiliary measurement vectors associated with the spanning tree for the second MRBMS include the measurements of all node voltage amplitudes, power flow measurements on the twigs of the spanning tree, and injection power measurements of those nodes that are not connected to any links.
When the historical topologies are known, the advantages of the proposed OHDC method based on the spanning tree include four aspects.
1) In theory, the number of spanning trees is less than that of all possible original topologies of the studied power grid. Therefore, the total number of modes that need to be processed at the off-line learning stage can be reduced by using the spanning tree method.
2) Any operational topology of the studied power grid is included in all structures of the field , and the corresponding historical snapshots needed are stored in the field , which lays the data foundation for DDSE.
3) The proposed spanning tree method has no special requirements for the distribution of historical measurements and all the historical measurements can be used in the auxiliary measurement vectors in most cases, whereas the number of auxiliary state variables associated with the spanning tree is , which is smaller than the number of auxiliary state variables corresponding to a mesh network (). The accuracy of the DDSE model is improved by using the proposed spanning tree method.
4) It should be emphasized that the off-line learning stage of the proposed DDSE method is to learn the MRBMSs (i.e., the mapping matrix H) corresponding to all the different spanning trees when the historical topologies are known. As a result, even if a new topology appears in the current snapshot in extreme cases, that is to say, there are no historical snapshots with the same topology as the current snapshot, we can still find historical snapshots with the same spanning tree as the current snapshot. These historical/current auxiliary measurement vectors associated with the spanning tree have the same MRBMS, thus the proposed DDSE method can still work.
When the historical topologies are unknown, the corresponding topologies can only be inferred according to the measurement vectors.
Taking out any two historical snapshots, the corresponding auxiliary measurement vectors and can be calculated according to (1). If and have a strong linear correlation, they can be considered to have the same topology [
(5) |
where is the difference between the two ranks of and , represents the consecutively ranking number from small (starting from 1) to large, and and are the
The criterion is that, if , then it is considered that and have a strong linear correlation, and their corresponding topologies can be considered the same. Obviously, the choice of the threshold is very important. In a large number of simulation experiments, we have found that even if is 0.9, the above method can still be used to correctly identify the historical measurement snapshots with the exact same topology. It should be pointed out that the reason why we use RCC instead of Pearson correlation coefficient (PCC) is that RCC does not require the auxiliary measurement vectors to conform to the normal distribution, and RCC is robust when gross errors exist in historical data.
As an analogue to the method proposed in Section II-B, all historical snapshots can be clustered according to the RCC between any two historical snapshots. Here, the three fields stored in the structure corresponding to each topological structure should be RCC, , and . With sufficient historical snapshots, theoretically, all the topologies of the studied power grid can be obtained. For each topology, assuming that there are s snapshots, the aggregation method of the auxiliary measurement vectors and the auxiliary state vectors is the same as (2)-(4).
According to (1), the true value of must be a linear function of , so we have
(6) |
where is the unknown mapping matrix from the true value of to the true value of ; and is the error vector.
According to (6), must also be a linear function of .
(7) |
where is the error matrix.
According to (1) and (6), as long as the local topologies of different snapshots are the same, these local topologies have the same matrix J and matrix H. This is the reason why different topologies sharing the same spanning tree can be clustered into one category.
Obviously, the unknown matrix corresponds to the third field in OHDC. If can be estimated according to (7) by the given and , the MRBMS can be obtained. Further, when the current measurement snapshot is given, by matching the current snapshot with the historical snapshots, we can get the MRBMS corresponding to the current snapshot, and then get the estimation value of the state variables, thereby constructing a DDSE method.
The task of the off-line learning stage of the proposed DDSE method is to estimate H based on (7). Obviously, the most direct method is to use the WLS method. At this time, a unique estimation value of can be obtained only when the matrix is column full rank. Therefore, for each historical category, it is preferable to ensure that historical snapshots are available when the WLS method is used, which is obviously a necessary condition. The necessary and sufficient condition is ensuring that s auxiliary measurement vectors are linearly independent.
In practical systems, we may not be able to obtain so many historical snapshots, and worse, there may be collinearity problems among different historical snapshots, so it is not ideal to use the WLS method to estimate H directly. This issue will be addressed in Section III.
Considering that the variables to be solved in mathematical planning are often vectors, two methods can be adopted to transform the matrix into a vector.
The first method is to stack the elements of the matrix variable into a vector, then we have
(8) |
where and are the column vectors with elements, , and represents the vectorization of the matrix by the column vectors; is the column vector to be solved with elements, ; and is the block diagonal matrix composed of matrices , .
The estimation value of can be obtained based on (8) by using the historical data, and then can be compressed into the matrix . However, in this method, is a diagonal matrix with a very large order, and is highly sparse, thus the memory required in the estimation process might be very large, which may affect the practicability of the algorithm.
The second method is to use the historical snapshots to solve each column of . The regression equation for solving the
(9) |
where ; and , , and are the
Solving models based on (9) by using the historical snapshots could give each column of . Obviously, the computer memory required to solve each model is very small. Although this method needs to solve (9) times, the total off-line calculation takes much less time than the first method, which supports the practical application of the algorithm.
Compared with the existing nonlinear regression equations and the approximate LREs of existing DDSE methods, the establishment of the LRE in this paper avoids the over-fitting and ill-conditioned problems in the existing DDSE methods, and lays a foundation for the establishment of new DDSE methods with good robustness.
The most important task at the off-line learning stage of the proposed DDSE method is to estimate based on the LRE, which will be given in this section.
For each , it is necessary to estimate the value of based on (9). An intuitive idea is using WLS, which is very simple in principle. When the noise conforms to the normal distribution, the WLS method is the optimal estimation; however, it is not a robust method and cannot handle ill-conditioned situations (e.g., there is a collinearity problem in the historical snapshots).
Considering that the least absolute value (LAV) estimation has good robustness [
Based on (9), the estimation value of , denoted as , can be obtained by solving the following LSAVRR model.
(10) |
where is the
When both and are equal to 0, model (10) is the LS method; when is equal to 0, model (10) is the RR method; when is sufficiently large, model (10) approaches the LAV method. Based on the extensive simulations, we recommend taking and , respectively. Note that the proposed model (10) is different from the elastic net regression method [
Note that the objective function of the proposed model (10) is non-differentiable, so model (10) cannot be solved directly using the gradient-based method. According to the same method in [
(11) |
where and are two auxiliary vectors, and and are the
Model (11) can be further transformed into a standard form of QP model as:
(12) |
where is a column vector with elements; is the estimation value of ; is the semi-definite block diagonal matrix, , represents the identity matrix (order is ) multiplied by ; , is the 2s-dimensional row vector, whose elements are all , is the 0-dimensional row vector, whose elements are all 0; is the
The standard QP model (12) can be solved by mature commercial software, such as GROUBI to obtain the estimation value . and are respectively introduced into model (11), then the estimation values of each column of can be obtained, thereby obtaining the MRBMS corresponding to each historical category. That is, the third field is obtained.
The complete off-line learning stage of the proposed DDSE method has been given above. Further research reveals that the off-line learning stage can be accelerated through the following three aspects.
As shown in Section II, when the historical topologies are known, all possible spanning trees of the studied power network have been stored in T fields , so the spanning tree of the current snapshot needs to be matched with T fields . This may cause the following problems.
1) Although some topologies exist in theory, the probability that they appear in the actual operation of the power network is very low. Considering all possible spanning trees will cause the value of T too large (for example, T is 3909 for IEEE 14-bus system), leading to a very heavy computational load for off-line learning.
2) Likewise, it will also cause too much calculation by considering all spanning trees at the on-line matching stage.
To this end, we propose a specific method for the RHC: ① calculate the probability of each spanning tree in the actual operation of the power grid; ② consider only those spanning trees with a relatively high probability of occurrence, so the value of T can be greatly reduced, and the reduced value is set to be Tcut.
When historical topologies are known, for Tcut spanning trees, a tree retrieval structure is established based on the following steps to improve the efficiency of the on-line matching stage.
Step 1: for each , establish a tree retrieval structure by taking elements (link numbers) in as the nodes, where the smallest number in is the root node, and each node in the tree structure (TS) corresponds to a link number in . Since the elements in have been arranged in the ascending order, the child node is always larger than its parent node. At this time, each node has only one child node (except for the leaf node); the number of layers in the TS is . A total of Tcut TSs can be obtained corresponding to all the ; let .
Step 2: if there are TSs with the same value of the
Step 3: ; if , go to Step 2; otherwise, go to Step 4.
Step 4: denote Tmerge as the number of TSs; let .
Step 5: for the
Step 6: sort the
Step 7: ; if , go to Step 6; otherwise, go to Step 8.
Step 8: ; if , go to Step 5; otherwise, go to Step 9.
Step 9: end.
To get MRBMSs, the estimation values of the columns of can be obtained by solving QP models n times. It is apparent that only the values of are different in these n QP models; therefore, the sensitivity algorithm for QP models in [
The task of the on-line matching stage of the proposed DDSE method is the QMCH to obtain the corresponding H matrix. That is to say, if the current snapshot and historical snapshots have the same spanning tree (when historical topologies are known) or the same original topology (when historical topologies are unknown), then the current snapshot and historical snapshots have the same H matrix; and further, the original state variables of the current snapshot can be estimated based on H.
When the current topology is known, the method of QMCH is as follows.
Step 1: choose a spanning tree arbitrarily in the network of the current snapshot; store all links in a collection with an ascending order by link numbers. Note that the open or overhauled branches in the current snapshot should also be considered as links. When selecting the spanning tree, avoid selecting branches that appear in one tree retrieval structure as twigs at the same time.
Step 2: take the first element in , and denote it as ; find the TS whose root node equals , and denote it as TSc.
Step 3: according to the depth-first search (DFS) algorithm, find a path from the root node to the leaf node in TSc, and the nodes on this path need to be the same as all the elements in . The corresponding to this path stores the historical topology that matches the current topology. If the original topologies and the spanning tree of the historical snapshots are the same as those of the current snapshot, the first MRBMS in the corresponding stores the target matrix H; if only the spanning tree of the historical snapshots is the same as that of the current snapshot, and their original topologies are not the same, the second MRBMS in the corresponding stores the target matrix H.
When the current topology is unknown, the auxiliary measurement vector of the current snapshot can be used to match historical snapshots based on (5). Once the RCC between the historical measurement vectors and the current measurement vector satisfies , the corresponding MRBMS, i.e., the target matrix H, is obtained.
After H is obtained, the estimation value of the auxiliary state vector for the current snapshot, , can be obtained by:
(13) |
where is the auxiliary measurement vector of the current snapshot.
If the current topology is unknown, the power flow can be further obtained and the estimation process ends; if the current topology is known, the estimation value of the original state variables (the voltage amplitudes and angles of all nodes) of the current snapshot can be further obtained [

Fig. 2 Structural framework of DDSE method.
The advantages of the proposed DDSE method are as follows: ① the MRBMSs are obtained based on the LRE, which avoids the over-learning problems; ②the proposed LSAVRR method can well solve the problem of collinearity in historical data, and it also has good robustness; ③ the proposed DDSE method does not require nonlinear iterations, and therefore does not require an initial guess, so that the proposed method has a strong adaptability to the uncertainty of power generation and load in smart grids; ④ the proposed DDSE method does not need to know any network parameters, so the uncertainty of network parameters does not have any impacts on the proposed method; meanwhile, the problems of convergence and leverage gross errors caused by the network parameters also have no effects on the proposed method; ⑤ the proposed DDSE method can be implemented with both known and unknown topologies, so it is adaptive to the frequent changes in the topology of smart grids; ⑥ in extreme cases, it may be difficult to find multiple historical snapshots with the same topology as the current snapshot in the historical database of SE, but the proposed DDSE method can still work by using the spanning tree method.
This section tests the performance of the proposed DDSE method on IEEE benchmark systems. In the tests, the load used in the simulation gradually changes from 90% to 110% of the base case. One measurement snapshot is generated every 10 s to mimic the sampling period of SCADA and MDSE runs every 3 minutes to generate the sample data.
The generation method of historical sample data is as follows: first calculate the power flow, and then superimpose the normal distribution random errors (the standard deviation is 1
The IEEE 4-bus system is firstly used to show the calculation process of the proposed DDSE method in detail.
The topology and measurement configuration of the IEEE 4-bus system are shown in

Fig. 3 IEEE 4-bus system with , , and .
According to the KMTT, can be obtained. According to the proposed OHDC method, all the can be obtained as: {③, ⑤}, {③, ④},{①, ③}, {①, ④},{①, ⑤}, {②, ③}, {②, ④}, and {②, ⑤}. The corresponding tree retrieval structures before the RHC are shown as

Fig. 4 Tree retrieval structures. (a) Before RHC. (b) After RHC.
For the first MRBMS, the numbers of auxiliary measurements and auxiliary state variables in this test system are and , respectively. Then, the first three matrices corresponding to
For the second MRBMS, the number of auxiliary state variables in this test system are ; whereas the number of auxiliary measurements corresponding to TS1, TS2, and TS3 in
Suppose there are two topologies of the current snapshot, as shown in

Fig. 5 Topology B of current snapshot.
When the topology of the current snapshot is the same as
When the topology of the current snapshot is the same as
(14) |
where includes the auxiliary measurements associated with the selected spanning tree.
Since the twigs of the selected spanning tree (①, ②, and ⑤) do not include ④, and are not associated with this spanning tree, and therefore, and are not included in .
As shown in
When the historical topologies of the IEEE 4-bus system are unknown, the performance of the proposed DDSE method is also tested. It is assumed that the measurement vectors and the estimation values of state vectors given by historical MDSE in the historical database of SE correspond to the three topological operation modes of the IEEE 4-bus system. In the test, 1000 historical measurement snapshots are taken out. The RCCs are calculated and the results show that these historical measurement snapshots can be clustered into 3 categories, and the intra-cluster RCC values for these 3 categories are all between 0.9975 and 1; while the average values of inter-cluster RCC are less than 0.8.
Take ten of the historical auxiliary measurement vectors, and the RCCs among them are shown in

Fig. 6 RCC correlation matrix of ten historical measurement snapshots.
1) When Historical Topologies Are Known
1) The number of spanning trees before and after RHC
Before and after the RHC, the number of spanning trees in each IEEE benchmark system is shown in
2) Estimation accuracy with different numbers of historical snapshots
Obviously, the number of historical snapshots s will affect the estimation accuracy of the matrix H at the off-line learning stage, which, in turn, will affect the estimation accuracy of the proposed DDSE method.
To test the influence of the number of historical snapshots on the estimation accuracy of the proposed DDSE method, we denote . Here, the measurement redundancy is taken to be 2.5 for each test system. With different Ratio (from 0.1 to 1.2), the mean absolute error of voltage magnitude (denoted as ) and the mean absolute error of phase angle (denoted as ) between the true values and the estimation values of the proposed DDSE method are given in Figs.

Fig. 7 with different Ratio for each IEEE benchmark system.

Fig. 8 with different Ratio for each IEEE benchmark system.
3) Estimation accuracy with different measurement redundancy
As we all know, the measurement redundancy affects the estimation accuracy of SE.
To test the influence of the measurement redundancy on the estimation accuracy of the proposed DDSE method, let . With different measurement redundancy from 1.5 to 3, and between the true values and the estimation values of the proposed DDSE method are given in Figs.

Fig. 9 with different redundancy for each IEEE benchmark system.

Fig. 10 with different redundancy for each IEEE benchmark system.
4) Comparison of estimation accuracy with other SE methods
When s is equal to m () and the measurement redundancy is 2.5, the estimation accuracies of WLS, the existing DDSE method in [
5) Robustness of proposed DDSE method
The robustness of the proposed DDSE method is also tested. For each IEEE benchmark system, the percentage of bad data (PBD) from 0% to 10% in each snapshot is randomly selected, and then added with 10% relative error. At this time, the number of the selected historical measurement snapshots meets (). In 100 tests, the changes of the average values of and with the changes of PBD obtained by the proposed DDSE method are shown in Figs.

Fig. 11 with different PBD for each IEEE benchmark system.

Fig. 12 with different PBD for each IEEE benchmark system.
6) Computation efficiency test
The computation efficiency of the on-line matching stage of the proposed DDSE method directly determines its engineering usability.
In order to measure the efficiency of the proposed DDSE method, the on-line calculation time of the proposed DDSE method is compared with that of WLS, as shown in

Fig. 13 Comparison of calculation efficiency.
2) When Historical Topologies Are Unknown
For the IEEE benchmark systems, when historical topologies are unknown, a large number of historical measurement snapshots are clustered based on the RCC. The clustering results show that the historical measurement snapshots have obvious clustering phenomena, and most of the measurement snapshots belong to the most common topological categories. And even if there are gross errors in the current snapshot, the matching method based on the proposed RCC also achieves the correct matching results. This proves the rationality of clustering using the RCC and the necessity of RHC. The final estimation error is less than 1
Aiming at resolving the shortcomings of the traditional MDSE methods, this paper proposes a DDSE method which includes off-line learning stage and on-line matching stage. The off-line learning stage targets to cluster historical data and develop the linear MRBMS; while the on-line matching stage obtains the current MRBMS by QMCH, and further quickly obtains the estimation values of the state variables of the current snapshot. The proposed DDSE method does not need to know the parameters of the network, and has good robustness and very high computation efficiency, making it very suitable for the on-line applications of large-scale systems.
In low-voltage distribution networks, the number of measurements is very limited (often not enough to ensure observability), and the topology information is difficult to obtain accurately. Next, we will study the DDSE method for low-voltage distribution networks. Also, the proposed DDSE method can be extended to the integrated energy systems (IESs) so as to realize the comprehensive, real-time and accurate perception of IES in uncertainty circumstances.
References
Nature. (2008, Sept.). Big data (specials). [Online]. Available: http://www.nature.com/news/specials/bigdata/index.html [Baidu Scholar]
R. Qiu and P. Antonik, Smart Grid and Big Data. Hoboken: Wiley, 2014. [Baidu Scholar]
X. He, Q. Ai, R. C. Qiu et al., “A big data architecture design for smart grids based on random matrix theory,” IEEE Transactions on Smart Grid, vol. 8, no. 2, pp. 674-686, Mar. 2017. [Baidu Scholar]
F. C. Schweppe and J. Wildes, “Power system static state estimation, part I: exact model,” IEEE Transactions on Power Apparatus and Systems, vol. PAS-89, no. 1, pp. 120-125, Jan. 1970. [Baidu Scholar]
F. C. Schweppe and D. B. Rom, “Power system static state estimation, part II: approximate model,” IEEE Transactions on Power Apparatus and Systems, vol. PAS-89, no. 1, pp. 125-130, Jan. 1970. [Baidu Scholar]
F. C. Schweppe, “Power system static state estimation, part III: implementation,” IEEE Transactions on Power Apparatus and Systems, vol.PAS-89, no. 1, pp. 130-135, Jan. 1970. [Baidu Scholar]
A. Abur and A. G. Expósito, Power System State Estimation: Theory and Implementation, New York: Marcel Dekker, 2004, pp. 157-184. [Baidu Scholar]
K. Dehghanpour, Z. Wang, and J. Wang, “A survey on state estimation techniques and challenges in smart distribution systems,” IEEE Transactions on Smart Grid, vol. 10, no. 2, pp. 2312-2322, Mar. 2019. [Baidu Scholar]
Y. Chen, F. Liu, G. He et al., “A Seidel-type recursive Bayesian approach and its applications to power systems,” IEEE Transactions on Power Systems, vol. 27, no. 3, pp. 1710-1711, Aug. 2012. [Baidu Scholar]
Y. Chen, F. Liu, S. Mei et al., “An improved recursive Bayesian approach for transformer tap position estimation,” IEEE Transactions on Power Systems, vol. 28, no. 3, pp. 2830-2841, Aug. 2013. [Baidu Scholar]
G. Sivanagaraju, S. Chakrabarti, S. C. Srivastava et al., “Uncertainty in transmission line parameters: estimation and impact on line current differential protection,” IEEE Transactions on Instrumentation and Measurement, vol. 63, no. 6, pp. 1496-1504, Jun. 2014. [Baidu Scholar]
Y. Xiang, J. Liu, Y. Liu et al., “Robust energy management of microgrid with uncertain renewable generation and load,” IEEE Transactions on Smart Grid, vol. 7, no. 2, pp. 1034-1043, Mar. 2016. [Baidu Scholar]
Y. Chen, F. Liu, S. Mei et al., “A robust WLAV state estimation using optimal transformations,” IEEE Transactions on Power Systems, vol. 30, no. 4, pp. 2190-2191, Jul. 2015. [Baidu Scholar]
Y. Chen, J. Ma, P. Zhang et al., “Robust state estimator based on maximum exponential absolute value,” IEEE Transactions on Smart Grid, vol. 8, no. 4, pp. 1537-1544, Jul. 2017. [Baidu Scholar]
Y. Chen, Z. Zhang, H. Fang et al., “Generalised-fast decoupled state estimator,” IET Generation, Transmission & Distribution, vol. 12, no. 22, pp. 5928-5938, Sept. 2018. [Baidu Scholar]
Y. Weng, R. Negi, C. Faloutsos et al., “Robust data-driven state estimation for smart grid,” IEEE Transactions on Smart Grid, vol. 8, no. 4, pp. 1956-1967, Jul. 2017. [Baidu Scholar]
J. Kim, L. Tong, R. J. Thomas et al., “Subspace methods for data attack on state estimation: a data driven approach,” IEEE Transactions on Signal Processing, vol. 63, no. 5, pp. 1102-1114, Mar. 2015. [Baidu Scholar]
J. Zhang, Z. Chu, L. Sankar et al., “Can attackers with limited information exploit historical data to mount successful false data injection attacks on power systems?” IEEE Transactions on Power Systems, vol. 33, no. 5, pp. 4775-4786, Sept. 2018. [Baidu Scholar]
M. Ferdowsi, A. Benigni, and A. Löwen, “a scalable data-driven monitoring approach for distribution systems,” IEEE Transactions on Instrumentation and Measurement, vol. 64, no. 5, pp. 1292-1305, May 2015. [Baidu Scholar]
M. Netto and L. Mili, “A robust data-driven koopman kalman filter for power systems dynamic state estimation,” IEEE Transactions on Power Systems, vol. 33, no. 6, pp. 7228-7237, Nov. 2018. [Baidu Scholar]
W. S. Rosenthal, A. M. Tartakovsky, and Z. Huang, “Ensemble kalman filter for dynamic state estimation of power grids stochastically driven by time-correlated mechanical input power,” IEEE Transactions on Power Systems, vol. 33, no. 4, pp. 3701-3710, Jul. 2018. [Baidu Scholar]
J. Yu, Y. Weng, and R. Rajagopal, “PaToPaEM: a data-driven parameter and topology joint estimation framework for time-varying system in distribution grids,” IEEE Transactions on Power Systems, vol. 34, no. 3, pp. 1682-1692, May 2019. [Baidu Scholar]
K. Dehghanpour, Y. Yuan, Z. Wang et al., “A game-theoretic data-driven approach for pseudo-measurement generation in distribution system state estimation,” IEEE Transactions on Smart Grid, vol. 10, no. 6, pp. 5942-5951, Nov. 2019. [Baidu Scholar]
Y. Yuan, K. Dehghanpour, and F. Bu, “A multi-timescale data-driven approach to enhance distribution system observability,” IEEE Transactions on Power Systems, vol. 34, no. 4, pp. 3168-3177, Jul. 2019. [Baidu Scholar]
A. S. Zamzam, X. Fu, and N. D. Sidiropoulos, “Data-driven learning-based optimization for distribution system state estimation,” IEEE Transactions on Power Systems, vol. 34, no. 6, pp. 4796-4805, Nov. 2019. [Baidu Scholar]
R. A. Jabr, “Radial distribution load flow using conic programming,” IEEE Transactions on Power Systems, vol. 21, no. 3, pp. 1458-1459, Aug. 2006. [Baidu Scholar]
H. Zou and T. Hastie, “Regularization and variable selection via the elastic net,” Journal of the Royal Statistical Society, vol. 67, no. 5, pp. 301-320, Mar. 2005. [Baidu Scholar]
A. G. Hadigheh, “Sensitivity analysis in convex quadratic optimization: simultaneous perturbation of the objective and right-hand-side vectors,” Algorithmic Operations Research: Series B, vol. 2, pp. 94-111, May 2007. [Baidu Scholar]
C. Gomez-Quiles, A. Villa Jaen, and A. Gomez-Exposito, “A factorized approach to WLS state estimation,” IEEE Transactions on Power Systems, vol. 26, no. 3, pp. 1724-1732, Aug. 2011. [Baidu Scholar]
Y. Chen, J. Ma, F. Liu et al., “A bilinear robust state estimator,” International Transactions on Electrical Energy Systems, vol. 26, no. 7, pp. 1476-1492, Jul. 2016. [Baidu Scholar]
Y. Chen, Y. Yao, and Y. Zhang, “A robust state estimation method based on SOCP for integrated electricity-heat system,” IEEE Transactions on Smart Grid, vol. 12, no. 1, pp. 810-820, Jan. 2021. [Baidu Scholar]