Journal of Modern Power Systems and Clean Energy

ISSN 2196-5625 CN 32-1884/TK

网刊加载中。。。

使用Chrome浏览器效果最佳,继续浏览,你可能不会看到最佳的展示效果,

确定继续浏览么?

复制成功,请在其他浏览器进行阅读

Power Factor Estimation of Distributed Energy Resources Using Voltage Magnitude Measurements  PDF

  • Samuel Talkington
  • Santiago Grijalva
  • Matthew J. Reno
School of Electrical and Computer Engineering, Georgia Institute of Technology, Atlanta, GA 30318, USA; Electric Power Systems Research Department, Sandia National Laboratories, Albuquerque, NM 87123, USA

Updated:2021-08-02

DOI:10.35833/MPCE.2021.000086

  • Full Text
  • Figs & Tabs
  • References
  • Authors
  • About
CITE
OUTLINE

Abstract

This paper presents a new method for the estimation of the injection state and power factor of distributed energy resources (DERs) using voltage magnitude measurements only. A physics-based linear model is used to develop estimation heuristics for net injections of real and reactive power at a set of buses under study, allowing a distribution engineer to form a robust estimate for the operating state and the power factor of the DER at those buses. The method demonstrates and exploits a mathematical distinction between the voltage sensitivity signatures of real and reactive power injections for a fixed power system model. Case studies on various test feeders for a model of the distribution circuit and statistical analyses are presented to demonstrate the validity of the estimation method. The results of this paper can be used to improve the limited information about inverter parameters and operating state during renewable planning, which helps mitigate the uncertainty inherent in their integration.

I. Introduction

THE success of the ongoing global energy transition is contingent upon the development of accurate models of distributed energy resources (DERs) to be used for the planning and operation of power systems. The rapid deployment of these DERs consequently leads to the cases of limited data availability and uncertainty regarding unobservable impacts to the distribution system.

One of the remaining limiting factors for high penetrations of DERs is the risk of unforeseen violations of engineering constraints due to unacceptable voltage rises from volatile power injections inherent in DERs. Direct curtailment is typically one of the solutions to this issue. However, this is unfavorable due to the loss of clean generation and revenue for the DER owner.

Advanced inverter technologies have emerged as central elements of the solution to these issues, and research works on their impact on distribution networks have appeared swiftly in [

1]-[4]. Additionally, the IEEE 1547-2018 standard updates now require all inverters to have reactive power and voltage regulation capabilities [5]. However, utilities and independent system operators (ISOs) may have limited information about these inverters, and therefore may be unable to accurately predict their effects on a distribution network. Frequently, these organizations may only have access to voltage measurements from metering devices and may have limited information about the overall operating conditions of inverter-based resources, including the power factor, curtailment, and voltage control parameters of the system. If utilities have access to this information, it may be incorrect or out-of-date due to the changes over time [6]. The inverters used in grid-connected photovoltaic (PV) systems have the capability to curtail active power output as well as inject and absorb reactive power, according to the behavior defined by a control curve. Frequently, non-unity power factor control settings are applied to inverter installations to reduce voltage problems that may be caused by the DER.

The primary contributions of this paper are the solutions to the inverse problem of estimating an unknown fixed power factor control setting for a DER. The methods presented here have remarkably low data input requirements, in that only voltage magnitude measurements are required. Although the deployment of advanced metering infrastructure (AMI) continues to expand, most utilities only have access to net energy measurements and do not have access to reactive power measurements, making the calculation of a DER power factor setting impossible. Through the methods developed in this paper, we show that it is possible to recover this setting with highly incomplete measurement data. This is achieved through a physics-based linearization of the AC power flow manifold, through which we demonstrate a novel result that the voltage magnitude sensitivities to real and reactive power injections are linearly independent for a fixed distribution network model. The result allows for a simultaneous estimation of the real and reactive power injection states at a set of DER interconnection points under study, which is used to construct a robust estimate for the power factor.

Furthermore, we enhance the predictive power and applicability of the method by deriving both a regularized regression model and a sparse approximation model, recasting the estimation as convex optimization problems. These reformulations improve the robustness of the estimation method in cases of a poor or inaccurate distribution system model or user uncertainty of the DER’s location, respectively.

Ⅱ. Background

The theorem in [

7] proves that for a fixed power network with 𝒮 slack buses, 𝒩 PQ buses, and nodal admittance matrix Y, the complex voltage sensitivity coefficients at each bus i𝒩 to real and reactive power injections Pl, Ql at each bus l𝒩 are unique solutions to the following two systems of differential equations:

1{i=l}=V¯iPlj𝒮𝒩Yi,jV¯j*+V¯ij𝒩Yi,jV¯j*Pl (1)
-j1{i=l}=V¯iQlj𝒮𝒩Yi,j*V¯j*+V¯ij𝒩Yi,j*V¯j*Ql (2)

where V¯i and V¯j are the complex nodal voltages; and V¯i* and V¯j* are the complex conjugate nodal voltages.

These equations and the research work in [

7] use complex voltage values, but this information is not available for a distribution system without access to phasor measurement unit (PMU) data. Our work exploits the use of voltage magnitude sensitivities without angle measurements, which are more common in realistic sensor data.

Researchers have taken the interest in data-driven modeling of advanced inverters, but little work has been done in terms of the estimation of their operational parameters and behaviors. Reference [

2] develops both linear and nonlinear state estimators for voltage magnitudes of the point of common coupling (PCC) using a Kalman filter. Intermittent measurements and packet loss are considered as a source of measurement error, but their linear model relies on complete knowledge of the power flow Jacobian, generation, and load profiles. In practice, the availability of such data may be challenging. Reference [8] attempts a similar problem using limited load and PV profile data to form representative scenarios [8].

Research works such as in [

4] use a variation of the optimal power flow to estimate optimal control settings for advanced inverters, while [9] and [10] develop a decentralized control approach to estimate optimal smart inverter settings based on volt-var, volt-watt, or power factor control. The feeder is analyzed using time-series simulation and various inverter control parameters, ranking their optimality based on the number of violations of voltage constraints.

An interest has grown in analyzing how distributed inverter-based systems behave with fixed control parameters [

1], [11], [12]. Reference [1] presents a fast quasi-static time-series algorithm for computing the impacts of dynamic var control, providing an estimate for the states of all controllers with a candidate control curve. In [12], a robust estimate for solar PV energy curtailment is provided for a distributed generation system operating with a fixed volt-watt curve.

The literature has only recently explored inverse problems in this domain. Approaches such as those in [

13], [14] use voltage sensitivities as training data to detect the presence of a solar PV installation on a distribution system using voltage sensitivities. Additional methods for the detection of PV locations and size, and the estimation of the overall distribution hosting capacity of system are developed using a Monte Carlo approach in [15].

However, the literature lacks the exploration into the inverse problems of estimating DER settings and operational parameters such as the power factor using data-driven methods.

ⅡI. Methodology

Throughout this paper, we consider a distribution network with nodes and possible points of interconnection for a DER system, where .

A. Definitions

1) Voltage Magnitude Sensitivities

The sensitivity matrix of node voltages to both real and reactive power injections is well understood [

7], [13]. These entries of the matrix are typically defined as the changes in voltage at bus i with respect to a real or reactive power injection at bus l. In this paper, voltage sensitivities refer to changes in voltage magnitude since distribution sensors do not provide voltage angle measurements without highly time-synchronized data like PMUs.

Assuming circuit parameters are fixed and injection magnitudes are normalized, the voltage sensitivities to real and reactive power injections, while linearly related, are intrinsically different functions. This distinction has been noted [

7] and is illustrated here as an example.

We define the voltage sensitivity matrix of a circuit as the change in voltage magnitude at any node i due to the presence of a real or reactive power injection at another “candidate” injection bus l. The formulation we will use in this paper alternates the columns between real and reactive injections, as shown in (3).

SPQVi[Pl,Ql]    i[1,], l[1,] (3)

where Vi is the voltage magnitude measurement. This gives us an ×(2) sensitivity matrix that appears as follows:

SPQViPlViQlViPViQVNPlVNQlVNPVNQ×(2) (4)

This matrix effectively captures two linearly independent signatures of the power system and describes the response of the node voltages to real and reactive power injections.

In practice, these matrices can be formed using perturb and observe methods [

7], [13], [14], our method is described in Algorithm 1 in the subsequent section.

We will denote each column of the interleaved SPQ matrix as sl, where l=1,2,,2. Note that this matrix has full column rank when formulated using a power system model without any voltage regulation equipment, and that pairs of columns for electrically identical buses such as those connected by switches or fuses are excluded. This verifies that the columns are linearly independent, and implies that the signatures of voltage sensitivities to real and reactive power are inherently different from the perspective of signal processing.

This distinction can also be motivated through a graphical example using simulated results from the IEEE 13-bus test feeder. If the submatrices of the SPQ matrix associated with real and reactive power injections are compared graphically, the functions have highly different behaviors. Submatrices of SPQ containing the alternating sensitivity columns corresponding to real or reactive power is shown in Fig. 1. Each matrix column sl captures the normalized changes in Vi to a real or reactive power injection on bus l of the IEEE 13-bus test feeder. The intrinsic difference in the signatures of the system response for different injection types becomes apparent. For simplicity, the suffix(i) to denote phase i of each bus for the remainder of this paper, where i=1, 2, and 3 represents phases A, B, and C, respectively.

Fig. 1 Submatrices of SPQ containing alternating sensitivity columns corresponding to real or reactive power. (a) Sensitivity of node voltage to real power. (b) Sensitivity of node voltage to reactive power.

The real and reactive power injection columns of SPQ, while linearly independent, may still exhibit some multicollinearity. We will later show how this contributes to the variance of the least-squares estimator in some use cases.

2) Measurement Data

When working with time-series data, we assume that networked voltage meters will give the user access to voltage profile data vi(t) at bus i for nodes across time horizon T. By taking the simple difference between the voltage samples across time, we can form a difference matrix D'×(T-1) as follows:

D'(Δvi,τ)    t[1,T-1],i[1,] (5)
Δvi,τ=vi(t)-vi(t-1) (6)

where Δvi,τ is the voltage deviation.

Furthermore, we define a measurement vector d¯ as a column vector containing samples of each row of the difference matrix at a specific time differential point of interest τ*:

d¯(Δvi,τ*)    i[1,] (7)
Δvi,τ*=vi(t*)-vi(t*-1) (8)

When considering solar PV, time intervals with low irradiance such as nighttime or cloudy days will have near-zero changes in voltage due to the real and reactive power injections of the PV and its corresponding inverter. Thus, it can be challenging to select τ* in practice for each bus i.

One choice is to simply select τ* during midday, during which the irradiance, power magnitude as well as the variance of the injection between t* and t*-1 are likely to be the highest [

13], [14]. This behavior is shown in Fig. 2. A 12-hour, 15-min granularity solution yields a total time horizon of T=287. Voltage deviations due to volatile real or reactive power injections from the system are more likely to occur during midday and are near-zero during the time of low irradiance.

Fig. 2 A subset of voltage deviation entries in D'×(T-1) from 08:00 to 16:20 for 100 kW solar PV installation with power factor of 0.8 on 633(1) of the IEEE 13-bus feeder.

Generally, the measurement vector d¯ is most likely to best approximate a linear combination of columns of SPQ during times of maximum variance in the voltage profile between time points. d¯ captures the change in voltage magnitude during the time differential τ* due to a PV injection change at a location l. When the vector d¯ is normalized by the size of the PV system, the contributions of the columns of SPQ corresponding to real and reactive power injections from a PV and var-dispatching inverter located at bus l can be estimated with linear regression models.

Since sensor data noise will primarily be driven by additive white Gaussian noise (AWGN), we assume that all model errors are distributed according to a standard normal ϵ𝒩(0,σϵ2).

B. Estimation Methodology

1) Least-squares Regression

The measurements obtained from the sensors will contain noise due to manufacturing errors, sensor class, or model inaccuracies. Hence, when projecting the d¯ vector on the subspace of SPQ, we obtain close but not exactly equal injection contributions. Using the measurement vector d¯ and the voltage sensitivity matrix SPQ, we can form a multiple linear regression approximation to estimate the corresponding injection type, location, and quantity as:

d¯x0VPl+x1VQl++x2-1VP+x2VQ+ϵ (9)

With many parameter estimation problems, we are estimating a vector x̂ when we multiply by our design matrix. We yield an approximation of the measurement targets as:

d̂=SPQx̂ (10)

where x̂ is the coefficient vector that results in the minimization of the sum of squared errors [

16]. These problems are known to have a unique least-squares solution as shown in (11). Later, we will denote the least-squares solution as x̂lsq to differentiate the solution from the other methods that we will discuss.

minx2||d¯-SPQx||22x̂=((SPQ)TSPQ)-1(SPQ)Td¯ (11)

2) Shrinkage Methods

A properly formed SPQ matrix with >2 will be full rank, allowing for simultaneous estimation of real and reactive power using voltage data only and the distinct real and reactive power sensitivity signatures. However, there may be multicollinearity between the columns of the interleaved SPQ. Therefore, the least-squares estimator, while very unbiased, may potentially have high variance [

16]. Therefore, the least-squares solution outlined above may not always be an ideal solution in practice. Shrinkage estimators, which reduce the variance in exchange for the introduction of more bias [17], apply well to this problem in the following two scenarios: ① an ill-posed estimation; ② a more realistic large feeder.

There are several instances where an ill-posed estimation could be faced. As the number of candidate injection columns in SPQ approaches the number of measurement buses, i.e., 2, we are exposed to a higher risk of overfitting and multicollinearity in the sensitivities as SPQ. When >/2, the estimation will become underdetermined.

If the distribution model used to derive the sensitivity matrix is significantly incorrect or out of date, some of the sensitivity signatures may be a poor basis for the model, and there may be instability in the fit against the observed measurement data. In these instances, the high variance of the least-square estimator can be combated by applying ridge regression, which allows the user to place a penalty to the L2 norm of the solution, thus biasing the model toward the solutions that are more regular. This may significantly improve the predictive accuracy of the model.

Forming an underdetermined sensitivity matrix (more columns than rows) may also be necessary if the distribution engineer has the uncertainty about the DER location, but still wishes to estimate the power factor settings. In this case, a higher-dimensional underdetermined SPQ will result in few coefficients of the injection estimation vector x̂ being relevant to the estimation of the DER’s power factor. This can be cast as a sparse approximation problem. In this case, to form our model, we apply an L1 norm penalty to the least-squares model that we developed previously and optimize for the weight of the penalty that yields the strongest predictive power. This technique is also known as the least absolute shrinkage and selection operator (LASSO) and has the convenient ability to perform simultaneous model selection and feature extraction [

17], yielding a very sparse x̂. This will help address the instability in the estimate as approaches /2. In the case of uncertain DER location, we propose the sparse approximation model (12), which is an augmented form of the problem in (11):

x̂lasso=argminx2||d¯-SPQx||22+λ||x||1 (12)

where λ>0 is the Lagrangian penalty factor set by the user. This method, also known as the basis pursuit, promotes sparsity. This means that the resulting estimation vector x̂ will have a small number of nonzero components. Unlike the least-squares formulation, there is no closed-form solution [

17], and it consists of solving the following constrained optimization problem:

x̂lasso=argminx2||d¯-SPQx||22s.t.  ||x||1λ       λ>0 (13)

The equivalent Lagrangian form can also be expressed as:

x̂lasso=argminx2m=1(d¯m-smTx)2+λl=12|xl| (14)

where d¯m is an element of the measurement vector observed; and smT is a particular row of SPQ.

For cases where there is a high degree of multicollinearity in the sensitivity matrix, e.g., a poor or inaccurate distribution model, we propose the use of ridge regression [

17], and the estimation vector becomes the solution to:

x̂ridge=argminx2||d¯-SPQx||22+λ||x||22 (15)

This can be converted to the standard least-squares solution in (11) by concatenating λI to the bottom of SPQ and zeros to the bottom of d¯, allowing for a closed-form solution.

This technique smooths out singularities in the sensitivity data. Similar to least-squares, it is assumed that the coefficients of x̂lsq and the residuals of this estimate will also be normally distributed [

18].

For both of the regularized regression models described in (12) and (15), the norm penalty λ can easily be chosen in practice using modern cross-validation algorithms such as those in [

19].

C. Statistical Analysis

1) Chi-squared Goodness-of-fit

A chi-squared goodness-of-fit test can be used for the least-squares procedure by noting that the estimated voltage magnitude deviations d̂ can be given by SPQx̂. By the central limit theorem, the normalized residuals of these voltage deviations are assumed to be distributed according to a standard normal, i.e.,

ri=d̂i-d¯iσi𝒩(0,1)    i[1,] (16)

where d̂i is the element of d̂; and ri is the residual.

Voltage meters typically have an error of at most less than one percent, so it is assumed that σi=(0.01),  i=1,2,,. Note that this is a worst-case scenario, and most meters in practice have errors of less than half a percent.

The least-squares solution x̂lsq minimizes the sum of squares of the residuals ri according to the definitions in (11). Hence, we can compute the chi-squared test statistic as:

||d¯-SPQx||22=χ2ξ||d¯-SPQx̂||22 (17)

The value of ξ can be written succinctly as:

ξ(x̂)=(SPQx̂-d¯)TΩ-1(SPQx̂-d¯) (18)

where Ω-1 is defined as:

Ω-11σi0001σi0001σi (19)

Using the observations above, we can then perform a chi-squared goodness-of-fit test with ν=-(2+1) degrees of freedom:

[χ2ξ]=1-[ξ,ν] (20)

where [χ2ξ] is the goodness-of-fit heuristic of the injection estimates used as the basis for the power factor estimation given the sensitivity model.

However, the validity of (17) does not hold for the shrinkage estimates x̂lasso and x̂ridge, as these estimates by definition do not always reach the global minimum sum of squared errors. Goodness-of-fit tests for LASSO, ridge, and other regularized regression estimates are actually only a recent topic, see [

20], and the application of these tests to this problem will be left for future work.

2) Confidence Intervals

Reference [

16] also describes a 100×(1-α)% confidence interval for each of the multiple linear regression coefficients x̂l:

x̂l±tα/2,-(2+1)Kll (21)

where Kll is the diagonal element of the variance-covariance matrix K for the estimated regression coefficients, defined as:

K=σ̂2((SPQ)TSPQ)-1 (22)

where σ̂2 is the mean squared error (MSE) of the model.

Similar to the chi-squared goodness-of-fit test, this analytic confidence interval can only be obtained for the least- squares estimator. However, a well-known method for achieving a confidence interval that is applicable to all of the aforementioned methods, including least-squares, is the bootstrap method [

17].

Note that the entries of our parameter estimate x̂[x1,x2,,x2] are the projections of d¯ onto the subspace SPQ, as described in (9). This interpretation of the model holds for all estimators developed. To obtain confidence intervals for each injection coefficient, we can draw Q subsets of a random N˜ rows of SPQ and d¯ entries, yielding augmented regression data denoted as S˜PQ and d˜. At each iteration, we then estimate a new x˜ from projecting d˜ onto the N˜ rows of SPQ. This method captures the variance in the estimate attributable to the data [

17], [21].

We can obtain a maximum likelihood estimate of the expected value for the injection estimates x˜¯ by finding the sample mean of the bootstrapped statistic:

x˜¯=𝔼̂[x̂]=b=1Qx˜bQ (23)

For least-squares regression models, it holds that bootstrap confidence intervals are exactly equivalent to the analytic confidence intervals described in (21) as Q [

17]. Therefore, we use this method to compare the confidence of each of the proposed estimation methods.

We propose the use of a nonparametric bootstrap confidence interval, specifically the percentile methodology, which is frequently used for the estimators we have selected. Through the resampling process described above, we can obtain an approximate distribution f(x˜l) for the x˜l coefficient of interest using this method. Finally, a 100×(1-α)% confidence interval for the coefficient can be obtained using the percentiles. In symbols, this would be the (x˜lα/2,x˜l1-α/2) such that:

-x˜lα/2f(x˜l)dx˜l,-x˜l1-α/2f(x˜l)dx˜l=α2,1-α2 (24)

IV. Algorithm Implementation

The sensitivity matrix can be understood as a static, model-based quantity fixed intertemporally for the power system under analysis. The construction of this matrix is straightforward and is outlined in Algorithm 1.

Figure 3 shows the visualization of sensitivity columns in SPQ formed with 100 kW or 100 kvar injections on IEEE 13-bus test feeder.

Algorithm 1: sensitivity matrix construction

Result: returns the SPQ matrix for the model

Initialization

Lock voltage regulating devices

Solve base case powerflow

Retrieve Vbase=(vi), iM

Define counter k=0

for lL do

if mod(k,2)0 then

Place reactive power injection

Solve power flow

Retrieve V=(vi),iM

SlPQ=V-Vbase

else

Place real power injection

Solve power flow

Retrieve V=(vi),iM

SlPQ=V-Vbase

end

k=k+1

end

As discussed previously, SPQ must be full rank to perform the estimation. Depending on the power system modeled, some columns may have linear dependencies, because the nodes that they represent may be electrically identical. For instance, the IEEE 13-bus best feeder has a switch between buses 671 and 692 [

22], [23], which is closed by default. Thus, when constructing SPQ with static injections as in Algorithm 1, the columns associated with real and reactive power injections for these two buses will be linearly dependent and SPQ will become rank-deficient. A simple solution is to use a distance metric e.g., the L1 norm of the residuals between each column as shown in (25), as a method for measuring the similarity between SPQ columns sl1 and sl2 l1,l2. The user can then discard the rows and columns corresponding to one of the indices from each pairwise comparison to solve the rank deficiency.

dL1(sl1,sl2)=||sl1-sl2||1=m=1|sl1,m-sl2,m| (25)

For the IEEE 13-bus test feeder, we discard the rows and columns associated with bus 692, yielding a full rank matrix. We also use the similarity metric (25) for a hierarchical clustering of the sensitivities, using the single linkage clustering algorithm described in [

24].

The visualization of sensitivity columns in an example SPQ matrix formed with 100 kW or 100 kvar injections on the IEEE 13-bus test feeder is shown in Fig. 3. The columns are clustered according to their L1 distances, as described in (25). The columns corresponding to real and reactive power sensitivities are labeled as green and purple, respectively. The numbers of clusters are increased iteratively until they are equal to the 2 injection candidates. If six clusters are chosen, the clusters correspond to common injection type and injection phase. This further verifies the result that the voltage magnitude sensitivity signatures are distinct.

Fig. 3 Visualization of sensitivity columns in SPQ formed with 100 kW or 100 kvar injections on IEEE 13-bus test feeder.

Using the injection estimation methods, we form Algorithm 2 to estimate the power factor of the nodes of interest. Many real-world inverter systems are three-phase. Thus, generality for three-phase estimations is included. With any of the estimation methods described in the paper, the basic relationships between the complex apparent power and power factor are used to form an estimate for the power factor from the estimated injections.

Algorithm 2  : power factor estimation procedure

Result: returns an estimation of the power factor of DER systems operating on nodes LM

Initialization

Load network model

Retrieve Vbase=(vi),iM

Compute SPQ (Algorithm 1)

Define injection buses LM

Load historical voltage measurements

Retrieve V=(vi),iM

Compute d¯ ((7) and (8))

Compute x̂ ((11), (12), or (15))

Define P, QR1×2L

if system is three phase then

for each x̂l, each phase do

if injection type is real then

Pl=x̂l

else

Ql=x̂l

end

end

else

for each x̂l do

if injection type is real then

Pl=x̂l

else

Ql=x̂l

end

end

end

for each Pl,Ql[P,Q] do

Sl=Pl2+Ql2

pf̂l=|Pl|Sl

end

V. Case Studies and Numerical Results

In this section, we present the results for two case studies using a small (IEEE 13-bus) feeder and a larger (IEEE 123-bus) feeder to demonstrate the performance of the algorithms. The sensitivity matrix models for the feeders are constructed according to (1) and Algorithm 1. As described previously, identical points are identified and filtered using (25), with an L1 tolerance of 1×10-12, and the removal decisions are verified by referencing [

23], [25]. We then estimate the power factor control settings using the process described in Algorithm 2.

Typically, a DER system outputs a mixture of real and reactive power at the interconnection point. Thus, we will show the results obtained when static injections of real and reactive power are placed on the buses of interest. This model represents a particular measurement instance where a grid-connected PV system simultaneously generates real power, and the PCC voltage is being regulated by a reactive power injection or absorption by its advanced inverter system.

A. Small Feeder and Ridge

For this experiment, we consider two static three-phase injections on buses 633 and 671 of the IEEE 13-bus feeder, as shown in Fig. 4.

Fig. 4 Graph plot of IEEE 13-bus feeder showing interconnection locations for static systems.

By using Algorithm 1, the sensitivity matrix is constructed and preprocessed as described previously using (25). Using the first control flow of Algorithm 2, the estimation vector x̂ can be formed, and the estimated injection states derived from the x̂ coefficients can easily be used to compute an estimate for the power factor of the PV systems according to the third control flow of Algorithm 2.

When the estimation problem is well posed, i.e., >>>, least-squares regression can be used without issue, yielding accurate estimates for the injections and power factor shown in Figs. 5 and 6. In Fig. 5, P̂1 and Q̂1 are the entries of x̂ that corresponding to real and reactive power injections, respectively. The inherent difference in the voltage sensitivity signatures is realized, and accurate numerical results and mean errors from the theoretical values are shown in Table I. χ2 goodness-of-fit statistics also show favorable results for the accuracy, as shown in Table Ⅱ.

Fig. 5 Well-posed (=26>2=16) least-squares injection estimate for three-phase injections.

Fig. 6 Well-posed least-squares power factor estimation for three-phase injections.

Table I Well-posed Least-squares Estimation Results
SenarioNode (phase)Injection (kVA)Power factor
Actual 633(1) 150+j100 0.83205
633(2) 150+j100 0.83205
633(3) 150+j100 0.83205
671(1) 25+j45 0.48564
671(2) 25+j45 0.48564
671(3) 25+j45 0.48564
633(1) 150.66+j100.43 0.83207
633(2) 148.24+j101.07 0.82621
Estimated 633(3) 150.36+j99.15 0.83484
671(1) 25.15+j44.57 0.49137
671(2) 25.95+j44.66 0.50234
671(3) 25.38+j45.10 0.49082
Mean error (%) 0.0156 0.8870
Table II χ2 Goodness-of-fit Statistics for IEEE 13-bus Least-squares Estimates
PCCξP[χ2ξ]
633 2.57×10-22 0.9999
671 3.71×10-21 0.9999
670 5.69×10-18 0.9999
632 3.56×10-19 0.9999

The least-squares injection estimate is highly unbiased, as is the case with all least-squares estimators. However, these estimators typically have a high variance.

To illustrate this tradeoff, Fig. 7 shows bootstrap sampling distributions for a subset of least-squares injection estimate coefficients of the IEEE 13-bus model: x̂0 (633(1), real power), x̂1 (633(1), reactive power), x̂6 (671(1), real power), and x̂7 (671(1), reactive power). In Fig. 7, f̂(x̂) is the sample frequency. Dashed lines indicate the means of the distributions, which are equivalent to the estimates in Fig. 4. High accuracy is observed, but the estimator has high variance.

Fig. 7 Bootstrap sampling distributions for a subset of least-squares injection estimate coefficients of IEEE 13-bus model.

With least-squares, while the mean of the coefficient distribution in principle represents a highly accurate estimate of the true parameter, the precision, clearly, may be low for small feeders. This is because for small feeder models such as the IEEE 13-bus case, the SPQ matrix has limited samples (=26), which can be attributed to the variance in the coefficient distribution.

As we expand the number of candidate injections in SPQ, the problem becomes ill-posed, and the risk of overfitting to the sensitivities increases dramatically. In our experiments, a large amount of instability in the least-squares solution x̂lsq is observed when candidate injection columns 2=25 are used in SPQ. However, using the shrinkage estimator x̂ridge, a much more accurate solution is obtained.

Figure 8 shows the ill-posed ridge regression estimation vector for three-phase injections. If we use least-squares, the estimate would be highly unstable, and we would effectively fit to the noise of the sensitivity data because is close to 2. By applying ridge regression, we can consider more candidate buses in exchange for more bias in the estimate.

Fig. 8 Ill-posed ridge regression injection estimation vector for three-phase injections.

B. Large Feeder and LASSO

Distribution engineers may be interested in a wider range of candidate buses when considering feeder models such as the IEEE 123-bus case. Furthermore, in large feeder models, it may be more difficult to preprocess the SPQ matrix so that electrically identical rows and columns are removed due to incomplete information regarding the state of the switches. Figure 9 shows the results of LASSO estimation methods outlined in (12). In this experiment, a 10-fold cross-validation is used to select the Lagrange multiplier λ.

Fig. 9 LASSO regression coefficients for single-phase injections.

Figure 9 demonstrates the simultaneous feature selection and model fitting capabilities of the method. The LASSO estimate is biased toward sparse estimates in favor of lowering the variance of the estimator. Since this example exhibits single-phase injections, this model is ideal. LASSO is useful when the distribution engineer knows with high confidence that there are few distributed generators on the candidate buses measured, meaning that few corresponding columns of SPQ are relevant. The penalty on the L1 norm of x̂ favors sparsity for the solution and improves the interpretability of the model in these application scenarios.

Like ridge, LASSO coefficients also exhibit significantly less variance, as shown in Fig. 10 and Table III. When using regularized estimates in the right context, significantly better precision and interpretability of the power factor estimation can be achieved.

Fig. 10 Example of bootstrap sampling distributions for a subset of LASSO injection estimate coefficients for IEEE 123-bus model.

Table III Confidence Intervals for IEEE 123-bus LASSO Estimates
αx̂7x̂6
x˜α/2x˜1-α/2x˜α/2x˜1-α/2
0.050 (95%) 32.819 57.919 283.908 314.041
0.320 (68%) 47.262 55.264 289.787 302.131

Further details of the behavior of the optimization algorithm are shown in Fig. 11, showing the effect of the Lagrangian penalty factor λ on the model coefficients and the MSE with each fold of the cross-validation procedure, where a new λ value is selected.

Fig. 11 LASSO injection estimate coefficients and MSE trace plots versus -log(λ). (a) Coefficients weights versus -log(λ). (b) MSE versus -log(λ).

In Fig. 11, we also show one solution for selecting the hyperparameters of the regularized estimators. Figure 11(a) shows the values of the parameters in the estimation vector x̂lasso, and Fig. 11(b) shows the MSE of the estimate, both as functions of the negative natural logarithm of the penalty factor λ.

To generate the trace in Fig. 11(a), we used 10-fold grid-search CV [

17], [19], which iteratively splits the rows of SPQ into nine test groups and seeks to find the value of λ that generates the lowest MSE values for predicting the remaining fold. The range of -log(λ) values considered is shown on the x-axis. In Fig. 11(b), we show the MSE at each fold. The final value of -log(λ) is indicated by the dashed vertical line.

VI. Discussion

There are several key limitations to the results in this paper. The preprocessing of time-series data in forming the d¯ vector has not yet been fully solved. If the user only has access to this type of data, determining which subset of D' to sample as d¯ may be challenging. Figure 2 illustrates this challenge by showing the values of D' as defined in (5), for all values of i and tT for an injection scenario with power factor of 0.8.

A practical solution is to select a window of d¯ values in the midday period of the time-series. Recent developments in the field of statistical learning could be used to perform feature extraction or selection to identify the timepoints of interest [

26], [27]. In time-series modeling, the estimation becomes less accurate when there is less difference in the voltage from power injections during nighttime and cloudy days.

As for many physics-based inverse problems, the estimation methods provided in this paper may have a high degree of variance. Additional improvements could be studied to determine the proper amount of bias to add to the estimators to enable more interpretable models, particularly when working with non-midday time-series data.

An additional limitation of these methods is that the rough location of the distributed generator is assumed to be known. However, [

13], [14], [28] could be used in tandem with this method to first estimate the location of an unknown distributed generator, and then the algorithms proposed in this paper could be applied to a constrained set of candidate injection locations to estimate the power factor of the DER. Another potential solution to this problem would be statistical methods such as best subset selection [17] or other regularization methods such as the LASSO method presented in this paper. If the user does not have confidence that the selected candidate injection locations contain a generator, it cannot be guaranteed that the observed voltage magnitude deviations in the data will be attributable to a distributed generator. In these instances, the estimation may be less interpretable.

Future works will include a more robust time-series feature extraction method as well as the extension of these methods to the estimation of other inverter parameters such as control settings and curtailment.

VII. Conclusion

This research has presented a novel physics-based and data-driven method to estimate the injection state and power factor of inverter-based DERs. Based purely on voltage magnitude measurements and model-derived sensitivities, flexible estimation approaches have been developed for various realistic use cases.

Firstly, we have shown that for a fixed power system model with a properly constructed SPQ sensitivity matrix, the real and reactive power voltage sensitivity signatures for a set of candidate buses under study are linearly independent, in which case, SPQ is full rank. Notably, we have also shown that there exists a significant enough difference between these signatures to estimate the real and reactive power injection states of inverter-based DERs using purely voltage measurements and linear parameter estimation models.

Additionally, we have shown that regularization methods are highly effective at improving the precision of the linear models in certain use case scenarios. Often, the least squares model may not be a viable option from a practical standpoint. For models with poor goodness of fit as derived in Section II, or when the model characteristics meet those described in Section III, the regularized methods will often be necessary, and will improve the estimation performance.

As the demand and need for solar PV and other inverter-based DERs increase, it is vital to access the information that characterizes these oftentimes unobservable distributed systems for ensuring a smooth transition to a decarbonized grid. These algorithms can be used in power system planning, operation, and control applications to give utilities and ISOs with the ability to estimate important control parameters and the operating point of these distributed generation systems. This can be of service in bolstering sustainable power system planning and decarbonization efforts.

References

1

M. U. Qureshi, S. Grijalva, and M. J. Reno, “A fast quasi-static time series simulation method for PV smart inverters with VAR control using linear sensitivity model,” in Proceedings of 2018 IEEE 7th World Conference on Photovoltaic Energy Conversion (WCPEC), Waikoloa, USA, Jun. 2018, pp. 1614-1619. [Baidu Scholar

2

S. Deshmukh, B. Natarajan, and A. Pahwa, “State estimation and voltage/var control in distribution network with intermittent measurements,” IEEE Transactions on Smart Grid, vol. 5, no. 1, pp. 200-209, Jan. 2014. [Baidu Scholar

3

Q. Long, J. Wang, D. Lubkeman et al., “Volt-var optimization of distribution systems for coordinating utility voltage control with smart inverters,” in Proceedings of 2019 IEEE PES Innovative Smart Grid Technologies Conference (ISGT), Washington, USA, Feb. 2019, pp. 1-5. [Baidu Scholar

4

A. O’Connell and A. Keane, “Volt-var curves for photovoltaic inverters in distribution systems,” IET Generation, Transmission & Distribution, vol. 11, no. 3, pp. 730-739, Mar. 2017. [Baidu Scholar

5

IEEE Standard for Interconnection and Interoperability of Distributed Energy Resources with Associated Electric Power Systems Interfaces, IEEE Standard 1547-2018, 2018. [Baidu Scholar

6

L. Blakely, M. J. Reno, and J. Peppanen, “Identifying common errors in distribution system models,” in Proceedings of 2019 IEEE 46th Photovoltaic Specialists Conference (PVSC), Chicago, USA, Jun. 2019, pp. 3132-3139. [Baidu Scholar

7

K. Christakou, J. LeBoudec, M. Paolone et al., “Efficient computation of sensitivity coefficients of node voltages and line currents in unbalanced radial electrical distribution networks,” IEEE Transactions on Smart Grid, vol. 4, no. 2, pp. 741-750, Jun. 2013. [Baidu Scholar

8

C. McEntee, N. Lu, and D. Lubkeman. (2020, Oct.). A regression-based voltage estimation method for distribution volt-var control with limited data. [Online]. Available: https://arxiv.org/abs/2010.12456 [Baidu Scholar

9

M. Rylander, M. J. Reno, J. E. Quiroz et al., “Methods to determine recommended feeder-wide advanced inverter settings for improving distribution system performance,” in Proceedings of 2016 IEEE 43rd Photovoltaic Specialists Conference (PVSC), Portland, USA, Jun. 2016, pp. 1393-1398. [Baidu Scholar

10

M. Rylander, J. Smith, and H. Li, “Determination of smart inverter control settings to improve distribution system performance,” in Proceedings of CIGRE US National Committee 2014 Grid of the Future Symposium, Houston, USA, Oct. 2014, pp. 1-6. [Baidu Scholar

11

R. Dobbe, P. Hidalgo-Gonzalez, S. Karagiannopoulos et al., “Learning to control in power systems: design and analysis guidelines for concrete safety problems,” Electric Power Systems Research, vol. 189, p. 106615, Dec. 2020. [Baidu Scholar

12

M. Emmanuel, J. I. G. Miner, P. Gotseff et al., “Estimation of solar photovoltaic energy curtailment due to volt-watt control,” IET Renewable Power Generation, vol. 14, no. 4, p. 6, Jan. 2020. [Baidu Scholar

13

S. Grijalva, A. Khan, J. S. Mbeleg et al., “Estimation of PV location in distribution systems based on voltage sensitivities,” in Proceedings of IEEE North American Power Symposium (NAPS), Tempe, USA, Apr. 2021, p. 5. [Baidu Scholar

14

C. Gomez-Peces, S. Grijalva, M. J. Reno et al., “Estimation of PV location based on voltage sensitivities in distribution systems with discrete voltage regulation equipment,” in Proceedings of IEEE PowerTech Madrid, Madrid, Spain, Jun. 2021, pp. 1-6. [Baidu Scholar

15

A. Dubey and S. Santoso, “On estimation and sensitivity analysis of distribution circuit’s photovoltaic hosting capacity,” IEEE Transactions on Power Systems, vol. 32, no. 4, pp. 2779-2789, Jul. 2017. [Baidu Scholar

16

J. L. Devore and K. N. Berk, Modern Mathematical Statistics with Applications, New York: Springer, 2nd ed., 2012. [Baidu Scholar

17

T. Hastie, R. Tibshirani, and J. Friedman, The Elements of Statistical Learning: Data Mining, Inference, and Prediction, New York: Springer, Jan. 2017. [Baidu Scholar

18

W. N. van Wieringen. (2020, Aug.). Lecture notes on ridge regression. [Online]. Available: https://arxiv.org/abs/1509.09169 [Baidu Scholar

19

F. Pedregosa, G. Varoquaux, A. Gramfort et al., “Scikit-learn: machine learning in Python,” Journal of Machine Learning Research, vol. 12, pp. 2825-2830, Dec. 2011. [Baidu Scholar

20

R. Lockhart, J. Taylor, R. J. Tibshirani et al., “A significance test for the lasso,” The Annals of Statistics, vol. 42, no. 2, pp. 413-468, Apr. 2014. [Baidu Scholar

21

A. Davison and D. Hinkley, “Bootstrap methods and their application,” Journal of the American Statistical Association, vol. 94, p. 5, Jan. 1997. [Baidu Scholar

22

W. H. Kersting and G. Shirek, “Short circuit analysis of IEEE test feeders,” in Proceedings of 2012 IEEE PES Transmission and Distribution Conference and Exposition, New Orleans, USA, May 2012, pp. 1-9. [Baidu Scholar

23

W. H. Kersting, Distribution System Modeling and Analysis, Boca Raton: CRC Press, 2018. [Baidu Scholar

24

D. Müllner. (2011, Sept.). Modern hierarchical, agglomerative clustering algorithms. [Online]. Available: https://arxiv.org/abs/1109.2378 [Baidu Scholar

25

W. H. Kersting, “Radial distribution test feeders,” IEEE Transactions on Power Systems, vol. 6, no. 3, pp. 975-985, Aug. 1991. [Baidu Scholar

26

Y. Sun, J. Li, J. Liu et al., “Using causal discovery for feature selection in multivariate numerical time series,” Machine Learning, vol. 101, pp. 377-395, Oct. 2015. [Baidu Scholar

27

J. Taylor and R. J. Tibshirani, “Statistical learning and selective inference,” Proceedings of the National Academy of Sciences, vol. 112, pp. 7629-7634, Jun. 2015. [Baidu Scholar

28

K. Mason, M. J. Reno, L. Blakely et al., “A deep neural network approach for behind-the-meter residential PV size, tilt and azimuth estimation,” Solar Energy, vol. 196, pp. 260-269, Jan. 2020. [Baidu Scholar