Abstract
We propose a new way to develop non-parametric models of power curves using artificial intelligence tools. One parametric model and eight non-parametric models are developed to emulate the behavior described by the power curve of the wind farms. A comparison between the power curve models based on artificial neural networks (ANNs) and those based on fuzzy logic are also proposed. Some of the power curve models based on ANNs and fuzzy inference systems (FISs) are used as well as two new FISs with the proposed new heuristic. An initial pre-training is proposed, resulting from the characteristics derived from the expert inference followed by a transformation of a fuzzy Mamdani system into a fuzzy Sugeno system. Although the presented values by the error indicators are comparable, the results show that the new pre-trained FIS models have better precision compared with the ANN and FIS models. The comparative study is conducted in two wind farms located in northeastern Brazil. The proposed method is a relevant alternative to improve power curve approximation based on an FIS.
THE modelling of power curves is a crucial factor in wind power operation, which contributes to different aspects of the operation, e.g., control and performance improvement of a wind turbine or a wind farm [
In [
In [
In [
In [
In [
Recently, the substantial growth of the Brazilian load demand is inevitable. Wind energy is one of the solutions. However, wind speed variability introduces a random profile in the energy matrix. Thus, over the years, heuristics and applications that seek to achieve the optimality of the control and operation have been increasingly developed. The concepts and techniques of artificial intelligence are applied to wind forecasting in order to improve the adaptation of wind farms and wind turbines under varying conditions.
One important element of wind power forecasting is a model that attempts to reproduce the power curve of a wind farm or machine under a given load condition when connected to an electrical network. This is the motivation for developing the models of power curves that optimally match wind turbine performance and enhance the reliability of power system.
Power curves of wind turbines represent the physical relationship between the electric power generated by the wind turbine and the wind speed incidence at the height of the rotor hub. According to the International Energy Agency (IEA), a power curve is the functional identity of a wind turbine and is defined as a performance certificate guaranteed by the manufacturer.
A typical power curve is modeled based on three basic characteristics: the cut-in speed, the range of wind speed constituting the region of effective operation, and the cut-out speed. These aspects are defined for a given height of the rotor hub in the steady state without turbulence.
These three features delimit the power curve of any turbine. The operation range of the turbine is established and the minimum and maximum operation values of the average speed and average power are defined. A schematic diagram of a power curve can be found in Fig. SA1 of Supplement A [
According to [
Using the power curve to control a wind farm is another generalized method, which considers the limitations of machine construction. Generally, for a wind farm, the measurements are taken over a specific period. The average of the variables is then calculated and aggregated to the wind speed period. The average values should be appropriate according to the local or global control analysis of the wind incidence on a single wind turbine or on several wind turbines in the wind farm.

Fig. 1 Average speed versus average power for a wind farm.
The available wind energy that crosses the rotor of a wind turbine can be obtained as:
(1) |
where is the power linked to a wind speed v; A is the surface area of the turbine rotor; and is the air density (a typical value is 1225 kg/m³ [
(2) |
where is the non-dimensional power coefficient, which represents the theoretical amount of mechanical force that can be extracted by the turbine rotor; and is the efficiency of the wind turbine [
The Mamdani inference model is one of the first systems using fuzzy set theory [
Considering that a fuzzy system is composed of n rules with one of the rules represented as: if , , ..., , then , where Xi are the system inputs; are the linguistic variables defined by the input relevance function; Y1 is the output; and Bi is a linguistic variable defined by the output relevance function.
Fuzzy Takagi-Sugeno-Kang (TSK) inference [
We develop a new heuristic that uses the pre-established inference for training and parameterizing the models of power curves. The proposed heuristic consists of an initial training that uses fuzzy inference, which is a starting point for the parameterization of the model. This stage is performed on fuzzy Mamdani models and is called the pre-set. The second stage consists of the transformation of the Mamdani model into a Sugeno model with similar parameters as in the previous model, which is necessary to reinforce the learning process. In the third stage, an adaptive neuro-fuzzy inference system (ANFIS) is used to optimize the parameters of FIS Sugeno block with a less complex starting point than the initial point.
In the process of secondary learning, ANFIS is used to establish the best parameters of the resulting FIS through neural networks. Examples of parameters include the number of membership functions, the types of membership functions (Gaussian), the best arrangement at each specific interval, the range of inference, and the number of clusters used in the learning. The pre-trained FIS can be considered as a less entropic set, i.e., with a better-defined clustering, than the starting point of the system without defining the parameters. The proposed method significantly improves learning the models of power curves, which is verified by evaluation indices.
A comparative study is conducted on two wind farms in the coastal region of northeastern Brazil based on the historical data of supervisory control and data acquisition (SCADA). The data are provided by the Brazilian electric system operator.
Wind farm 1 has 28 Suzlon model S88 aerogenerators of 2100 kW and 60 Hz with a rotor diameter of 88 m. They are installed in the towers with a height of 80 m for a total installed capacity of 70.56 MW. Wind farm 2 has 60 Suzlon model S88 aerogenerators of 2100 kW and 60 Hz with a rotor diameter of 88 m. They are installed in towers with a height of 80 m for a total installed capacity of 126 MW.
The databases are inserted into the models through the construction of conditioning patterns with two basic sets. First, the learning set is used to train the models, and second, the simulation set is used to compare the efficiency between the models. The learning set is divided into smaller subsets called the training, testing and validation sets with 60%, 20%, and 20% of the data, respectively. The learning set covers approximately two years in wind farm 1 and three years in wind farm 2. The simulation is carried out over one year for both wind farms.
After defining the learning and simulation sets, the training phase starts to train the models in order to ensure a good approximation of the power curves for each wind farm. The effectiveness of the learning process is verified in the simulation phase. The dataset arrangement and number of patterns are provided in
Although the power curve provided by a manufacturer describes the relationship between the wind speed and power generated for a specific air density, it does not consider the installation site or wear of the wind turbine. Therefore, it is important to develop the models of power curves for wind farms in operation.
In [
In [
In [
Considering various parameters in power estimation by incident winds, we develop nine models of power curves forwind turbine: two with artificial neutral networks (ANNs), two with FIS (Mamdani), two with FIS (Sugeno), two with FIS (Sugeno) by ANFIS, and one with the latest fuzzy models. Note that the data range is chosen to better adapt the models of power curves with actual operation of the wind farms, which focuses on the effective region of the power curves between the cut-in and cut-out speeds.
The parametric model of the average power curve is mainly based on the average power curve of wind farms 1 and 2. Thus, to ensure a good approximation of the trend, a polynomial function is constructed to suit the characteristics of the average power curve. The average power curve consists of a parametric model that determines a polynomial function, establishing the approximate average relation of the power curve based on the given database.
The polynomial function is composed of specific, well-known sections in the power curve. In this paper, medium curve models are operation models, and the region is of effective operation, i.e., after the cut-in region and before the cut-out region. Therefore, the polyfit function in MATLAB is used to generate the medium curves. Four polynomial regions are considered in each wind farm dataset, as shown in

Fig. 2 Polynomial approximations relative to average power curves in wind farms 1 and 2.
The polynomial forms for each region are expressed as follows. The equations include polynomial coefficients that create a best fit approximation. The region names characterize the relationship between the points, which is specified according to a distinctive polynomial for both wind farms.
1) Cubic region 1 of wind farm 1 is described as .
2) Cubic region 1 of wind farm 2 is described as .
3) Cubic region 2 of wind farm 1 is described as .
4) Cubic region 2 of wind farm 2 is described as .
5) Cubic region 3 of wind farm 1 is described as .
6) Cubic region 3 of wind farm 2 is described as .
7) Constant region of wind farm 1 is described as .
8) Constant region of wind farm 2 is described as .
ANNs are applied in the structure of two models of power curves, and the choice of their input variables is made due to their close correlation with wind speed. More than one hidden layer is used in the architectures, which enhances the approximation precision. The Levenberg Marquardt learning algorithm is applied in ANN models, and the hyperbolic tangent is used as the activation function in each layer [
The PC1-ANN model consists of a multi-layer perception (MLP) neural network with the average wind speed as the input and the average wind power as the output. The PC1-ANN architectures and characteristics for wind farms 1and 2 are shown in Figs. SA2-4 of Supplement A.
The PC2-ANN model consists of an MLP neural network with two inputs: the average wind speed (output) and the average wind direction. The architectures of PC2-ANN [
In this paper, we use the models based on three distinct types of fuzzy systems: Mamdani inference, TSK inference, and an ANFIS neuro-fuzzy system.
1) PC1-fuzzy (Mamdani) Model
The PC1-fuzzy model consists of an FIS with the average wind speed as the input and the average wind power as the output [
The architecture of PC1-fuzzy is shown in Fig. SA8 of Supplement A. PC1-fuzzy resulting from the learning process related to wind farms 1 and 2 is shown in

Fig. 3 PC1-fuzzy resulting from learning process related to wind farms 1 and 2.
2) PC2-fuzzy (Mamdani) Model
The PC2-fuzzy model is a Mamdani FIS with two inputs: the average wind speed and the average wind direction. The output is the average wind power. The learning stage of the PC2-fuzzy (Mamdani) model is manually constructed by trial and error to describe the appropriate position for the pertinence function in an established range and to compare the obtained and expected values. Five logistic sigmoid functions are used for the speeds and seven Gaussian member functions are used for the direction. The same criterion of the five logistic sigmoid functions is used by the output inference. The architecture of PC2-fuzzy is shown in Fig. SA9 of Supplement A.

Fig. 4 PC2-fuzzy surface resulting from learning process related to wind farms 1 and 2.
3) PC1-fuzzy (Sugeno) Model and PC1-fuzzy (Sugeno-ANFIS) Model
PC1-fuzzy (Sugeno) and PC1-fuzzy (Sugeno-ANFIS) apply the wind speed as the input and the average wind power as the output with TSK as the inference. The output is discriminated by ranges. And each range obeys a specific inference function, either constant or linear.
To convert the characteristics of a Mamdani FIS into a Sugeno FIS, the “mam2sug” transformation function in MATLAB is applied. The transformation converts a Mamdani FIS, where the inference is given by the criterion of the maximum and minimum range, to a Sugeno FIS, whose inference or output function is given by linear or constant functions related to each range. The main difference between the traditional Sugeno models and the models developed in this paper is the pre-training, resulting from the use of the “mam2sug” transformation.
After the transformation, the input data clustering becomes better defined, i.e., more classes of data per range of input variables are formed. The increasing number of classes increases the amount of information per track, which improves the performance of the PC1-fuzzy (Sugeno-ANFIS) model and makes it more accurate than a TSK system with standard initialization without pre-training. After the pre-training, PC1-fuzzy (Sugeno) model is obtained without using ANFIS.
The routine for creating the fuzzy PC (Sugeno-ANFIS) is as:
1) Creation of FIS (Mamdani)
a) The learning database is initialized in FIS, and the inputs and outputs are loaded into it.
b) The numbers of intervals of the respective inputs and outputs are selected by successive tests.
c) The type of member function inferred in each interval is chosen through the correlation trend between the input and output variables.
d) The best arrangement of the member functions in each interval is manually adjusted by trial and error until the value of the output reaches a satisfactory approximation level.
2) Creation of pre-established inference (pre-training)
a) The “mam2sug” transformation is used, converting the FIS (Mamdani) into the FIS (Sugeno).
b) The learning database is started in ANFIS.
c) A satisfactory sampling radius is defined on ANFIS.
d) With the value of the sample ray, the grid partition function is chosen. It automatically adjusts the best arrangement of the member functions by interval.
e) FIS is created (Sugeno-ANFIS).
The architecture of the PC1-fuzzy (Sugeno-ANFIS) model and the trend curve resulting from the new learning process for both wind farms are shown in Figs.

Fig. 5 PC1-fuzzy (Sugeno-ANFIS) resulting from learning process related to wind farms 1 and 2.

Fig. 6 Trend curve resulting from learning process related to wind farms 1 and 2.
4) PC2-fuzzy (Sugeno) model and PC2-fuzzy (Sugeno-ANFIS) model
PC2-fuzzy (Sugeno-ANFIS) has similar characteristics compared with those of PC1-fuzzy (Sugeno-ANFIS), differing only in its architecture. The average wind speed and the average wind direction are the inputs and the average wind power is the output. The heuristic and the methods used to define the main parameters of this power approximation block are similar to those described in the previous fuzzy block. Besides, after the pre-training, PC2-fuzzy (Sugeno) model is obtained without using ANFIS. PC2-fuzzy (Sugeno-ANFIS) and the trend surfaces resulting from the learning process are shown in Figs.

Fig. 7 PC2-fuzzy (Sugeno-ANFIS) resulting from learning process related to wind farms 1 and 2.

Fig. 8 Trend surfaces resulting from learning process related to wind farms1 and 2.
The polynomial model of the average power curve is used as a reference in the approximation. The mean absolute error (MAE), normalized mean absolute error (NMAE), and root mean square error (RMSE) between the approximate power values are calculated for the curves and their respective real values according to the database of wind farms 1 and 2.
The power predict error , the mean absolute power prediction error , the normalized mean absolute power prediction error , and the root mean square power prediction error , are calculated in (3)-(6), which is based on the installed capacity of the wind farm and RMSE:
(3) |
(4) |
(5) |
(6) |
where and are the actual and desired outputs of the network, respectively. The gain of the respective power curve model is calculated in (7) when it is compared with the reference model. In this case, the polynomial parametric model of the average power curve is:
(7) |
where is the evaluation criterion of the reference model; and CA(k) is the evaluation criterion of the proposed model. MAE, NMAE, and RMSE can be used as the evaluation criteria.
The performance indices MAE, NMAE and RMSE are presented in
In the proposed method, a second variable along with wind speed is applied as the input to the models of power curves. Two hidden layers are used in the ANN models, which enhances the learning performance of these models.
The significant difference between the error indicators for wind farms 1 and 2 is important to be noted, which indicates that the behavior of most of the algorithms on the same dataset is similar. When a dataset can be easily explained by the models, all of the error indicators are low as shown in
Similar behavior occurs for the Sugeno-ANFIS models since ANFIS is responsible for deep learning by defining the best parameters of the pre-trained FIS. The optimal search accomplishes the influence of neuro-fuzzy inferences in the training process, thus the performances of these models are improved.
Figures

Fig. 9 MAE of simulation set for models of active power curves in wind farms 1 and 2.

Fig. 10 NMAE of simulation set for models of active power curves in wind farms 1 and 2.

Fig. 11 RMSE of simulation set for models of active power curves in wind farms 1 and 2.
These gains are described by MAE, NMAE, and RMSE indices of 50.65%, 50.65%, and 46.68%, respectively. For wind farm 2, the PC1-fuzzy (Sugeno-ANFIS) and PC2-fuzzy (Sugeno-ANFIS) models have the best results, as indicated by the MAE, NMAE, and RMSE of 5.26%, 6.78%, and 4.70%, respectively.
Another relevant result is the advantage of the models with two correlated inputs, which is expected due to the greater amount of information provided to the fuzzy block. The presence of some negative gains indicates lower performance in relation to the reference model.
The performance of the models of power curves on the simulation dataset is shown in the power responses according to Tables

Fig. 12 Responses of power curves for entries of simulation dataset.
The small and similar errors are achieved in the developed models. Small differences may have an impact on the power forecasting, the generation dispatching from other sources, the cost of the generator system, and the finances of the wind farm owner. The energy quantity saved by the proposed methods is shown in Tables
In this paper, a new method is proposed to improve the performance of approximation models of power curves. Pre-established inference is applied, resulting from the conversion of a Mamdani FIS into a Sugeno FIS.
The results indicate that the pre-training is effective in terms of improving the power curves because it selects the clusters with a pre-defined organization tendency. And the result database has less data entropy, which is a decisive factor in obtaining satisfactory results in the learning process.
Two wind farms are studied. The reference model is a polynomial parametric model of the average power curve. The performance evaluation shows that the proposed fuzzy model outperforms the others. The pre-inference criteria improves the performance of the fuzzy models in this paper compared with that of other models. Moreover, the models presenting the best performance are those with two inputs: the average wind speed and the average wind direction. The choice of these variables is based on the study on correlation.
The gains obtained by the new models in relation to the average power curve (the reference model) are particularly satisfactory for wind farm 1, where the gains in MAE, NMAE, and RMSE indices of the highest pre-established inference models are 50.65%, 50.65%, and 46.68%, respectively.
For wind farm 2, due to better wind speed and direction data, the pre-established inference models also have positive gains in MAE, NMAE, and RMSE indices, which are 5.26%, 6.78%, and 4.70%, respectively. However, they are much more modest in relation to the gains for wind farm 1.
The proposed models exhibits better performance compared with those in [
References
J. C. Albuquerque, “Avaliação de curvas de potência em modelos de previsão de geração eólica em curto prazo,” M.Sc. dissertation, Departamento de Engenharia Elétrica, Universidade Federal de Pernambuco, Recife, Brasil, 2015. [百度学术]
R. R. B. de Aquino, J. C. Albuquerque, O. N. Neto et al., “Assessment of power curves in models of wind power forecasting, ” in Proceedings of International Joint Conference on Neural Networks, Vancouver, Canada, Jul. 2016, pp. 3915-3922. [百度学术]
A. Kusiak, H. Zheng, and Z. Song, “On-line monitoring of power curves,” Renewable Energy, vol. 34, no. 6, pp. 1487-1493, Jun. 2009. [百度学术]
A. Kusiak and A. Verma, “Monitoring wind farms with performance curves,” IEEE Transactions on Sustainable Energy, vol. 4, no. 1, pp. 192-199, Jan. 2013. [百度学术]
M. Lydia, A. I. Selvakumar, S. S. Kumar et al., “Advanced algorithms for wind turbine power curve modeling,” IEEE Transactions on Sustainable Energy, vol. 4, no. 3, pp. 827-835, Jul. 2013. [百度学术]
Y. H. Wan, E. Ela, and K. Orwig, “Development of an equivalent wind plant power-curve,” National Renewable Energy Laboratory NREL, Golden, USA, Tech. Rep. NREL/CP-550-48146, Jun. 2010. [百度学术]
A. Sedaghata, A. Hassanzadehb, J. Jamalic et al., “Determination of rated wind speed for maximum annual energy production of variable speed wind turbines,” Applied Energy, vol. 205, no. 1, pp. 781-789, Nov. 2017. [百度学术]
S. A. Akdag and Ö. Güler, “Alternative moment method for wind energy potential and turbine energy output estimation,” Renewable Energy, vol. 120, pp. 69-77, May 2018. [百度学术]
T. Ouyang, A. Kusiak, and Y. He, “Modeling wind-turbine power curve: A data partitioning and mining approach,” Renewable Energy, vol. 102, pp. 1-8, Mar. 2017. [百度学术]
J. Tian, D. Zhou, C. Su et al., “Wind turbine power curve design for optimal power generation in wind farms considering wake effect,” Energies, vol. 10, no. 3, pp. 395-413, Mar. 2017. [百度学术]
T. Jin and Z. Tian, “Uncertainty analysis for wind energy production with dynamic power curves,” in Proceedings of 2010 IEEE 11th International Conference on Probabilistic Methods Applied to Power Systems (PMAPS), Singapore, Jun. 2010, pp. 745-750. [百度学术]
B. Stephen, S. J. Galloway, D. McMillan et al., “A copula model of wind turbine performance,” IEEE Transactions on Power Systems, vol. 26, no. 2, pp. 965-966, May 2011. [百度学术]
C. L. Archer, H. P. Simão, W. Kempton et al.,“The challenge of integrating offshore wind power in the U.S. electric grid, Part I: wind forecast error,” Renewable Energy, vol. 103, pp. 346-360, Apr. 2017. [百度学术]
F. P. G. Márquez, A. M. Tobias, J. M. P. Pérez et al., “Condition monitoring of wind turbines: techniques and methods,” Renewable Energy, vol. 46, pp. 169-178, Oct. 2012. [百度学术]
I. Colak, S. Sagiroglu, and M. Yesilbudak, “Data mining and wind power prediction: a literature review,” Renewable Energy, vol. 46, pp. 241-247, Oct. 2012. [百度学术]
J. Gottschall and J. Peinke “How to improve the estimation of power curves for wind turbines,” Environmental Research Letters, vol. 3, no. 1, pp. 15005-15011, Jan. 2008. [百度学术]
Wind Turbines-Part 12-1: Power Performance Measurements of Electricity Producing Wind Turbines, IEC61400-12-1, 2005. [百度学术]
M. Stiebler, Wind Energy Systems for Electric Power Generation. Berlin: Springer-Verlag, 2008, pp. 17-27. [百度学术]
T. Ackermann and L. Söder, “Wind energy technology and current status: a review,” Renewable and Sustainable Energy Reviews, vol. 4, no. 4, pp. 315-374, Dec. 2000. [百度学术]
T. Ackermann, Wind Power in Power Systems. Hoboken: Wiley, 2012, pp. 47-68. [百度学术]
A. Betz, “The maximum of the theoretically possible exploitation of wind by means of a wind motor,” Wind Engineering, vol. 37, no. 4, pp. 441-446, May 2013. [百度学术]
S. O. Rezende, Sistemas Inteligentes: Fundamentos e Aplicações, Barueri: Manole, Jan. 2005. [百度学术]
M. Sugeno, Industrial Applications of Fuzzy Control. New York: Elsevier Science Inc., 1985. [百度学术]
J. Whale, M. P. McHenry, and A. Malla, “Scheduling and conducting power performance testing of a small turbine,” Renewable Energy, vol. 55, pp. 55-61, Jul. 2013. [百度学术]
S. Shokrzadeh, M. J. Jozani, and E. Bibeau, “Wind turbine power curve modeling using advanced parametric and nonparametric methods,” IEEE Transactions on Sustainable Energy, vol. 5, no. 4, pp. 1262-1269, Oct. 2014. [百度学术]
M. Mehrjoo, M. J. Jozani, and M. Pawlak, “Wind turbine power curve modeling for reliable power prediction using monotonic regression,” Renewable Energy, vol. 147, pp. 214-222, Mar. 2020. [百度学术]
R. K. Pandit, D. Infield, and A. Kolios, “Comparison of advanced non-parametric models for wind turbine power curves,” IET Renewable Power Generation, vol. 13, no. 9, pp. 1503-1510, Jul. 2019. [百度学术]