Abstract
With the popularity of smart meters and the growing availability of high-resolution load data, the research on the dynamics of electricity consumption at finely resolved timescales has become increasingly popular. Many existing algorithms underperform when clustering load profiles contain a large number of feature points. In addition, it is difficult to accurately describe the similarity of profile shapes when load sequences have large fluctuations, leading to inaccurate clustering results. To this end, this paper proposes a high-resolution load profile clustering approach based on dynamic largest triangle three buckets (LTTBs) and multiscale dynamic time warping under limited warping path length (LDTW). Dynamic LTTB is a novel dimensionality reduction algorithm based on LTTB. New sequences are constructed by dynamically dividing the intervals of significant feature points. The extraction of fluctuation characteristics is optimized. New curves with more concentrated features will be applied to the subsequent clustering. The proposed multiscale LDTW is used to generate a similarity matrix for spectral clustering, providing a more comprehensive and flexible matching method to characterize the similarity of load profiles. Thus, the clustering effect of a high-resolution load profile is improved. The proposed approach has been applied to multiple datasets. Experiment results demonstrate that the proposed approach significantly improves the Davies-Bouldin indicator (DBI) and validity index (VI). Therefore, better similarity and accuracy can be achieved using high-resolution load profile clustering.
WITH the development of smart grids and advanced metering infrastructure, a vast amount of fine-grained electricity consumption data has been generated [
Numerous methods, including K-medoids [
To retain the major information related to electricity usage activities and improve computational efficiency, some studies have utilized different dimensionality reduction technologies to reduce the dimensions of the load data such as piecewise aggregate approximation (PAA) [
Dynamic time warping (DTW) can realign the matching relationship between data points to alleviate the problem of time drift [
To address these problems, a high-resolution load profile clustering approach based on dynamic LTTB and a multiscale dynamic time warping under limited warping path length (LDTW) is proposed in this paper. The proposed approach reduces the dimensions of high-resolution load profiles using the dynamic LTTB and obtains a new representative sequence composed of the original data. It can dynamically select the region where the appropriate feature points are located according to the degree of fluctuation in the curve. The use of raw data to form a new sequence ensures that the characteristics will not be flattened because of violent fluctuations over a short time period. Subsequently, the multiscale LDTW is used as a metric to describe the similarity of sequences with regards to shape and value. The method of limiting the overall step size is used to determine the matching relationship of the data points in multiscale LDTW, suppressing pathological alignments without limiting the matching relationship of each point. The similarity matrix of spectral clustering is created accordingly. Finally, spectral clustering improves the similarity of the load profiles in the same cluster and increases the accuracy of load profiling.
The main contributions of this paper are as follows.
1) An improved LTTB algorithm is proposed to address the high volatility of high-resolution load profiles. A dynamic method for determining a region of representative points in the load profiles is provided, and the new sequence is composed of the original data to reduce the possibility of distortion. It can overcome the fixed bucket limitation and maintain a balance between the retention of key information and the computational cost.
2) Multiscale LDTW is proposed to construct a similarity matrix for spectral clustering. The determination of the matching relationship between data points by limiting the overall step size is applied to load profile clustering for the first time. This flexible method suppresses pathological alignment. The discrimination ability of the shape similarity is enhanced, and the accuracy of high-dimensional load profile clustering is improved.
3) An approach to high-resolution load profile clustering is proposed, which improves the effect of clustering and provides a more accurate description of the power consumption behavior.
The main process of load profile clustering includes five steps: input raw data to generate load profiles, preprocess the raw load profiles, reduce the dimensionality of the load profiles using dynamic LTTB, cluster load profile using multiscale LDTW, and extract typical load profiles.
We regard the load data that fall outside three standard deviations of the mean as outlier data and fill in missing data using the k-nearest neighbor (KNN) algorithm. We normalize the data using the min-max method and convert data at different levels into a unified measure. The processed data are arranged to form the load profiles for clustering.
Assuming that the original data have been screened for outliers and that the missing values are filled in, we convert the original data into a new sequence . The th datum is calculated using (1).
(1) |
where is the th record after normalization using the extreme values; and and are the minimum and maximum records, respectively.
The proposed dynamic LTTB has a dynamic bucket size that is different from LTTB with equal-sized buckets. Inspired by the LTD algorithm, dynamic LTTB reassigns each bucket to capture representative feature points. If the fluctuation in a bucket is considerable, it is likely that the data points are better represented as two buckets. Two adjacent buckets can be merged if the fluctuation is low. The steps of the proposed method are as follows.
Step 1: separate all the data points into roughly equal-sized buckets, with the first and last data points from the original data serving as the first and last buckets, respectively.
Step 2: evaluate the fluctuation in the data in each bucket by the variance and adjust the sizes of the buckets accordingly. Select the highest-ranking bucket and divide it into two equal-sized buckets so that an extra bucket can be created. We maintain the total number of iterative buckets by locating and combining an adjacent bucket pair with the lowest total variance sum. Meanwhile, we focus on limiting the size of a single bucket. Taking division and merging as one operation, we obtain the final bucket distribution through iteration. The number of iterations for adjusting the bucket size is calculated using (2).
(2) |
where N is the number of iterations; n is the original count of data points; and T is the downsampling threshold.
To guarantee that a bucket does not cause an excessive time shift after redivision, which will have a negative effect on the description of the load curve characteristics, we need to limit the size of a single bucket. Let the maximum length of the barrel be times the original length and the minimum length be times the original length. The range of the limit length can be adjusted according to different datasets. We select different values of and for multiple power load datasets having different lengths for the experiments. During the experiments, we need to select different values of and to limit the splitting of the buckets, and we compare the effects of dimensionality reduction for different values of and to determine the appropriate values.
Step 3: go through all the buckets and choose the representative points from each. The first and last buckets have only one point that is selected by default. The dynamic LTTB moves from left to right, working with three buckets at a time. The first point in the left corner of the triangle is always fixed as the previously selected point. We select the midpoint of the third bucket as a temporary point. Thus, the dynamic LTTB has two fixed points. The remaining point to be determined is the intermediate data point. We use the effective area (EA) of the data point of the current bucket to determine the most representative point. The EA of a point is the area of a triangle formed by its two adjacent points. The construction of the largest triangle across the buckets is depicted in

Fig. 1 Largest triangle formed by three adjacent buckets.
By adjusting the buckets of the original data and restricting bucket segmentation dynamically, representative data points can be extracted more effectively, and the interval for selecting the feature data can be selectively controlled when the curves have different fluctuations.
The description of the similarity between curves and the choice of clustering algorithm are the important components of the power load profile clustering analysis. The DTW algorithm is frequently used to compare similarities across time series and has a positive impact on curve comparisons. On this basis, the proposed multiscale LDTW offers a more adaptable and multiangle data point matching relationship. Spectral clustering has been widely used because of its excellent clustering effects. This paper proposes spectral clustering based on multiscale LDTW to improve clustering outcomes. It employs multiscale LDTW as a similarity measure to produce a similarity matrix for spectral clustering.
The proposed multiscale LDTW inherits the concept of DTW with a limited warping path length. In the matching process, the soft restriction method of limiting the number of steps in the entire series matching process is used to generate an evaluation index for clustering. On this basis, the original distance measurement method is improved. Moreover, a method for determining the step length for the restriction of series with different characteristics is proposed to balance the topological and direct alignments of the series.
The main idea of multiscale LDTW is to determine the step size of a path to provide more flexible restriction for data point matching and find the optimal matching path. We can observe the process in reverse. Let the total step size of a path be , the current step size in the matching process be , and the corresponding distance be . The last point of the two series must match, and at this time. must originate from the left, lower, or lower left side according to DTW rules. The previous step cannot be used if the step size for these three positions does not satisfy . An additional dimension is added to determine whether the possible step size of the previous data point conforms to the total step size . When filling the distance matrix , each position needs to include the path length corresponding to all possible path lengths, and the minimum is selected in this step.
Therefore, we can improve the calculation process of DTW and obtain:
(3) |
where and represent the positions of the data points in the series and , respectively; and is the distance between the data points and .
The numerical values and derivatives of the series measure the characteristics of the series from different perspectives. The difference in the numerical values indicates the difference between the series themselves, whereas the derivative more closely reflects whether the changes in the trends of the series are similar. In this study, we use a combination of numerical and derivative differences as the distance measure and provide adjustable weights. An element of the distance matrix is calculated as:
(4) |
where is the distance metric obtained after weighting; and are the numerical and derivative differences of each corresponding data point of the two series, respectively; and and are the corresponding weights ().
The weights can be adjusted according to the characteristics of different time series. To calculate the derivative of a data point, we use the following approximation:
(5) |
The use of the total step size as the limiting condition can improve flexibility during the matching process, but there is a problem of determining a limited step size for the profiles. In this study, the standard deviation is used to measure the fluctuation in the sequences in the datasets, and the limited step size is determined accordingly.
The proposed approach determines whether it is necessary to provide a more relaxed step size limitation by measuring the fluctuation in the data points compared with the points at the same position. Let the original step size be the length of the series. The standard deviation of the points at the same position is calculated and recorded, and the difference between the two paired points is compared with the standard deviation of the position. When the difference is small, the fluctuation is not sufficient to increase the step size. Otherwise, the original step size is increased by 1. Then, we iterate over the data points at each location to obtain the final step size .
The calculation process for multiscale LDTW is summarized in
Algorithm 1 : calculation process for multiscale LDTW |
---|
Input: sequences ; limited step size ; distance matrix formed by each combination of data points between the sequences calculated by (4) |
Output: distance between sequences and calculated by multiscale LDTW |
1. Let be the length of the sequence |
2. Let be the matrix recording the cumulative distances computed during the matching process. and q represent the positions of the data points matched by the current series A and B, respectively; diff records the accumulated distance to the current position; and is used to record the matching situation of the previous step |
3. Initialize |
4. Calculate D when or using (3) |
5. for to N do |
6. for to N do |
7. for to do |
8. Calculate by (3) and record the coordinate corresponding to the minimum value using |
9. end for |
10. end for |
11. end for |
12. , |
13. The smallest in the range from MinStep to MaxStep is the multiscale LDTW distance |
Spectral clustering is becoming increasingly popular because of its excellent performance for time-series clustering. The fundamental concept behind spectral clustering is the conversion of all of the power data into points in space. The edge that links the weights of these points between two spots separated by a long distance is lower.
To achieve the goal of clustering, the edge weight between distinct subgraphs after cutting is as low as possible, and the edge weight sum within the subgraph is as high as possible by cutting the graph made of all data points [
Step 1: create a graph for a given set of samples containing load profiles, with each series acting as a vertex on the graph. Calculate the multiscale LDTW distance between each sample.
Step 2: use multiscale LDTW as the distance measure fm to calculate the distance between each series to form the similarity matrix S.
(6) |
Construct the degree matrix . Each diagonal element of is the sum of the elements of each row of the associated similarity matrix S, and all other elements are 0.
(7) |
Calculate the Laplacian matrix using S and and normalize the Laplacian matrix to produce superior clustering results.
(8) |
Step 3: compute the eigenvectors of and arrange the vectors in ascending order of eigenvalues to form the matrix . To form a new matrix , the first k eigenvectors are chosen. The row vector of matrix is used as the new feature of the original series to obtain the clustering result.
The flowchart of the proposed approach is shown in

Fig. 2 Flowchart of proposed approach.
In this section, the proposed approach is evaluated with multiple high-resolution load profile datasets. And it is compared with popular approaches in terms of the effects of dimensionality reduction and the similarity measures.
In this study, multiple publicly available load datasets are used to evaluate the proposed approach. We choose the smart building energy dataset CU-BEMS [
Dataset | Length | Size of samples | CV |
---|---|---|---|
CU-BEMS | 1440 | 101 | 1.42 |
Computer | 720 | 241 | 0.83 |
Powercons | 144 | 176 | 1.06 |
Electric devices | 96 | 1367 | 1.23 |
This subsection introduces the performance measures used in this study. The effects of dimensionality reduction are evaluated using the average distinguished information (ADI). The Davies-Bouldin indicator (DBI) and validity index (VI) are used to measure the clustering effect. The DBI is a commonly used indicator to measure the clustering effect. By comparing the similarity based on both distance and correlation, the VI considers whether the changes in the trends between series are similar in a more thorough manner.
1) ADI [
(9) |
where is the element in a two-dimensional representation dataset Y; M is the total number of series; is the mean of all the th data of the series; and is the length of each series.
2) DBI [
(10) |
where is the th data point in class ; is the center of class ; is the number of data points in class ; is the number of data points in ; is the value of the th attribute of the center point of class ; is the mean distance between the samples in the th class and their cluster centroids; and is the distance between the th class and the th cluster centroid. A smaller DBI indicates a better clustering effect.
3) VI [
(11) |
where is the average value of the distances from all of the objects of the dataset to the corresponding clustering centers; is the average distance between cluster centers; is the average correlation between the consumption profiles and the corresponding clustering centers; and is the average correlation between clustering centers.
In this subsection, we analyze the effects of dimensionality reduction from three aspects, compare the information retention and clustering quality after performing equally-spaced sampling (ES), PAA, LTTB, and dynamic LTTB, and show the details of data processing through an example. We use different methods to reduce the dimensions of the four datasets with different lengths, and the resulting ADIs are listed in
Dataset | New dimension | ADI | |||
---|---|---|---|---|---|
ES | PAA | LTTB | Dynamic LTTB | ||
CU-BEMS | 48 | 0.72 | 0.78 | 0.83 | 0.89 |
24 | 0.71 | 0.69 | 0.74 | 0.88 | |
Computer | 48 | 0.18 | 0.22 | 0.25 | 0.28 |
24 | 0.13 | 0.20 | 0.18 | 0.23 | |
Powercons | 24 | 0.24 | 0.30 | 0.33 | 0.35 |
12 | 0.20 | 0.26 | 0.31 | 0.32 | |
Electric devices | 24 | 0.19 | 0.25 | 0.41 | 0.48 |
12 | 0.22 | 0.17 | 0.39 | 0.47 |
Further, we cluster the dimensionless load profiles created by different methods as clustering targets and compare their clustering effects. In
Dataset | Index | Original data | ES | PAA | LTTB | Dynamic LTTB |
---|---|---|---|---|---|---|
CU-BEMS | DBI | 0.74 | 1.92 | 1.21 | 1.06 | 0.78 |
VI | 0.11 | 0.34 | 0.19 | 0.16 | 0.13 | |
Computer | DBI | 2.52 | 4.11 | 2.97 | 2.89 | 2.77 |
VI | 0.29 | 0.67 | 0.38 | 0.30 | 0.26 | |
Powercons | DBI | 1.51 | 2.57 | 1.73 | 1.69 | 1.63 |
VI | 0.74 | 1.14 | 0.91 | 0.93 | 0.87 | |
Electric devices | DBI | 3.31 | 4.82 | 4.05 | 3.48 | 3.41 |
VI | 0.48 | 0.91 | 0.71 | 0.63 | 0.56 |
To demonstrate the effects of different dimensionality reduction methods more intuitively, we select a series in the Powercons dataset as an example to show the series obtained by different dimensionality reduction methods. The original series length decreases from 144 to 12. The results are shown in

Fig. 3 Example series from Powercons dataset showing effects of different dimensionality reduction methods. (a) ES. (b) PAA. (c) LTTB. (d) Dynamic LTTB.
The sampling time of this dataset is 10 min. The green dotted line represents the division line for each bucket. From this example, we can observe the advantages of dynamic LTTB for dimensionality reduction. The results of ES can be considered as random selection, which is likely to miss key data features. The PAA uses the average value of the data in each bucket as a representative. If the data points in the bucket significantly fluctuate, the representative point may poorly describe the characteristics of the original series compared with the average. LTTB is more flexible in selecting representative points, but it does not perform well in cases where multiple key data points are located in the same bucket. The dynamic LTTB considers the locations of potential key data points during the process of generating buckets and accordingly creates a flexible division. In the case of load profiles with large fluctuations and uncertain locations, the feature retention of the original sequence can be improved, which is conducive to the subsequent clustering.
We use different similarity methods to cluster different datasets and compare their clustering performances and the matching relationships of data points. We perform clustering using the similarity matrix calculated by DTW, derivative dynamic time warping (DDTW), LimitDTW, and multiscale LDTW and evaluate the clustering quality using DBI and VI. The results are presented in
Dataset | Index | DTW | DDTW | LimitDTW | Multiscale LDTW |
---|---|---|---|---|---|
CU-BEMS | DBI | 1.52 | 0.91 | 1.03 | 0.78 |
VI | 0.26 | 0.17 | 0.21 | 0.13 | |
Computer | DBI | 3.43 | 3.07 | 2.96 | 2.77 |
VI | 0.41 | 0.28 | 0.29 | 0.26 | |
Powercons | DBI | 2.18 | 1.74 | 1.95 | 1.63 |
VI | 1.31 | 0.91 | 0.98 | 0.87 | |
Electric devices | DBI | 3.97 | 4.87 | 3.73 | 3.41 |
VI | 0.77 | 0.83 | 0.72 | 0.56 |
To demonstrate the improvement in data point matching with multiscale LDTW more clearly, we take the matching paths of two load profiles in the Powercons dataset obtained by DTW and multiscale LDTW. The sampling time of this dataset is 10 min. The calculated suitable step size for this example is 179. The connection relationships between the data points are shown in

Fig. 4 Connection relationships between data points obtained with DTW and multiscale LDTW. (a) DTW. (b) Multiscale LDTW.
The clustering results for the CU-BEMS dataset in

Fig. 5 Clustering results for CU-BEMS dataset with proposed approach. (a) Cluster 1. (b) Cluster 2. (c) Cluster 3. (d) Cluster 4. (e) Cluster 5. (f) Cluster 6.
According to the results, we can obtain the following characteristics of each cluster. There is a continuous power consumption peak from 08:00 to 20:00 in cluster 1, and it decreases slightly at noon. The load of cluster 2 is very low for a long time. The peak periods of power consumption for cluster 3 are 08:00-12:00 and 13:00-16:00. The overall load level of cluster 4 is low, but the load from 06:00 to 20:00 is relatively high. The peak load of cluster 5 occurs from 00:00 to 08:00 and from 16:00 to 24:00. The load of cluster 6 is maintained at a stable level. Compared with previous methods, the proposed approach achieves a more accurate division. Each cluster has distinct characteristics considering the similarity between samples from multiple perspectives.
In this subsection, we compare DTW, DDTW, and LimitDTW with multiscale LDTW. The average time required to generate a similarity matrix using different methods for four different datasets is shown in

Fig. 6 Average time required to generate a similarity matrix using different methods for four different datasets.
This paper presents a high-resolution load profile clustering approach based on dynamic LTTB and multiscale LDTW that improves the effects of feature extraction and clustering in the data with substantial fluctuations. The proposed dynamic LTTB can scan the possible positions of data points with key features and flexibly adjust the distribution of the buckets in each sequence. The proposed multiscale LDTW suppresses the pathological alignments of the data by limiting the overall step size of the matching between the data points in sequences. The adjustable numerical and derivative distances are combined for measurement. Compared with other popular methods, the proposed approach exhibits better accuracy for most datasets.
In future work, an efficient approach to DTW that can quickly determine the matching relationships between data points is worth studying.
References
Y. Liu, G. Wang, W. Guo et al., “Power data mining in smart grid environment,” Journal of Intelligent & Fuzzy Systems, vol. 40, no. 2, pp. 3169-3175, Feb. 2021. [Baidu Scholar]
Y. Li, D. Han, and Z. Yan, “Long-term system load forecasting based on data-driven linear clustering method,” Journal of Modern Power Systems and Clean Energy, vol. 6, no. 2, pp. 306-316, Mar. 2018. [Baidu Scholar]
M. Wen, R. Xie, K. Lu et al., “Feddetect: a novel privacy-preserving federated learning framework for energy theft detection in smart grid,” IEEE Internet of Things Journal, vol. 9, no. 8, pp. 6069-6080, Sept. 2021. [Baidu Scholar]
Y. Wang, Q. Chen, C. Kang et al., “Load profiling and its application to demand response: a review,” Tsinghua Science and Technology, vol. 20, no. 2, pp. 117-129, Apr. 2015. [Baidu Scholar]
C. Si, S. Xu, C. Wan et al., “Electric load clustering in smart grid: methodologies, applications, and future trends,” Journal of Modern Power Systems and Clean Energy, vol. 9, no. 2, pp. 237-252, Mar. 2021. [Baidu Scholar]
Z. Wang and H. Wang, “Analyzing seasonal variation in residential load patterns via two-stage clustering and relative entropy: poster,” in Proceedings of the Twelfth ACM International Conference on Future Energy Systems, Torino, Italy, Jun. 2021, pp. 286-287. [Baidu Scholar]
D. Zhang, X. Zhao, Y. Guo et al., “Cluster analysis of smart meter load based on electricity behavior characteristics,” in Proceedings of International Conference on Genetic and Evolutionary Computing, Jilin, China, Jan. 2022, pp. 429-437. [Baidu Scholar]
A. Dedić, T. Konjić, M. Ćalasan et al., “Fuzzy C-means clustering applied to load profiling of industrial customers,” Electric Power Components and Systems, vol. 49, no. 11-12, pp. 1-17, Apr. 2022. [Baidu Scholar]
S. Lin, F. Li, E. Tian et al., “Clustering load profiles for demand response applications,” IEEE Transactions on Smart Grid, vol. 10, no. 2, pp. 1599-1607, Nov. 2017. [Baidu Scholar]
Z. Bi, Y. Leng, Z. Liu et al., “An improved spectral clustering algorithm using fast dynamic time warping for power load curve analysis,” in Proceedings of International Conference on Mobile Computing, Applications, and Services, Athens, Greece, Dec. 2020, pp. 143-159. [Baidu Scholar]
J. Menezes and N. Poojary, “Dimensionality reduction and classification of hyperspetral images using DWT and DCCF,” in Proceedings of 2016 3rd MEC International Conference on Big Data and Smart City (ICBDSC), Muscat, Oman, Mar. 2016, pp. 1-6. [Baidu Scholar]
S. Steinarsson, Downsampling Time Series for Visual Representation. Ph.D. dissertation, University of Iceland, Reykjavik, Iceland, 2013. [Baidu Scholar]
J. L. Chen, Y. Huang, W. Qiu et al., “Research on feature extraction method of power grid online data based on big data,” Journal of Physics: Conference Series, vol. 2030, p. 012064, Sept. 2021. [Baidu Scholar]
H. Ding, G. Trajcevski, P. Scheuermann et al., “Querying and mining of time series data: experimental comparison of representations and distance measures,” Proceedings of the VLDB Endowment, vol. 1, no. 2, pp. 1542-1552, Aug. 2008. [Baidu Scholar]
G. L. Ray and P. Pinson, “Online adaptive clustering algorithm for load profiling,” Sustainable Energy, Grids and Networks, vol. 17, p. 100181, Mar. 2019. [Baidu Scholar]
T. Teeraratkul, D. O’Neill, and S. Lall, “Shape-based approach to household electric load curve clustering and prediction,” IEEE Transactions on Smart Grid, vol. 9, no. 5, pp. 5196-5206, Mar. 2017. [Baidu Scholar]
M. Gao, T. Gong, R. Lin et al., “A power load clustering method based on limited DTW algorithm,” in Proceedings of 2019 IEEE 3rd Information Technology, Networking, Electronic and Automation Control Conference (ITNEC), Chengdu, China, Mar. 2019, pp. 253-256. [Baidu Scholar]
H. Liang and J. Ma, “Develop load shape dictionary through efficient clustering based on elastic dissimilarity measure,” IEEE Transactions on Smart Grid, vol. 12, no. 1, pp. 442-452, Jan. 2021. [Baidu Scholar]
Z. Zhang, R. Tavenard, A. Bailly et al., “Dynamic time warping under limited warping path length,” Information Sciences, vol. 393, pp. 91-107, Jul. 2017. [Baidu Scholar]
A. Ghosal, A. Nandy, A. K. Das et al., “A short review on different clustering techniques and their applications,” in Emerging Technology in Modelling and Graphics. Berlin: Springer, 2019, pp. 69-83. [Baidu Scholar]
M. Pipattanasomporn, G. Chitalia, J. Songsiri et al., “CU-BEMS, smart building electricity consumption and indoor environmental sensor datasets,” Scientific Data, vol. 7, no. 1, pp. 1-14, Jul. 2020. [Baidu Scholar]
H. A. Dau, A. Bagnall, K. Kamgar et al., “The UCR time series archive,” IEEE/CAA Journal of Automatica Sinica, vol. 6, no. 6, pp. 1293-1305, Nov. 2019. [Baidu Scholar]
R. Granell, C. J. Axon, and D. C. Wallom, “Impacts of raw data temporal resolution using selected clustering methods on residential electricity load profiles,” IEEE Transactions on Power Systems, vol. 30, no. 6, pp. 3217-3224, Dec. 2014. [Baidu Scholar]
A. Rajabi, L. Li, J. Zhang et al., “A review on clustering of residential electricity customers and its applications,” in Proceedings of 2017 20th International Conference on Electrical Machines and Systems (ICEMS), Sydney, Australia, Aug. 2017, pp. 1-6. [Baidu Scholar]
Y. Wang, L. Li, and Q. Yang, “Application of clustering technique to electricity customer classification for load forecasting,” in Proceedings of 2015 IEEE International Conference on Information and Automation, Lijiang, China, Aug. 2015, pp. 1425-1430. [Baidu Scholar]