Machine+Learning+for+PV+System+Operational+Fault+Analysis_Literature+Review_Final
Machine+Learning+for+PV+System+Operational+Fault+Analysis_Literature+Review_Final
Abstract. This review paper aims to discover the research gap and
assess the feasibility of a holistic approach for photovoltaic (PV) system
operational fault analysis using machine learning (ML) methods. The
analysis includes the detection and diagnosis of operational faults in
a PV system. Even if standard protective devices are installed in PV
systems, they fail to clear various faults because of low current during low
mismatch levels, high impedance fault, low irradiance, etc. This failure
will increase the energy loss and endanger the PV system’s reliability,
stability, and security. As a result of the ML method’s ability to handle
a non-linear relationship, distinguishing features with similar signatures,
and their online application, they are getting attractive in recent years
for fault detection and diagnosis (FDD) in PV systems. In this paper, a
review of literature on ML-based PV system FDD methods is provided.
It is found that considering their simplicity and performance accuracy,
Artificial Neural networks such as Multi-layer Perceptron are the most
promising approach in finding a central PV system FDD. Besides, the
review paper has identified main implementation challenges and provides
recommendations for future work.
1 Introduction
Owning to the various advantages PV system can provide, the global market
for PV has been increasing sharply. According to [1], the cumulative globally
installed capacity in 2019 increased to about 627 GW. Assuming a medium
scenario where cases like COVID-19 pandemic considered, [14] estimated the
total global installed PV generation capacity to exceed 1.2 TW by 2022. In
addition, the price of electricity from a PV system is constantly decreasing [20].
This all shows a promise for further increase in the PV market in the coming
years.
?
Supported by NTNU
Nomenclature
most frequent and dangerous faults. Nevertheless, most of the literature focused
on PV array fault. Besides, only one paper tried to implement the ML method in
a programmable logic device based on the author’s knowledge. Thus, this paper
aim at answering the reason behind. To the best of the author’s knowledge, this
review paper is the first to review literature, keeping in mind the feasibility of a
holistic fault analysis approach for a PV system specifically for standalone PV
system (SAPVS) as well as categorizing and analyzing FDD into methods based
on ML and deep learning, ensemble learning, and transfer learning.
The paper is organized as follows: After providing a summary of review pa-
pers, the first part of section 2 gives detailed information about various faults
commonly occurring in PV system components. Then, the second part of section
2 provides a comprehensive literature review on PV FDD. Whereas, section 3
presents and discuss all the findings. Finally, the paper concludes by summariz-
ing the main findings and providing some recommendations.
2 Literature Review
Arc Fault (AF) AF is a fault where current flows in the air or dielectric outside
the conductor due to loose connection. It could be a series arc in case of a
connection between modules or a parallel arc in case of a closely placed conductor
at different potential differences [4, 33, 31]. On the contrary to other faults, arc
fault has little effect on the I-V or I-P characteristic of PV arrays. Nevertheless,
it leads to a severe distortion in the output current and voltage waveform [33].
Arc Fault Circuit Interrupters and Arc Fault Detectors are recommended for
clearing this fault. However, multiple of them have to be installed to clear the
fault correctly. Moreover, when they are installed at the inverter side, they fail
to protect the fault as attenuated arc signals reach them. Beside detecting arc
fault, identifying which arc fault is occurring is important as the measure taken
for one will increase the impact of the other [33].
Even it can lead to destruction due to fire hazards as a result of cell/module tem-
perature increase due to the dissipated energy [27]. Furthermore, as it results
in multiple peaks in I-V characteristic curve, identifying the maximum power
point by MPPT will be challenging [33]. Besides, unless a time factor is used, it
is hard to differentiate it from OC fault as their effect on power output charac-
teristic has similarity [27]. Furthermore, to mitigate the problem, bypass diodes
are installed at each module, but this will increase the installation cost [33].
Others In addition to the above main PV array faults others may include degra-
dation faults [18, 34], hot spot fault [39], fault in bypass diode which could be
OC or SC fault [33], and blocking diodes faults [30].
Solar Battery Fault The battery takes around 43% of the life cycle cost of
SAPVS [37]. As a result, it shall get attention, and a good working condition shall
be provided. The main faults that could happen in this PV component includes
external short-circuit fault [36], degradation fault [35], internal fault which could
be GF and SC fault [32], overcharging (over-voltage), undercharging (under-
voltage) and open circuit (total voltage to zero) [39]. The impact of those faults
in a battery may range from decreasing its performance, shorten its lifetime, and
increased maintenance cost to fire hazard explosion [32]. The lack of guidelines
on how to select fuse and circuit breaker is mentioned in [32] as one of the main
challenges in detecting internal faults. Moreover, the gradual change of current
and voltage of a battery makes detecting faults on time extremely difficult.
MPPT Fault An MPPT control system comprises various sensors to get ir-
radiance, temperature, current, and voltage measurements and an optimization
algorithm to search the maximum power point to operate the PV array and
boost the PV system yield. Thus, any error in any part of the MPPT will lead
to a wrong operating power point, which significantly decreases the PV sys-
tem’s output power. Sensor failure and lack of an efficient and effective MPPT
algorithm are the most common fault in MPPT [27, 29].
and localizing the occurrence. This section closely look at the literature on ML
application for FDD in PV systems by classifying them into methods based on
1) machine learning and deep learning, 2) ensemble learning, and 3) transfer
learning.
Methods based on ensemble learning (EL) [13], similarly to [12] the model
is based on I-V characteristics and focus on LL fault at different mismatch and
impedance level. However, here they used probabilistic ensemble learning model
comprising of SVM, NB, and KNN. For decision, the average of all the results
of the algorithm was used. They could achieve an average of 99% and 99.5%
for detecting and classifying LL fault. Moreover, they have evaluated the model
with simulation and experimental data set. In [22] EL method with DT, RF,
DA, etc., was used to detect PS and SC fault, but the focus is still PV array.
They have used electrical parameters as input features.
Among other ML methods, SVM and MLP, in general, have been used exten-
sively to detect and classify faults in a PV system. For evaluating the models,
accuracy and confusion metrics are the most employed performance indices.
However, some have utilized their own metrics and execution time. Due to ML’s
random nature, it is very important to report performance after conducting a
reasonable number of model execution though it takes time.
Looking at the data source where the experimental PV system was installed,
Algeria took the lead, China and Korea take second place. It is also very impor-
tant to analyze the performance of the ML methods under different climatic and
geographical conditions before utilizing them. This is because the challenge for
the PV array and the battery is different depending on the geographical location.
For instance, while snow is a big problem in the polar region, dust, soiling, and
higher operating temperature are huge problems in the equatorial region.
Most of the papers depend on the input features which has been generated
from a simulated PV system. Whereas only a few have included experimental
data. This is because of the difficultly of setting up a PV system only for collect-
ing data. Furthermore, when an available PV system exists as the environmental
condition can not be controlled, it is tedious and time-consuming to generate
a data set that will enable the model to acquire a generalization capacity. Ir-
radiance, temperature, and major points from I-V characteristics are the most
utilized input features in case of a fault in a PV array. In comparison, current
and voltage data are used in case of a fault in a battery, inverter, MPPT, and
others. Electrical and meteorological data are mostly used in ML, whereas image
data are the most common input features for deep learning algorithms such as
CNN. However, recently as 1-D can be transformed to 2-D data, electrical and
meteorological data are also employed for deep learning algorithms in general.
Faults like arc fault that does not reflect its effect on I-V characteristics of
PV arrays, a method that includes the analysis of signal waveform (some kind of
transformation, for example, wavelet) which could show signal distortion effect,
might be an appropriate method to capture most of the faults in a PV system.
Moreover, in most papers, prepossessing of data like normalization has resulted
in better accuracy. Nevertheless, whenever this is not possible deep learning
models are efficient due to their capacity in extracting features automatically.
Even if major progress has been seen in the research area in using the ML
method for FDD in a PV system, only one paper has implemented the ML
method in prototype based on the literature review. Furthermore, so far, this
method is not commercialized. Thus, the authors have identified the following
main challenges.
Table 1: Summary of reviewed literature on PV system FDD using ML methods
[7] 2017 ML KELM I-V char 3 3 PV array SC, OC, PS, Degradation - -
[8] 2016 ML MLP, RBF I, V, peaks from I-V char 3 5 PV array SC,bypass diode, OC, connection, PS - -
[39] 2014 ML - - - - Battery Internal resistance fault, overcharging - -
11
Note: - : Not given, 3: Is used, 5: Not used G: Irradiance, T: Temperature and for other abbreviations please refer to the document
12 T. Zenebe et al.
– Training, validation, and test data set that fit at least major fault in a PV
system, PV type and size are very rare to find.
– Even if most researchers have developed their own data set, most of them
are simulation data. Besides, in DL-based methods, gathering the image data
using a camera and drone is very expensive.
– Many measuring devices and sensors are needed due to the absence of a
proper method for effective input feature selection.
– There is a lack of knowledge on how to generate rare but severe faults.
– Selection of model configurations is done with try and error.
– The model devised so far does not have the modularity and generalization
capacity; as a result, ML model selection varies depending on fault type, the
size, and type of input data.
– Studies that guide integrating the methods with the existing protective de-
vices are not developed very well. Moreover, all the paper does not go in-
depth on how to clear the faults. Once the fault is classified, a method and
strategy are needed to coordinate it with protective devices for clearing the
fault automatically and/or convey the message to the operators for solutions.
– The model’s accuracy is variable as it depends on the data size, data quality,
and the number of input and output features.
– For comparing ML methods based on accuracy, cost, execution time, memory
usage, there are no standards or common testing platforms.
Acknowledgements
We would like to thank our Electrical Power Engineering department at NTNU
for funding this work as part of a PhD. project.
Conflict of interest
The authors declare that they have no conflict of interest.
ML for PV System Operational Fault Analysis 13
References
1. Snapshot of Global PV Markets - 2020 p. 20
2. Ahmad, S., et al.: Fault Classification for Single Phase Photovoltaic Systems using Ma-
chine Learning Techniques. In: 2018 8th IEEE India International Conference on Power
Electronics (IICPE). pp. 1–6. https://doi.org/10.1109/IICPE.2018.8709463
3. Ali, M.U., et al.: A machine learning framework to identify the hotspot
in photovoltaic module using infrared thermography 208, 643–651.
https://doi.org/10.1016/j.solener.2020.08.027
4. Aziz, F., et al.: A Novel Convolutional Neural Network-Based Ap-
proach for Fault Classification in Photovoltaic Arrays 8, 41889–41904.
https://doi.org/10.1109/ACCESS.2020.2977116
5. Basnet, B., et al.: An Intelligent Fault Detection Model for Fault Detection in Photovoltaic
Systems 2020, e6960328. https://doi.org/10.1155/2020/6960328
6. Bognár, A., et al.: An unsupervised method for identifying local PV shad-
ing based on AC power and regional irradiance data 174, 1068–1077.
https://doi.org/10.1016/j.solener.2018.10.007
7. Chen, Z., et al.: Intelligent fault diagnosis of photovoltaic arrays based on optimized kernel
extreme learning machine and IV characteristics. Applied Energy 204, 912–931 (2017)
8. Chine, W., et al.: A novel fault diagnosis technique for photovoltaic systems based on
artificial neural networks. Renewable Energy 90, 501–512 (2016)
9. Cho, T.H., et al.: Comparison of Intelligent Methods of SOC Estimation for Battery of
Photovoltaic System 9(9), 8
10. Da Costa, C.H., et al.: A Comparison of Machine Learning-Based Methods for
Fault Classification in Photovoltaic Systems. In: 2019 IEEE PES Innovative Smart
Grid Technologies Conference - Latin America (ISGT Latin America). pp. 1–6.
https://doi.org/10.1109/ISGT-LA.2019.8895279
11. Dong, A., et al.: Fault Diagnosis and Classification in Photovoltaic Systems Using SCADA
Data. In: 2017 International Conference on Sensing, Diagnostics, Prognostics, and Control
(SDPC). pp. 117–122. https://doi.org/10.1109/SDPC.2017.31
12. Eskandari, A., et al.: Autonomous Monitoring of Line-to-Line Faults in Photovoltaic Sys-
tems by Feature Selection and Parameter Optimization of Support Vector Machine Using
Genetic Algorithms 10(16), 5527. https://doi.org/10.3390/app10165527
13. Eskandari, A., et al.: Line-line fault detection and classification for photovoltaic
systems using ensemble learning model based on I-V characteristics 211, 354–365.
https://doi.org/10.1016/j.solener.2020.09.071
14. Europe, S.P.: Global market outlook for solar power/2020–2024. Solar Power Europe:
Brussels, Belgium (2020)
15. Garoudja, E., et al.: An enhanced machine learning based approach for failures detection
and diagnosis of PV systems. Energy conversion and management 151, 496–513 (2017)
16. Ghaffarzadeh, N., Azadian, A.: A comprehensive review and performance evaluation in
solar (PV) systems fault classification and fault detection techniques
17. Hajji, M., et al.: Multivariate feature extraction based supervised ma-
chine learning for fault detection and diagnosis in photovoltaic systems .
https://doi.org/10.1016/j.ejcon.2020.03.004
18. Harrou, F., et al.: Monitoring of Photovoltaic Systems Using Improved Kernel-Based
Learning Schemes 11(3), 806–818. https://doi.org/10.1109/JPHOTOV.2021.3057169
19. Hong, J., et al.: Fault prognosis of battery system based on accurate voltage
abnormity prognosis using long short-term memory neural networks 251, 113381.
https://doi.org/10.1016/j.apenergy.2019.113381
20. IRENA: Renewable power generation costs in 2018. International Renewable Energy
Agency, Abu Dhabi (2019)
21. Ismail, N., et al.: Short-Circuit Incipient Faults Detection from Single Phase PWM Inverter
using Artificial Neural Network 10, 10
14 T. Zenebe et al.
22. Kapucu, C., et al.: A supervised ensemble learning method for fault diagnosis in photo-
voltaic strings 227, 120463. https://doi.org/10.1016/j.energy.2021.120463
23. Lazzaretti, A.E., et al.: A Monitoring System for Online Fault Detection and Classification
in Photovoltaic Plants 20(17), 4688. https://doi.org/10.3390/s20174688
24. Li, B., et al.: Application of Artificial Neural Networks to photovoltaic fault detection and
diagnosis: A review 138, 110512. https://doi.org/10.1016/j.rser.2020.110512
25. Livera, A., et al.: Advanced Failure Detection Algorithms and Performance Decision Clas-
sification for Grid-Connected PV Systems. https://doi.org/10.4229/EUPVSEC20172017-
6BV.2.13
26. Madeti, S.R., Singh, S.N.: A comprehensive study on different types of
faults and detection techniques for solar photovoltaic system 158, 161–185.
https://doi.org/10.1016/j.solener.2017.08.069
27. Madeti, S.R., Singh, S.N.: Modeling of PV system based on experimental data for fault
detection using kNN method 173, 139–151. https://doi.org/10.1016/j.solener.2018.07.038
28. Malik, A., et al.: Transfer Learning-Based Novel Fault Classification Technique for Grid-
Connected PV Inverter. In: Mekhilef, S., et al. (eds.) Innovations in Electrical and
Electronic Engineering. pp. 217–224. Lecture Notes in Electrical Engineering, Springer.
https://doi.org/10.1007/978-981-16-0749-3-16
29. Mano, M., et al.: Classification and detection of faults in grid connected photovoltaic
system. Int. J. Sci. Eng. Res 7(4), 149–154 (2016)
30. Mellit, A.: Recent Applications of Artificial Intelligence in Fault Diagnosis of Photovoltaic
Systems. In: Mellit, A., Benghanem, M. (eds.) A Practical Guide for Advanced Methods
in Solar Photovoltaic Systems, pp. 257–271. Advanced Structured Materials, Springer
International Publishing. https://doi.org/10.1007/978-3-030-43473-1-13
31. Mellit, A., et al.: Fault detection and diagnosis methods for photovoltaic systems: A review.
Renewable and Sustainable Energy Reviews 91, 1–17 (2018)
32. Nailen, R.L.: Battery protection-where do we stand? IEEE transactions on industry ap-
plications 27(4), 658–667 (1991)
33. Pillai, D.S., Rajasekar, N.: A comprehensive review on protection challenges and fault
diagnosis in PV systems. Renewable and Sustainable Energy Reviews 91, 18–40 (2018)
34. Sabbaghpur Arani, M., Hejazi, M.A.: The Comprehensive Study of Electri-
cal Faults in PV Arrays 2016, 1–10. https://doi.org/10.1155/2016/8712960,
https://www.hindawi.com/journals/jece/2016/8712960/
35. Sabri, N., et al.: Intelligent fault supervisory system applied on stand-alone photovoltaic
system. In: 2018 International Conference on Applied Smart Systems (ICASS). pp. 1–5.
https://doi.org/10.1109/ICASS.2018.8651950
36. Sabri, N., et al.: Faults diagnosis in stand-alone photovoltaic system using artificial neural
network. In: 2018 6th International Conference on Control Engineering & Information
Technology (CEIT). pp. 1–6. IEEE (2018)
37. Sabri, N., et al.: Battery internal fault monitoring based on anomaly detection algorithm.
Advanced Statistical Modeling, Forecasting, and Fault Detection in Renewable Energy
Systems p. 187 (2020)
38. Shin, J.H., Kim, J.O.: On-Line Diagnosis and Fault State Classification Method of Pho-
tovoltaic Plant 13(17), 4584. https://doi.org/10.3390/en13174584
39. Tadj, M., et al.: An innovative method based on satellite image analysis to check fault in a
PV system lead–acid battery 47, 236–247. https://doi.org/10.1016/j.simpat.2014.06.010
40. Youssef, A., et al.: The role of artificial intelligence in photo-voltaic systems design and
control: A review 78, 72–79. https://doi.org/10.1016/j.rser.2017.04.046
41. Zhao, Y., et al.: Line-line fault analysis and protection challenges in solar photovoltaic
arrays. IEEE transactions on Industrial Electronics 60(9), 3784–3795 (2012)