Artificial Intelligence-Based Power Transformer Health Index For Handling Data Uncertainty
Artificial Intelligence-Based Power Transformer Health Index For Handling Data Uncertainty
ABSTRACT Power transformer is a critical and expensive asset in electric transmission and distribution
networks. It is essential to monitor the health condition of all power transformer fleet in such networks
to avoid unwanted outages. The health index (HI) is a quick and efficient way to assess the condition of
power transformers based on multi-criteria. While Power transformer HI method has been well presented
in the literature, not much attention was given to handle the uncertainty and reliability of this method due
to unavailability of used data. Therefore, this paper aims to tackle this issue through employing Artificial
Intelligence (AI)-based techniques to reveal the health condition of power transformers with high accuracy
and at the same time handling data uncertainty. The proposed HI approach assesses the power transformer
insulation system based on oil quality, dissolved gas analysis (DGA), and paper condition. In this regard,
collected data from 504, 150-kV transformers are used to establish the proposed AI-models. Seven AI
algorithms including k-Nearest Neighbor (kNN), Support Vector Machine (SVM), Random Forest (RF),
Naïve Bayes (NB), Artificial Neural Network (ANN), Adaptive Boosting (AdaBoost), and Decision Tree are
investigated. A performance comparison of the proposed AI-based HI models is carried out using the scoring-
weighting-based HI method as the reference. Results show that RF model provides the best performance in
predicting power transformer HI with an accuracy of 97.3%.
INDEX TERMS Power transformer, health index, insulation system, condition monitoring, artificial
intelligence.
I. INTRODUCTION the aging mechanism of the transformer oil and paper insu-
Power transformers are among the most critical and expen- lation have been published in the literatures [3]–[7]. The
sive assets within the electrical transmission and distribution health condition and remnant life of power transformers have
networks. With the continous increase in the load demand, been assessed through various parameters of the insulation
power transformers are operating close to their nominal rat- system [8], [9].
ings and becoming more prone to failures. Therefore, power Reliable and cost effective condition monitoring and asset
transformers need to be continuously monitored during their management techniques are essential for electric utilities in
entire operational life to avoid sudden and catastrophic fail- preparing an appropriate financial plans to estimate the future
ures. Over the past two decades, several condition monitoring cost of maintenance, repair, and replacement of their power
techniques have been developed for power transformers [1]. transformer fleet. In this regard, much research effort has
Among these techniques, insulation system has been a com- been conducted to help utility companies optimize their asset
mon key component to identify transformer health state and maintenance costs. Transformer asset management (TAM)
estimate its useful rmnant life [2]. Several papers to explain practice has been explained in [10]–[12] in which strategic
plans for future maintenance and replacement activities based
The associate editor coordinating the review of this manuscript and on various diagnostic testing methods are presented. The
approving it for publication was Zhouyang Ren . diagnostic testing methods can be generally categorized into
This work is licensed under a Creative Commons Attribution 4.0 License. For more information, see https://creativecommons.org/licenses/by/4.0/
VOLUME 9, 2021 150637
D. Rediansyah et al.: Artificial Intelligence-Based Power Transformer Health Index for Handling Data Uncertainty
condition monitoring (CM) and condition assessment (CA) using several factors, industry standards, and utility expert
techniques. CM techniques employ electrical, chemical, and judgment [27]. AI-based general regression neural network
physical tests to be collectively used by the CA tools to (GRNN) was introduced in [28] to calculate the HI based
determine the health condition of the transformer. on four-class transformer conditions; very poor, poor, fair,
The transformer health index (HI), as primarily explained and good. In [29], AI-based HI using Bayesian network was
in [13], is a single factor that utilizes the information from proposed to quantify the parameters contribution through
operating observations, field inspections, and laboratory tests score-probability and population failure statistics. In [30],
to provide a reliable TAM decision. Majority of the data used principal component analysis (PCA) and analytical hierarchy
in the HI model are based on the insulation system testing. process (AHP) were proposed to determine the transformer
Oil insulation tests include dissolved gases analysis (DGA), health condition based on an expert empirical formula.
oil quality analysis (OQA), and furan analysis (FFA). Due to The use of AI methods to define transformer health
the high electric and thermal stress within operating trans- index has been presented in the literatures. For instance,
formers, oil and paper insulation decomposes and releases decision tree, random forest (RF), static vector machine
some gases that dissolve in the oil and decrease its dielectric (SVM), artificial neural network (ANN), and k-nearest neigh-
strength. These gases include hydrogen (H2 ), methane (CH4 ), bor (kNN) methods were used in [27] to automate the assess-
ethylene (C2 H4 ), acetylene (C2 H2 ), ethane (C2 H6 ), carbon ment process. A study in [31] compared several machine
monoxide (CO), and carbon dioxide (CO2 ). DGA test is learning algorithms such as ANN, SVM, and Gaussian
conducted to quantify these gases from which internal trans- bayesian networks (GBN), to evaluate the health condition
former electrical and thermal faults can be identified [14]. of power transformers through probabilistic HI framework.
Considering the guideline in [15], the OQA is determined by An approach is presented in [32] that compared the perfor-
analyzing the oil breakdown voltage (BDV), acidity, water mance of various AI methods such as naïve Bayes (NB),
content, interfacial tension (IFT), dielectric dissipation fac- multinomial logistic regression (MLR), ANN, SVM, kNN,
tor (DDF), and color. FFA is conducted to measure furan one rule (OneR), decision tree (J48), and RF in identifying
compounds that are generated due to cellulose degradation transformer health index. However, regardless the various
and dissolve in the transformer oil [16]. Among the 5 furan AI methods proposed in the literatures to calculate the
compounds, furfuraldehyde (also known as furfural/2FAL) transformer HI, more thorough studies are still required to
dominates the measurements and is correlated to the degree improve the accuracy of such methods, in particular with the
of polymerization of the paper insulation [17]. uncertainty or unavailability of the used data.
A scoring-weighting (also called weighted-sum) is the As discussed in [9], data uncertainty is the main shortcom-
most commonly used method to calculate the HI of power ing of the HI method which affects its reliability and accuracy.
transformers [13], [18]–[23]. The approach starts by com- Data unavailability is the main reason of uncertainty. The
paring each parameter to a scoring table, then weighting study in [33] reported the effect of data unavailability and
each parameter based on its importance. The weights are proposed a remedial approach based on RF to predict the
usually identified by expert personnel. The individual scores missing data and improve the scoring-weighting HI. How-
are combined into a single index that reveals the overall health ever, the development and investigation of AI-based HI model
condition of the transformer. in handling data uncertainty has not yet discussed thoroughly
As the HI is a linear combination of different scores and with proven implementation feasibility.
weighted measurement data, it is a challenging task to deal
with the uncertainty of the used data through the current TABLE 1. Summary of previous researches.
utility practice to identify the HI.
The rapid development of computer science and data
processing has resulted in new HI approaches based on
machine learning algorithms for big data analysis [9]. Arti-
ficial Intelligence (AI)-based HI approach was presented
in [24] using DGA, furan, and oil test data to assess the
health condition of several in-service transformers. However,
due to the limited available data, the prediction accuracy
of the developed neuro-fuzzy model has only been 56.3%.
The study in [25] conducted a transformer HI sensitivity
analysis using self adaptive neuro-fuzzy inference system
(ANFIS) whose parameters were tuned using particle swarm Table 1 shows the summary of the previous researches.
optimizer (PSO). In [26], probabilistic Markov chain models From the above discussion, the gaps in this area of research
were used to predict the future condition of the transformer can be summarized as below:
based on HI calculation using a non-linear optimization tech- • despite the several AI-based methods used to identify the
nique. AI with a fuzzy-based support vector machine (SVM) HI of power transformers, accuracy of such methods is
was employed in transformer HI-based condition assessment still relatively low.
•there is no reliable and widely accepted technique to deal unavailable data on the accuracy of both conventional SW
with data uncertainity which affects the reliability of the and the identified best performing AI-based HI is evaluated
current HI practice. by adjusting the complete data into 50 missing data scenarios.
As such, the main contribution of this paper is to: A preprocessing approach is then proposed to enhance the
• construct an AI-based approach to enrich the current
ability of the AI-based HI model to deal with such unavailable
conventional HI scoring-weighting method for power data and provide an accurate HI value.
transformers.
A. TRANSFORMER POPULATION
• improve the accuracy of the HI calculation using multi-
AI models to process most common routine testing data The data used in this study were collected from PT. PLN
for power transformer insulation system. PERSERO, an Indonesian state electricity company. Data are
• develop an effective correlation for all used parameters
collected from 504 units, all with a specific high voltage
in calculating the transformer HI. of 150 kV with a low voltage of either 70 kV or 20 kV. Major-
• identify the best AI-based combination for dealing with
ity of these transformers use Kraft paper insulation and they
data uncertainty and missing values. are periodically inspected in a condition-based maintenance
scheme by checking their insulation properties through DGA
and dielectric characteristics (oil quality analysis). Dielec-
tric characteristics include oil breakdown voltage (BDV),
water content, interfacial tension (IFT), acidity, and color.
In addition, 2-FAL measurement are provided for some units.
Normally, the oil testing is conducted once a year or more
frequent, if needed. Data used for the analysis in this paper are
the same year data when HI is calculated, therefore omitting
the uncertainty caused by outdated data. The age classifica-
tion of the observed population sample is shown in Figure 2.
TABLE 2. The scoring-weighting for oil quality. TABLE 6. The scoring-weighting paper condition.
where SFj is the parameter’s scoring factor and Wj is the condition monitoring, fault diagnosis and asset manage-
weighting factor. ment [39], [40].
The weighting parameters and factors are obtained by The health index category and description is classified as
using AHP. This technique is built based on the judgment shown in Table 8. A more detailed explanation of the scoring-
of five experts with in-depth experience in transformer weighting method can be found in [20].
E. AI-BASED HI DEVELOPMENT
To establish the AI-based HI model, the collected transformer
data have been divided into 354 training and 150 testing data
sets. Each set comprises data of transformers of different
aging, operating and health conditions.
The output of the AI model is the overall HI which
is validated using the corresponding HI calculated by the
scoring-weighting method. If the AI model does not provide FIGURE 9. Dealing the uncertainty with AI-based HI.
H. EVALUATION
The evaluation of the model is based on the accuracy of the
prediction of the transformer HI category. The classification
accuracy (CA) is the proportion of correctly classified data as
given by (8).
(TP + TN )
Accuracy = (8)
(TP + TN + FP + FN )
where TP is true positive, TN is true negative, FP is false
positive, and FN is false negative. This calculation is based
on the confusion matrix of binary classification in which
the evaluation is considered as a multi-class classification
problem.
will be decreased significantly. The range of the classification The color categories in terms of the accuracy indicator are
accuracy in HI categories will have a value above 80% if the green for high (H) accuracy level, yellow for medium (M)
certainty level is above 85%. accuracy level, and red for low (L) accuracy level.
2) RANDOM FOREST-BASED by considering the missing values with the average or the
The AI-based RF algorithm also suffers when predicting most frequent value. The impact of missing data on the
transformers HI with many unavailable data as confirmed by model has been evaluated, and the proposed RFwP performed
Figure 19 in which R2 is 0.7614. The correlation of the accu- better than other investigated methods and resulted in a
racy and certainty level for this model can be approximated satisfactory classification accuracy even with low certainity
as below. levels. This solves the current common issue of calculating
the transformer HI with unavailable data. Results in this
y = 0.7542x + 0.1232 (10)
paper pave the way to adopt AI-based methods in calculating
In some cases, RF model can predict the HI with an accu- the transformer HI. However, implementing such techniques
racy above 80% even though the certainty level is only 70%. calls for more validation and feasibility studies on large
Nevertheless, the accuracy will be better if the certainty level transformer population with several scenarios of unavailable
is above 80%. data.
[18] A. Naderian, S. Cress, R. Piercy, F. Wang, and J. Service, ‘‘An approach [41] L. Breiman, ‘‘Random forests–random features,’’ Statistic Dept., Univ.
to determine the health index of power transformers,’’ in Proc. Conf. Rec. California, Berkeley, CA, USA, Tech. Rep. 567, 1999, pp. 1–29. [Online].
IEEE Int. Symp. Electr. Insul., Jun. 2008, pp. 192–196. Available: https://www.stat.berkeley.edu/~breiman/random-forests.pdf
[19] I. G. N. S. Hernanda, A. C. Mulyana, D. A. Asfani, I. M. Y. Negara, and
D. Fahmi, ‘‘Application of health index method for transformer condition
assessment,’’ in Proc. IEEE Region 10 Conf. (TENCON), Oct. 2014,
pp. 1–6. DHANU REDIANSYAH (Graduate Student
[20] W. R. Tamma, R. A. Prasojo, and Suwarno, ‘‘High voltage power trans-
Member, IEEE) received the B.Sc. degree from
former condition assessment considering the health index value and its
the Department of Mechanical Engineering, Insti-
decreasing rate,’’ High Voltage, vol. 6, no. 2, pp. 314–327, Apr. 2021.
tut Teknologi Indonesia, in 2009, and the M.Sc.
[21] W. R. Tamma, R. Azis Prasojo, and S. Suwarno, ‘‘Assessment of high
voltage power transformer aging condition based on health index value degree from the School of Electrical Engineer-
considering its apparent and actual age,’’ in Proc. 12nd Int. Conf. Inf. ing and Informatics, Institut Teknologi Bandung,
Technol. Electr. Eng. (ICITEE), Oct. 2020, pp. 292–296. Indonesia, in 2021. Since 2011, he has been
[22] L. En-Wen and S. Bin, ‘‘Transformer health status evaluation model working at PT. PLN (Persero), the Indonesian state
based on multi-feature factors,’’ in Proc. Int. Conf. Power Syst. Technol., of electricity company. He has experienced at a
Oct. 2014, pp. 1417–1422. generation power plant, distribution, and trans-
[23] K. Taengko and P. Damrongkulkamjorn, ‘‘Risk assessment for power trans- mission. His research interests include high voltage equipment condition
formers in PEA substations using health index,’’ in Proc. 10th Int. Conf. monitoring and diagnostic, renewable energy power plant, and power quality.
Electr. Eng./Electron., Comput., Telecommun. Inf. Technol., May 2013,
pp. 1–6.
[24] E. Kadim, N. Azis, J. Jasni, S. Ahmad, and M. Talib, ‘‘Transformers health
index assessment based on neural-fuzzy network,’’ Energies, vol. 11, no. 4, RAHMAN AZIS PRASOJO (Graduate Student
p. 710, Mar. 2018. Member, IEEE) received the B.Sc. degree from the
[25] K. Ibrahim, R. M. Sharkawy, H. K. Temraz, and M. M. A. dan Salama, Department of Electrical Engineering, Politeknik
‘‘Transformer health index sensitivity analysis using neuro-fuzzy mod- Negeri Malang, Malang, Indonesia, in 2015, and
elling,’’ in Proc. 2nd Int. Conf. Adv. Technol. Appl. Sci. (ICaTAS), 2017,
the M.Sc. degree from the School of Electrical
pp. 1–7.
Engineering and Informatics, Institut Teknologi
[26] M. Yahaya, N. Azis, M. Ab Kadir, J. Jasni, M. Hairi, and M. Talib,
‘‘Estimation of transformers health index based on the Markov chain,’’ Bandung, Bandung, Indonesia, in 2017, where he
Energies, vol. 10, no. 11, p. 1824, Nov. 2017. is currently pursuing the Ph.D. degree. Since 2019,
[27] A. D. Ashkezari, H. Ma, T. K. Saha, and C. Ekanayake, ‘‘Application he has been an Assistant Professor with the Depart-
of fuzzy support vector machine for determining the health index of the ment of Electrical Engineering, Politeknik Negeri
insulation system of in-service power transformers,’’ IEEE Trans. Dielectr. Malang. He has published various conference papers and journal articles
Electr. Insul., vol. 20, no. 3, pp. 965–973, Jun. 2013. regarding high voltage power transformer condition monitoring and diag-
[28] M. M. Islam, G. Lee, and S. N. Hettiwatte, ‘‘Application of a general regres- nostics. He has reviewed several reputable journals.
sion neural network for health index calculation of power transformers,’’
Int. J. Electr. Power Energy Syst., vol. 93, pp. 308–315, Dec. 2017.
[29] S. Li, G. Wu, H. Dong, L. Yang, and X. Zhen, ‘‘Probabilistic health index-
based apparent age estimation for power transformers,’’ IEEE Access, SUWARNO (Senior Member, IEEE) received
vol. 8, pp. 9692–9701, 2020. the B.Sc. and M.Sc. degrees from the Depart-
[30] S. Tee, Q. Liu, and Z. Wang, ‘‘Insulation condition ranking of transformers ment of Electrical Engineering, Institut Teknologi
through principal component analysis and analytic hierarchy process,’’ IET Bandung, Indonesia, in 1988 and 1991, respec-
Gener., Transmiss. Distrib., vol. 11, no. 1, pp. 110–117, Jan. 2017. tively, and the Ph.D. degree from Nagoya
[31] J. I. Aizpurua, B. G. Stewart, S. D. J. McArthur, B. Lambert, J. G. Cross,
University, Japan, in 1996. He is currently a Pro-
and V. M. Catterson, ‘‘Improved power transformer condition monitoring
fessor and the Emeritus Dean of the School of
under uncertainty through soft computing and probabilistic health index,’’
Appl. Soft Comput., vol. 85, Dec. 2019, Art. no. 105530.
Electrical Engineering and Informatics, Institut
[32] A. A. Alqudsi dan El-Hag, ‘‘Application of machine learning in trans- Teknologi Bandung. He serves as the Head for
former health index prediction,’’ Energies, vol. 12, no. 14, p. 2694, 2019. the Electrical Power Engineering Research Group.
[33] R. A. Prasojo and A. Abu-Siada, ‘‘Dealing with data uncertainty for He has published over 200 international journal articles or conference papers.
transformer insulation system health index,’’ IEEE Access, vol. 9, His research interests include high voltage insulating materials and technol-
pp. 74703–74712, 2021. ogy and diagnostics of HV equipment. He was the General Chairperson of
[34] Z. Li, Y. Zhang, A. Abu-Siada, X. Chen, Z. Li, Y. Xu, L. Zhang, and several international conferences, such as ICPADM 2006, ICEEI 2007, CMD
Y. Tong, ‘‘Fault diagnosis of transformer windings based on decision tree 2012, ICHVEPS 2017, and ICHVEPS 2019. He serves as the Editor-in-Chief
and fully connected neural network,’’ Energies, vol. 14, no. 6, p. 1531, for the International Journal on Electrical Engineering and Informatics.
Mar. 2021.
[35] A. Abu-Siada, M. Arshad, and S. Islam, ‘‘Fuzzy logic approach to identify
transformer criticality using dissolved gas analysis,’’ in Proc. IEEE PES
Gen. Meeting, Jul. 2010, pp. 1–5. A. ABU-SIADA (Senior Member, IEEE) received
[36] Transformer Reliability Survey, CIGRE WG, document A2.37, TB642, the B.Sc. and M.Sc. degrees in electrical engineer-
Dec. 2015. ing from Ain Shams University, Egypt, in 1998,
[37] IEEE Guide for the Interpretation of Gases Generated in Mineral Oil- and the Ph.D. degree in electrical engineering
Immersed Transformers, IEEE Standard C57.104-2019, 2019.
from Curtin University, Australia, in 2004. He is
[38] Mineral Insulating Oils in Electrical Equipment—Supervision and Main-
currently the Discipline Lead of electrical and
tenance Guidance, IEC Standard 60422:2013, 2013.
computer engineering with Curtin University.
[39] R. A. Prasojo, A. Setiawan, Suwarno, N. U. Maulidevi, and B. Anggoro
Soedjarno, ‘‘Development of analytic hierarchy process technique in deter- His research interests include power electron-
mining weighting factor for power transformer health index,’’ in Proc. 2nd ics, power system stability, condition monitor-
Int. Conf. High Voltage Eng. Power Syst. (ICHVEPS), Oct. 2019. ing, and power quality. He is the Vice Chair of
[40] R. Azis Prasojo, Suwarno, N. Ulfa Maulidevi, and B. Anggoro Soedjarno, the IEEE Computation Intelligence Society, WA Chapter. He is also the
‘‘A multiple expert consensus model for transformer assessment index Editor-in-Chief for the International Journal of Electrical and Electronic
weighting factor determination,’’ in Proc. 8th Int. Conf. Condition Monitor. Engineering and a regular reviewer for various IEEE TRANSACTIONS.
Diagnosis (CMD), Oct. 2020, pp. 234–237.