0% found this document useful (0 votes)
31 views6 pages

Sensors 19 03987 v4

Uploaded by

Joely Silva
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
31 views6 pages

Sensors 19 03987 v4

Uploaded by

Joely Silva
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 6

See discussions, stats, and author profiles for this publication at: https://www.researchgate.

net/publication/341978110

Deep Learning-Based Defect Detection System in Steel Sheet Surfaces

Conference Paper · June 2020


DOI: 10.1109/TENSYMP50017.2020.9230863

CITATIONS READS

29 1,923

2 authors:

Didarul Amin Shamim Akhter


International University of Business Agriculture and Technology Ahsanullah University of Science & Tech
3 PUBLICATIONS 29 CITATIONS 108 PUBLICATIONS 500 CITATIONS

SEE PROFILE SEE PROFILE

All content following this page was uploaded by Shamim Akhter on 10 July 2020.

The user has requested enhancement of the downloaded file.


2020 IEEE Region 10 Symposium (TENSYMP), 5-7 June 2020, Dhaka, Bangladesh

Deep Learning-Based Defect Detection System in


Steel Sheet Surfaces
Didarul Amin Shamim Akhter
AISIP Lab, Dept. Of Computer Science and Engineering AISIP Lab, Dept. of Computer Science and Engineering
International University of Business Agriculture and Technology International University of Business Agriculture and Technology
Dhaka, Bangladesh Dhaka, Bangladesh
[email protected] https://orcid.org/0000-0003-1408-9133

Abstract— Steel is one of the most important building In this study, we advance the steel defect inspection methods
materials of modern times and the production process of flat sheet by applying modern segmentation approach to partition the
steel is complicated. Before shipping or delivering steel, sheets image into various regions and designing a new machine
need to undergo a careful inspection procedure to avoid defects learning model to feed the region pixels to detect the defect
and thus localizing and classifying surface defects on a steel sheet region from a single sample image of steel sheet and classify
is crucial. In this study, we advance the steel defect inspection
them according to their corresponding multi-level classes.
methods by designing machine learning models that aim to detect
multi-level defects from sample steel sheet images and classify We apply U-NET and Residual U-NET to solve the given
them according to their corresponding classes. We explore two (2) problem. Dice coefficient method is used to trace the
deep learning methods including U-NET and Deep Residual U- accuracy of the selected machine learning models
NET to solve the steel defect detection problem with a Dice
coefficient accuracy of 0.543 and .731 correspondingly.
II. RELATED WORKS
Keywords—Deep Learning, U-NET, Deep Residual U-NET, Many researchers introduced computer vision steel
Defect Detection, Steel Surface Images. surface inspection systems. Caleb and Steuer in [1] used
artificial neural networks (ANN) to detect defects in hot
rolled steel strip. Pakkanen et al. in [2] applied edge
I. INTRODUCTION histogram, color structure, and homogeneous texture as
The defect detection system in steel sheet surfaces is features extractor, and K-Nearest Neighbor as a classifier on
playing a critical role in the steel sheets industry by detecting, hot-rolled steel strip surface to detect the defected images.
localizing, recognizing and subsequently correcting causative Hongbin and Keesug et al. in [3] applied Support Vector
factors. It is also quite necessary for controlling product Machine (SVM) as a classifier of the inspection system, SVM
quality and generating real-time analysis reports. Detecting gives better performance than ANN for their samples on hot
process involves determining the existence of the steel rolling steel. Smriti and Bhandari in [4] considered edge
surface defects from images taken from the industrial detection with a modified scheme based on heuristics used by
cameras. Localization locates all known content in the scene human inspectors for identifying surface imperfections to
including the defect regions. Recognition takes the defect compute the features then applied SVM as classifier for
regions infers the defect category according to the defect classifying surface images into two (2) classes defective and
appearance. A defect detection system in steel sheet surfaces defect-free; the system was applied on surface texture
is thus a combined process of detection, localization, and database. Sharifzadeh et al. in [5] used image processing
classification. Typically, in steel mills, human inspectors algorithms for detecting four popular classes of steel defects.
manually perform the defect detection process on steel sheets. Liu et al. in [6] used a relevance vector machine as a classifier
However, this procedure is very time consuming, costly but to detect four kinds of defects on the steel surface. Luiz et al.
lower efficient and does not meet up the requirement of real- in [7] adopted Principal Component Analysis as features
time online defect detection. Many recent pieces of research extractor, and Self-Organizing Maps as a classifier to classify
are conducted on a combined approach of computer vision six classes of the hot-rolled steel surface defects. Song and
with machine learning methods to solve the requirements for Yan in [8] proposed a new method Adjacent Evaluation
real-time online defect detection on steel sheets. However, Completed Local Binary Patterns as feature extractor and
they apply some morphological operations on high-frequency employed SVM as a classifier on Northeastern University
images generated by low-cost industrial cameras and simple (NEU) hot-rolled steel strip surface defect database. Song et
classifiers to solve the classification problem. Thus, these al. in [9] adopted a scattering convolution network as a
approaches lower accuracy and unable to handle complex feature extractor and employed SVM as a classifier on the
problems including multi-level classifications or localize the NEU database. Wang et al. [10] proposed fault diagnosis
defected area within a single image. based on a continuous sparse auto-encoder and illustrated the
effectiveness of the presented approach by IEC TC 10 dataset
of transformers faults. Mao et al. in [11] proposed an
intelligent fault-diagnosis by the auto-encoder algorithm the
effectiveness of the presented approach is verified by rolling
element bearings data set. Lu et al. in [12] used stacked de-
noising auto-encoder as fault diagnosis method and rotating

978-1-7281-7366-5/20/$31.00 ©2020 IEEE


machinery datasets were employed to demonstrate the
effectiveness of the proposed method.

All the related works referred above consider


classifier models only to recognize defect classes. Their (a) Sample image with class 1 defects
analyses miss the detection of the defective area or pixel
locations. CNN is largely used when the whole image is
needed to be classified as a class label. But many tasks require
classifying each pixel of the image. Fully connected CNN
like U-NET or Residual U-NET can be used as a pixel-level (b) Sample image with class 2 defects
classifier. We also want to approach a combined model for
defect classification and defect localization using
segmentation and modern fully CNNs. To complete the
above goal, we aim to explore image segmentation using U-
NET and Residual U-NET. They both are efficient to solve (c) Sample image with class 3 defects
multi-label defects on a single image and localize the defect
on images.

III. DATASET AND FEATURE


(d) Sample image with class 4 defects
Our dataset [13] consists of steel sheet images,
corresponding defects classes, and defect regions. Steel sheet
images are categories into two parts including training and
testing. There are 12568 training images and 1801 testing
images of 1600×256×1 size each. Out of all the training
(e) Sample image with multi label defects
images, 5902 images are with defects and 6666 images are
without defects. The distribution of all training images
according to the four (4) defects is presented in fig.1. Few
defected sample images from our dataset are presented in
fig.2
(f) Sample image contain three type defects

Figure 2: Sample images with instance-level defects

IV. METHODOLOGY
A. U-NET
U-NET [14] architecture is presented in fig.3. It consists
of a contracting path (the left side) and an expanding path (the
right side). There are 4 down-sampling layers in the
contracting path. Each down-sampling layer contains two
3x3 convolutional units, each followed by batch
normalization and ReLU, and then a 2x2 max pooling. The
contextual information from the contracting path is then
Figure 1: Number of images for each label.
transferred to the expanding path by skip connection. There
are four (4) up-sampling layers in the expanding path. Each
up-sampling layer contains a 2x2 transposed convolution, a
concatenation with the corresponding feature maps from the
encoding path, and two (2) 3x3 convolutional units, each
followed by batch normalization and ReLU. Finally, there is
a 1x1 convolutional layer and a SoftMax layer to map the
feature vector at each pixel into five (5) different classes (0
for no defect, 1-4 for defects of different classes).
Figure 3: Architecture of the U-NET

B. Deep Residual U-NET


Deep Residual U-NET [15] is presented in fig. 4. It is a
semantic segmentation NN which combines strengths of both
U-NET and Res-Net networks. This combination brings us
two benefits: 1) the residual unit makes ease training of the
network; 2) skip connections within a residual unit and
between low levels and high levels of the network will Figure 4: Architecture of the Deep Residual U-NET
facilitate information propagation without degradation,
making it possible to design a neural network with much
fewer parameters however could achieve comparable ever TABLE I: THE NETWORK STRUCTURE OF RESUNET
better performance on semantic segmentation. In our
proposed work we utilize a 7-level deep architecture and
comprises of three parts: encoding, bridge and decoding. The
first part encodes the input image into compact
representations. The last part recovers the representations to
a pixel-wise categorization, i.e. semantic segmentation. The
middle part serves as a bridge connecting the encoding and
decoding paths. All of the three parts are built with residual
units which consist of two 3 × 3 convolution blocks and an
identity mapping. Each convolution block includes a BN
layer, a ReLU activation layer and a convolutional layer. The
identity mapping connects the input and output of the unit.
Encoding path has three (3) residual units. In each unit,
instead of using pooling operation to downsample, the feature
map size, a stride of 2 is applied to the first convolution block V. RESULT AND ANALYSIS
to reduce the feature map by half. Correspondingly, decoding A. Evaluation Metrics
path composes of three residual units. Before each unit, there The Dice coefficient is used to compare the pixel-wise
is an up-sampling of feature maps from the lower level and agreement between a predicted segmentation and its
concatenation with the feature maps from the corresponding corresponding ground truth. The formula is given by:
encoding path. After the last level of decoding path, a 1 × 1
∗| ∩ |
convolution and a sigmoid activation layer are used to project Dice X, Y | | | |
the multi-channel feature maps into the desired segmentation.
… (1)
In total, we have 15 convolutional layers comparing with 23
Where X is the predicted set of pixels and Y is the ground
layers of U-NET. It is worth noting that the indispensable
truth. The Dice coefficient is defined to be 1 when both X
cropping in U-NET is unnecessary in our network. The
and Y are empty. We used Dice loss function during training,
parameters and output size of each step are presented in
which is defined as
Table-I.

L 1 ∑
∑ ∑
… (2)
Where B is the batch size, n = 4 × 256 × 1600 is the total
number of pixels of the 4 defect classes, yj(i) is the true label
of each pixel (1 for defect and 0 for normal), and y^j(i) is the a)
predicted probability.

B. Image Segmentation and Multi Labels Classification


We apply instance segmentation method which uses
the output of semantic segmentation as input and obtains
instance-aware segmentation result. U-NET is configured
with a batch size of 4 due to the limitation of GPU memory.
Adam is used as the optimizer and Dice loss function during
training. The loss values and the positive Dice score during
training are shown in Fig. 5a and 5b, respectively. After 20
epochs, Dice scores results in 0.543. The U-NET model
presents good performance for predicting defects of class 3 b)
(Fig. 5c). However, it fails to predict many defects of class 1,
2 and 4 (Fig. 5d).
Deep Residual U-NET is configured with a batch size of
4 due to the limitation of GPU memory. We also use Adam
as the optimizer. We use Dice loss function during training.
The loss values and the positive Dice score during training
are shown in Fig. 6a and 6b, respectively. After 20 epochs,
Dice scores are 0.731. The deep residual U-NET model
shows good performance for predicting defects of all classes
(Fig. 6c).

c)

Image (HxW=256x1600)

Predicted-mask (defect class 1)

Predicted-mask (defect class 2)

Predicted-mask (defect class 3)

Figure 5: Results of the U-NET. (a) Loss of the training and


the validation sets. (b) Average positive Dice score of the Predicted-mask (defect class 4)
validation set during training. (c) A sample image containing
Figure 6: Results of the Deep Residual U-NET. (a) Model
defect of class 3. (d) A sample image containing defect of
class 1 accuracy of the training and the validation sets. (b) Model loss
of the training and the validation sets. (c) A predicted image
sample with multiple defects.
VI. CONCLUSION [13] Severstal: Steel defect detection. [Online] Available at:
https://www.kaggle.com/c/severstal-steel-defect-detection.
The proposed work anchorages the segmentation-based [14] O. Ronneberger, P. Fischer, and T. Brox. “U-NET: convolutional
instance segmentation computer vision techniques and networks for biomedical image segmentation”, International
modern deep NNs including U-NET and Residual U-NET to Conference on Medical Image Computing and Computer-assisted
localize steel surface defects and do region-based multiple Intervention, pages 234–241. Springer, 2015 J. Clerk Maxwell, A
Treatise on Electricity and Magnetism, 3rd ed., vol. 2. Oxford:
labels classification. Among U-NET and Residual U-NET, Clarendon, 1892, pp.68–73.
Residual U-NET performs better with the Dice Coefficient [15] Z. Zhang, Q. Liu, and Y. Wang, “Road Extraction by Deep Residual
score of 0.731. In the near future, we will explore more deep U-NET”, IEEE Geoscience and Remote Sensing Letters, vol. 15, no. 5,
NNs on this problem, analyze their performance and improve pp. 749–753, 2018.
classification accuracy.

ACKNOWLEDGMENT
This research is supported and funded by the
Information and Communication Technology (ICT)
Division, Government of the People's Republic of
Bangladesh. The authors would like to express their sincere
gratitude to the International University of Business
Agriculture and Technology (IUBAT), Sector-10, Uttara
Model Town, Dhaka, Bangladesh for their necessary
supports.

REFERENCES
[1] P. Caleb and M. Steuer, “Classification of surface defects on hot rolled
steel using adaptive learning methods”, Fourth International
Conference on Knowledge-Based Intelligent Engineering Systems and
Allied Technologies, Bright, UK, August 30 – September 1, 2000.
[2] J. Pakkanen, J. Iivarinen, R. Rautkorpi, and J. Rauhamaa, “Content-
based retrieval of surface defect images with PicSOM”, International
Journal of Fuzzy Systems, vol. 6, no. 3, pp.160-167, 2004.
[3] C. Keesug, K. Kyungmo, J. Lee, “Development of defect classification
algorithm for POSCO rolling strip surface inspection system”, SICE-
ICASE International Joint Conference, Busan, Korea, Oct. 18-21,
2006.
[4] H. Smriti and S. Bhandari,”A simple approach to surface defect
detection”, The Third International Conference on Industrial and
Information Systems, Kharagpur, India, December 8-10, 2008.
[5] M. Sharifzadeh, S. Alirezaee, R. Amirfattahi, and S. Sadri, “Detection
of steel defect using the image processing algorithms”, 12th IEEE
International Multitopic Conference, Karachi, Pakistan, December 23-
24, 2008.
[6] Y. Liu, Y. Hsu and Y. Sun, “A computer vision system for automatic
steel surface inspection”, 5th IEEE Conference on Industrial
Electronics and Applications, USA, June 15-17, 2010.
[7] A. Luiz, L. Flavio, and E. Paulo, “Automatic detection of surface
defects on rolled steel using computer vision and artificial neural
networks”, 36th Annual Conference on IEEE Industrial Electronics
Society, Glendale, USA, November 7-10, 2010.
[8] K. Song and Y. Yan, “A noise robust method based on completed local
binary patterns for hot-rolled steel strip surface defects”, Applied
Surface Science, vol. 285, pp. 858-864, 2013.
[9] K. Song, S. Hu, Y. Yan, “Automatic recognition of surface defects on
hot-rolled steel strip using scattering convolution network”, Journal of
Computational Information Systems, vol. 7, pp. 3049-3055, 2014.
[10] L. Wang, Z. Xiaoying, P. Jiangnan and T. Gongyou,” Transformer fault
diagnosis using continuous sparse autoencoder”, Springer Plus, vol. 5,
pp. 1-13, 2016.
[11] W. Mao, J. He, Y. Li and Y. Yan,” Bearing fault diagnosis with auto-
encoder extreme learning machine: a comparative study”, Journal of
Mechanical Engineering Science, vol. 1, no. 1, pp. 1–19, 2016.
[12] C. Lu, W. Zhen-Ya, Q. Wei-Li and M. Jian, “Fault diagnosis of rotary
machinery components using a stacked denoising auto-encoder based
health state identification”, Signal Processing, vol. 130, pp. 377-388,
2016.

View publication stats

You might also like