0% found this document useful (0 votes)

16 views

An End-to-End Steel Surface Defect Detection Approach Via Fusing Multiple Hierarchical Features

Uploaded by

sophiageminijiang

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

16 views

An End-to-End Steel Surface Defect Detection Approach Via Fusing Multiple Hierarchical Features

Uploaded by

sophiageminijiang

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 12

IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, VOL. 69, NO.

4, APRIL 2020 1493

An End-to-End Steel Surface Defect Detection

Approach via Fusing Multiple
Hierarchical Features
Yu He, Kechen Song , Qinggang Meng , and Yunhui Yan

Abstract— A complete defect detection task aims to achieve

the specific class and precise location of each defect in an image,
which makes it still challenging for applying this task in practice.
The defect detection is a composite task of classification and
location, leading to related methods is often hard to take into
account the accuracy of both. The implementation of defect
detection depends on a special detection data set that contains
expensive manual annotations. In this paper, we proposed a novel
defect detection system based on deep learning and focused on a Fig. 1. Defect classification and defect detection task. (a) Defect classification
practical industrial application: steel plate defect inspection. In task aims to “What,” only outputting a defect class score. (b) Defect detection
order to achieve strong classification ability, this system employs a task aims to “What” and “Where,” outputting a bounding box with a defect
baseline convolution neural network (CNN) to generate feature class score.
maps at each stage, and then the proposed multilevel feature
fusion network (MFN) combines multiple hierarchical features
into one feature, which can include more location details of
defects. Based on these multilevel features, a region proposal
network (RPN) is adopted to generate regions of interest (ROIs).
For each ROI, a detector, consisting of a classifier and a bounding
box regressor, produces the final detection results. Finally, we set
up a defect detection data set NEU-DET for training and
evaluating our method. On the NEU-DET, our method achieves
74.8/82.3 mAP with baseline networks ResNet34/50 by using
300 proposals. In addition, by using only 50 proposals, our Fig. 2. Complicated defects. (a) Multiple defects. The yellow boxes indicate
method can detect at 20 ft/s on a single GPU and reach 92% of the the defects belong to an identical class. (b) Multiclass defects. The red and
above performance, hence the potential for real-time detection. blue boxes indicate the defects of different classes. (c) Overlapping defects.
The pink box surrounds an overlapping region of defects of different classes.
Index Terms— Automated defect inspection (ADI), defect
detection dataset (NEU-DET), defect detection network (DDN),
multilevel-feature fusion network (MFN). in industry, which is unreliable and time-consuming. In order
to replace the manual work, it is desirable to allow a machine
I. I NTRODUCTION to automatically inspect surface defects from steel plates with
the use of computer vision technologies.
D EFECT inspection is a crucial step to guarantee the
quality of industrial production, especially for steel
plates. However, this process is usually performed manually
The founder of computer vision, British neurophysiologist
Marr, considers that a vision task can be defined as “What is
Where” that is the process of discovering what presents in an
Manuscript received March 21, 2019; revised April 28, 2019; accepted image and where is it [1]. Therefore, the object classification
April 29, 2019. Date of publication May 8, 2019; date of current version and detection are the most fundamental problems in the field
March 10, 2020. This work was supported in part by the National Natural
Science Foundation of China under Grant 51805078 and Grant 51374063, of computer vision research [2]. Similarly, the automated
in part by the National Key Research and Development Program of China defect inspection (ADI) can also be divided into two types:
under Grant 2017YFB0304200, in part by the Fundamental Research Funds defect classification and defect detection. Given a defect
for the Central Universities under Grant N170304014 and Grant N150308001,
and in part by the China Scholarship Council under Grant 201806085007. image, the defect classification task is to solve if this image
The Associate Editor coordinating the review process was Emanuele Zappa. contains some class of defect [Fig. 1(a)], and the defect
(Corresponding authors: Kechen Song; Yunhui Yan.) detection task is to solve where a defect exists in this image,
Y. He, K. Song, and Y. Yan are with the School of Mechanical Engineering
and Automation, Northeastern University, Shenyang 110819, China, and represented by a bounding box with a class score [Fig. 1(b)].
also with the Key Laboratory of Vibration and Control of Aero-Propulsion Therefore, a complete defect detection task consists of two
Systems, Ministry of Education of China, Northeastern University, Shenyang parts: defect classification, determining specific categories of
110819, China (e-mail: [email protected]; [email protected];
[email protected]). defects, and defect localization, obtaining detailed regions of
Q. Meng is with the Department of Computer Science, Loughborough defects. For defect inspection on steel plates, the detection task
University, Loughborough LE11 3TU, U.K. (e-mail: [email protected]). has superior advantages to complicated defects, e.g., multiple
Color versions of one or more of the figures in this article are available
online at http://ieeexplore.ieee.org. defects [Fig. 2(a)], multiclass defects [Fig. 2(b)], and overlap-
Digital Object Identifier 10.1109/TIM.2019.2915404 ping defects [Fig. 2(c)]. The classification task can only find
0018-9456 © 2019 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission.
See https://www.ieee.org/publications/rights/index.html for more information.
1494 IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, VOL. 69, NO. 4, APRIL 2020

a bounding box with a class score for precisely classifying

and locating a defect [Fig. 3(b)]. A DL-based segmenter like
Mask R-CNN [13] seems to be better for showing the shape
of a defect. However, this kind of segmenter will consume
huge amounts of computation source, which cannot meet the
real-time demand of industrial inspection. Furthermore, it is
highly impracticable for the industry to build a large instance-
level defect segmentation data set, and thereby this kind of
Fig. 3. Different styles of obtaining a defect region. (a) Many previous segmenter is almost impossible to apply. Therefore, it is the
detectors based on hand-craft features directly combine related spatial cells
into a block through various special approaches. The block is regarded as a
best tradeoff to perform defect detection for ADI at present.
detection region, which is a coarse box without refining. (b) Detectors based This paper mainly addresses three challenges. First, the
on DL mainly use regression methods to refine a predicting box. Through a detection system needs strong classification ability. The com-
large amount of iterative learning, the predicting box is gradually close to the
groundtruth box. Finally, the refined box is regarded as the bounding box of
mon classification problems such as interclass similarity, intr-
the defect, which can represent the precise location information of the defect. aclass difference, and background interference are also present
in ADI [9], [11]. Therefore, we equip a deep network ResNet
into the system as the backbone [23]. As current research
the defect with the highest category confidence in an image in transfer learning [15], the key to drive large networks is
and not know the number of defects shown in Fig. 2(a), classes pretraining on ImageNet [22]. The detection system can gain
of defects shown in Fig. 2(b), and emerge of an overlapping strong classification power by training ResNet on enough data.
defect shown in Fig. 2(c). However, for the follow-up quality Second, the challenge of performing defect localization
assessment system, the quantity, category, and complexity of using CNN features in DL-based methods remains. As we
defects would be served as the chief indicators to evaluate the known, the convolutional layers of CNN can be regarded as
quality of a steel plate. It is apparent that defect detection can filters, which results in some location details will be gradually
achieve a more comprehensive information reflection of a steel lost when an image flows in the CNN. Usually, DL-based
plate surface. methods perform localization based on the last convolutional
The previous ADI methods have two common problems: feature map [14], [28], [34]. Our method is to fuse multi-
the one is the unclear usage of hand-craft features [3]–[5]. The ple feature maps. Because the feature maps exhibit diverse
determination of features is too subjective, and thereby human characteristics at each stage of CNNs: the shallow features
experience usually plays a decisive role in it. The other prob- have rich information but not discriminative enough, and the
lem is imprecise defect localization [Fig. 3(a)]. Most methods deep features are semantic robustly but lose too many details.
only perform defect classification [6]–[8] or an incomplete In other fields [34], the Hypernet also uses more features but
defect detection. For example, some methods perform binary they are mainly selected from the latter part of the network.
classification to find the regions of defects [9], [10] or only The proposed multilevel-feature fusion network (MFN) com-
provide a coarse region of a defect [11], [12]. The recent bines the multiple features covering all stages. We address the
developed deep learning (DL) technology can overcome the detection from the industrial perspective. Since gray images
drawbacks of traditional ADI methods and have achieved have less information than color images, the MFN must
significant results on many vision tasks. The DL can extract include lower level features that are discarded by HyperNet.
discriminative representations through a deep network [e.g., Furthermore, the MFN uniforms the size of multiple features
a convolution neural network (CNN)]. These representations before fusion, which can not only save more details of images
can reach a high level of abstract and therefore have strong but also use less parameters of models.
representation ability. The hand-craft features, by contrast, are Third, in defect detection, data annotation is expensive,
merely the combination of low-level features [16]. Moreover, because one has to draw a defect’s bounding box and assign a
DL can train on location-annotated samples to obtain precise class label to it. Recent progress in this field can be attributed
location information. to two factors: 1) ImageNet pretrained models and 2) large
At present, some studies have already applied DL for ADI. baseline CNNs, which made great progress in DL-based defect
However, most methods can only perform defect classification classification [18]–[20]. However, the limited data and expen-
due to the lack of special data sets [18]–[21]. The defect sive annotation still limit the development of defect detection.
classification seems to be oversimplify and unable to pro- In this paper, we open a defect detection data set NEU-DET
vide location information. Other methods use a combination for fine-tuning models. When the DL models have finished
of DL and traditional image processing to perform defect training on a special data set, they can be used to perform the
detection or segmentation [17]. These methods always use defect detection task.
a DL classifier in parallel with a detector or a segmenter This paper establishes an end-to-end ADI system, called
that based on traditional image processing. This way can defect detection network (DDN), in an attempt to overcome
eliminate the need for special training data sets but damage the above-mentioned challenges. The DDN 1) adopts a strong
the end-to-end characteristic of DL system and lose the ResNet in defect classification; 2) proposes the MFN to assem-
intelligence and generalization to some extent. Unlike the ble more location details; and 3) sets up a defect detection data
above-mentioned methods, we attempt to establish an end- set for fine-tuning and reports improvements on it. In more
to-end defect detection system for ADI, which can provide detail, first, we pretrain the ResNet on the ImageNet and
HE et al.: END-TO-END STEEL SURFACE DEFECT DETECTION APPROACH 1495

fine-tune all the models on the NEU-DET. The MFN can and transferability so that there are some defect inspection
fuse the selected features into a multilevel feature, which has methods based on CNN. For example, Chen and Ho [21]
characteristics covering all the stages of the ResNet. Next, demonstrate that an object detector like Overfeat [24] can be
a region proposal network (RPN) is adopted in proposals transferred to be a defect detector by some means. Similar
generation based on the multilevel features and then the DDN to [18] and [19], they demonstrate that using a sequential
can output the class scores and the coordinates of bounding CNN to extract features can improve classification accuracy
box. Finally, we evaluate the proposed method on NEU-DET on defect inspection. Similarly, based on a sequential CNN,
and the results can demonstrate a clear superior to other ADI Ren et al. [17] perform an extra defect segmentation task on
methods. classification results to define the boundary of a defect. More-
To summarize, the main contributions of this paper are as over, Natarajan et al. [20] employ a deeper neural network
follows. VGG19 for defect classification. With the depth of CNN,
1) The introduction of the end-to-end defect detection the defect classification accuracy has been further improved.
pipeline DDN that integrates the ResNet and the RPN
for precise defect classification and localization. B. Baseline Networks
2) The proposed MFN for fusing multilevel features. Com-
There are three popular CNN architectures at present, which
pared with other fusing methods, MFN can combine
are used as baseline networks for pretraining. The early suc-
the lower level and higher level features, which makes
cessful networks are based on the sequential pipeline architec-
multilevel features to have more comprehensive charac-
ture [25], which establish the basic structure of CNN and prove
teristics.
the importance of depth of networks. Subsequently, the incep-
3) A defect detection data set NEU-DET for fine-tuning
tion networks employed modular units, which increase both
networks and a demonstration that the proposed DDN
the depth and width of a network without the increment of
has a very competitive performance on this data set.
computational cost [26]. The third type is ResNet using resid-
ual blocks to make networks deeper without overfitting [23].
II. R ELATED W ORK ResNet is widely applied in various vision tasks, achieving
A. Defect Inspection competitive results with a few parameters.
Generally, a defect classification method includes two parts: Choosing a proper baseline network is the key to gain
a feature extractor and a classifier. The classic feature extractor good results for DL methods. A large network has strong
is to obtain hand-craft features such as HOG and LBP, represent-ability for input data hence the extracted features
and they are always followed by a classifier, e.g., SVM. at high-abstract level, but there is a great demand for
Therefore, the combination of different feature extractors and training data.
classifiers produces a variety of defect classification meth-
ods. For instance, Song and Yan [3] improve the LBP to C. CNN Detectors
against noise and adopt NNC and SVM to classify defects.
The CNN detectors aim to classify and locate each target
Ghorai et al. [9] is based on a small set of wavelet features
with a bounding box. They are mainly divided into two meth-
and use SVM to perform defect classification. Different from
ods: one is the region-based method and another is the direct
above-mentioned two methods, Chu et al. [8] employ a general
regression method. The most famous region-based detectors
feature extractor and enhance SVM. From the perspective of
are the “R-CNN family” [27], [28], [14]. In this framework,
computer vision, the defect classification task is essentially
thousands of class-independent region proposals are employed
defect image classification, which is struggled in complicated
for detection. Region-based methods are superior in precision
defect images. To solve it, the simple and direct way is to
but require slightly more computation. The representative
perform defect localization before defect classification making
direct regression methods are YOLO [29] and SSD [30].
the inspection task classify on regions of defects instead of a
They directly divide an image into small grids and then for
whole defect image, which is the defect detection task. For
each grid predict bounding boxes, which then regressed to
example, the defect detectors in [11] and [12] first perform
the groundtruth boxes. The direct regression method is fast to
a 0–1 classification to judge features whether belong to a
detect but struggles in small instances.
defect class or a nondefect class, and then finds defect regions
based on the boundary of defect-class features, finally perform
different classification methods to determine the specific class III. D EFECT D ETECTION N ETWORK
of a defect. In addition, there is another simplified detector In this section, the DDN is described in detail (see Fig. 4).
for the requirement of quick detection, which only focuses on A single-scale image of an arbitrary size is processed by a
regions of defects but regardless of the defects are in different CNN, and the convolutional feature maps at each stage of
categories [10]. the ConvNet are produced (ConvNet represents the convo-
However, the DL-based methods differ radically from the lutional part of a CNN). We extract multiple feature maps
above methods. Hand-craft feature extractor locally analyses and then aggregate them in the same dimension by using
a single image and extract features. However, CNN is to a lightweight MFN. In this way, MFN features have the
construct the representation of all the input data through characteristics from several hierarchical levels of ConvNet.
a large amount of learning. CNN has fine generalization Next, RPN [14] is employed to generate region proposals
1496 IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, VOL. 69, NO. 4, APRIL 2020

Fig. 4. DDN. In a single pass, we extract features from each stage of the Baseline ConvNet, which then fused into a multilevel feature by MFN. RPN is
adopted to generate ROIs based on the multilevel feature. For each ROI, the corresponding multilevel feature is transformed into a fixed-length feature through
the ROI pooling and the GAP layers. Two fc layers process each fixed-length feature and feed into output layers producing two results: a one-of-(C + 1)
defect class prediction (cls) and a refined bounding box coordinate (loc).

[regions of interest (ROIs)] over the MFN feature. Finally, of sequential pipeline architecture of the same magni-
the MFN feature corresponding to each ROI is transformed tude (ResNet50 vs. VGG16, 0.85 M vs. 138 M para-
into a fixed-length feature through the ROI pooling [28] meters). It implies that ResNet has lower computational
and the global average pooling (GAP) layers. The feature cost and less probability of overfitting.
is fed into two fully connected (fc) layers. One is a one-of- 2) ResNet uses GAP to process the final convolutional
(C + 1) defect classification layer (“cls”) and the other is a feature map instead of the dual stacked fc layers, which
bounding-box regression layer (“loc”). can be in a manner of preserving more comprehensive
The rest of this section introduces the details of DDN and location information of defects in the image.
motivates why we need to design MFN into the network for 3) ResNet has a modularized ConvNet, which is easy to
the defect detection task. integrate.
In this paper, we select ResNet34 and ResNet50 as base-
A. Baseline ConvNet Architecture line networks. The detailed structures of both networks
are shown in Table I, and residual blocks are denoted as
As we know that pretraining on the ImageNet data set is
{R2, R3, R4, R5}.
important to achieve competitive performance, and then this
pretrained model can be fine-tuned on a relatively small defect
data set. In this paper, we select the recent successful baseline B. Produce Multilevel Features
network ResNet as the backbone. ResNet presents several Previous excellent approaches only utilize high-level fea-
attractive advantages as follows. tures to extract region proposals (like the faster R-CNN extract
1) ResNet can achieve the state-of-the-art precision with proposals upon the last convolutional feature maps). In order
extremely few parameters, in comparison with the CNN to obtain quality region proposals, single-level features should
HE et al.: END-TO-END STEEL SURFACE DEFECT DETECTION APPROACH 1497

TABLE I
A RCHITECTURE OF BASELINE N ETWORKS

be extended to multilevel features. Obviously, the simplest over a feature map inside each ROI to convert it into a small
way is to assemble feature maps from multiple layers [31]. feature vector (512-d for ResNet34 and 2048-d for ResNet50)
Therefore, now comes the question, which layers should be with a fixed size of W × H (in this paper, 7 × 7). At last,
combined? There are two essential conditions: nonadjacent, based on these small cubes, calculate the offset of each region
because adjacent layers have highly local correlation [32], and proposal with an adjacent groundtruth box and the probability
coverage, including features from low level to high level. For whether there exist defects.
a ResNet, the most intuitive way is to combine the last layers For a single image, RPN may extract thousands of region
in each residual block. proposals. To deal with the redundant information, the greedy
To fuse features at different levels, the proposed network nonmaximum suppression (NMS) is often applied for elimi-
MFN is appended on the pretrained model. MFN has four nating high-overlap region proposals. We set the intersection
branches, denoted as {B2, B3, B4, B5}, and each branch over union (IOU) threshold for NMS at 0.7, which can discard
is a small network. B2, B3, B4, and B5 are sequentially a majority of region proposals. After NMS, the top-K ranked
connected to the last layer of R2, R3, R4, and R5. When region proposals are selected from the rest. In the following,
an image flows through the baseline ConvNet, the Ri features we fine-tune DDN using top-300 region proposals owing to
are produced in order. The Ri feature means the feature map the extracted quality region proposals, but reduce this number
output from the last layer of the residual block Ri , i = to accelerate the detection speed without harming accuracy at
2, . . . , 5. Similarly, the Bi feature is the feature map produced test-time.
from the last layer of the MFN batch Bi , i = 2, . . . , 5. Then, IV. T RAINING
each of Ri features is led to the corresponding branch in MFN A. Multitask Loss Function
producing Bi features. Finally, multilevel features are obtained
The defect detection task can be divided into two subtasks,
via concatenating the B2, B3, B4, and B5 features, which come
hence DDN has two output layers. The cls layer outputs a
from different stages of a CNN.
discrete probability distribution, k = (k1 , . . . , kC ), for each
As a final note, MFN is efficient in computation and strong
ROI over C + 1 categories (C defect categories plus one
in generalization. MFN can reduce required parameters via
background category). As usual, k is computed by a softmax
modifying the number of filters of 1 × 1 conv. This operation
function. The cls loss L cls is a log loss over two classes (defect
may hurt accuracy but prevent overfitting in the case of
or not defect). L cls = − log(k, k ∗ ) where k ∗ is the groundtruth
insufficient training data.
class. The loc layer outputs bounding box regression offsets,
t = (tx , t y , tw , th ), for each of the C defect categories. As in
C. Extract Region Proposals [28], the loc loss L loc is a smooth L1 loss function. L loc =
The RPN is employed to extract region proposals by sliding SmoothL1(t − t ∗ ) where t ∗ is the groundtruth box associated
on the multilevel feature maps. RPN takes an image of with a positive sample. For bounding box regression, we adopt
arbitrary size as input and outputs anchor boxes (candidate the parameterizations of t and t ∗ given in [27]
boxes), each with a score representing whether it is a defect tx = (x − x a )/wa , t y = (y − ya )/ h a
or not. The originality of RPN is the “anchor” scheme that
tw = log(w/wa ), th = log(h/ h a )
makes anchor boxes in multiple scales and aspect ratios. Then,
anchor boxes are hierarchically mapped to the input image tx∗ = x ∗ − x a /wa , t y∗ = y ∗ − ya / h a

so that region proposals of multiple scales and aspect ratios tw∗ = log w∗ /wa , th∗ = log h ∗ / h a (1)
produced. As a result of the resolution size of MFN feature, the
where the subscripts x, y, w, and h denote each box’s center
RPN can be considered as sliding on the R4 feature. Follow
coordinates and its width and height. The variables x, x a , and
[14], we set three aspect ratios {1:1, 1:2, 2:1}. Considering
x ∗ separately represent the predicted box, anchor box, and
multiple sizes of defects, we set four scales {642 , 1282 , 2562,
groundtruth box (the same rules for y, w, and h).
5122 }. Therefore, RPN produces 12 anchor boxes at each
With these definitions, we minimize a multitask loss func-
sliding location.
tion, which is defined as
The region proposal extractor always ends with an ROI
pooling layer. This layer performs a max-pooling operation L(k, k ∗ , t, t ∗ ) = L cls (k, k ∗ ) + λp∗ L cls (t, t ∗ ) (2)
1498 IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, VOL. 69, NO. 4, APRIL 2020

Algorithm 1 Five-Step Joint Training Algorithm

Defect images with annotations
Train the merged network for initializing MFN with the
pretrained model, obtaining model M P .
Train RPN based on M P , generating proposals P ∗ .
Train the detector network using proposals P ∗ , obtaining
model M D∗.

Fine-tune RPN based on M D ∗ , generating proposals P and

obtaining model M R .
Fine-tune the detector network using proposals P, obtaining
model M D .
Combine M R and M D as the final model.

where λ is the weight parameter balancing both cls and loc

terms. During training, we set λ = 2 indicating that DDN
is devoted to achieving better defect locations. p∗ is the
activation parameter of the loc term. The localization loss Fig. 5. Examples of defect images with annotations in NEU. The green box
is a groundtruth box, which has a class label and two corner coordinates of
is involved in the subsequent calculation only for positive the box (top-left and bottom-right). The category to which the image belongs
samples ( p∗ = 1) and is disabled otherwise ( p∗ = 0). (a) crazing, (b) inclusion, (c) patches, (d) pitted surface, (e) rolled-in scale,
We follow the “IOU” strategy in [14] to determine the positive and (f) scratches.
and negative samples from anchors.
all new layers [33]. To avoid overfitting, we also use several
B. Joint Training data augmentation methods such as rotation, reflection, and
For pretrained network, MFN and RPN are new layers. shift, but remove the dropout module.
Hence, we need to make these three networks share the com-
mon convolutional features through training. The pretrained V. E XPERIMENTS
model is essentially a classification network, and multilevel The performance of DDN is evaluated on our defect data
features generated from MFN can be directly fed into the sets: NEU-CLS and NEU-DET. We demonstrate that DDN
cls layer. Therefore, the pretrained network and MFN can be achieves a reasonable design and promising results.
merged into one network, and then performed an end-to-end
training. Without RPN, the rest of DDN is a detector network. A. NEU-DET Data Set
To share features with RPN, the four-step alternating training NEU surface defect1 is a defect classification data set
strategy in [14] is adopted, alternating between training RPN that we opened seven years ago [3]. There are six types of
and training detector network. Combining these two strategies, defects from hot-rolled steel plates, including crazing, inclu-
we develop a practicable five-step joint training algorithm, sion, patches, pitted surface, rolled-in scales, and scratches.
which is shown in Algorithm 1. Each class has 300 images, but it does not mean that an image
After step 2 and step 3, RPN and the detector network are consists of a single defect. Examples of defect images are
initialized with the ImageNet pretrained model in succession. shown in Fig. 5.
However, these two networks have not shared the convolu- To perform defect detection tasks, we provide annotations
tional features at this point. They get it until the fine-tuning saved as XML files. With them, the classification data set is
processes of step 3 and step 4 are finished. Specifically, upgraded to a detection data set. The annotation marks the
we freeze the shared convolutional layers and only fine-tune class and bounding box of each defect appearing in an image.
the unshared layers. Finally, we combine two networks as a Each bounding box is regarded as a groundtruth box, which is
united network. represented by its top left and bottom right coordinates. There
are nearly 5000 groundtruth boxes in total. For simplicity,
C. Implementation we call the original data set NEU-CLS, and the complemented
data set NEU-DET. Examples of annotations are also shown
For DDN, we adopt image-centric training strategy. Images
in Fig. 5.
are resized such that their short side is 600 pixels. We use
stochastic gradient descen to train with a weight decay
of 0.0001 and a momentum of 0.9. We take a single image B. Defect Classification on NEU-CLS
per minibatch iteration. The minibatch size is 64 for detec- As mentioned above, MFN can be merged into baseline
tor network training (include MFN training) and 128 for CNNs for defect classification tasks. Therefore, we first
RPN training. We fine-tune the model using a learning rate 1 The NEU data set has been introduced in our previous work [3].
of 0.001 for 200k minibatch iterations and 0.0001 for another If you want to know the details about the data sets, visit the website:
100k minibatch iterations. We use “Xavier” initialization for http://faculty.neu.edu.cn/yunhyan/NEU_surface_defect_database.html
HE et al.: END-TO-END STEEL SURFACE DEFECT DETECTION APPROACH 1499

we evaluate the results of detection experiments by average

precision (AP), which is a good tradeoff between the two
significant detection indexes: Precision and Recall. These
indexes are defined as follows:
TP
Precision = (3)
TP + FP
TP
Recall = (4)
TP + FN
Precision + Recall
AP = (5)
2
where TP, FP, and FN represent the number of true positives,
false positives, and false negatives, respectively. The mean AP
(mAP) is also calculated to evaluate the overall performance,
which is the mean value of the AP of all the classes.
Table II shows the results of defect detection experiments.
Under the baseline ResNet34/50, DNN achieves a mAP of
74.8/82.3, 4.6/4.4 higher than faster R-CNN. This result
Fig. 6. Classification results on NEU-CLS data set. demonstrates that the proposals extracted from multilevel fea-
tures are superior to the proposals extracted from single-level
features. Under the same baseline network (VGG16), faster
report results on defect classification to demonstrate that our R-CNN achieves an mAP of 72.3 and HyperNet achieves an
approach can achieve the competitive accuracy over other mAP of 74.8. DNN achieves an mAP of 76.6, 4.3 points higher
related methods, and merging MFN does not significantly than faster R-CNN and 1.8 points higher than HyperNet.
affect the classification ability. Fig. 6 shows the defect clas- HyperNet is also a detector based on the multiple features,
sification results compared with other methods. According to but our method can extract higher quality region proposals,
Fig. 6, we can get the following conclusions. which will be discussed in Section VI in detail. The examples
1) The networks with MFN can perform well on defect of detection results on NEU-DET are shown in Fig. 7.
classification so the multilevel features still have strongly Through the previous defect classification experiments, it is
semantical capability. proven that MFN effects slightly on classification accuracy.
2) For ResNet34, MFN slightly harms the classification Therefore, the improvement of mAP is benefited from the
results. However, this influence is vanished for the quality region proposals extracted from multilevel features.
deeper network ResNet50. It indicates that features That means that MFN contributes to improve the localization
extracted from deeper network are more distinctive accuracy. We specifically evaluate the performance of MFN
hence the entire network becomes more robust. in Section V-D.
3) With MFN, the ResNet34 obtains 99% of the accuracy
of the ResNet50, which indicates that, in practice, a very D. Analysis on MFN
deep network is not really required for defect classifica- To verify MFN is able to improve the localization accuracy,
tion task. we compare with several region proposal extractors, sliding
As we know, stronger performance on defect classification window, Edge Boxes [35], and Selective Search [36]. In addi-
should be positively correlated with stronger performance on tion to these methods, RPN + MFN is also compared with the
defect detection. A good classification result is the prerequisite naive RPN (extract proposals based on single-level features).
for subsequent defect detection experiments. If the quality of proposals gets improved, the detector can use
fewer proposals and stricter IOU thresholds without harming
recall. Therefore, we evaluate recall on NEU-DET test set with
C. Defect Detection on NEU-DET different numbers of proposals and IOU thresholds. The num-
We carry out defect detection experiments on NEU-DET ber of proposals is the top-K ranked region proposals selected
data set. Conventionally, we divide the NEU-DET into training by these methods. IOU denotes a ratio between intersection
set and test set, and fix the training/testing split. The training and union of the predicted boxes and the groundtruth boxes.
set containing 1260 images used for fine-tuning the network Fig. 8 shows the defect recall with various IOU thresholds
introduced in Section IV-B, and the test set containing 540 at three different numbers of region proposals. The larger
images. We compare DDN with faster R-CNN and Hyper- the IOU threshold, the more quality the selecting proposals.
Net [34] on the test set and both methods use the same Unsurprisingly, the performance of the methods based on
baseline network (VGG16 [40]) mentioned in their papers. convolutional features is strongly higher than the methods
In addition, DDN and faster R-CNN are also experimented on without CNN [37]. When IOU > 0.7, the recall of naive
ResNet34/50 due to the similar proposals generator. Unlike RPN drops sharply compared with RPN + MFN. The
defect classification, only accuracy is not an appropriate naive RPN only extracts proposals from high-level features
performance measure in case of defect detection. Therefore, and some location information is filtered by the preceding
1500 IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, VOL. 69, NO. 4, APRIL 2020

TABLE II
D ETECTION R ESULTS ON NEU-DET

TABLE III
C OMBINING L AYERS IN D IFFERENT M ANNERS

Increasing the number of proposals can get a promising

recall, but this will greatly increase the runtime of the detec-
tion [38], and what is worse, low-quality proposals would
be involved in the process of detection, leading to failure of
defect detection in some cases. Therefore, a good detector
should select as few proposals as possible and meanwhile a
relatively strict IOU threshold. Fig. 9 shows the defect recalls
with various numbers of proposals at three different IOU
thresholds. The naive RPN achieves a desirable recall with top-
300 proposals, but RPN + MFN only needs top-100 proposals
to get a similar performance.
As shown in Fig. 10, for RPN + MFN with ResNet34,
we achieve 92% of the performance of selecting 300 proposals
by selecting only 50 proposals, which reduces the run time by
half. We consider selecting top-50 proposals as a good tradeoff
in practical defect detection task.

VI. D ISCUSSION
In this section, to demonstrate our design is logical and
advanced, we discuss several implicit factors that can influence
on defect detection.
Fig. 7. Examples of detection results on NEU-DET. For each defect,
the yellow box is the bounding box indicating its location and the green
label is the class score. The subset to which the image belongs (a) crazing, A. Combine Which Layers for MFN?
(b) inclusion, (c) patches, (d) pitted surface, (e) rolled-in scale, and
(f) scratches. MFN combines features from various levels into a mul-
tilevel feature, which is effective for improving detection.
layers making the decline of proposals in quality. With the In Section III-B, it is briefly discussed that what kind of
increasing number of proposals, the naive RPN drops more layers should be combined. In DDN, we select four layers
sharply when IOU > 0.7. This is because RPN extract too that are the last layers of R1, R2, R3, and R4. Therefore,
many low-quality proposals and it is more obvious with the whether other combination manners of these four layers may
increase of proposals. The naive RPN works badly with the result in better performance. Therefore, we train DDN +
strict IOU threshold (e.g., IOU > 0.7). MFN can help RPN ResNet34 in five different combination manners on NEU-DET
to obtain location information from low-level and mid-level data set. As shown in Table III, combining all the four layers
features, which makes RPN is under a higher tolerance for outperform the other manners. It indicates that the multilevel
strict IOU threshold. feature is effective for improving the accuracy of detection.
HE et al.: END-TO-END STEEL SURFACE DEFECT DETECTION APPROACH 1501

Fig. 8. Recall versus IOU threshold on the NEU-DET at different numbers of region proposals. (a) 50 region proposals. (b) 100 region proposals. (c) 300
region proposals.

Fig. 9. Recall versus number of proposals on the NEU-DET at different IOU thresholds. (a) IOU threshold is 0.5. (b) IOU threshold is 0.6. (c) IOU threshold
is 0.7.

Furthermore, low-level feature (e.g., R1 feature) should be multiple 5 × 5 convs to uniform the resolution and dimension-
paid more attention than high-level feature (e.g., R5 feature) ality simultaneously. However, the 5 × 5 conv is an expensive
for defect detection because R2 feature has richer location operation, which has the same effect as the double stacked
information than R5 feature. 3 × 3 conv but requiring additional parameters. Table IV
shows the comparable results among three patterns in detail.
B. Is the Simple Design More Effective for MFN? The front-mounted style uses three times fewer parameters
than the back-mounted, and five times fewer than hyperstyle.
The major role of MFN is to uniform the features from Therefore, MFN in the front-mounted style has less possibility
different levels in resolution and dimensionality. To keep the to be overfitting. Moreover, in case of the same resolution size,
dimension consistent, a straightforward approach is using 1×1 MFN features can preserve more complete information due to
conv to reduce/increase the dimensionality. There are two its larger dimensionality than Hyper feature’s (512 vs. 126).
placement patterns for 1 × 1 conv: front-mounted and back-
mounted. The front-mounted pattern means that 1 × 1 conv is
placed before concatenating multilevel feature. What we use C. Do We Need More Defect Data?
in this paper is the front-mounted pattern, that is, a 1 × 1 As we known, an object detector can improve performance
conv is placed at the end of each branch of MFN, and the with more training data [39]. Therefore, whether this rule is
back-mounted pattern means that a 1 × 1 conv is placed after also effective for industrial defect data? In order to make clear
concatenating multilevel feature. This pattern seems simple this problem, we train the DDN on not only the complete
but in fact needs more parameters. Similar to [34], we use NEU-DET data set but also each subset separately. As shown
1502 IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, VOL. 69, NO. 4, APRIL 2020

TABLE IV
U NIFORM D IMENSIONALITY IN D IFFERENT S TYLES

Fig. 11. AP of each defect class on separate training versus complete training.

Fig. 10. Detection time versus number of proposals on the NEU-DET. The
detection time refers to the GPU runtime per image. Sliding window, Edge
Boxes, and Selective Search are CPU-based methods that are far less than
GPU-based methods on detection speed.

in Fig. 11, for AP of each defect class, the performance of

separate training is worse than the complete training in general.
Specifically, the crazing, rolled-in scale, and scratches dropped
sharply, whereas the inclusion, patches, and pitted surface
Fig. 12. Examples of failure cases. Yellow box indicates the detection
present moderate decline. This may be due to the former results produced by the DDN, and pink box indicates the failure detection.
requiring more data for learning than the latter. Although (a) Overdistinctive defect. (b) Confusing defect. (c) Interference between
the total amount of training data is the same, results emerge similar defects. (d) Undefinable scope.
dramatical difference. We consider that more training data can
improve the represent ability of CNN for special instances.
That is to say, if DDN can be trained on more detection alize some failure cases, as shown in Fig. 12, for analysis and
data, the AP may also be improved. Finally, it is need to attempt to explore the reasons for the unsatisfactory detection.
emphasize that other types of training data may be useless We can observe that the DDN is robust to the continuous linear
(e.g., common object) because the DDN is fine-tuned on the “crazing” defects but fails to find the discontinuous one in
ImageNet pretrained model. the lower right of Fig. 12(a). It means that the overdistinctive
defect is hard to be correctly recognized, which, due to the
defect data provided, is not comprehensive. It is also difficult
D. Failure Case Analysis to define the confusing defects, as shown in Fig. 12(b), and
Though our method achieves promising results in general, even the human eye cannot accurately distinguish them from
in some cases, there is a poor performance for defect classes the background. Two kinds of defects, the “inclusion” and
such as “crazing,” “inclusion,” “patches,” and “rolled-in scale.” “patches” as shown in Fig. 12(c), are overlapped and the
Combining with the success cases shown in Fig. 7, we visu- “inclusion” gets a lower score. It is no doubt that the DDN has
HE et al.: END-TO-END STEEL SURFACE DEFECT DETECTION APPROACH 1503

the ability to handle the overlapped defects and the success [14] S. Ren, K. He, R. Girshick, and J. Sun, “Faster R-CNN: Towards real-
case is shown in Fig. 7(f). We guess the reason is that the time object detection with region proposal networks,” in Proc. Neural
Inf. Process. Syst. (NIPS), Montreal, QC, Canada, Dec. 2015, pp. 91–99.
“inclusion” and the “patches” in the figure are similar, and [15] J. Yosinski, J. Clune, Y. Bengio, and H. Lipson, “How transferable are
they influence each other when they are very close. For the features in deep neural networks?” in Proc. Neural Inf. Process. Syst.
“rolled-in scale,” the bounding box may ignore some edge (NIPS), Montreal, QC, Canada, Dec. 2014, pp. 3320–3328.
[16] Y. Lecun, Y. Bengio, and G. Hinton, “Deep learning,” Nature, vol. 521,
defects shown in Fig. 12(d) due to such defects that are too no. 7553, pp. 436–444, May 2015.
scattered to define their scope. A more ideal defect detector [17] R. Ren, T. Hung, and K. C. Tan, “A generic deep-learning-based
is yet wanted because there is still room for improvement. approach for automated surface inspection,” IEEE Trans. Cybern.,
vol. 48, no. 3, pp. 929–940, Mar. 2018.
[18] Y. Li, G. Li, and M. Jiang, “An end-to-end steel strip surface defects
VII. C ONCLUSION recognition system based on convolutional neural networks,” Steel Res.
Int., vol. 88, no. 2, Feb. 2017, Art. no. 1600068.
In this paper, the DDN, a defect inspection system for steel [19] S. Zhou, Y. Chen, and D. Zhang, “Classification of surface defects
plates is proposed. This system is a DL network that can on steel sheet using convolutional neural networks,” Mater. Technol.,
obtain the specific category and detailed location of a defect by vol. 51, no. 1, pp. 123–131, Feb. 2017.
fusing the multilevel features. For defect detection tasks, our [20] V. Natarajan, T.-Y. Hung, S. Vaikundam, and L.-T. Chia, “Convolu-
tional networks for voting-based anomaly classification in metal surface
system can provide detailed and valuable indicators for quality inspection,” in Proc. IEEE Int. Conf. Ind. Technol. (ICIT), Toronto, ON,
assessment system, such as the quantity, category, complexity, Canada, Mar. 2017, pp. 986–991.
and area of a defect. Furthermore, we set up a precious defect [21] P.-H. Chen and S.-S. Ho, “Is overfeat useful for image-based surface
defect classification tasks?” in Proc. IEEE Int. Conf. Image Process.
detection data set—NEU-DET. Experiments show that DDN (ICIP), Phoenix, AZ, USA, Sep. 2016, pp. 749–753.
can achieve 99.67% accuracy for defect classification task and [22] J. Deng, W. Dong, R. Socher, L.-J. Li, K. Li, and L. Fei-Fei, “ImageNet:
82.3 mAP for defect detection task. In addition, the system can A large-scale hierarchical image database,” in Proc. IEEE Comput. Vis.
Pattern Recognit. (CVPR), Anchorage, AK, Jun. 2009, pp. 248–255.
run at a detection speed of 20 ft/s while keeping the mAP at 70. [23] K. He, X. Zhang, S. Ren, and J. Sun, “Deep residual learning for image
In the feature, we will focus on two directions as follows: recognition,” in Proc. IEEE Comput. Vis. Pattern Recognit. (CVPR),
the one is data augmentation technology due to the expensive Boston, MA, USA, Jun. 2015, pp. 770–778.
[24] P. Sermanet, D. Eigen, X. Zhang, M. Mathieu, R. Fergus, and
manual annotations in detection data sets. The other is to Y. LeCun, “OverFeat: Integrated recognition, localization and detection
perform the defect segmentation task with DL technologies, using convolutional networks,” in Proc. Int. Conf. Learn. Represent.
which can obtain a more precise defect boundary. (ICLR), Banff, AB, Canada, Apr. 2014, pp. 1–16.
[25] A. Krizhevsky, I. Sutskever, and G. E. Hinton, “ImageNet classifica-
tion with deep convolutional neural networks,” in Proc. Neural Inf.
R EFERENCES Process. Syst. (NIPS), Las Vegas, NV, USA, Dec. 2012, vol. 60, no. 2,
pp. 1097–1105.
[1] D. Marr, Vision: A Computational Investigation Into the Human Repre-
[26] C. Szegedy, V. Vanhoucke, S. Ioffe, J. Shlens, and Z. Wojna, “Rethinking
sentation and Processing of Visual Information. Cambridge, MA, USA:
the Inception architecture for computer vision,” in Proc. IEEE Com-
MIT, 2010, pp. 3–4.
put. Vis. Pattern Recognit. (CVPR), Las Vegas, NV, USA, Jun. 2016,
[2] D. A. Forsyth, Computer Vision: A Modern Approach. Upper Saddle
pp. 2818–2826.
River, NJ, USA: Prentice-Hall, 2002, pp. 482–539.
[3] K. Song and Y. Yan, “A noise robust method based on completed local [27] J. Long, E. Shelhamer, and T. Darrell, “Fully convolutional networks for
binary patterns for hot-rolled steel strip surface defects,” Appl. Surf. Sci., semantic segmentation,” in Proc. IEEE Comput. Vis. Pattern Recognit.
vol. 285, pp. 858–864, Nov. 2013. (CVPR), Columbus, OH, USA, Jun. 2015, pp. 3431–3440.
[4] P. Caleb-Solly and J. E. Smith, “Adaptive surface inspection via inter- [28] R. Girshick, “Fast R-CNN,” in Proc. IEEE Int. Conf. Comput. Vis.
active evolution,” Image Vis. Comput., vol. 25, no. 7, pp. 1058–1072, (ICCV), Santiago, Chile, Dec. 2015, pp. 1440–1448.
Jul. 2007. [29] J. Redmon, S. Divvala, R. Girshick, and A. Farhadi, “You only look
[5] Y. Dong, D. Tao, X. Li, J. Ma, and J. Pu, “Texture classification and once: Unified, real-time object detection,” in Proc. IEEE Comput.
retrieval using shearlets and linear regression,” IEEE Trans. Cybern., Vis. Pattern Recognit. (CVPR), Las Vegas, NV, USA, Jun. 2016,
vol. 45, no. 3, pp. 358–369, Mar. 2015. pp. 779–788.
[6] M. Xiao, M. Jiang, G. Li, L. Xie, and L. Yi, “An evolutionary classifier [30] W. Liu et al., “SSD: Single shot multibox detector,” in Proc. Springer
for steel surface defects with small sample set,” EURASIP J. Image Vid. Euro. Conf. Comput. Vis. (ECCV), Amsterdam, Netherlands, Oct. 2016,
Process., vol. 2017, no. 48, pp. 1–13, Dec. 2017. pp. 21–37.
[7] Y. Park and I. S. Kweon, “Ambiguous surface defect image classification [31] L. Zhang, Y. Gao, C. Hong, Y. Feng, J. Zhu, and D. Cai, “Feature
of AMOLED displays in smartphones,” IEEE Trans. Ind. Inform., correlation hypergraph: Exploiting high-order potentials for multimodal
vol. 12, no. 2, pp. 597–607, Apr. 2016. recognition,” IEEE Trans. Cybern., vol. 44, no. 8, pp. 1408–1419,
[8] M. Chu, J. Zhao, X. Liu, and R. Gong, “Multi-class classification for Aug. 2014.
steel surface defects based on machine learning with quantile hyper- [32] Y. LeCun, L. Bottou, Y. Bengio, and P. Haffner, “Gradient-based
spheres,” Chemom. Intell. Lab. Syst., vol. 168, pp. 15–27, Sep. 2017. learning applied to document recognition,” Proc. IEEE, vol. 86, no. 11,
[9] S. Ghorai, A. Mukherjee, M. Gangadaran, and P. K. Dutta, “Automatic pp. 2278–2324, Nov. 1998.
defect detection on hot-rolled flat steel products,” IEEE Trans. Instrum. [33] X. Glorot and Y. Bengio, “Understanding the difficulty of training deep
Meas., vol. 62, no. 3, pp. 612–621, Mar. 2013. feedforward neural networks,” in Proc. 13th Int. Conf. Artif. Intell.
[10] Q. Luo and Y. He, “A cost-effective and automatic surface defect inspec- Statist., vol. 9, May 2010, pp. 249–256.
tion system for hot-rolled flat steel,” Robot. Comput.-Integr. Manuf., [34] T. Kong, A. Yao, Y. Chen, and F. Sun, “HyperNet: Towards accurate
vol. 38, pp. 16–30, Apr. 2016. region proposal generation and joint object detection,” in Proc. IEEE
[11] K. Liu, H. Wang, H. Chen, E. Qu, Y. Tian, and H. Sun, “Steel Comput. Vis. Pattern Recognit. (CVPR), Las Vegas, NV, USA, Jun. 2016,
surface defect detection using a new Haar–weibull-variance model in pp. 845–853.
unsupervised manner,” IEEE Trans. Instrum. Meas., vol. 66, no. 10, [35] C. L. Zitnick and P. Dollar, “Edge boxes: Locating object proposals from
pp. 2585–2596, Oct. 2017. edges,” in Proc. Springer Euro. Conf. Comput. Vis. (ECCV), Zurich,
[12] M. Chu, R. Gong, S. Gao, and J. Zhao, “Steel surface defects recognition Switzerland, Oct. 2014, pp. 391–405.
based on multi-type statistical features and enhanced twin support vector [36] J. R. R. Uijlings, K. E. A. van de Sande, T. Gevers, and
machine,” Chemom. Intell. Lab. Syst., vol. 171, pp. 140–150, Sep. 2017. A. W. M. Smeulders, “Selective search for object recognition,”
[13] K. He, G. Gkioxari, P. Dollar, and R. Girshick, “Mask R-CNN,” in Int. J. Comput. Vis., vol. 104, no. 2, pp. 154–171, Sep. 2013.
Proc. IEEE Int. Conf. Comput. Vis. (ICCV), Venice, Italy, Oct. 2017, [37] Y. Wei et al., “Cross-modal retrieval with CNN visual features: A new
pp. 2980–2988. baseline,” IEEE Trans. Cybern., vol. 47, no. 2, pp. 449–460, Feb. 2017.
1504 IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, VOL. 69, NO. 4, APRIL 2020

[38] J. Hosang, R. Benenson, P. Dollár, and B. Schiele, “What makes for Qinggang Meng received the B.S. and M.S. degrees
effective detection proposals?” IEEE Trans. Pattern Anal. Mach. Intell., from the School of Electronic Information Engi-
vol. 38, no. 4, pp. 814–830, Apr. 2016. neering, Tianjin University, Tianjin, China, and the
[39] X. Zhu, C. Vondrick, C. C. Fowlkes, and D. Ramanan, “Do we need Ph.D. degree in computer science from Aberystwyth
more training data?,” Int. J. Comput. Vis., vol. 119, no. 1, pp. 76–92, University, Aberystwyth, U.K.
Aug. 2016. He is currently a Professor with the Depart-
[40] K. Simonyan and A. Zisserman, “Very deep convolutional networks for ment of Computer Science, Loughborough Uni-
large-scale image recognition,” in Proc. Int. Conf. Learn. Represent. versity, Loughborough, U.K. His current research
(ICLR), San Diego, CA, USA, May 2015, pp. 1–16. interests include biologically and psychologically
inspired learning algorithms and developmental
robotics, service robotics, robot learning and adapta-
tion, multi-unmanned aerial vehicle cooperation, drivers distraction detection,
Yu He received the B.S. degree from the School of human motion analysis and activity recognition, activity pattern detection,
Mechanical Engineering and Automation, Liaoning pattern recognition, artificial intelligence, and computer vision.
Technical University, Fuxin, China, in 2014, and the Dr. Meng is a fellow of the Higher Education Academy, U.K.
M.S. degree from the School of Mechanical Engi-
neering and Automation, Northeastern University,
Shenyang, China, in 2016, where he is currently
pursuing the Ph.D. degree.
His current research interests include deep learn-
ing, pattern recognition, and intelligent inspection.

Kechen Song received the B.S., M.S., and Ph.D. Yunhui Yan received the B.S., M.S., and Ph.D.
degrees from the School of Mechanical Engineering degrees from the School of Mechanical Engineering
and Automation, Northeastern University, Shenyang, and Automation, Northeastern University, Shenyang,
China, in 2009, 2011, and 2014, respectively. China, in 1981, 1985, and 1997, respectively.
Since 2014, he has been a Teacher with North- Since 1982, he has been a Teacher with Northeast-
eastern University. His current research interests ern University, and became as a Professor in 1997.
include vision-based inspection system for steel sur- From 1993 to 1994, he stayed as a visiting scholar
face defects, surface topography, image processing, at the Tohoku National Industrial Research Institute,
and pattern recognition. Sendai, Japan. His current research interests include
intelligent inspection, image processing, and pattern
recognition.

ADHD - What Everyone Needs To Know (PDFDrive)
No ratings yet
ADHD - What Everyone Needs To Know (PDFDrive)
217 pages
Final MCQ OB
100% (3)
Final MCQ OB
43 pages
Computational Intelligence and Neuroscience - 2021 - Zhao - A New Steel Defect Detection Algorithm Based on Deep Learning
No ratings yet
Computational Intelligence and Neuroscience - 2021 - Zhao - A New Steel Defect Detection Algorithm Based on Deep Learning
13 pages
A_Multi-Branch_U-Net_for_Steel_Surface_Defect_Type
No ratings yet
A_Multi-Branch_U-Net_for_Steel_Surface_Defect_Type
19 pages
Research On Defect Detection Method For Steel Metal Surface Based On Deep Learning
No ratings yet
Research On Defect Detection Method For Steel Metal Surface Based On Deep Learning
5 pages
Steel Surface Defect Classification w Deep Learning
No ratings yet
Steel Surface Defect Classification w Deep Learning
2 pages
Electronics 12 04422 v2
No ratings yet
Electronics 12 04422 v2
17 pages
Product Defect Identification System
No ratings yet
Product Defect Identification System
11 pages
(IJCST-V8I2P2) : Veena N. Jokhakar
No ratings yet
(IJCST-V8I2P2) : Veena N. Jokhakar
8 pages
DDSNet_Deep_Dual-Branch_Networks_for_Surface_Defect_Segmentation
No ratings yet
DDSNet_Deep_Dual-Branch_Networks_for_Surface_Defect_Segmentation
16 pages
Detection and Segmentation of Manufacturing Defects With Convolutional Neural Networks and Transfer Learning
No ratings yet
Detection and Segmentation of Manufacturing Defects With Convolutional Neural Networks and Transfer Learning
15 pages
2018 Defect Detection of Rail Surface With Deep Convolutional Neural Networks
No ratings yet
2018 Defect Detection of Rail Surface With Deep Convolutional Neural Networks
6 pages
An Improved Faster R-CNN For Steel Surface Defect Detection
No ratings yet
An Improved Faster R-CNN For Steel Surface Defect Detection
5 pages
A Generic Automated Surface Defect Detection Based PDF
No ratings yet
A Generic Automated Surface Defect Detection Based PDF
17 pages
Steel Surface Defect Detection Using Deep Learning
No ratings yet
Steel Surface Defect Detection Using Deep Learning
7 pages
Sensors 19 03987 v4
No ratings yet
Sensors 19 03987 v4
6 pages
2022-Semantic Segmentation Ofdefects Based On DCNN and Itsapplication On Fatigue Lifetimeprediction For SLM Ti-6Al-4Valloy
No ratings yet
2022-Semantic Segmentation Ofdefects Based On DCNN and Itsapplication On Fatigue Lifetimeprediction For SLM Ti-6Al-4Valloy
16 pages
Tim 2020 3033726
No ratings yet
Tim 2020 3033726
12 pages
Coatings 13 00017 v2
No ratings yet
Coatings 13 00017 v2
30 pages
2501.11310v1
No ratings yet
2501.11310v1
18 pages
Anomaly Detection For Industrial Surface Inspection Application in Maintenance of Aircraft Components
No ratings yet
Anomaly Detection For Industrial Surface Inspection Application in Maintenance of Aircraft Components
6 pages
2024 Specificity - Autocorrelation.integration - Network.for - Surface.defect - Detection.of - No Service - Rail
No ratings yet
2024 Specificity - Autocorrelation.integration - Network.for - Surface.defect - Detection.of - No Service - Rail
10 pages
A Systematic Review On Deep Learning With CNNs Applied To Surface Defect Detection
No ratings yet
A Systematic Review On Deep Learning With CNNs Applied To Surface Defect Detection
29 pages
Materials: Using Deep Learning To Detect Defects in Manufacturing: A Comprehensive Survey and Current Challenges
No ratings yet
Materials: Using Deep Learning To Detect Defects in Manufacturing: A Comprehensive Survey and Current Challenges
23 pages
Automatic Localization of Casting Defects With Convolutional Neural Networks
No ratings yet
Automatic Localization of Casting Defects With Convolutional Neural Networks
11 pages
Algorithms
No ratings yet
Algorithms
30 pages
IRJET a Comprehensive Survey of Defect D
No ratings yet
IRJET a Comprehensive Survey of Defect D
6 pages
Metals 07 00311
No ratings yet
Metals 07 00311
11 pages
Automatic Visual Inspection System For Quality Con
No ratings yet
Automatic Visual Inspection System For Quality Con
10 pages
A Robust Completed Local Binary Pattern RCLBP For Surface Defect Detection
No ratings yet
A Robust Completed Local Binary Pattern RCLBP For Surface Defect Detection
8 pages
1 s2.0 S0278612522001054 Main
No ratings yet
1 s2.0 S0278612522001054 Main
14 pages
Sensors 23 06558 v2
No ratings yet
Sensors 23 06558 v2
17 pages
AI Driven Industry 4 0 Advancing Quality
No ratings yet
AI Driven Industry 4 0 Advancing Quality
17 pages
ieietspc_202206_001
No ratings yet
ieietspc_202206_001
7 pages
Surface Defect Detection of Industrial Parts Based
No ratings yet
Surface Defect Detection of Industrial Parts Based
11 pages
Applied Sciences: Research On A Surface Defect Detection Algorithm Based On Mobilenet-Ssd
No ratings yet
Applied Sciences: Research On A Surface Defect Detection Algorithm Based On Mobilenet-Ssd
17 pages
Group 42 Report Final
No ratings yet
Group 42 Report Final
39 pages
FEGAN_ A Feature Extraction Based Approach for GAN Anomaly Detection and Localization
No ratings yet
FEGAN_ A Feature Extraction Based Approach for GAN Anomaly Detection and Localization
15 pages
1 s2.0 S1350630722000218 Main
No ratings yet
1 s2.0 S1350630722000218 Main
12 pages
Vision-Based Automatic Detection of Steel Surface Defects
No ratings yet
Vision-Based Automatic Detection of Steel Surface Defects
14 pages
1 s2.0 S0926580521003927 Main
No ratings yet
1 s2.0 S0926580521003927 Main
14 pages
A_Small-Sized_Object_Detection_Oriented_Multi-Scale_Feature_Fusion_Approach_With_Application_to_Defect_Detection
No ratings yet
A_Small-Sized_Object_Detection_Oriented_Multi-Scale_Feature_Fusion_Approach_With_Application_to_Defect_Detection
14 pages
Anomaly Detection of Defect Using Energy of Poi - 2024 - Engineering Application
No ratings yet
Anomaly Detection of Defect Using Energy of Poi - 2024 - Engineering Application
15 pages
Automated Metallic Weld Defect Detection
No ratings yet
Automated Metallic Weld Defect Detection
4 pages
Real Time Steel Surface Defect Detection Using DCNN: A Project Report ON
No ratings yet
Real Time Steel Surface Defect Detection Using DCNN: A Project Report ON
33 pages
Automatic Detection of Welding Defects Using Deep PDF
No ratings yet
Automatic Detection of Welding Defects Using Deep PDF
11 pages
Attention-Guided_Multitask_Learning_for_Surface_Defect_Identification
No ratings yet
Attention-Guided_Multitask_Learning_for_Surface_Defect_Identification
9 pages
Sensors 24 00264 v2
No ratings yet
Sensors 24 00264 v2
17 pages
Segmentation-Based Deep-Learning Approach For Surface-Defect Detection
No ratings yet
Segmentation-Based Deep-Learning Approach For Surface-Defect Detection
17 pages
111 Applying CNN To Infrared Thermography For Preventive Maintenance of Electrical Equipment
No ratings yet
111 Applying CNN To Infrared Thermography For Preventive Maintenance of Electrical Equipment
4 pages
2021 DL+MV+IGAN+Defect Detection
No ratings yet
2021 DL+MV+IGAN+Defect Detection
9 pages
Deep Learning Based Computer Vision Syst
No ratings yet
Deep Learning Based Computer Vision Syst
10 pages
Enhancing Quality Control in Industry 4.0 Advanced Image Processing For Automated Defect Detection
No ratings yet
Enhancing Quality Control in Industry 4.0 Advanced Image Processing For Automated Defect Detection
8 pages
Surface Defects Detection For Ceramic Tiles Using Image Processing and Morphological Techniques
No ratings yet
Surface Defects Detection For Ceramic Tiles Using Image Processing and Morphological Techniques
5 pages
Sensors 24 00232
No ratings yet
Sensors 24 00232
22 pages
approved 2
No ratings yet
approved 2
25 pages
A Survey of Methods For Automated Quality Control Based On Images
No ratings yet
A Survey of Methods For Automated Quality Control Based On Images
29 pages
Detection_of_rail_surface_defects_based
No ratings yet
Detection_of_rail_surface_defects_based
7 pages
A Welding Defect Identification Approach in X-ray Images Based on Deep Convolutional Neural Networks
No ratings yet
A Welding Defect Identification Approach in X-ray Images Based on Deep Convolutional Neural Networks
12 pages
Automated X-Ray Inspection Robot: Enhancing Quality Control Through Computer Vision
From Everand
Automated X-Ray Inspection Robot: Enhancing Quality Control Through Computer Vision
Fouad Sabry
No ratings yet
Underwater Computer Vision: Exploring the Depths of Computer Vision Beneath the Waves
From Everand
Underwater Computer Vision: Exploring the Depths of Computer Vision Beneath the Waves
Fouad Sabry
No ratings yet
Machine Learning - Advanced Concepts
From Everand
Machine Learning - Advanced Concepts
Derrick Mwiti
No ratings yet
The Axe in Springtime (the Cherry Orchard)
No ratings yet
The Axe in Springtime (the Cherry Orchard)
18 pages
Efficient Capital Markets: Dr. Amir Rafique
No ratings yet
Efficient Capital Markets: Dr. Amir Rafique
45 pages
The Photographer s Eye John Szarkowski All Chapters Instant Download
100% (1)
The Photographer s Eye John Szarkowski All Chapters Instant Download
81 pages
Braving The Unknown
No ratings yet
Braving The Unknown
2 pages
ĐÁP ÁN_ĐỀ THAM KHẢO THPT TỪ NĂM 2025_ĐỀ 1
No ratings yet
ĐÁP ÁN_ĐỀ THAM KHẢO THPT TỪ NĂM 2025_ĐỀ 1
6 pages
Mise en Abyme. International Journal of PDF
No ratings yet
Mise en Abyme. International Journal of PDF
97 pages
Sample TOEFL Independent Essays
No ratings yet
Sample TOEFL Independent Essays
10 pages
Violin : Arr - Seek Sheet Music
No ratings yet
Violin : Arr - Seek Sheet Music
1 page
Miss Julie
No ratings yet
Miss Julie
11 pages
Q2 Module 5.2 Death Living Life Before It Ends
No ratings yet
Q2 Module 5.2 Death Living Life Before It Ends
35 pages
B.Inggris kel 13 (Freud of Model Human in Mind)QnA
No ratings yet
B.Inggris kel 13 (Freud of Model Human in Mind)QnA
11 pages
Activity 3 & Rubric - Week 10-1
No ratings yet
Activity 3 & Rubric - Week 10-1
2 pages
Elizabeth Closs Traugott and Richard B D
No ratings yet
Elizabeth Closs Traugott and Richard B D
12 pages
Ota Video Assignment
No ratings yet
Ota Video Assignment
7 pages
Persuasive Speech
100% (1)
Persuasive Speech
3 pages
Glup 4193 - CSP Assignment 3
No ratings yet
Glup 4193 - CSP Assignment 3
2 pages
Englsih Summative, The Lord of The Flies
No ratings yet
Englsih Summative, The Lord of The Flies
7 pages
MTB Week4 QT1
No ratings yet
MTB Week4 QT1
5 pages
G3 BoSY CRLA Scoresheets and Administration Guides
No ratings yet
G3 BoSY CRLA Scoresheets and Administration Guides
7 pages
Therapist Client Confidentiality Agreement
No ratings yet
Therapist Client Confidentiality Agreement
5 pages
Wasihun Mohammed
No ratings yet
Wasihun Mohammed
81 pages
XII Sample Paper 1 Answer Key
No ratings yet
XII Sample Paper 1 Answer Key
4 pages
References Sales Psychology
No ratings yet
References Sales Psychology
13 pages
Psychology of Human Behaviour at Work
No ratings yet
Psychology of Human Behaviour at Work
5 pages
Optimism
No ratings yet
Optimism
23 pages
Impromptu Speech Rubric
100% (2)
Impromptu Speech Rubric
1 page
W11.mengurus Pelajar Berisiko
No ratings yet
W11.mengurus Pelajar Berisiko
33 pages
Project Proposal
No ratings yet
Project Proposal
9 pages

An End-to-End Steel Surface Defect Detection Approach Via Fusing Multiple Hierarchical Features

Uploaded by

An End-to-End Steel Surface Defect Detection Approach Via Fusing Multiple Hierarchical Features

Uploaded by

IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, VOL. 69, NO.

4, APRIL 2020 1493

An End-to-End Steel Surface Defect Detection

Abstract— A complete defect detection task aims to achieve

a bounding box with a class score for precisely classifying

Algorithm 1 Five-Step Joint Training Algorithm

Fine-tune RPN based on M D ∗ , generating proposals P and

where λ is the weight parameter balancing both cls and loc

we evaluate the results of detection experiments by average

Increasing the number of proposals can get a promising

in Fig. 11, for AP of each defect class, the performance of

You might also like