0% found this document useful (0 votes)

10 views10 pages

Identifying Threat Objects Using Faster

Uploaded by

virendrav

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

10 views10 pages

Identifying Threat Objects Using Faster

Uploaded by

virendrav

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 10

International Journal of Advances in Intelligent Informatics ISSN 2442-6571

Vol. 8, No. 3, November 2022, pp. 381-390 381

Identifying threat objects using faster region-based

convolutional neural networks (faster r-cnn)
Reagan Galvez a,1,*, Elmer Pamisa Dadios b,2
a
Electronics Engineering Department, Bulacan State University, City of Malolos, Bulacan 3000, Philippines
b
Manufacturing Engineering and Management Department, De La Salle University, Taft Avenue, Manila 1004, Philippines
1
[email protected]; 2 [email protected]
* corresponding author

ARTICLE INFO ABSTRACT

Automated detection of threat objects in a security X-ray image is vital to

Article history prevent unwanted incidents in busy places like airports, train stations, and
Selected paper from The 2021 4th malls. The manual method of threat object detection is time-consuming
International Symposium on and tedious. Also, the person on duty can overlook the threat objects due
Advanced Intelligent Informatics to limited time in checking every person’s belongings. As a solution, this
(SAIN’21), Yogyakarta (Virtually), paper presents a faster region-based convolutional neural network (Faster
October 13-14, 2021, R-CNN) object detector to automatically identify threat objects in an X-
http://sain.ijain.org/2021/. Peer- ray image using the IEDXray dataset. The dataset was composed of scanned
reviewed by SAIN’21 Scientific X-ray images of improvised explosive device (IED) replicas without the
Committee and Editorial Team of main charge. This paper extensively evaluates the Faster R-CNN
IJAIN journal architecture in threat object detection to determine which configuration
can be used to improve the detection performance. Our findings showed
Received September 13, 2021 that the proposed method could identify three classes of threat objects in
Revised September 27, 2022 X-ray images. In addition, the mean average precision (mAP) of the threat
Accepted November 30, 2022 object detector could be improved by increasing the input image's image
Available online November 30, 2022 resolution but sacrificing the detector's speed. The threat object detector
achieved 77.59% mAP and recorded an inference time of 208.96 ms by
Keywords
resizing the input image to 900 × 1536 resolution. Results also showed that
Computer vision
increasing the bounding box proposals did not significantly improve the
Convolutional neural networks
detection performance. The mAP using 150 bounding box proposals only
Faster r-cnn
achieved 75.65% mAP, and increasing the bounding box proposal twice
Threat object detection
reduced the mAP to 72.22%.
X-ray imaging
This is an open access article under the CC–BY-SA license.

1. Introduction
Terrorist attacks in many countries result in the injury and deaths of civilians and even military
personnel [1]. In the Philippines, this problem is also dominant due to the terrorist attacks that
happened recently [2] caused by the use of an improvised explosive device (IED). IED is a homemade
explosive device used by perpetrators designed to harm people. Generally, IED contains a power source,
switch, initiator, wires, and main charge. The power source, commonly a 9 volts battery, provides power
to the initiator (electric or non-electric) to start the detonation of the main charge. The arming or firing
of the IED is controlled by the switch.
In the Global Terrorism Index 2022, the Philippines was listed in the top 20 countries most impacted
by terrorism [3]. As a safety measure, tightened security in public transport systems such as airport
terminals, train stations, and also in commercial establishments is strictly implemented. Pieces of
baggage are scanned using an X-ray machine to identify the objects inside and look for threats like
explosives and bladed weapons. Although this process is valid, the possibility of missed detection is high

https://doi.org/10.26555/ijain.v8i3.952 http://ijain.org [email protected]

382 International Journal of Advances in Intelligent Informatics ISSN 2442-6571
Vol. 8, No. 3, November 2022, pp. 381-390

during rush hour because of the limited time to scan thousands of baggage and identify threat objects
[4]. As a solution, this paper used Faster Region-based Convolutional Neural Network (Faster R-CNN)
to identify threat objects (e.g., battery, mortar, wires) in an X-ray image to aid the operator in deciding
whether a piece of baggage poses a threat or not. Faster R-CNN [5] is a deep learning-based object
detector from the family of a region-based convolutional neural network that introduces Region Proposal
Networks (RPN). This network accepts a feature map and then outputs object proposals (bounding box)
with corresponding objectness scores.
To date, several studies in the computer vision field explored Faster R-CNN in many different
applications such as vehicle detection [6], disease detection [7], [8], face detection [9], [10], ship
detection [11], [12], metal object detection [13], radar images [14], defect detection [15], [16], object
detection on medical images [17], [18], and autonomous driving [19]. Although many researchers
successfully implemented Faster R-CNN in object detection, there are few studies [20] that explored
this detector for X-ray images due to limited data available and complicated procedures in collecting X-
ray images. Some researchers used a different approach [21], like improved Mask R-CNN [22], X-ray
proposal and discriminative Networks [23], and multi-view branch-and-bound search algorithm [24] for
object detection in X-ray images. Researchers in [25] and [26] were able to implement a deep learning-
based object detector for identifying threat objects such as IEDs. However, a detailed evaluation is still
needed to know the right configuration and trade-offs.
The contributions of this paper are as follows: (a) extensive evaluation of Faster R-CNN architecture
in threat object detection, (b) investigation of how the bounding box proposals and image resolution
affects the performance of the treat object detector, (c) experiments on how to improve the performance
of the threat object detector in terms of mean average precision (mAP) and speed.

2. Method
The overview of the Faster R-CNN architecture for identifying threat objects is shown in Fig. 1.

proposals
RPN

FC Class labels battery: 90%

RoI FC wires: 87%

pooling layers mortar: 95%

Bounding
anchor boxes FC box
predictions

Preprocessing CNN

input
feature maps

Fig. 1. Faster R-CNN architecture

The input is an X-ray image with corresponding class labels and bounding boxes. X-ray images are
fed to the preprocessing stage, such as resizing and augmentation before feature extraction. Data
augmentation performs random geometric transformations to the image to increase the training data.
Features are extracted using CNN via transfer learning using ResNet-101 [27] as a base network. The
RPN module accepts anchor boxes and looks for possible objects in the image. The anchor boxes serve
as a reference at multiple scales (e.g., 64 × 64, 128 × 128, and 256 × 256) and aspect ratios (e.g., 1:1, 2:1,
1:2). Each sliding window contains nine anchor boxes centered at every position. Then, the RPN module
determines its objectness score and proposed regions where the objects are possibly located. The
objectness score measures the probability that an anchor is an object. The output of the RPN module is

Galvez and Dadios (Identifying threat objects using faster region-based convolutional neural networks (faster r-cnn))
ISSN 2442-6571 International Journal of Advances in Intelligent Informatics 383
Vol. 8, No. 3, November 2022, pp. 381-390

bounding box proposals, each having an objectness score. The region of interest (ROI) pooling module
accepts the top N proposals from the RPN module and extracts fixed-sized windows of ROI features
from the feature maps. The N proposals were varied from 10 to 450 to determine the effect on the
detection performance. The ROI pooling module resizes the feature map into 14 × 14 × D, where D is
the depth of the feature map. When max pooling is applied with a stride of 2, the result is a 7 × 7 × D
feature vector that will be fed to two fully connected (FC) layers and then finally passed to two fully
connected layers that yield the class label and bounding box. Class label C has four dimensions (3 classes
+ 1 background) such as the battery, mortar, and wires, while the bounding boxes are twelve (4
coordinates ×3 classes).
2.1. Dataset
Dataset collection was done using a dual-view X-ray machine. In order to capture the X-ray images
projected to the computer monitor, a video recorder was used. The images were collected by extracting
one out of five frames (20%) in a given video file to ensure that the extracted images were not similar
to the previous image. As an example, in a 60-second video with a frame rate of 30 frames per second
(fps), the extracted images will be 360 images. Once extracted, the images were manually selected based
on the clarity and quality of the image. Finally, the images were labeled according to classes using
LabelImg [28]. The dataset was called IEDXray [25], as shown in Fig. 2, which is composed of X-ray
images of IED replicas without the main charge. The left part of the figure shows the one-channel
histogram (grayscale) of the sample X-ray image. The histogram shows that the pixel intensities of the
image were concentrated approximately between 200 to 255 (white pixels). This dataset contains the
basic circuitry of an IED without explosive material. Six IED types were scanned in the X-ray machine.

Fig. 2. IEDXray dataset.

2.2. Training and Evaluation

Faster R-CNN was trained using stochastic gradient descent (SGD) with momentum. Momentum
is a method used to improve convergence speed and reduce oscillation [29]. Several hyperparameter
values were tried during the experiment using the manual search method. The highest mAP was achieved
using the following hyperparameter values: learning rate = 0.0003, momentum = 0.9, batch size = 1.
Regularization was also added to the model to increase the mAP by augmenting the data passed into the
network for training. Data augmentation was used as an implicit regularization [30]. Each experiment
was trained for 20,000 steps. The IEDXray dataset was divided into train and test data. Train and test
data consist of 1,209 and 134 images, respectively. Then, the evaluation metric used to measure the
performance of Faster R-CNN in threat object detection was based on the PASCAL VOC metric [31],
which uses the equations (1), (2), and (3) to compute the mean average precision (mAP).
Intersection over Union (IoU) was calculated by dividing the area of intersection between ground-
truth XG and predicted bounding box XP to the area of union shown in (1). To be considered as correct
detection or true positive (TP), the score should have an IoU > 0.5 [31]; otherwise, it is a false positive
(FP). A false negative (FN) is recorded for undetected ground truths.

Galvez and Dadios (Identifying threat objects using faster region-based convolutional neural networks (faster r-cnn))
384 International Journal of Advances in Intelligent Informatics ISSN 2442-6571
Vol. 8, No. 3, November 2022, pp. 381-390

𝐴𝐴(𝑋𝑋𝐺𝐺 ∩𝑋𝑋𝑃𝑃 )
𝐼𝐼𝐼𝐼𝐼𝐼 =
𝐴𝐴(𝑋𝑋𝐺𝐺 ∪𝑋𝑋𝑃𝑃 )
(1)

Then, the precision P and recall R values were calculated to compute the average precision AP. P in
(2) measures the percentage of correct positive predictions, and R in (3) measures the ability of the
model to find all ground-truth bounding boxes. Where TP, FP, and FN are true positive, false positive,
and false negative, respectively.
𝑇𝑇𝑇𝑇
𝑃𝑃 =
𝑇𝑇𝑇𝑇+𝐹𝐹𝑇𝑇
(2)
𝑇𝑇𝑇𝑇
𝑅𝑅 =
𝑇𝑇𝑇𝑇+𝐹𝐹𝐹𝐹
(3)

Given that the average precision AP is the precision P averaged across all recall R values between 0
and 1, the mAP in (4) can be computed by averaging the AP of all class C (3 classes). The classes were
battery, mortar, and wires.
1
𝑚𝑚𝑚𝑚𝑃𝑃 =
𝐶𝐶
∑𝐶𝐶𝑖𝑖=1 𝑚𝑚𝑃𝑃𝑖𝑖 (4)

2.3. Hardware and Software Setup

All of the experiments were conducted on a desktop computer with Intel Core i7-9700K 3.6 GHz 8-
Core Processor, 16GB RAM, using Ubuntu 18.04 LTS with NVIDIA RTX 2070 8GB graphics
processing unit in a Tensorflow framework

3. Results and Discussion

In this research, two important parameters of the Faster R-CNN were investigated, such as the
number of bounding box proposals generated by the RPN and the image resolution of the input image
using the IEDXray dataset that was discussed in the previous section.
3.1. Bounding Box Proposals
In the experiment, the number of bounding box proposals varied between 10 and 450 to explore the
trade-off. Table 1 illustrates the mAP and evaluation time (per image) of Faster R-CNN on the different
number of bounding box proposals. The mean average precision (mAP) was calculated from the last
training step (20,000), while the evaluation time was measured by averaging the time it takes to evaluate
the test data. The notation (e.g., APbattery) is the average precision of each class. It can be seen from the
table that changing the number of bounding box proposals in each training results in different values of
mAP. The highest value was achieved using 150 bounding box proposals (75.65%), with a small
difference when using 75 bounding box proposals (75.10%). What is interesting about the data is that
using 75 bounding box proposals reduces the evaluation time by 29.85 ms (22.22%) and still has a
comparable mAP as 150 bounding box proposals. On the other hand, increasing the number of bounding
box proposals from 150 to 450 recorded a 1.16% decrease in mAP. Therefore, increasing the number of
bounding box proposals does not always improve the mAP of the object detector.

Table 1. Faster R-CNN performance on the different number of bounding box proposals
bounding box
mAP APbattery APmortar APwires time(ms)
proposal
10 0.6733 0.6923 0.9885 0.3391 89.55
75 0.7510 0.7381 0.9874 0.5274 104.48
100 0.7359 0.7034 0.9862 0.5180 126.87
150 0.7565 0.7292 0.9862 0.5540 134.33
300 0.7222 0.6843 0.9828 0.4994 171.64
450 0.7449 0.7374 0.9828 0.5146 216.42

The precision and recall in each class using 150 bounding box proposals are shown in Table 2. It can
be seen that the Faster R-CNN detected the mortar with high precision (96.67%) and high recall
(100%). While the wires were not accurately detected with 87.41% precision and 65.10% recall.

Table 2. Precision and recall (150 bounding box proposals)

class precision (%) recall (%)

battery 97.09 63.29
wires 87.41 65.10
mortar 96.67 100

The performance of Faster R-CNN in each bounding box proposal during the evaluation is shown
in Fig. 3. It can be seen that the mAP using 10 bounding box proposals significantly reduces the
performance of the object detector.

Fig. 3. mAP plot on different bounding box proposals.

The inference time in each bounding box proposal was also evaluated. The comparison of mAP versus
time on the different number of bounding box proposals is presented in Fig. 4. Using 450 bounding box
proposals gives the slowest inference time, while 10 bounding box proposals are the fastest but give the
lowest mAP. The graph indicates that it is recommended to use 75 bounding box proposals to get the
best trade-off between speed and mAP.

Fig. 4. mAP vs. time on different bounding box proposals.

Galvez and Dadios (Identifying threat objects using faster region-based convolutional neural networks (faster r-cnn))
386 International Journal of Advances in Intelligent Informatics ISSN 2442-6571
Vol. 8, No. 3, November 2022, pp. 381-390

3.2. Image Resolutions

In this experiment, the input image resolutions were varied between 150 × 256 and 900 × 1536. Then,
the Faster R-CNN was trained using these image resolutions. Table 3 shows the performance of Faster
R-CNN on different image resolutions. The aspect ratio of all resolutions was fixed (75/128), while the
number of proposals was 300. It can be seen from the table that as the image resolution gets bigger, the
mAP increases. The highest mAP was achieved using 900 × 1536 resolution (77.59%) in exchange for
lower speed. Increasing the resolution by a factor of 2 (from 150 × 256 to 300 × 512) increases the mAP
by 16.42% while increasing it to a factor of 4 (from 150 × 256 to 600 × 1024) increases the mAP by
27.09%. In addition, there is no change in evaluation time if 150 × 256 or 300 × 512 resolution is used,
but the mAP in 300 × 512 is higher than 150 × 256. The table clearly shows that image resolution can
significantly impact the mAP of the object detector.

Table 3. Faster R-CNN performance on different image resolutions

resolution mAP APbattery APmortar APwires time(ms)

150 × 256 0.4513 0.2739 0.9469 0.1332 149.25
300 × 512 0.6155 0.5196 0.9711 0.3558 149.25
450 × 768 0.6984 0.6745 0.9805 0.4402 156.72
600 × 1024 0.7222 0.6843 0.9828 0.4994 171.64
750 × 1280 0.7563 0.7527 0.9828 0.5335 186.57
900 × 1536 0.7759 0.7739 0.9740 0.5799 208.96

The precision and recall in each class using 900 × 1536 resolution are shown in Table 4. It can be
seen that the Faster R-CNN detected the mortar with high precision (93.55%) and high recall (100%).
While the wires were not accurately detected with 77.84% precision and 75% recall.

Table 4. Precision and recall (900 × 1536 Resolution)

class precision (%) recall (%)

battery 95.65 69.62
wires 77.84 75
mortar 93.55 100

The mAP plot on different image resolutions is shown in Fig. 5. Interestingly, the image size was
observed to affect the performance of the object detector. Increasing the image size also increases the
mAP of the object detector.

Fig. 5. mAP plot on different image resolutions.

Same with the bounding box proposal experiment, the inference time in different image resolutions
was also examined. The comparison of mAP versus time on different image resolutions is presented in
Fig. 6. The increased mAP can be achieved by sacrificing the speed of the object detector. Every 150
pixels increase in the shorter edge, and 256 pixels increase in the other edge of the input image increases
the mAP while the evaluation speed slows down.

Fig. 6. mAP vs. time on different image resolutions.

After training and evaluating the Faster R-CNN, the trained model was tested in an X-ray image to
verify its detection performance. A python script was developed that accepts an input image, performs
inference, and outputs the bounding box coordinates and corresponding class labels of the threat objects.
The detection output using Faster R-CNN is shown in Fig. 7. The class label and class score of the
detected objects are shown in the upper portion of the bounding box coordinates. The model was able
to detect three classes of IED components, such as battery, mortar, and wires.

Fig. 7. Detection output in the X-ray images

Galvez and Dadios (Identifying threat objects using faster region-based convolutional neural networks (faster r-cnn))
388 International Journal of Advances in Intelligent Informatics ISSN 2442-6571
Vol. 8, No. 3, November 2022, pp. 381-390

4. Conclusion
This study extensively evaluated Faster R-CNN in identifying threat objects in an X-ray image
dataset. Different experiments were conducted to increase the performance of the threat object detector
by changing the number of bounding box proposals and the image resolution of the input image. These
experiments confirmed that increasing the number of bounding box proposals may lower the mean
average precision (mAP) and slows the detection time. The research has also shown that increasing the
input image's size positively impacts the mAP by sacrificing speed. It is recommended to identify the
best trade-off between the mAP and speed when using Faster R-CNN by balancing the bounding box
proposals and the image size. Overall, the experiment result shows that the proposed method can reliably
identify the threat object in an X-ray image.
More X-ray images can be added to the training data to improve this study further. The data is
recommended to have other objects aside from the IED components. This may increase the
generalizability of the IED detector model and prevent several false positives and negatives. If acquiring
additional data is impossible, another option is to generate synthetic X-ray images using another machine
learning framework like generative adversarial networks (GANs) and variational autoencoders (VAEs).
Acknowledgment
The authors thank Bulacan State University, De la Salle University, and the Engineering Research
and Development for Technology (ERDT), Department of Science and Technology (DOST) for their
financial support while doing this research.
Declarations
Author contribution. Reagan Galvez performed the manuscript revision, data acquisition, training, and
evaluation. Elmer Dadios provided consultations to improve the content of the paper.
Funding statement. The Philippine Council for Industry, Energy, and Emerging Technology Research
and Development (PCIEERD) funded the research under Project No. 05464.
Conflict of interest. The authors declare no conflict of interest.
Additional information. No additional information is available for this paper.
References
[1] C. Schmeitz, D. Barten, K. Van Barneveld, H. De Cauwer, L. Mortelmans, F. Van Osch, J. Wijnands, E.
C. Tan, and A. Boin, “Terrorist Attacks Against Emergency Medical Services: Secondary Attacks are an
Emerging Risk,” Prehos. Disast. Med., vol. 37, no. 2, pp. 185-191, 2022, doi: 10.1017/S1049023X22000140.
[2] S. Buigut, B. Kapar, and U. Braendle, “Effect of regional terrorism events on Malaysian tourism demand,”
Tour. and Hospit. Res., vol. 22, no 3., pp. 271–283.
[3] Institute for Economics & Peace, “Global terrorism index 2018: measuring the impact of terrorism.” 2022,
Accessed : Dec, 20, 2022. [Online]. Available : http://visionofhumanity.org/reports/
[4] V. Riffo, S. Flores, and D. Mery, “Threat Objects Detection in X-ray Images Using an Active Vision
Approach,” J. Nondestruct. Eval., vol. 36, no. 3, p. 44, Sep. 2017, doi: 10.1007/s10921-017-0419-3.
[5] S. Ren, K. He, R. Girshick, and J. Sun, “Faster R-CNN: towards real-time object detection with region
proposal networks,” IEEE Trans. Pattern Anal. Mach. Intell., vol. 39, no. 6, pp. 1137–1149, Jun. 2017, doi:
10.1109/TPAMI.2016.2577031.
[6] H. Ji, Z. Gao, T. Mei, and Y. Li, “Improved faster r-cnn with multiscale feature fusion and homography
augmentation for vehicle detection in remote sensing images,” IEEE Geosci. Remote Sens. Lett., vol. 16, no.
11, pp. 1761–1765, 2019, doi: 10.1109/LGRS.2019.2909541.
[7] G. Zhou, W. Zhang, A. Chen, M. He, and X. Ma, “Rapid detection of rice disease based on FCM-KM and
faster r-cnn fusion,” IEEE Access, vol. 7, pp. 143190–143206, 2019, doi: 10.1109/ACCESS.2019.2943454.
[8] F. Deng, W. Mao, Z. Zeng, H. Zeng, and B. Wei, “Multiple diseases and pests detection based on federated
learning and improved faster R-CNN,” IEEE Trans. Instrum. Meas., vol. 71, pp. 1–11, 2022, doi:
10.1109/TIM.2022.3201937.

[9] W. Wu, Y. Yin, X. Wang, and D. Xu, “Face detection with different scales based on Faster R-CNN,” IEEE
Trans. Cybern., vol. 49, no. 11, pp. 4017–4028, Nov. 2019, doi: 10.1109/TCYB.2018.2859482.
[10] P. J. Lu and J.-H. Chuang, “Fusion of multi-intensity image for deep learning-based human and face
detection,” IEEE Access, vol. 10, pp. 8816–8823, 2022, doi: 10.1109/ACCESS.2022.3143536.
[11] Z. Lin, K. Ji, X. Leng, and G. Kuang, “Squeeze and excitation rank Faster R-CNN for ship detection in
SAR images,” IEEE Geosci. Remote Sens. Lett., vol. 16, no. 5, pp. 751–755, May 2019, doi:
10.1109/LGRS.2018.2882551.
[12] Y. Li, S. Zhang, and W.-Q. Wang, “A lightweight faster R-CNN for ship detection in SAR images,” IEEE
Geosci. Remote Sens. Lett., vol. 19, pp. 1–5, 2022, doi: 10.1109/LGRS.2020.3038901.
[13] R. Gao et al., “Small foreign metal objects detection in X-Ray images of clothing products using faster R-
CNN and feature pyramid network,” IEEE Trans. Instrum. Meas., vol. 70, pp. 1–11, 2021, doi:
10.1109/TIM.2021.3077666.
[14] R. Gonzales-Martinez, J. Machacuay, P. Rotta, and C. Chinguel, “Hyperparameters tuning of faster R-CNN
deep learning transfer for persistent object detection in radar images,” IEEE Lat. Am. Trans., vol. 20, no. 4,
pp. 677–685, Apr. 2022, doi: 10.1109/TLA.2022.9675474.
[15] Y. Zhang, Z. Zhang, K. Fu, and X. Luo, “Adaptive defect detection for 3-D printed lattice structures based
on improved faster R-CNN,” IEEE Trans. Instrum. Meas., vol. 71, pp. 1–9, 2022, doi:
10.1109/TIM.2022.3200362.
[16] F. Selamet, S. Cakar, and M. Kotan, “Automatic detection and classification of defective areas on metal
parts by using adaptive fusion of faster R-CNN and shape from shading,” IEEE Access, vol. 10, pp. 126030–
126038, 2022, doi: 10.1109/ACCESS.2022.3224037.
[17] Y. Liu, Z. Ma, X. Liu, S. Ma, and K. Ren, “Privacy-preserving object detection for medical images with
faster R-CNN,” IEEE Trans. Inf. Forensics Secur., vol. 17, pp. 69–84, 2022, doi:
10.1109/TIFS.2019.2946476.
[18] Z. Qian et al., “A new approach to polyp detection by pre-processing of images and enhanced faster R-
CNN,” IEEE Sens. J., vol. 21, no. 10, pp. 11374–11381, May 2021, doi: 10.1109/JSEN.2020.3036005.
[19] G. Wang, J. Guo, Y. Chen, Y. Li, and Q. Xu, “A PSO and BFO-based learning strategy applied to Faster
R-CNN for object detection in autonomous driving,” IEEE Access, vol. 7, pp. 18840–18859, 2019, doi:
10.1109/ACCESS.2019.2897283.
[20] S. Akcay, M. E. Kundegorski, C. G. Willcocks, and T. P. Breckon, “Using deep convolutional neural
network architectures for object classification and detection within X-ray baggage security imagery,” IEEE
Trans. Inf. Forensics Secur., vol. 13, no. 9, pp. 2203–2215, Sep. 2018, doi: 10.1109/TIFS.2018.2812196.
[21] D. Mery, D. Saavedra, and M. Prasad, “X-Ray baggage inspection with computer vision: a survey,” IEEE
Access, vol. 8, pp. 145620–145633, 2020, doi: 10.1109/ACCESS.2020.3015014.
[22] J. Zhang, X. Song, J. Feng, and J. Fei, “X-Ray image recognition based on improved Mask R-CNN
algorithm,” Math. Probl. Eng., vol. 2021, pp. 1–14, Sep. 2021, doi: 10.1155/2021/6544325.
[23] B. Gu, R. Ge, Y. Chen, L. Luo, and G. Coatrieux, “Automatic and robust object detection in X-Ray baggage
inspection using deep convolutional neural networks,” IEEE Trans. Ind. Electron., vol. 68, no. 10, pp. 10248–
10257, Oct. 2021, doi: 10.1109/TIE.2020.3026285.
[24] M. Baştan, “Multi-view object detection in dual-energy X-ray images,” Mach. Vis. Appl., vol. 26, no. 7–8,
pp. 1045–1060, Nov. 2015, doi: 10.1007/s00138-015-0706-x.
[25] R. L. Galvez, E. P. Dadios, A. A. Bandala, and R. R. P. Vicerra, “Object detection in x-ray images using
transfer learning with data augmentation,” Int. J. Adv. Sci. Eng. Inf. Technol., vol. 9, no. 6, p. 2147, Dec.
2019, doi: 10.18517/ijaseit.9.6.9960.
[26] R. L. Galvez and E. P. Dadios, “Threat object detection and analysis for explosive ordnance disposal robot,”
Glob. J. Eng. Technol. Adv., vol. 11, no. 1, pp. 078–087, Apr. 2022, doi: 10.30574/gjeta.2022.11.1.0074.
[27] K. He, X. Zhang, S. Ren, and J. Sun, “Deep residual learning for image recognition,” in 2016 IEEE
Conference on Computer Vision and Pattern Recognition (CVPR), Jun. 2016, pp. 770–778, doi:
10.1109/CVPR.2016.90.

Galvez and Dadios (Identifying threat objects using faster region-based convolutional neural networks (faster r-cnn))
390 International Journal of Advances in Intelligent Informatics ISSN 2442-6571
Vol. 8, No. 3, November 2022, pp. 381-390

[28] Tzutalin, “LabelImg,” Github. 2015, [Online]. Available : https://github.com/tzutalin/labelImg

[29] N. Qian, “On the momentum term in gradient descent learning algorithms,” Neural Networks, vol. 12, no.
1, pp. 145–151, 1999, doi: 10.1016/s0893-6080(98)00116-6.
[30] A. Hernandez-Garcia and P. König, “Data augmentation instead of explicit regularization,” CoRR, vol.
abs/1806.0, 2018, [Online]. Available : http://arxiv.org/abs/1806.03852.
[31] M. Everingham, L. Van Gool, C. K. I. Williams, J. Winn, and A. Zisserman, “The pascal visual object classes
(VOC) challenge,” Int. J. Comput. Vis., vol. 88, no. 2, pp. 303–338, Jun. 2010, doi: 10.1007/s11263-009-
0275-4.

Galvez and Dadios (Identifying threat objects using faster region-based convolutional neural networks (faster r-cnn))

Multi-Object Detection in Security Screening Scene Based On
No ratings yet
Multi-Object Detection in Security Screening Scene Based On
22 pages
Evaluation of A Dual Convolutional Neural Network 2019
No ratings yet
Evaluation of A Dual Convolutional Neural Network 2019
8 pages
Li 2021 J. Phys.: Conf. Ser. 1827 012085
No ratings yet
Li 2021 J. Phys.: Conf. Ser. 1827 012085
11 pages
Weapon Detection Using Artificial Intelligence and Deep Learning For Security Applications
No ratings yet
Weapon Detection Using Artificial Intelligence and Deep Learning For Security Applications
5 pages
IMINT Target Acquisition Using Deep Learning
No ratings yet
IMINT Target Acquisition Using Deep Learning
5 pages
Backbone Search For Object Detection For Applications in Intrusion Warning Systems
No ratings yet
Backbone Search For Object Detection For Applications in Intrusion Warning Systems
10 pages
Development of Framework For Detecting Smoking Scenes
No ratings yet
Development of Framework For Detecting Smoking Scenes
5 pages
Real Time Object Detection in Surveillance Cameras With 2xjeq74wam
No ratings yet
Real Time Object Detection in Surveillance Cameras With 2xjeq74wam
8 pages
Jimaging 10 00197
No ratings yet
Jimaging 10 00197
19 pages
10 1109@access 2019 2932731
No ratings yet
10 1109@access 2019 2932731
9 pages
Ding 2018 IOP Conf. Ser. Mater. Sci. Eng. 322 062024
No ratings yet
Ding 2018 IOP Conf. Ser. Mater. Sci. Eng. 322 062024
6 pages
Object Detection Using Deep Learning
No ratings yet
Object Detection Using Deep Learning
6 pages
A Comprehensive Survey of The R-CNN Family For Object Detection
No ratings yet
A Comprehensive Survey of The R-CNN Family For Object Detection
6 pages
WD Project Final
No ratings yet
WD Project Final
66 pages
5 Ijlemr 77839
No ratings yet
5 Ijlemr 77839
5 pages
IED Smart Living
No ratings yet
IED Smart Living
12 pages
Towards More Efficient Security Inspection Via Deep Learning A Task-Driven X-Ray Image Cropping Scheme - Hong Duc Ngoyen
No ratings yet
Towards More Efficient Security Inspection Via Deep Learning A Task-Driven X-Ray Image Cropping Scheme - Hong Duc Ngoyen
16 pages
Weapon Detection Using Artificial Intelligence and Deep Learning For Security Applications
No ratings yet
Weapon Detection Using Artificial Intelligence and Deep Learning For Security Applications
7 pages
2802 8020 1 PB
No ratings yet
2802 8020 1 PB
3 pages
Object Detection
No ratings yet
Object Detection
76 pages
Real-Time Object Detection Using Deep Learning and Open CV
No ratings yet
Real-Time Object Detection Using Deep Learning and Open CV
4 pages
A - Survey Object - Detection - and - X-Ray - Security - Imaging
No ratings yet
A - Survey Object - Detection - and - X-Ray - Security - Imaging
26 pages
10 21541-Apjess 1542885-4187651
No ratings yet
10 21541-Apjess 1542885-4187651
5 pages
Ref 14
No ratings yet
Ref 14
5 pages
Real Time Object Detection Using SSD and MobileNet
No ratings yet
Real Time Object Detection Using SSD and MobileNet
6 pages
Joint Sub-Component Level Segmentation and Classification For Anomaly Detection Within Dual-Energy X-Ray Security Imagery
No ratings yet
Joint Sub-Component Level Segmentation and Classification For Anomaly Detection Within Dual-Energy X-Ray Security Imagery
5 pages
Research Paper G19
No ratings yet
Research Paper G19
5 pages
MP Set 241 9 5
No ratings yet
MP Set 241 9 5
12 pages
1 ObjectDetection
No ratings yet
1 ObjectDetection
46 pages
Object Detection Techniques A Review
No ratings yet
Object Detection Techniques A Review
9 pages
IJISAE 20 Divya+kumawat 3 1834
No ratings yet
IJISAE 20 Divya+kumawat 3 1834
10 pages
Machine Learning - Advanced Concepts
From Everand
Machine Learning - Advanced Concepts
Derrick Mwiti
No ratings yet
Manuscript Template 2
No ratings yet
Manuscript Template 2
13 pages
A Brief Review and Challenges of Object 2020
No ratings yet
A Brief Review and Challenges of Object 2020
17 pages
Object and Face Detection Based On Center-Net 1
No ratings yet
Object and Face Detection Based On Center-Net 1
7 pages
2022 V13i3059
No ratings yet
2022 V13i3059
11 pages
Remotesensing 14 00984 v2
No ratings yet
Remotesensing 14 00984 v2
21 pages
Obstacle Detection and Classification Using Deep Learning For Tracking in High-Speed Autonomous Driving
No ratings yet
Obstacle Detection and Classification Using Deep Learning For Tracking in High-Speed Autonomous Driving
6 pages
Applsci 11 10261
No ratings yet
Applsci 11 10261
18 pages
Applsci 12 03322 v2
No ratings yet
Applsci 12 03322 v2
17 pages
13688-Article Text-24453-1-10-20230508
No ratings yet
13688-Article Text-24453-1-10-20230508
6 pages
Keypoint Density-Based Region Proposal For Fine-Grained Object Detection Using Regions With Convolutional Neural Network Features
No ratings yet
Keypoint Density-Based Region Proposal For Fine-Grained Object Detection Using Regions With Convolutional Neural Network Features
6 pages
Literature Survey For Robotics
No ratings yet
Literature Survey For Robotics
6 pages
I Jeter 039112021
No ratings yet
I Jeter 039112021
8 pages
Weapon Detection Using Deep Learning Model For Smart Surveillance System
No ratings yet
Weapon Detection Using Deep Learning Model For Smart Surveillance System
11 pages
Yolo
No ratings yet
Yolo
24 pages
Comparative Analysis of Deep Learning Image Detection Algorithms
No ratings yet
Comparative Analysis of Deep Learning Image Detection Algorithms
27 pages
Object Detection Using Adaptive Mask RCNN
No ratings yet
Object Detection Using Adaptive Mask RCNN
12 pages
A Novel Model To Detect and Categorize Objects From Images by Using A Hybrid Machine Learning Model
No ratings yet
A Novel Model To Detect and Categorize Objects From Images by Using A Hybrid Machine Learning Model
13 pages
Najibi G-CNN An Iterative CVPR 2016 Paper
No ratings yet
Najibi G-CNN An Iterative CVPR 2016 Paper
9 pages
Object Detection Using TensorFlow
No ratings yet
Object Detection Using TensorFlow
21 pages
Detecting and Identifying Occluded and Camouflaged Objects in Low-Illumination Environments
No ratings yet
Detecting and Identifying Occluded and Camouflaged Objects in Low-Illumination Environments
9 pages
Object Detection Harmful Weapons Detection Using YOLOv4
No ratings yet
Object Detection Harmful Weapons Detection Using YOLOv4
8 pages
Finalreport
No ratings yet
Finalreport
56 pages
Deep Learning Algorithms For Object Detection
No ratings yet
Deep Learning Algorithms For Object Detection
43 pages
Seminar Paper by Roquia Salam
No ratings yet
Seminar Paper by Roquia Salam
29 pages
2 Paper
No ratings yet
2 Paper
6 pages
Sensors 22 04833
No ratings yet
Sensors 22 04833
17 pages
Image and Video Analytics Unit 3
No ratings yet
Image and Video Analytics Unit 3
18 pages
Image Compression: Efficient Techniques for Visual Data Optimization
From Everand
Image Compression: Efficient Techniques for Visual Data Optimization
Fouad Sabry
No ratings yet
A Comprehensive Study On The Influence OF: Palash Yadav Monika Kaushal
No ratings yet
A Comprehensive Study On The Influence OF: Palash Yadav Monika Kaushal
9 pages
CS701
No ratings yet
CS701
1 page
Digital Logic Design Lab
No ratings yet
Digital Logic Design Lab
42 pages
Fee Structure 2025 26
No ratings yet
Fee Structure 2025 26
2 pages
02 Prayagraj To Mhow
No ratings yet
02 Prayagraj To Mhow
3 pages
Muskan IR
No ratings yet
Muskan IR
16 pages
Race Sports
No ratings yet
Race Sports
1 page
Language Translator Python
No ratings yet
Language Translator Python
25 pages
Animesh Kumar
No ratings yet
Animesh Kumar
1 page
Analemmatic Sundial PDF Generator
0% (1)
Analemmatic Sundial PDF Generator
37 pages
Basic Computer Architecture
No ratings yet
Basic Computer Architecture
33 pages
Splash-Proof Weighing Scale: Operation Manual
No ratings yet
Splash-Proof Weighing Scale: Operation Manual
11 pages
Linear Algebra For Quantum Computing (From Amelie Schreiber Notebook)
No ratings yet
Linear Algebra For Quantum Computing (From Amelie Schreiber Notebook)
72 pages
MTSM-1 Multi Location Connectivity of COSEC Door Controller
No ratings yet
MTSM-1 Multi Location Connectivity of COSEC Door Controller
8 pages
Python Basic and Advanced-Day 6
No ratings yet
Python Basic and Advanced-Day 6
12 pages
Student Login Details For PG SEM I 2023 - GEOGRAPHY
No ratings yet
Student Login Details For PG SEM I 2023 - GEOGRAPHY
36 pages
2024 Summer Academy Module Details
No ratings yet
2024 Summer Academy Module Details
2 pages
WFO Consolidated Software Lineup Matrix Date: Aok Doc Id
No ratings yet
WFO Consolidated Software Lineup Matrix Date: Aok Doc Id
36 pages
G66.eu - Axe-Fx III
No ratings yet
G66.eu - Axe-Fx III
38 pages
Vitara Service Manual
No ratings yet
Vitara Service Manual
835 pages
AxiomV Hardware Manual
No ratings yet
AxiomV Hardware Manual
67 pages
Informecial App Analysis Question Test NEW
No ratings yet
Informecial App Analysis Question Test NEW
4 pages
Delta Ia-Hmi Dop300 Diastudio SQL Am en 20241223
No ratings yet
Delta Ia-Hmi Dop300 Diastudio SQL Am en 20241223
25 pages
Additional Complex Number Problems 2 PDF
No ratings yet
Additional Complex Number Problems 2 PDF
2 pages
Acs Template Instructions Ol Readme
No ratings yet
Acs Template Instructions Ol Readme
7 pages
Mapúa University: Mesh Analysis and Nodal Analysis
No ratings yet
Mapúa University: Mesh Analysis and Nodal Analysis
9 pages
PLC - 1 (CPU 1214C AC/DC/Rly) : Totally Integrated Automation Portal
No ratings yet
PLC - 1 (CPU 1214C AC/DC/Rly) : Totally Integrated Automation Portal
39 pages
Migrating A Survey From LimeSurvey To Qualtrics
No ratings yet
Migrating A Survey From LimeSurvey To Qualtrics
11 pages
Wolftrack: Installation Manual
No ratings yet
Wolftrack: Installation Manual
19 pages
Language Summary ? 2
No ratings yet
Language Summary ? 2
16 pages
Project Report On Secondary Research
No ratings yet
Project Report On Secondary Research
7 pages
Programming: Just Basic Tutorials
67% (3)
Programming: Just Basic Tutorials
360 pages
2020 Polaroid P422T Data Sheet 002
No ratings yet
2020 Polaroid P422T Data Sheet 002
2 pages
MAPEH (Arts) : Quarter 1 - Module 1: Appreciation of The Elements, Principles, & Processes of Arts Using New Technologies
100% (2)
MAPEH (Arts) : Quarter 1 - Module 1: Appreciation of The Elements, Principles, & Processes of Arts Using New Technologies
11 pages
DragonFly v1.0
No ratings yet
DragonFly v1.0
1 page
COBOL SQL SFPDCDRV Stuck in Status Processing (Doc ID 2308542.1)
No ratings yet
COBOL SQL SFPDCDRV Stuck in Status Processing (Doc ID 2308542.1)
2 pages
101 500.prepaway - Premium.exam.120q
No ratings yet
101 500.prepaway - Premium.exam.120q
38 pages

Identifying Threat Objects Using Faster

Uploaded by

Identifying Threat Objects Using Faster

Uploaded by

International Journal of Advances in Intelligent Informatics ISSN 2442-6571

Vol. 8, No. 3, November 2022, pp. 381-390 381

Identifying threat objects using faster region-based

ARTICLE INFO ABSTRACT

Automated detection of threat objects in a security X-ray image is vital to

https://doi.org/10.26555/ijain.v8i3.952 http://ijain.org [email protected]

FC Class labels battery: 90%

RoI FC wires: 87%

Fig. 1. Faster R-CNN architecture

Fig. 2. IEDXray dataset.

2.2. Training and Evaluation

2.3. Hardware and Software Setup

3. Results and Discussion

Table 2. Precision and recall (150 bounding box proposals)

class precision (%) recall (%)

Fig. 3. mAP plot on different bounding box proposals.

Fig. 4. mAP vs. time on different bounding box proposals.

3.2. Image Resolutions

Table 3. Faster R-CNN performance on different image resolutions

resolution mAP APbattery APmortar APwires time(ms)

Table 4. Precision and recall (900 × 1536 Resolution)

class precision (%) recall (%)

Fig. 5. mAP plot on different image resolutions.

Fig. 6. mAP vs. time on different image resolutions.

Fig. 7. Detection output in the X-ray images

[28] Tzutalin, “LabelImg,” Github. 2015, [Online]. Available : https://github.com/tzutalin/labelImg

You might also like