A Ship Detection Method Base
A Ship Detection Method Base
Jie Z Yufen S
Computer Science and Technology department Computer Science and Technology department
Wuhan University of Technology Wuhan University of Technology
Wuhan, China Wuhan, China
e-mail: [email protected] e-mail: [email protected]
Abstract—A Ship Detection Method based on Recurrent recognition method based on feedforward neural network.
Neural Network in a Marine Radar System is investigated. However, the author has not deleted the feature dimension
Considering that the characteristics of the single-frame radar which exists the problem, and the performance of the model
image are not obvious, we selected image sequences in other classifiers needs to be further expanded. Some
concatenated by single-frame radar images as research objects studies have also selected image sequences that are
and designed a coding mode to complete the mapping of radar concatenated by single-frame radar images as research
targets to vectors. Our model based on RNN made a good objects. Weigen Huang [3] proposed a ship target
performance on the dataset made by ourselves. recognition method based on Kalman filter in 2014 and
realized path prediction through similarity calculation of
Keywords-radar image; sequence; RNN; detection Normalized Cross-correlation function. The focus of this
I. INTRODUCTION model is on the identification and path tracking of a single
ship target in a continuous frame radar image, without
The shipborne radar target recognition, that is, radar effectively distinguishing ship targets and clutter.
detection and identification of the target against a Considering that the characteristics of the single-frame
background of clutter and interference, is of great radar image are not obvious, this paper intends to use the
significance in maritime surveillance, ship route planning, target sequence composed of continuous N-frame radar
risk warning and so on. At this stage, the target recognition image as the feature extraction object. Through RNN, we
of shipborne radar still has great difficulty. On the one hand, hoped that the context information of the radar image target
the radar image is wholly composed of the strength of the sequence could be excavated to enhance the credibility of
target echo, and the number of its features is far less than the radar target recognition, and effectively identify the ship
that of the optical image. The existence of factors such as target and clutter.
the dense distribution of sea targets and the dynamic
variation of sea clutter has increased the difficulty of target II. RELATED WORK
recognition of shipborne radar. On the other hand, shipborne
The research on target recognition of shipborne radar
radar is susceptible to weather and various clutter, noise and
has been continuously developed since the 1950s as a
co-channel interference in practical applications. Target
specific application of radar target recognition. Over the
echo signals and radar images are severely affected,
past decades, with the continuous development of radar
resulting in the inability to distinguish ship targets and
technology and continuous enrichment and improvement of
clutter
At this stage, the field of shipborne radar target various basic theories, shipborne radar target recognition
recognition mainly uses single-frame radar signals as has achieved excellent results. In recent years, the rapid
research objects, whether it is a one-dimensional range growth of techniques such as machine learning and deep
image, SAR image or other forms of radar echo signals. Yu learning has also provided a new solution for shipborne
Zhi-fu [1] proposed a fusion recognition method based on radar target recognition.
PSO-BP neural network in 2012. The BP neural network In 1983, PE Zwicke [4] developed a digitally
optimized by particle swarm optimization (PSO) algorithm implemented Mellin transform and applied it to radar target
was applied to radar image recognition. However, the recognition. The translation and scale invariance of the
problem with the model is that the feature parameters with Mellin transform provides a way to extract features from the
discrete values are not selected, so the recognition efficiency radar target profile that are insensitive to the radar's vertical
of the negative samples needs to be further improved. A and horizontal angles. The authors use the direct extension
problem with the model is that the feature parameters with of the Mellin basic definition to implement the Mellin
discrete values are not selected, so the recognition of the transform. The new method eliminates several problems
negative samples needs to be further improved. In 2013, such as the explosion of the sample number caused by
Nedyalko Petrov [2] designed an encoded mode for radar exponential sampling and the error mixed in the
echo signal eigenvectors and proposed a radar target transformation caused by the calculation of the interpolation
549
Authorized licensed use limited to: Pontificia Universidad Javeriana. Downloaded on October 23,2023 at 03:36:30 UTC from IEEE Xplore. Restrictions apply.
feature vector. We borrowed the idea that the RNN model
used dimensional reduction for multidimensional input in
NLP and selected the vector coding method combining
Label Encoding and binary coding to extract 15D for each
input target contour. The feature vector is used for the
calculation of subsequent models.
The first ten-dimensional features of each set of feature
vectors are tagged. The content is the HU moment of the
target contour (HU [0]-HU [6]), the centroid of the target
contour, and the area, representing the physical meaning of
the target contour. The first ten-dimensional features are
used to calculate the similarity between the target contours
Figure 2. The Pre-processing Network in the process of contour sequence extraction. The last five-
dimensional features of feature vectors are binary coding.
A. Target Contour Extraction According to the five characteristics summarized above,
The pre-processing network reads the continuous N- five thresholds are set after several rounds of testing. Each
frame radar image from the corresponding folder. After feature is set 0 or 1 according to whether it reaches the
completing the grey-scale conversion, we use OpenCV’s threshold. The specific meanings of the five-dimensional
findContours () function to extract the contour of each frame features are: 1.Whether the area of the target contour is
of the radar image and complete the preliminary mapping smaller than the limited area S; 2.Whether the target contour
from the target contour to the contour vector. After the is at the edge of the image; 3.Whether there are other
garbage outline with a contour area of 0 is removed, the contours within the range of the limited distance d below the
remaining contour vectors are input to the next stage for target contour; 4.Whether the length of the sequence in
feature vector extraction. which the target contour is located is smaller than the
limited length L; 5.Whether the displacement of the target
B. Feature Vector Extraction contour and the previous one is smaller than limited
According to the different characteristics of the above- displacement X. Table I gives a sample of feature vector
mentioned summarized clutter and ship target in the radar coding.
image, we design the composition rules of the target contour
550
Authorized licensed use limited to: Pontificia Universidad Javeriana. Downloaded on October 23,2023 at 03:36:30 UTC from IEEE Xplore. Restrictions apply.
V. RNN
Because the characteristics of single-frame radar image
are not obvious and it is difficult to distinguish from clutter,
this model uses RNN as the classifier. We hoped that the
radar target recognition credibility could be enhanced by
extracting the context information of the radar target
contour sequence to achieve an effective distinction between
the ship target and clutter. In this model, the input of the
RNN is a target contour feature vector of 15-dimensional,
and the output is a 1-dimensional size target contour
category, where 0 represents the target contour as the ship
Figure 4. The RNN structure at a certain moment
target, and 1 represents the clutter. Since the input vector
dimension of the RNN is relatively low and there is no
The loss function of backpropagation in the model uses
situation where the dimensional explosion affects the
the standard logistic regression loss function, and the update
performance of the model, we use the basic RNN unit
of the model parameters such as weights in the model is
structure for training.
realized by the gradient descent. To prevent the model from
the gradient explosion in the back-propagation due to
abnormal data, we adopt the method of gradient clipping to
suppress. When the amplitude of the gradient is greater than
the set threshold, the gradient will be equal to the threshold,
that is, the gradient stops increasing.
B. Category Annotation
In the test mode, the model will classify the
corresponding target contour according to the output of the
RNN, which complete the final step of the end-to-end
output of the entire model. According to the results of
several rounds of model training tests, the boundary value
Figure 3. The RNN structure between the ship target and the clutter is stable at around
0.6590, so we set the threshold to 0.6590. For the contour
Fig. 3 shows the RNN structure under a complete with the value of the output layer higher than 0.6590, we use
sequence of a target, where w(t), s(t), s(t-1), y(t) putText () in the library OpenCV to tag it as clutter, and the
respectively represents the input layer, the hidden layer of contour whose output value is less than or equal to 0.6590 is
the current and the previous moment, the output layer, and tagged as a ship.
U, V, W are the corresponding weight matrixes. In the
actual model, w(t) and s(t-1) are combined into the input VI. EXPERIMENT
layer.
A. Data Preparation
A. Forward Propagation and Back Propagation In the field of radar image target recognition, there is no
Like other RNN models, the RNN of our model includes more general training data set for the academic research of
two parts: forward propagation and backpropagation. The shipborne radar target recognition. Considering the
forward propagation is mainly used for the identification influence of different data sets on model parameters and
between ship targets and clutter. Also, the information recognition effects, our team came to the Yangtze River
contained in the current feature vector is passed to the next. with 4G radar. Field data collection was carried out on the
Backpropagation is mainly used to update the parameters edge of Wuchang River Beach. To eliminate the influence
and weights of the model. of some factors in the acquisition process on the
According to the structure diagram of the RNN at a experimental results, we removed some radar images with
certain moment given in Fig. 4, we can clearly understand radar noise and fusion between two targets in the dataset.
how we calculate the category probability for the radar The 1000 consecutive radar images with clear radar
target contour at each moment. Considering that the contours and less radar noise are selected for model training
problem dealt with this model is a two-class problem (the and testing. At the same time, in order to avoid the
distinction between ship target and clutter), the target output interference of the complex radar contours on the bridge
function uses the sigmoid function, so the RNN output pier and both sides of the Yangtze River, we selected the
formula at each moment is: target area of the same size that contains only the ship and
the buoy (i.e. non-hull) and uniformly cut the radar image.
s( ) () s - (1) For the data used for model training, we use the labelImg
() s (2) tool to mark the location and the information of category of
each target in the radar image.
() (3)
551
Authorized licensed use limited to: Pontificia Universidad Javeriana. Downloaded on October 23,2023 at 03:36:30 UTC from IEEE Xplore. Restrictions apply.
TABLE II. THE TEST RESULTS
contours contours
552
Authorized licensed use limited to: Pontificia Universidad Javeriana. Downloaded on October 23,2023 at 03:36:30 UTC from IEEE Xplore. Restrictions apply.
[4] Zwicke P E, Kiss I. A new implementation of the mellin transform [7] Granger E, Rubin M, Grossberg S, et al. A what-and-where fusion
and its application to radar classification of ships[J]. IEEE neural network for recognition and tracking of multiple radar
Transactions on Pattern Analysis & Machine Intelligence, 1983, emitters[J]. Neural Netw, 2001, 14(3):325-344.
5(2):191-9. [8] Azimi-Sadjadi M R, Yao D, Huang Q, et al. Underwater target
[5] Yin Z, Yang W, Yang Z, et al. A Study on Radar Emitter Recognition classification using wavelet packets and neural networks[J]. IEEE
Based on SPDS Neural Network[J]. Information Technology Journal, Transactions on Neural Networks, 2000, 11(3):784-794.
2011, 10(4):883-888. [9] Ting C, Wei G, Bing S. A new radar emitter recognition method
[6] Wu F, Wang C, Jiang S, et al. Classification of Vessels in Single-Pol based on pulse sample figure[C]// Eighth International Conference on
COSMO-SkyMed Images Based on Statistical and Structural Fuzzy Systems & Knowledge Discovery. IEEE, 2011.
Features[J]. Remote Sensing, 2015, 7(5):5511-5533. [10] Mikolov T, Chen K, Corrado G, et al. Efficient Estimation of Word
Representations in Vector Space[J]. Computer Science, 2013.
553
Authorized licensed use limited to: Pontificia Universidad Javeriana. Downloaded on October 23,2023 at 03:36:30 UTC from IEEE Xplore. Restrictions apply.