Ship Detection For KOMPSAT-3A Optical Images Using Binary Features and Adaboost Classification
Ship Detection For KOMPSAT-3A Optical Images Using Binary Features and Adaboost Classification
Jae Young Chang* , Han Oh, Seung-Jae Lee, and Kwang Jae Lee
National Satellite Operation and Application Center, Korea Aerospace Research Institute (KARI)
IGARSS 2020 - 2020 IEEE International Geoscience and Remote Sensing Symposium | 978-1-7281-6374-1/20/$31.00 ©2020 IEEE | DOI: 10.1109/IGARSS39084.2020.9323847
*
E-mail: [email protected]
Supervised machine learning is the process of selecting fea- 2.1. Binary features
tures considered to be important to distinguish the objects of
interest from the background and determining their weights 2.1.1. MCT (Modified Census Transform) feature
through trial and error. This requires a dataset that represents
A MCT feature is defined as an ordered set of comparisons of
the correct answer.
pixel intensities in a local neighborhood representing which
Some datasets have been published for object detection in
pixels have lesser intensity than the intensity mean [5]. Let
satellite images, and they differ in the type of sensor used for
N (x) define a local spatial neighborhood of the pixel at x.
image acquisition, the number of object types, and the label ¯
The intensity mean on this neighborhood is denoted by I(x).
data format. xView dataset consists of horizontal rectangle ¯ ¯
Let a comparison function ζ I(x), I(y) be 1 if I(x) < I(y)
labels for 60 kinds of objects for WorldView-3 optical images N
and let denote the concatenation operation, then a MCT
[1]. In the case of ship objects, such horizontal rectangular
feature is defined as .
label format is not suitable in consideration of long shapes
and clustered situations. DOTA dataset, on the other hand, O
¯
Γ(x) = ζ I(x), I(y) (1)
consists of rotated rectangle labels for 16 kinds of objects for
y∈N (x)
various GSD images, which are suitable for representing ship
position [2]. In this paper, we adopt the rotated rectangle for- An important property of the feature is its robustness to
mat. monotonic gray-scale changes caused, for example, by illu-
In the field of satellite image analysis, deep-learning- mination variations. Another important property is its com-
based detection technology, which is the mainstream in com- putational simplicity, which makes it the feature of the first
puter vision, is being introduced. It is easy to apply as an cascade classifier for early rejection of candidates.
Authorized licensed use limited to: ANNA UNIVERSITY. Downloaded on August 28,2024 at 08:33:23 UTC from IEEE Xplore. Restrictions apply.
x
(x2, y2) 2.2.1. Preparation of training samples
(x1, y1)
The size of the detection template is defined as 50×18 pixels,
h2 θ y and it includes a margin of 5.5 pixels around. So the minimum
detectable breadth of the ship is 7 pixels and is actually 3.85
h1
w2 m. Although the input image is color, only brightness is used
w1 to classify the ship regardless of the color change according
(a) (b) to the atmospheric state. Some examples of training samples
are shown in Fig. 2.
Fig. 1. (a) Two LAB features and (b) an RE feature.
969
Authorized licensed use limited to: ANNA UNIVERSITY. Downloaded on August 28,2024 at 08:33:23 UTC from IEEE Xplore. Restrictions apply.
(a) MCT (b) LAB (c) RE (d) RF
Table 1. Boosting process with binary features
• Given (F1 , c1 ), ..., (F2 , cm ) where ci = 0 for ships
and ci = 1 for backgrounds. Fi is a feature vector of
i-th sample.
i
• A feature value of i-th sample is specified as Fpq ,
stage
where p is a feature type and q is a feature index.
1 1
• Initialize D1 (i) = 2l , 2n where l and n are the number
of ships and backgrounds.
• For t = 1, ..., T :
- Select a feature type pt randomly from four feature
types and prepare a corresponding feature pool.
- Generate weak classifier wt with a feature Fpit qt , error
Fig. 3. Visualization of selected weak classifiers. (a) Posi-
t and distribution Dt .
1 1−t
tions of MCT features are depicted as red dots. (b) Areas of
- Choose αt = 2 ln t LAB features are superimposed by weights. (c) (d) Positions
of referenced pixels by each feature, RE and RF, are superim-
- Update the distribution
( posed by weights.
Dt (i) e−αt if wt (Fpit qt ) = ci
Dt+1 = Zt × αt
e if wt (Fpit qt ) 6= ci
where Zt is chosen so that Dt+1 is a distribution. classifier mainly observes the lateral edges of the ship. The
- Save the selected feature index qt to a set Qpt RE-based classifier mainly observes the ship’s center and
head, while the RF-based classifier observes the entire inter-
• The resulting elementary classifier of a single feature nal area. Table 2 shows the distribution of weights by the
can be obtained by a combination of the corresponding feature type. The highest proportion of RF can be interpreted
weak classifiers. as difficult to find effective weak classifiers at consistent
positions. The lowest proportion of LAB, which captures
T
X the dominant edges of the ship, shows that the boundaries
hpq (Fpq ) = αt wt (Fpq )I(pt = p)I(qt = q) between ships and backgrounds are very subtle.
t=1
where I(·) is the indicator function that takes 1 if the Table 2. Distribution of weights by feature type.
argument is true and 0 otherwise. Feature type MCT LAB RE RF
• The final strong classifier is the sum of all selected el- Sum of α 501.5 324.0 580.6 715.1
ementary classifiers. Ratio(%) 23.6 15.3 27.4 33.7
X4 X
H(F) = hpq (Fpq )
p=1 q∈Qp
970
Authorized licensed use limited to: ANNA UNIVERSITY. Downloaded on August 28,2024 at 08:33:23 UTC from IEEE Xplore. Restrictions apply.
ages acquired from high resolution optical satellites such as
Table 3. Detection performance on KOMPSAT-3A test set
KOMPSAT-3A. In addition, we introduced the method to ex-
Elementary
Recall Precision F2 score clude confusing samples to reduce the class overlap problem.
classifiers
To improve the performance of the proposed algorithm,
512 78.7 78.2 78.6 we can extend classifiers with other aspect ratios and add fea-
640 72.9 85.8 75.1 tures such as HOG (Histogram of Oriented Gradients), which
768 70.5 88.2 73.4 more efficiently represents subtle changes in ship’s appear-
896 67.7 90.9 71.3 ance. This is the subject of ongoing research.
90.0 5. REFERENCES
(a) [5] B. Froba and A. Ernst, “Face detection with the modified
census transform,” in Proc. of Sixth IEEE International
Conference on Automatic Face and Gesture Recognition,
May 2004, pp. 91–96.
[6] S. Yan, S. Shan, X. Chen, and W. Gao, “Locally assem-
bled binary (lab) feature with feature-centric cascade for
fast and accurate face detection,” in Proc. of CVPR, June
(b) 2008, pp. 1–7.
[7] M. Ozuysal, M. Calonder, V. Lepetit, and P. Fua, “Fast
keypoint recognition using random ferns,” IEEE Trans.
on Pattern Analysis and Machine Intelligence, vol. 32,
no. 3, pp. 448–461, March 2010.
[8] A. Vezhnevets and O. Barinova, “Avoiding boosting
overfitting by removing confusing samples,” in Proc. of
(c) ECML, 2007, pp. 430–441.
4. CONCLUSION
971
Authorized licensed use limited to: ANNA UNIVERSITY. Downloaded on August 28,2024 at 08:33:23 UTC from IEEE Xplore. Restrictions apply.