Color Exploitation in Hog-Based Traffic Sign Detection
Color Exploitation in Hog-Based Traffic Sign Detection
[ [ [ [ [ [
(c) Triangular signs
2670
Authorized licensed use limited to: UNIVERSITE DE SFAX. Downloaded on November 13,2024 at 10:22:17 UTC from IEEE Xplore. Restrictions apply.
Fig. 3. Example image showing several correctly detected traffic signs, indicated by cyan rectangles and one undetected sign indicated by a
red rectangle
subset of the images to obtain a training set, and use the other images times vary between 30 seconds and a few minutes. Note that in a
(approx. 3,000) for testing. single pass of the specific algorithm, all traffic sign classes are de-
We evaluate the detection of three different classes of traffic tected simultaneously, whereas the generic detector locates only a
signs: blue-circular, red-circular and triangular signs, and we train single class of signs.
the system using 170, 74 and 53 samples, respectively. Each class We have applied both the specific algorithm and the HOG de-
contains intra-class variation due to the various signs in the class and tectors to the dataset (see Figure 1) and the results are shown in Fig-
due to the different imaging conditions, as shown in Figure 1. The ure 4. The AUC scores are summarized in Table 1. Figure 2 shows
distribution of the different types of signs in the training sets is rep- an example output image of the CIELab detector. In general, we ob-
resentative for the total dataset. The resolution of the images in our serve that the HOG detector outperforms the dedicated algorithm in
dataset is 4800 × 2400 pixels. most cases. We have found that the choice for a color space has a
Using the generic HOG detection algorithm, we train a differ- significant impact on the detection performance. For blue circular
ent detector for each class from the positive object samples and a traffic signs, the performance in the CIELab color space is superior
common set of negative samples in the form of images containing to other color spaces. For red circular traffic signs, the CIELab and
no traffic signs. Additionally, for each class, the positive samples YCbCr color spaces give similar performance, while for red triangu-
of the other classes are added as negative examples. The positive lar signs the performance in the YCbCr space is the highest. Detec-
samples are traffic signs having a resolution of 24 × 24 pixels in a tion in the RGB and HSV color spaces is suboptimal in these exper-
48 × 48 pixel region. Dalal and Triggs [3] have found that the use of iments. It is interesting to note that performance in the H-channel is
contextual information is beneficial. For training the SVM, the pro- almost identical to the performance in the HSV-space. This indicates
posed iterative approach is used with an initial set of 200 randomly that saturation and intensity information is largely irrelevant for the
selected background samples. considered traffic sign detection application.
We consider different versions of the HOG algorithm. Whereas
Dalal and Triggs propose to use the gradient in the color channel Name Red circ. AUC Blue circ. AUC Triangular AUC
with maximum gradient magnitude, traditional HOG only uses a sin- Dedicated 41.6% 56.2% 45.5%
gle color channel. The green channel of the RGB color space is often H(HSV) 32.0% 70.3% 50.0%
employed for traffic sign detection, but this causes many misdetec- HSV 32.0% 70.4% 50.0%
tions between red and blue signs. We have found that the H-channel CIELab 56.0% 85.0% 65.7%
of HSV color space gives better results. In our experiments we will
RGB 46.4% 56.9% 52.8%
use the H-channel detector as a single channel detector. Results for
YCbCr 55.7% 69.2% 74.6%
the G-channel detector are omitted because the performance is sig-
nificantly less. To incorporate more color information, we concate-
nate HOG descriptors for each color channel. We compare results Table 1. Detection performance of the Dedicated algorithm and the
for the following color spaces: RGB, HSV, CIELab and YCbCr. HOG detector in several color spaces, for three different classes of
In our experiments, we have used the following settings for our traffic signs. The highest scores are indicated in bold.
HOG detector: cell size 4 × 4 pixels, 9 orientation bins and 4 block
normalizations (b = 2). For each color channel, the dimensionality
of the feature vector is 2, 304. Applying the single-channel HOG
detector on a 4800 × 700 image takes about 23 seconds using a sin- 4. CONCLUSIONS
gle CPU-core at 2.7 GHz. Each image is downscaled in 35 steps
using a scaling factor of 1.05. This leads to the detection of traffic We have evaluated two different algorithms for traffic sign detection
signs ranging from 24 × 24 pixels to 132 × 132 pixels. Because of on a large-scale dataset. A dedicated algorithm uses a processing
the preprocessing steps in the specific algorithm, the execution time chain of three stages to detect traffic signs, which has been manu-
varies significantly over the total set of images. Typical execution ally tuned. We compare this to the generic Histogram of Oriented
2671
Authorized licensed use limited to: UNIVERSITE DE SFAX. Downloaded on November 13,2024 at 10:22:17 UTC from IEEE Xplore. Restrictions apply.
Gradients (HOG) algorithm, that automatically learns its detector
from a set of training images. In addition to the standard HOG al-
gorithm, we propose an extension that simultaneously uses infor-
mation of multiple color channels and show that it outperforms the
&!%! !"#$%
single-channel algorithm. Furthermore, we have employed an iter-
ative technique for SVM training which is novel in this context, to
deal with the large variation in background appearance. This signifi-
cantly lowers memory consumption and therefore allows the utiliza-
tion of more background images in the training process.
Experimental results show that for the considered task, the
generic HOG algorithm significantly outperforms the dedicated al-
' (
gorithm in most cases by a range of 10–30%. The choice of the
color space has a profound effect on the performance. We have
found that the CIELab and YCbCr spaces provide the best per-
formance, probably due to the availability of two dedicated color
!"#$%
channels fitting to the traffic signs. The HSV and RGB spaces are
&!%!
less suitable for traffic sign detection. Furthermore, we have shown
' (
that performance of the single channel H-detector is nearly identical
to the performance of the HSV-detector. This indicates that satura-
(a) Circular blue traffic signs.
tion and intensity information is largely irrelevant for the considered
traffic sign detection application and thus that color is the dominant
feature. !"#$%
&!%!
' (
5. REFERENCES
!"#$%
[1] H. Fleyeh and M. Dougherty, “Road and traffic sign detection
and recognition,” in Proc. 16th Mini EURO Conf. and 10th
Meeting of EWGT, September 2005, pp. 644–653.
[2] Paul Viola and M. Jones, “Rapid object detection using a
2672
Authorized licensed use limited to: UNIVERSITE DE SFAX. Downloaded on November 13,2024 at 10:22:17 UTC from IEEE Xplore. Restrictions apply.