Why Deeplab v3
Why Deeplab v3
1 - Information Loss:
With Max pooling, there is a significant reduction in the resolution of the image, leading to a
loss of information
Atrous Convolution:
Atrous convolution, also known as dilated convolution, is a variant of traditional convolution that allows
for the expansion of the receptive field without increasing the number of parameters. It is a
fundamental operation used in many advanced convolutional neural network (CNN) architectures,
including DeepLab.
ASPP uses in DeepLab, to capture multi-scale contextual information and improve the accuracy
of segmentation results. It consists of parallel atrous convolutions with different dilation rates
and a global pooling branch.
Here's how ASPP works:
1. Parallel Atrous Convolutions: ASPP includes multiple atrous convolutional layers with
different dilation rates.
2. Global Pooling Branch: ASPP also includes a global pooling branch that performs
pooling operations, such as average pooling or max pooling, on the feature maps. This
branch aggregates information across the entire spatial extent of the feature maps,
capturing global context information. The pooled feature representation is then used to
capture high-level contextual information.
3. Fusion: The outputs from the parallel atrous convolutional layers and the global pooling
branch are combined through concatenation or aggregation operations. This fusion step
brings together features captured at different scales and with different contextual
information.