Research on Damage Defect Detection Based on Computer Vision
Research on Damage Defect Detection Based on Computer Vision
Fig. 1 High Level Diagram of Damage Defect Detection Computer Vision Model
II. MODELS AND ALGORITHMS FOR DAMAGE useful features from the images and categorize them into
DEFECT DETECTION BASED ON pre-defined classes. ImageNet[3] is a large scale data set of
COMPUTER VISION images related to each category, this help researchers to use
pre-trained models directly and researchers can start their
In any industry, the biggest challenge is to transform work with small dataset. ImageNet contains 14 million
business problem into machine learning and AI problem images with 20000+ categories. Image classification[4]
statement. There are several examples in real time where involves categorizing or labelling images or specific areas
automation and ML can be applied. Framing an ML problem within them. This process offers a comprehensive
is the most creative and approachable way to solve any understanding of the image's content and can be carried out
machine learning problem. In this paper, our main focus is using either traditional methods or deep learning techniques.
to identify the defects and no-defects, to improve the Popular deep learning CNN architectures include Alexnet,
customer experience by not delivering the damaged product ResNet20, VGGNet16, and ResNet50. In this paper,
and identify the damage packages at first place. This will be experimentation based on transfer learning and ResNet50
an image classification problem, where binary classifier will pre-trained models will be presented. Since it has lots of
predict the defect and no-defect categories with probability advantages compare to other deep neural network
or confidence score. Image processing and image architectures, in term of training time, learning pattern etc.
classification is a core filed of computer vision and deep Transfer learning is a reliable source for experimenting with
learning. It gives an advance opportunity of rapid small dataset of images and transfer learning models are
development towards deep learning and machine learning. already trained on large dataset to give the desire output by
Advance image processing algorithm helps to read the tuning hyperparameters and architectural changes.
III. IMAGE AUGMENTATION robust. In our case, the models will be highly accurate for
No Defects, but less accurate for all the different types of
Training a model on large datasets presents a defects. In this use case, image augmentation would be
significant real-time challenge in deep learning. Deep optimal approach to handle imbalance dataset. Image
learning and computer vision tasks have achieved augmentation involves creating additional examples from
remarkable results thanks to the use of convolutional neural the original dataset, but with slight alterations such as
networks (CNNs). However, diverse dataset for training has flipping images vertically or creating mirrored images. This
become a necessary part for computer vision models. Image will increase the number of images in the underrepresented
Augmentation [5] is a process to diversify the data based on classes and yield a more balanced dataset. Image
the existing dataset. Computer vision models rely on a large augmentation helps this experiment to augment the images
amount of data for training, but for certain types of images by rotating the image on different angles, along with
there aren’t enough examples to provide a robust cropping and blurring technique. Image augmentation helps
representation of the data. For example, if we are training a this model to increase 12% specificity and 2% sensitivity
model to identify images of defects in products, there are from AlexNet to ResNet50 with image augmentation. Model
many images of Empty and No Defect cases but relatively is capable to identify the 82% (441/536) of missorts defect
few images of different types of defects. Let’s say the split is and 61% (464/750) of the overall defects with 95% high
95% No Defects, and 5% Defects. This means that the confidence and 91% precision rate.
training process may lead to an inefficient representation of
the data, resulting in models that are highly specific and less
IV. CONCLUSION
REFERENCES