Deep Learning Computer Vision Notes
Deep Learning Computer Vision Notes
Vision
1. Introduction to Deep Learning
Deep Learning is a subfield of machine learning that uses algorithms inspired by the
structure and function of the brain called artificial neural networks.
It is particularly effective for tasks involving large amounts of unstructured data such as
images, audio, and text.
- Neuron: Basic unit that takes input, processes it using an activation function, and gives
output.
- Layers: Input, hidden, and output layers.
- Activation Functions: Sigmoid, ReLU, Tanh, Softmax.
- Forward Propagation and Backpropagation.
Computer Vision is the field of study that enables machines to interpret and make decisions
based on visual data.
It includes techniques for acquiring, processing, analyzing, and understanding images and
videos.
5. Applications of Computer Vision
- Image Classification
- Object Detection (YOLO, SSD, Faster R-CNN)
- Face Recognition
- Image Segmentation
- Optical Character Recognition (OCR)
7. Challenges