0% found this document useful (0 votes)
66 views2 pages

Deep Learning Computer Vision Notes

The document provides an overview of Deep Learning and its application in Computer Vision, highlighting the use of artificial neural networks for processing unstructured data. It covers the basics of neural networks, including layers and activation functions, and delves into Convolutional Neural Networks (CNNs) used for image-related tasks. Additionally, it discusses various applications, tools, challenges, and techniques in the field of Computer Vision.

Uploaded by

MarieFernandes
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
66 views2 pages

Deep Learning Computer Vision Notes

The document provides an overview of Deep Learning and its application in Computer Vision, highlighting the use of artificial neural networks for processing unstructured data. It covers the basics of neural networks, including layers and activation functions, and delves into Convolutional Neural Networks (CNNs) used for image-related tasks. Additionally, it discusses various applications, tools, challenges, and techniques in the field of Computer Vision.

Uploaded by

MarieFernandes
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 2

Notes: Deep Learning and Computer

Vision
1. Introduction to Deep Learning

Deep Learning is a subfield of machine learning that uses algorithms inspired by the
structure and function of the brain called artificial neural networks.
It is particularly effective for tasks involving large amounts of unstructured data such as
images, audio, and text.

2. Neural Networks Basics

- Neuron: Basic unit that takes input, processes it using an activation function, and gives
output.
- Layers: Input, hidden, and output layers.
- Activation Functions: Sigmoid, ReLU, Tanh, Softmax.
- Forward Propagation and Backpropagation.

3. Convolutional Neural Networks (CNNs)

CNNs are specialized neural networks for processing image data.


- Convolution Layer: Applies filters to extract features.
- Pooling Layer: Reduces spatial size (Max Pooling, Average Pooling).
- Fully Connected Layer: Final decision-making layer.
- Used for: Image classification, object detection, face recognition.

4. Introduction to Computer Vision

Computer Vision is the field of study that enables machines to interpret and make decisions
based on visual data.
It includes techniques for acquiring, processing, analyzing, and understanding images and
videos.
5. Applications of Computer Vision

- Image Classification
- Object Detection (YOLO, SSD, Faster R-CNN)
- Face Recognition
- Image Segmentation
- Optical Character Recognition (OCR)

6. Tools and Libraries

- TensorFlow and Keras


- PyTorch
- OpenCV
- FastAI
- Scikit-image

7. Challenges

- Requirement of large datasets


- High computational power
- Overfitting and underfitting
- Data annotation and labeling

You might also like