0% found this document useful (0 votes)

22 views25 pages

L10-DL Intro

The document provides an overview of deep learning, including the universal approximation theorem and the evolution of neural networks from single-layer perceptrons to deep architectures. It discusses the capabilities of convolutional neural networks (CNNs) inspired by the visual cortex, highlighting their advantages in processing high-dimensional data. Additionally, it covers the applications of deep learning in computer vision, such as classification, detection, segmentation, and style transfer.

Uploaded by

garvitkhurana47

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

22 views25 pages

L10-DL Intro

Uploaded by

garvitkhurana47

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 25

Introduction to Deep

Learning

Professor Qiang Yang

Universal function approximation Theorem: Neural
networks with at least one hidden layer of sufficiently
many sigmoid/tanh/Gaussian units can approximate any
function arbitrarily closely.
• Although a two-layer network has universal approximation
capabilities, it may require exponentially large hidden
neurons.
• For many years, two-layer network was the most widely
used architecture, because it proved difficult to train
networks with more than two layers effectively.
• McCulloch Pitts neuron 1943
• Perceptron (Rosenblatt, 1962)
• Minsky and Papert- 1969 limited capabilities of Single layer
networks https://leon.bottou.org/publications/pdf/perceptrons-2017.pdf
• Backpropagation (1980) Only the weights in the final two
layers learn useful values. Hand-crafted features.
• LeNet -LeCun (1998) No Relu, No Softmax, No Adam
https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=726791
• Deep Learning- LeCun, Bengio, Hinton (2015)
https://www.cs.toronto.edu/~hinton/absps/NatureDeepReview.pdf
Artificial General Intelligence (AGI)
The field of Artificial Intelligence seeks to recreate the powerful
capabilities of the brain in machines.

Many of the AI systems in current use fall short of the tremendous breadth of
capabilities of the human brain.
An artificial general intelligence (AGI) is a hypothetical type of
intelligent agent which, if realized, could learn to accomplish any
intellectual task that human beings or animals can perform. Read
Wikipedia

Generative AI: Deep learning models that generate outputs in the form of
images, video, audio, text, and candidate drug molecules.

Large Language models (LLMs) such as GPT-4 are early, incomplete forms of
AGI.

https://arxiv.org/abs/2303.12712
3D shape of a protein using AlphaFold (Jumper et al., 2021)
https://www.nature.com/articles/s41586-021-03819-2
https://generated.photos/
Number of computer cycles needed to train SOTA neural networks 1 petaflop = 1015 FP operations
Rectified Linear Unit (ReLU)
• They compute a linear weighted sum of their inputs.
• The output is a non-linear function of the total input.
• This is the most popularly used neuron.

Or written as: f(x) = max {0,x}

A smooth approximation of the ReLU is “softplus” function

f(x) = ln (1+ex)

https://proceedings.mlr.press/v15/glorot11a/glorot11a.pdf
Softmax function
(Normalized exponential
function)

If we take an input of [1,2,3,4,1,2,3], the softmax of that is

[0.024, 0.064, 0.175, 0.475, 0.024, 0.064, 0.175].
The softmax function highlights the largest values and
suppress other values.

Comparing to “max” function, softmax is differentiable.

Convolutional Neural Networks
(CNN)
Visual Cortex Inspired CNN Model
Hubel and Wiesel 1981 Nobel Prize for Physiology or Medicine .

The classic experiment showed how the visual cortex processes information
in a hierarchical way, extracting increasingly complex information. They
showed that there is a topographical map in the visual cortex that
represents the visual field, where nearby cells process information from
nearby visual fields.

They identified two types of neuron cells: simple cells whose output is
maximized by straight edges having particular orientations within their
receptive field, and complex cells which have larger receptive fields and
combine the outputs of the simple cells. They also discovered that
neighbouring cells have similar and overlapping receptive fields.

This gave the concept of sparse interactions in CNN’s where the network
focusses on local information rather than taking the complete global
information.
Advantages of CNN
1. They have sparse connections instead of fully
connected connections which lead to reduced
parameters and make CNN’s efficient for processing
high dimensional data.
2. Weight sharing takes place where the same weights
are shared across the entire image, causing reduced
memory requirements as well as translational
invariance.
3. CNN’s use a very important concept of subsampling
or pooling in which the most prominent pixels are
propagated to the next layer dropping the rest. This
provides a fixed size output matrix which is typically
required for classification and invariance to
translation, rotation.
Introduction
• Traditional pattern recognition models use hand-crafted
features and relatively simple trainable classifier.

hand-crafted “Simple”
feature Trainable output
extractor Classifier

• This approach has the following limitations:

• It is very tedious and costly to develop hand-crafted
features
• The hand-crafted features are usually highly dependents
on one application, and cannot be transferred easily to
other applications
Deep Learning
• Deep learning seeks to learn rich hierarchical
representations (i.e. features) automatically through
multiple stage of feature learning process.

Low-level Mid-level High-level Trainable

features features features classifier output

Feature visualization of convolutional net trained on ImageNet

(Zeiler and Fergus, 2013)
Learning Hierarchical
Representations
Low-level Mid-level High-level Trainable
output
features features features classifier

Increasing level of abstraction

• Hierarchy of representations with increasing level of

abstraction. Each stage is a kind of trainable nonlinear
feature transform
• Pixel → edge → texton → motif → part → object
Forward problem: where you predict an output based
on known input variables.
Inverse problem: the task of inferring hidden or
unobserved variables from observed data
To find the underlying cause or parameters that
generated the data, often by utilizing a known forward
model that describes how the data was produced.
Inverse problems are often "ill-posed," meaning they
may not have a unique solution, be highly sensitive to
noise in the data, or lack stability, making them
challenging to solve directly with traditional methods.
The preference for one choice over others is called
inductive bias or prior knowledge. It is the set of
assumptions that the learner uses to predict
outputs of given inputs that it has not
encountered.
Examples of inverse problems

1. Determine the internal defects of a rotating system

from sensor measurements at the surface.
2. Filling in missing parts of an image based on the
surrounding pixels.
https://av.tib.eu/media/21899

“Deep Convolutional Neural Network for Inverse

Problems in Imaging”.
https://ieeexplore.ieee.org/stamp/stamp.jsp?arnum
ber=7949028
Computer Vision (The automatic analysis and interpretation of image data)
Applications for ML in Computer vision:
1. Classification (Image recognition)
2. Detection: Detection of objects in an image and their locations within the
image
3. Segmentation of images in which each pixel is classified individually
thereby dividing the image into regions sharing a standard label

An image and its corresponding semantic segmentation in which each picture

is coloured according to its class.
4. Caption generation in which a textual description is generated
automatically from an image.
5. Inpainting in which a region of an image is replaced with synthesized pixels
that are consistent with the rest of the image.

On the left is the original image. In the middle, an image with sections removed

and the image with inpainting on the right

6. Style transfer in which an input image in one style is
transformed into a corresponding image in a different
style
7. Super-resolution in which the resolution of the image is
improved
8. Scene reconstruction in which one or more two-
dimensional images of a scene are used to reconstruct a
3-D representation

Data Science and Data Analytics (En)
100% (1)
Data Science and Data Analytics (En)
733 pages
Rotten Fruit Vegetable Detector Machine
No ratings yet
Rotten Fruit Vegetable Detector Machine
71 pages
Deep Learning
100% (3)
Deep Learning
32 pages
Deep Learning
No ratings yet
Deep Learning
15 pages
Tutorial Math Deep Learning 2018 PDF
No ratings yet
Tutorial Math Deep Learning 2018 PDF
103 pages
UNIT-1 Foundations of Deep Learning
100% (1)
UNIT-1 Foundations of Deep Learning
51 pages
4th Unit Aktu Machine Learning
No ratings yet
4th Unit Aktu Machine Learning
9 pages
Ocular Diseases Classification Using A Lightweight CNN and Class Weight Balancing On OCT Images
No ratings yet
Ocular Diseases Classification Using A Lightweight CNN and Class Weight Balancing On OCT Images
16 pages
HCIP-AI-EI Developer V2.0 Training Material
No ratings yet
HCIP-AI-EI Developer V2.0 Training Material
508 pages
L11 Learning III Neural Network Architectures
No ratings yet
L11 Learning III Neural Network Architectures
35 pages
Introduction To Deep Learning 17th January 2025
No ratings yet
Introduction To Deep Learning 17th January 2025
60 pages
AI Cheat
No ratings yet
AI Cheat
13 pages
Convolutional Neural Networks-Part1
No ratings yet
Convolutional Neural Networks-Part1
15 pages
Unit-Ii DLL
No ratings yet
Unit-Ii DLL
19 pages
Cse (Convolutional Neural Network) PPT+Questions
No ratings yet
Cse (Convolutional Neural Network) PPT+Questions
18 pages
Deep Learning Midsem Merged Previous Batch
No ratings yet
Deep Learning Midsem Merged Previous Batch
423 pages
1803 08823 PDF
No ratings yet
1803 08823 PDF
122 pages
DL Tutorial NIPS2015 PDF
No ratings yet
DL Tutorial NIPS2015 PDF
133 pages
Deep Learning Applications
No ratings yet
Deep Learning Applications
309 pages
Deep Reinforcement Learning in Computer Vision: A Comprehensive Survey
No ratings yet
Deep Reinforcement Learning in Computer Vision: A Comprehensive Survey
103 pages
NN DL
No ratings yet
NN DL
54 pages
7 Deep Learning
No ratings yet
7 Deep Learning
75 pages
Deep Learning
No ratings yet
Deep Learning
78 pages
Lecun 20181015 Ihes Gomax PDF
No ratings yet
Lecun 20181015 Ihes Gomax PDF
109 pages
BMM 2018 - Deep Learning Tutorial
No ratings yet
BMM 2018 - Deep Learning Tutorial
47 pages
DL Intro
No ratings yet
DL Intro
64 pages
AI Chapter 5
No ratings yet
AI Chapter 5
31 pages
001 Intro
No ratings yet
001 Intro
66 pages
Parallelized Deep Neural Networks
No ratings yet
Parallelized Deep Neural Networks
34 pages
Ece18898g Neural Networks
No ratings yet
Ece18898g Neural Networks
47 pages
Module 5
No ratings yet
Module 5
72 pages
Demystifying Deep Convolutional Neural Networks - Adam Harley (2014) CNN PDF
No ratings yet
Demystifying Deep Convolutional Neural Networks - Adam Harley (2014) CNN PDF
27 pages
FInal - Year - Project On Image Forensics Tool
No ratings yet
FInal - Year - Project On Image Forensics Tool
96 pages
Deep Learning Notes
No ratings yet
Deep Learning Notes
14 pages
CV Mot
No ratings yet
CV Mot
69 pages
CC511 Week 7 - Deep - Learning
No ratings yet
CC511 Week 7 - Deep - Learning
33 pages
Machine Learning 4th Unit
No ratings yet
Machine Learning 4th Unit
54 pages
Introtodeeplearning MIT 6.S191
No ratings yet
Introtodeeplearning MIT 6.S191
36 pages
Autonomous Driving With Deep Learning: A Survey of State-of-Art Technologies
No ratings yet
Autonomous Driving With Deep Learning: A Survey of State-of-Art Technologies
33 pages
Major Report
No ratings yet
Major Report
27 pages
Intro CNN PDF
No ratings yet
Intro CNN PDF
31 pages
Advanced Spectral Classifiers For Hyperspectral Images A Review
No ratings yet
Advanced Spectral Classifiers For Hyperspectral Images A Review
25 pages
A Survey On Computer Vision Algorithms
No ratings yet
A Survey On Computer Vision Algorithms
16 pages
Introduction To Deep Learning: TA: Drew Hudson May 8, 2020
No ratings yet
Introduction To Deep Learning: TA: Drew Hudson May 8, 2020
33 pages
A Survey On Deep Network PDF
No ratings yet
A Survey On Deep Network PDF
24 pages
Review IML 2020
No ratings yet
Review IML 2020
17 pages
Introduction To Neural Networks: CMSC475/675 Fall 2004
No ratings yet
Introduction To Neural Networks: CMSC475/675 Fall 2004
19 pages
CNN
No ratings yet
CNN
31 pages
Module 1
No ratings yet
Module 1
64 pages
AI Slide 2
No ratings yet
AI Slide 2
82 pages
Chapter 5 Deep Learning
No ratings yet
Chapter 5 Deep Learning
35 pages
How Computer Vision Can Replace Traditional Sensorsfor Accurate Object Sizing
No ratings yet
How Computer Vision Can Replace Traditional Sensorsfor Accurate Object Sizing
41 pages
Deep Learning Lab Manual - Week 1-10
No ratings yet
Deep Learning Lab Manual - Week 1-10
81 pages
Deep Learning: Architectures: Deep Neural Network
No ratings yet
Deep Learning: Architectures: Deep Neural Network
40 pages
Research Paper Presentation
No ratings yet
Research Paper Presentation
19 pages
v1 Covered - 1
No ratings yet
v1 Covered - 1
50 pages
Ocean Engineering: Pengpeng He, Delin Hu, Yong Hu
No ratings yet
Ocean Engineering: Pengpeng He, Delin Hu, Yong Hu
16 pages
Lec 1
No ratings yet
Lec 1
30 pages
A Survey of Convolutional Neural Networks - Analysis-Applications-Prospects
No ratings yet
A Survey of Convolutional Neural Networks - Analysis-Applications-Prospects
21 pages
Session 2 ANN 2024
No ratings yet
Session 2 ANN 2024
29 pages
Introduction To Deep Learning: 1 General Overview
No ratings yet
Introduction To Deep Learning: 1 General Overview
29 pages
Image Classification Using Convolutional Neural Networks
No ratings yet
Image Classification Using Convolutional Neural Networks
8 pages
Module 5
No ratings yet
Module 5
20 pages
Going Deeper With Contextual CNN For Hyperspectral Image Classification
No ratings yet
Going Deeper With Contextual CNN For Hyperspectral Image Classification
14 pages
Artificial Intelligence and Marketing - Pitfalls and Opportunities
No ratings yet
Artificial Intelligence and Marketing - Pitfalls and Opportunities
15 pages
CV Lab 12 - Implementatin of A Simple CNN
No ratings yet
CV Lab 12 - Implementatin of A Simple CNN
9 pages
Admin,+4554 Article+Text 17736 2 10 20210928
No ratings yet
Admin,+4554 Article+Text 17736 2 10 20210928
13 pages
249 254Tesma601IJEAST
No ratings yet
249 254Tesma601IJEAST
7 pages
Med3D: Transfer Learning For 3D Medical Image Analysis: April 2019
No ratings yet
Med3D: Transfer Learning For 3D Medical Image Analysis: April 2019
13 pages
Deep Learning For Computer Vision A .Brief Review, Research Paper
No ratings yet
Deep Learning For Computer Vision A .Brief Review, Research Paper
7 pages
‎⁨فصل ثاني اسراء⁩
No ratings yet
‎⁨فصل ثاني اسراء⁩
13 pages
Sat - 27.Pdf - Face Mask Detection Using Convolutional Neural Network
No ratings yet
Sat - 27.Pdf - Face Mask Detection Using Convolutional Neural Network
10 pages
DeepFake-O-Meter v2.0 An Open Platform For DeepFake Detection
No ratings yet
DeepFake-O-Meter v2.0 An Open Platform For DeepFake Detection
7 pages
Bioconf Iscku2024 00099
No ratings yet
Bioconf Iscku2024 00099
12 pages
Literature Survey For Lung Cancer Analysis and Prediction
No ratings yet
Literature Survey For Lung Cancer Analysis and Prediction
6 pages
Building CNN Model - Formatted Paper
No ratings yet
Building CNN Model - Formatted Paper
7 pages
Optimizing Inventory For Fashion Stores Using AI
No ratings yet
Optimizing Inventory For Fashion Stores Using AI
7 pages
RESNET
No ratings yet
RESNET
5 pages
Multi-Model Deep Neural Network Based Features Extraction and Optimal Selection Approach For Skin Lesion Classification
No ratings yet
Multi-Model Deep Neural Network Based Features Extraction and Optimal Selection Approach For Skin Lesion Classification
7 pages
BM466 - Homework 4
No ratings yet
BM466 - Homework 4
10 pages
Thesis Proposal Zhiyun Gong Revised
No ratings yet
Thesis Proposal Zhiyun Gong Revised
6 pages
MSCDA 605 Machine Learning Exam Model Answers May - 2019
No ratings yet
MSCDA 605 Machine Learning Exam Model Answers May - 2019
7 pages
HRJ R1333
No ratings yet
HRJ R1333
6 pages
Dip 7
No ratings yet
Dip 7
4 pages
TrafficRuleViolationIRJET V9I6414
No ratings yet
TrafficRuleViolationIRJET V9I6414
8 pages
Review of CNN-MHSA: A Convolutional Neural Network and Multi-Head Self-Attention Combined Approach For Detecting Phishing Websites by
No ratings yet
Review of CNN-MHSA: A Convolutional Neural Network and Multi-Head Self-Attention Combined Approach For Detecting Phishing Websites by
3 pages
Duplichecker Plagiarism Report
No ratings yet
Duplichecker Plagiarism Report
3 pages

L10-DL Intro

Uploaded by

L10-DL Intro

Uploaded by

Introduction to Deep

Professor Qiang Yang

Or written as: f(x) = max {0,x}

A smooth approximation of the ReLU is “softplus” function

If we take an input of [1,2,3,4,1,2,3], the softmax of that is

Comparing to “max” function, softmax is differentiable.

• This approach has the following limitations:

Low-level Mid-level High-level Trainable

Feature visualization of convolutional net trained on ImageNet

Increasing level of abstraction

• Hierarchy of representations with increasing level of

1. Determine the internal defects of a rotating system

“Deep Convolutional Neural Network for Inverse

An image and its corresponding semantic segmentation in which each picture

and the image with inpainting on the right

You might also like