Open navigation menu

Scribd

0% found this document useful (0 votes)

7 views82 pages

AI Slide 2

The document provides an overview of deep learning concepts, including supervised and unsupervised learning, neural networks, convolutional networks, and reinforcement learning. It highlights the use of TensorFlow for building and training models, particularly focusing on applications of Generative Adversarial Networks (GANs) and their capabilities in generating realistic images and data augmentation. Additionally, it discusses various techniques such as backpropagation, data preprocessing, and the architecture of convolutional neural networks.

Uploaded by

curvelearning52

Copyright

© © All Rights Reserved

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

7 views82 pages

AI Slide 2

The document provides an overview of deep learning concepts, including supervised and unsupervised learning, neural networks, convolutional networks, and reinforcement learning. It highlights the use of TensorFlow for building and training models, particularly focusing on applications of Generative Adversarial Networks (GANs) and their capabilities in generating realistic images and data augmentation. Additionally, it discusses various techniques such as backpropagation, data preprocessing, and the architecture of convolutional neural networks.

Uploaded by

curvelearning52

Copyright

© © All Rights Reserved

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 82

Introduction

• Basics of Tensorflow
• Machine Learning: analytic solution vs.
gradient descent
Supervised Learning (image recognition)
• Neural networks reminder
• Convolution Networks
• Going deeper with ConvNets
Unsupervised Learning
• Autoencoder
• Generative Adversial Network
Sequence modelling
• RNN, LSTM
• Word2vec
Reinforcement Learning
• Deep Q-learning
• Frozen Lake
DIGITAL VISUALIZATION OF IMAGE:
MNIST DATASET

• Handwritten digits
• 60.000 training data and 10.000 test data
• 28x28 grayscale images
• matrix of size 28x28 with value between 0 and 255
• data preprocessing = rescaling to [0,1]
DIGITAL VISUALIZATION OF IMAGE:

Inputs and
Outputs 256 X 256
Matrix

DL model

4-Element Vector

X Y

1
2 A
3 C M
4 T F
5 G
6

With deep learning, we are searching for a surjective

(or onto) function f from a set X to a set Y.
NEURAL NETWORKS
Supervised Deep Learning with Neural
Networks
Input Hidden Layers Output
From one layer to the next

X1
W1

X2 W2
f is the activation function,
Wi is the weight, and bi is Y3
the bias.
X3 W3
Activation Functions

Image Credit: towardsdatascience.com

BACKPROPAGATION

• Forward Activation: Predict the output

• Compute the loss
• Backward Error: And correct the parameters

X f✓ ŷ y
BACKPROPAGATION

• Forward Activation: Predict the output

• Compute the loss
• Backward Error: And correct the parameters

forward pass

X f✓ ŷ y
BACKPROPAGATION

• Forward Activation: Predict the output

• Compute the loss
• Backward Error: And correct the parameters

error

X f✓ ŷ y
BACKPROPAGATION

• Forward Activation: Predict the output

• Compute the loss
• Backward Error: And correct the parameters

X f✓ ŷ y

backpropagation of the error over the network

using derivative function
BACKPROPAGATION

• Forward Activation: Predict the output

• Compute the loss
• Backward Error: And correct the parameters

forward pass

error

X f✓ ŷ y

backpropagation of the error over the network

using derivative function
Training - Minimizing the
Loss
The loss function with regard to weights Input Output
and biases can be defined as

W1, b1 X1

Y2

The weight update is computed by moving W2, b2 X2

a step to the opposite direction of the cost
gradient. L
W3, b3 X3

Iterate until L stops decreasing.

CONVOLUTIONAL NETWORKS
Convolution in
2D
Convolution
Kernel
Max Pooling
Pooling - Max-Pooling and Sum-
Pooling
Convolutional Neural Networks
A convolutional neural network (CNN, or ConvNet) is a class of deep, feed-forward
artificial neural networks that explicitly assumes that the inputs are images, which allows
us to encode certain properties into the architecture.

(Image Credit: https://becominghuman.ai)

Deep Learning for Facial Recognition

(Image Credit: www.edureka.co)

MNIST - CNN
Visualization
CONVNET

« Convolutional neural networks »

• Created by Yann LeCun (90’s)

• Well-know since 2000

• Big acceleration with GPUs

• Computer vision

• NLP

• Artificiel Intelligence

• Convolution & Pooling

ConvNets usually evaluated on ImageNet (5 millions images, 1000 classes)

CONVNETS

Alex Net

80.1%

Google Net Inception

93.4%
CONVNETS
FEATURE MAPS

Layer 1: ~ Gabor filters

FEATURE MAPS
FEATURE MAPS
FILTERS
FINE-TUNING

FROZEN FINE-TUNED

• Filters after first convolutional

layer are generic (Gabor filters)
• Deeper you go in network and
more task specific are your
filters
Transfer Learning
Large Compute Overhead for power Limited Edge Computing Applications!
CNN Implementation - Data Augmentation
(DA)

DA helps to popular
artificial training
instances from the
existing train data sets.
Object Detection

Clean Background , Static Objects

Object Detection

Clean Background , Static Objects

Object Detection

Cluttered Background , Movable Objects

Object Detection

Cluttered Background , Static Objects

Yolo
Object detection using Regression

By detecting possible regions of interest using the Region Proposal Network and then
performing recognition on those regions separately, YOLO performs all of its predictions with
the help of a single fully connected layer.
WHAT IS TENSORFLOW?
• A python library
• pip install tensorflow
• Google
• open-source
• library for numerical computation
using data flow graphs
• CPU and GPU
• Research & Industry
PRINCIPLE
« HELLO WOLRD » 13

• INTRODUCTION EXERCISES

• Difference between constant/variable

and placeholder

• Constant = a fixed Variable

• With placeholder you need to feed

data to your graph during your
session

• Tensorflow workflow:
• Draw your graph
• Feed data
• … and optimize
AUTOENCODER
NEURAL NETWORK LEARNING

X f Y

Supervised learning
‣ y are given !

X X
f Z g

selfsupervised learning
‣ y is no longer needed
AUTOENCODER

X X
f Z g
encoder latent decoder

• Learning a compact data representation

• Encode input to smaller latent space
• Decode from the latent space to the input
• Predict input from input
• Loss function = mean square error
• f and g are neural networks
• SGD as usual
AUTOENCODER

X X
f Z g
encoder latent decoder
Operation in encoder CNN

No padding, No padding,
padding,
No stride stride
stride

Operation in decoder CNN

padding,
and stride
Epoch 1 (top) vs Epoch 10 (bottom).
GENERATIVE MODELS
GENERATIVE MOMENT MATCHING NETWORKS

X X’
f Z g
encoder latent decoder RMSE

X
GENERATIVE MOMENT MATCHING NETWORKS

Z
GENERATED
predicted latent

N(0,1)
GENERATIVE MOMENT MATCHING NETWORKS
Z latent

MMD
X
f Z
GENERATED
predicted latent

encoder

N(0,1)
GENERATIVE MOMENT MATCHING NETWORKS

X
Z
GENERATED
g GENERATED

decoder

N(0,1)
GENERATIVE MOMENT MATCHING NETWORKS
latent
X
f Z
encoder ~
~
X
latent Z
GENERATED
g GENERATED

decoder

N(0,1)
Discriminator

▪ Discriminator is a Convolutional Neural Network consisting of many hidden layers and

one output layer, GANs can have only two outputs: either be 1 or 0 :
if the output is 1 then the provided data is real and if the output is 0 then it refers to it
as fake data.

▪ Discriminator is trained on the real data so it learns to recognize how actual data
looks like and what features should the data have to be classified as real.
Generator

▪ Generator is an Inverse Convolutional Neural Net, it does exactly opposite of what a

CNN does, because in CNN an actual image is given as an input and a classified label is
expected as an output but in Generator, a random input (a vector having some values )
is given to this Inverse CNN

▪ An actual image is expected as an output. In simple terms, it generates data from a

piece of data using its past learning.
GENERATIVE ADVERSIAL NETWORKS
Intuition

Generator fake money

GENERATIVE ADVERSIAL NETWORKS
Intuition

Generator fake money Discriminator

FAKE
OR
REAL?

FAKE
OR
REAL?

real money
GENERATIVE ADVERSIAL NETWORKS

• Discriminator is trained on actual data to classify whether given data is true or not. The Generator starts to
generate data from a random input and discriminator analyzes the data and checks how close it is to be
classified as real.

• If the generated data does not contain enough features to be classified as real by the Discriminator, using
backpropagation, generator weights are readjusted to create new data which is better than the previous one.

• This process keeps repeating as long as the Discriminator keeps classifying the generated data as fakes,

• Eventually, Generator becomes so accurate that it becomes tough to distinguish between the real data and
the data generated by the Generator.
GENERATIVE ADVERSIAL NETWORKS
noise Generator fake image Discriminator real or not?

Z G D Y

• G and D are neural networks

• Find a G that minimizes
the accuracy of the best D
D Y
• Alternate optimization of G
and D real image

http://blog.aylien.com/introduction-generative-adversarial-networks-code-tensorflow/
GAN: APPLICATIONS
Generate Examples for Image Datasets
GANs can be used to generate new examples for image datasets in various domains, such as medical
imaging, satellite imagery, and natural language processing. By generating synthetic data,
researchers can augment existing datasets and improve the performance of machine learning
models.
Generate Photographs of Human Faces
GANs can generate realistic photographs of human faces, including images of people who do not
exist in the real world. You can use these rendered images for various purposes, such as creating
avatars for online games or social media profiles.
Generate Realistic Photographs
GANs can generate realistic photographs of various objects and scenes, including landscapes,
animals, and architecture. These rendered images can be used to augment existing image datasets
or to create entirely new datasets.
Generate Cartoon Characters
GANs can be used to generate cartoon characters that are similar to those found in popular movies
or television shows. These developed characters can create new content or customize existing
characters in games and other applications.
Image-to-Image Translation
GANs can translate images from one domain to another, such as converting a photograph of a real-
world scene into a line drawing or a painting. You can create new content or transform existing
images in various ways.
Text-to-Image Translation
GANs can be used to generate images based on a given text description. You can use it to create
visual representations of concepts or generate images for machine learning tasks.
GAN: APPLICATIONS
Semantic-Image-to-Photo Translation
GANs can translate images from a semantic representation (such as a label map or a
segmentation map) into a realistic photograph. You can use it to generate synthetic data for
training machine learning models or to visualize concepts more practically.
Face Frontal View Generation
GANs can generate frontal views of faces from images that show the face at an angle. You
can use it to improve face recognition algorithms' performance or synthesize pictures for
use in other applications.
Generate New Human Poses
GANs can generate images of people in new poses, such as difficult or impossible for
humans to achieve. It can be used to create new content or to augment existing image
datasets.
Photos to Emojis
GANs can be used to convert photographs of people into emojis, creating a more
personalized and expressive form of communication.
Photograph Editing
GANs can be used to edit photographs in various ways, such as changing the background,
adding or removing objects, or altering the appearance of people or animals in the image.
Face Aging
GANs can be used to generate images of people at different ages, allowing users to visualize
how they might look in the future or to see what they might have looked like in the past.
GAN: APPLICATIONS

Photo Blending
GANs can blend two or more photographs, creating a new image that combines elements from
the original images.
Super Resolution
GANs can enhance images' resolution, allowing users to produce higher-quality versions of low-
resolution images.
Photo Inpainting
GANs can fill in missing or damaged parts of photographs, creating a more complete and visually
appealing image.
Clothing Translation
Clothing translation is converting an image of clothing from one style or design to another. GANs
have been used to develop systems that can translate images of clothing from one type to
another, such as changing the color or pattern of a shirt or dress.
Video Prediction
Video prediction is generating future frames of a video based on a given sequence of past frames.
GANs have been used to develop systems that can generate realistic, high-quality video frames
that accurately predict the future evolution of the scene.
3D Object Generation
3D object generation creates 3D models of objects or scenes from 2D images or other data. GANs
have been used to develop systems that can generate realistic, high-quality 3D models of objects
and settings, such as buildings, cars, and people. You can use these systems for various
applications, such as virtual reality, video games, and computer-aided design.
GAN: EXAMPLES
GAN: EXAMPLES
GAN: EXAMPLES

Ongoing topic…
Sequence Generation through SeqGAN
Sequence Generation GAN
Sequence Generation GAN
GAN Startup Landscape
High
Geographic Reach

Low Application diversity High

You might also like

Deep Neural Networks
No ratings yet
Deep Neural Networks
25 pages
ChatGPT and AI in Procurement Course Syllabus 1
No ratings yet
ChatGPT and AI in Procurement Course Syllabus 1
11 pages
Ai Article Writer
0% (2)
Ai Article Writer
4 pages
Sohit
No ratings yet
Sohit
10 pages
4th Unit Aktu Machine Learning
No ratings yet
4th Unit Aktu Machine Learning
9 pages
Lect 2 Common Architectural Principles of Deep Networks
No ratings yet
Lect 2 Common Architectural Principles of Deep Networks
20 pages
Economics of Artificial Intelligence Implications For The Future of Work
No ratings yet
Economics of Artificial Intelligence Implications For The Future of Work
35 pages
Bachelor Thesis Topics Software Engineering
100% (3)
Bachelor Thesis Topics Software Engineering
7 pages
2407.12836v1 Heteful Features
No ratings yet
2407.12836v1 Heteful Features
4 pages
Mastering Generative AI With Diffusion Models - NVIDIA's Cutting-Edge Course
No ratings yet
Mastering Generative AI With Diffusion Models - NVIDIA's Cutting-Edge Course
94 pages
Unit 6
No ratings yet
Unit 6
41 pages
论文或学位论文
100% (1)
论文或学位论文
6 pages
Deep Learning (MODULE-3)
No ratings yet
Deep Learning (MODULE-3)
85 pages
Btech CSE
No ratings yet
Btech CSE
17 pages
CNN Students
No ratings yet
CNN Students
170 pages
What Is Artificial Intelligence Artificial Intelligence in 10 Minutes What Is AI Simplilearn (mp3)
No ratings yet
What Is Artificial Intelligence Artificial Intelligence in 10 Minutes What Is AI Simplilearn (mp3)
5 pages
Unit 3
No ratings yet
Unit 3
105 pages
Lecture2.2 UnimodalRepresentations Part1 PDF
No ratings yet
Lecture2.2 UnimodalRepresentations Part1 PDF
92 pages
6S191 MIT DeepLearning L1
No ratings yet
6S191 MIT DeepLearning L1
108 pages
Machine Learning (CSO851) - Lecture 10
No ratings yet
Machine Learning (CSO851) - Lecture 10
83 pages
Module 04 - Learners Guide
No ratings yet
Module 04 - Learners Guide
101 pages
Cs-A-501 Ai - Ocw
No ratings yet
Cs-A-501 Ai - Ocw
107 pages
Lecture - 07 (Convolutional Neural Networks)
No ratings yet
Lecture - 07 (Convolutional Neural Networks)
57 pages
Deeplearning - Ai Deeplearning - Ai
No ratings yet
Deeplearning - Ai Deeplearning - Ai
81 pages
Lec14 CNNRNNModels
No ratings yet
Lec14 CNNRNNModels
64 pages
ENG6500 8 DL IntroductionToDeepLearning Part2
No ratings yet
ENG6500 8 DL IntroductionToDeepLearning Part2
65 pages
Generative Models
No ratings yet
Generative Models
39 pages
Deep Learning Day 27
No ratings yet
Deep Learning Day 27
43 pages
Stage 424 June 2023
No ratings yet
Stage 424 June 2023
89 pages
Data Science Nigeria Machine and Deep Learning Study Guide
No ratings yet
Data Science Nigeria Machine and Deep Learning Study Guide
78 pages
Lecture 3 V33
No ratings yet
Lecture 3 V33
52 pages
DL Unit3 1
No ratings yet
DL Unit3 1
67 pages
DLA Unit 4
No ratings yet
DLA Unit 4
38 pages
Lecture1 ANN - Full
No ratings yet
Lecture1 ANN - Full
66 pages
Deep Learning
No ratings yet
Deep Learning
90 pages
Images and Convolutional Neural Networks: Practical Deep Learning
No ratings yet
Images and Convolutional Neural Networks: Practical Deep Learning
34 pages
Week 8
No ratings yet
Week 8
61 pages
4b Image Processing
No ratings yet
4b Image Processing
63 pages
Introduction To Deep Learning: Nandita Bhaskhar
No ratings yet
Introduction To Deep Learning: Nandita Bhaskhar
56 pages
Lec6 RNN Attention Search
No ratings yet
Lec6 RNN Attention Search
62 pages
04introduction To Neural Networks
No ratings yet
04introduction To Neural Networks
62 pages
Rec03 - Deep Architectures
No ratings yet
Rec03 - Deep Architectures
65 pages
Unit 2
No ratings yet
Unit 2
28 pages
UNIT-2 DL
No ratings yet
UNIT-2 DL
51 pages
CNN Eem305
100% (1)
CNN Eem305
7 pages
Module 5
No ratings yet
Module 5
20 pages
Anthony
No ratings yet
Anthony
33 pages
Co2 CNN 3
No ratings yet
Co2 CNN 3
31 pages
Week 9 Generative Adversarial Networks
No ratings yet
Week 9 Generative Adversarial Networks
50 pages
GANs
No ratings yet
GANs
41 pages
Automation and RPA in The Enterprise
No ratings yet
Automation and RPA in The Enterprise
47 pages
Chapter 5 Deep Learning
No ratings yet
Chapter 5 Deep Learning
35 pages
PP&DS 5
No ratings yet
PP&DS 5
31 pages
Microsoft Ai SDK For Sap Abap
No ratings yet
Microsoft Ai SDK For Sap Abap
16 pages
Topic 5
No ratings yet
Topic 5
32 pages
Class Notes Unit 5
No ratings yet
Class Notes Unit 5
13 pages
BMM 2018 - Deep Learning Tutorial
No ratings yet
BMM 2018 - Deep Learning Tutorial
47 pages
Embedded Systems and IoT - CS3691 - Notes Book - Unit 3 - IOT and Arduino Programming
No ratings yet
Embedded Systems and IoT - CS3691 - Notes Book - Unit 3 - IOT and Arduino Programming
33 pages
IC Unit6 DeepLearning
No ratings yet
IC Unit6 DeepLearning
35 pages
DL Unit-4
No ratings yet
DL Unit-4
26 pages
Comeptitor Analysis - US Vehicle Market
No ratings yet
Comeptitor Analysis - US Vehicle Market
29 pages
DL Ia2
No ratings yet
DL Ia2
13 pages
Deep Learning
No ratings yet
Deep Learning
45 pages
JNTUK R20 UNIT-IV DEEP LEARNING TECHNIQUES-www - Jntumaterials.co - in
No ratings yet
JNTUK R20 UNIT-IV DEEP LEARNING TECHNIQUES-www - Jntumaterials.co - in
26 pages
Kim - 2023 - Human Likeness and Attachment Effect On The Perceived Interactivity of AI Speakers
No ratings yet
Kim - 2023 - Human Likeness and Attachment Effect On The Perceived Interactivity of AI Speakers
8 pages
The Place and Role of Artificial Intelligence Chatbots in Adult Education and Training of Adult Educators
No ratings yet
The Place and Role of Artificial Intelligence Chatbots in Adult Education and Training of Adult Educators
18 pages
Introduction To Convolutional Neural Networks1-Unit3
No ratings yet
Introduction To Convolutional Neural Networks1-Unit3
10 pages
CV PPT Mt101
No ratings yet
CV PPT Mt101
16 pages
Context Engineering in Artificial Intelligence - The Next Frontier Beyond Prompting
No ratings yet
Context Engineering in Artificial Intelligence - The Next Frontier Beyond Prompting
3 pages
Generative Ai
No ratings yet
Generative Ai
21 pages
Unit 5 Autoencoders
No ratings yet
Unit 5 Autoencoders
6 pages
4th - Year - Time - Table - (2024-25) (W.e.f. 3-10-2024)
No ratings yet
4th - Year - Time - Table - (2024-25) (W.e.f. 3-10-2024)
5 pages
Talent Management in The Age of Digital Transformation and Changes in The Workforce Characteristics
No ratings yet
Talent Management in The Age of Digital Transformation and Changes in The Workforce Characteristics
7 pages
Introtodeeplearning MIT 6.S191
No ratings yet
Introtodeeplearning MIT 6.S191
36 pages
Unit Ii ML
No ratings yet
Unit Ii ML
22 pages
EI 2023 AVM-125 Patrick - Müller
No ratings yet
EI 2023 AVM-125 Patrick - Müller
7 pages
Artificial Intelligence: BY B.Bala Srinivasu 20BPE1032
No ratings yet
Artificial Intelligence: BY B.Bala Srinivasu 20BPE1032
9 pages
Portfolio - Yashica Jain
No ratings yet
Portfolio - Yashica Jain
4 pages
Cia 1 - Aiml
No ratings yet
Cia 1 - Aiml
3 pages
CSI106 IntroductiontoComputerScience Nhậpmônkhoahọcmáytính 10262024
No ratings yet
CSI106 IntroductiontoComputerScience Nhậpmônkhoahọcmáytính 10262024
2 pages
2015WS HS SpikingVision
No ratings yet
2015WS HS SpikingVision
23 pages
English Terminal
No ratings yet
English Terminal
6 pages
Developmental Dyslexia Detection Using Machine Lea
No ratings yet
Developmental Dyslexia Detection Using Machine Lea
7 pages
Introduction To Deep Learning: TA: Drew Hudson May 8, 2020
No ratings yet
Introduction To Deep Learning: TA: Drew Hudson May 8, 2020
33 pages
Artificial Neural Network Part-2
No ratings yet
Artificial Neural Network Part-2
15 pages
Dark Side of Ai PDF
No ratings yet
Dark Side of Ai PDF
4 pages
Gender Detection by Voice Using Deep Learning
No ratings yet
Gender Detection by Voice Using Deep Learning
5 pages
Max78000 Article Series Part 1
No ratings yet
Max78000 Article Series Part 1
4 pages
Scanline Rendering: Exploring Visual Realism Through Scanline Rendering Techniques
From Everand
Scanline Rendering: Exploring Visual Realism Through Scanline Rendering Techniques
Fouad Sabry
No ratings yet
Hidden Line Removal: Unveiling the Invisible: Secrets of Computer Vision
From Everand
Hidden Line Removal: Unveiling the Invisible: Secrets of Computer Vision
Fouad Sabry
No ratings yet