0% found this document useful (0 votes)

46 views61 pages

Artificial Neural Network Course Slides

The AIS302 course on Artificial Neural Networks, taught by Dr. Ensaf Hussein, aims to provide students with an understanding of deep learning principles and practical experience in applying deep learning models. The course covers topics such as deep neural networks, optimization, sequence modeling, and generative models, with a project-based grading policy. Prerequisites include AIS301: Machine Learning, and students must attend 75% of lectures to qualify for the final exam.

Uploaded by

Hana El Gabry

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

46 views61 pages

Artificial Neural Network Course Slides

Uploaded by

Hana El Gabry

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 61

AIS302: ANN (Artificial Neural Networks)

Lecture 1: Introduction
Spring 2025

Dr. Ensaf Hussein

Associate Professor, Artificial Intelligence,
School of Information Technology and Computer Science,
Nile University.
Course learning outcomes
• Understand the principles behind deep learning
• Have practical experience applying deep learning to novel situations

• By the end of the semester, you should be able to

• Look at a problem
• Identify if DL could be a solution
• Apply the appropriate DL model

2
Prerequisites
AIS301: Machine
Learning

AIS302: Artificial Neural AIS411: Natural Language AIS462: Computational

Networks Processing Intelligence

AIS412: Deep Learning AIS421: NLP Applications

Course tentative schedule
• Deep Neural Networks and Hyperparameter tuning
• Regularization and Optimization
• Sequence Modeling: Recurrent and Recursive Nets
• Sequence models & attention mechanism
• Autoencoders / Applications
• Sequence-to-sequence models
• Attention / Multi-head attention
• Transformers
• Vision Transformers (ViT).

4
Grading Policy

This course is Project-Based Learning.

• Grading items:
❑ Classwork
❑10% Lab Tasks
❑10% Programming Assignments
❑10% Lecture Quizzes
❑ 20% Midterm Exam
❑ 10 % Project (Follow-Up)
❑Final assessment
❑ 10 % Final Project
❑ 30% Final Exam
•students with less than 30% in final exam will get an F in the course
• Students should attend 75% of lectures and labs to enter the final exam 5
Resources:

Refernece : https://www.deeplearningbook.org/
Textbook
• A great book!

• Very recent (and up-to-date!)

• Easy to read and follow
• Very nice figures!
• Neither terribly theoretical nor
extremely about coding!

• Available freely online by author

https://udlbook.github.io/udlbook/
• Many slides are adopted from
content by Simon.

8
Topics
Deep neural networks
How to train them
How to measure their performance
How to make that performance better

Networks specialized to images

Image classification
Image segmentation

Networks specialized to text

Text generation
ChatGPT
Selective

Generative learning (unsupervised)

based on

Generating random cats!

time

RL using Deep Learning

9
Warm-Up

https://play.blooket.com/play?id=3123283
2
What is AI?
•AI aims to simulate intelligent behavior.
3
What is Machine Learning?

•ML is a subset of AI that learns from data to make decisions.

4
5
What is Deep Learning?
•Deep Learning (DL) is a type of ML based on deep neural
networks.

What is Deep Neural

Network?
A Deep Neural Network (DNN) is a machine learning model with multiple layers of artificial neurons,
enabling it to learn complex patterns from data. It consists of an input layer, multiple hidden layers,
and an output layer, using backpropagation and gradient descent for training. DNNs are widely used
in image recognition, speech processing, and natural language understanding, powering applications
like self-driving cars, chatbots, and medical diagnostics. 7
8
Figures from http://udlbook.com
17
AI is all about deep learning
➢ Yes
➢ No

…...... is fitting mathematical models to observed data

➢ AI
➢ Machine Learning
➢ Deep Learning
18
Deep learning == Machine learning?
➢ Yes
➢ No

…...... is a type of machine learning

➢ Deep Neural Networks
➢ Supervised Learning
➢ Deep Learning
19
6
Figures from http://udlbook.com
12
Supervised learning
• Define a mapping from input to output
• Learn this mapping from paired input/output (labeled) data examples

Model Model
Real world input Real world output
input output

22
Simple example …
• Predict the height of a child given his/her age.

Model Model
Age of child Height of child
input output

23
What is a supervised learning model?

Regression

• An equation relating input (age) to output (height)

• Search through family of possible equations to find one that fits training data well

24
Figures from http://udlbook.com
What is a supervised learning model?

Deep neural networks are just a very flexible family of equations

Fitting deep neural networks = “Deep Learning”
25
Figures from http://udlbook.com
Regression

Deep learning
model

Univariate regression problem (one output, real value)

Fully connected network
26
Figures from http://udlbook.com
Graph regression

Deep learning
model

Multivariate regression problem (>1 output, real value)

Graph neural network
27
Figures from http://udlbook.com
Text classification

Deep learning
model

Binary classification problem (two discrete classes)

Transformer network
28
Figures from http://udlbook.com
Music genre classification

Deep learning
model

Multiclass classification problem (discrete classes, >2 possible values)

Recurrent neural network (RNN)
29
Figures from http://udlbook.com
Image classification

Deep learning
model

Multiclass classification problem (discrete classes, >2 possible values)

Convolutional network
21
Figures from http://udlbook.com
Image segmentation

• Multivariate binary classification problem (many outputs, two discrete classes)

• Convolutional encoder-decoder network
Depth estimation

• Multivariate regression problem (many outputs, continuous)

• Convolutional encoder-decoder network
Pose estimation

• Multivariate regression problem (many outputs, continuous)

• Convolutional encoder-decoder network
Terms
• Regression = continuous numbers as output
• Classification = discrete classes as output
• Two class and multiclass classification treated differently
• Univariate = one output
• Multivariate = more than one output

34
35
“Given a news article, I want to predict if it is political,
sports, or economical”.
What type of a problem is that?
➢ multivariate classification
➢ univariate regression
➢ binary classification
➢ multivariate regression
➢ multiclass classification
36
Other type of examples …

37
Translation

Deep learning
model

38
Figures from http://udlbook.com
Image captioning

Deep learning
model

39
Figures from http://udlbook.com
Image generation from text

Deep learning
model

40
Figures from http://udlbook.com
What do these examples
have in common?

41
What do these examples have in common?
• Very complex relationship between input and output
• Sometimes may be many possible valid answers
• But outputs (and sometimes inputs) obey rules

Language obeys grammatical rules Natural images also have “rules”

Learn the “grammar” of the data

from LOTS of unlabeled examples
42
Figures from http://udlbook.com
33
Unsupervised Learning
• Learning from data without labels
• Clustering
• Finding outliers
• Generating new examples
• Filling in missing data

34
DeepCluster: Deep Clustering for Unsupervised Learning of Visual Features (Caron et al., 2018) 35
Figures from http://udlbook.com
Unsupervised Learning
• Learning about a dataset without labels
• e.g., clustering
• Generative models can create examples
• e.g., generative adversarial networks

36
Unsupervised Learning
• Learning about a dataset without labels
• e.g., clustering
• Generative models can create examples
• e.g., generative adversarial networks
• PGMs learn distribution over data
• e.g., variational autoencoders,
• e.g., normalizing flows,
• e.g., diffusion models

47
Figures from http://udlbook.com
Generative models

48
Figures from http://udlbook.com
Generative models

49
Figures from http://udlbook.com
Generative models

50
Figures from http://udlbook.com
Conditional synthesis

Original image Removed regions Generated regions

51
Figures from http://udlbook.com
Figures from http://udlbook.com 42
ChatGPT
53
54
45
Reinforcement learning

• Build an agent which lives in a world and can perform certain actions
at each time step.
• Goal: take actions to change the state so that you receive rewards
• You don’t receive any data – you have to explore the environment
yourself to gather data as you go.

46
Example: chess
• States are valid states of the chess board
• Actions at a given time are valid possible moves
• Positive rewards for taking pieces, negative rewards for losing
them

57
Figures from http://udlbook.com
Example: chess
• States are valid states of the chess board
• Actions at a given time are valid possible moves
• Positive rewards for taking pieces, negative rewards for losing
them

58
Figures from http://udlbook.com
Why is this difficult?
• Stochastic
• Make the same move twice, the opponent might not do the same thing
• Rewards also stochastic (opponent does or doesn’t take your piece)
• Temporal credit assignment problem
• Did we get the reward because of this move? Or because we made good
tactical decisions somewhere in the past?
• Exploration-exploitation trade-off
• If we found a good opening, should we use this?
• Or should we try other things, hoping for something better?

59
Landmarks in (Machine) Deep Learning
• 1958 Perceptron (Simple `neural’ model)
• 1986 Backpropagation (Practical Deep Neural networks)
• 1989 Convolutional networks (Supervised learning)
• 2012 AlexNet Image classification (Supervised learning)
• 2014 Generative adversarial networks (Unsupervised learning)
• 2014 Deep Q-Learning -- Atari games (Reinforcement learning)
• 2016 AlphaGo (Reinforcement learning)
• 2017 Machine translation (Supervised learning)
• 2019 Language models ((Un)supervised learning)
• 2022 Dall-E2 Image synthesis from text prompts ((Un)supervised learning)
• 2022 ChatGPT ((Un)supervised learning)
• 2023 GPT4 Multimodal model ((Un)supervised learning)
60
2018 Turing award winners

61
Where are we going?
• Supervised learning (overview with regression example)
• Shallow neural networks (a more flexible model)
• Deep neural networks (an even more flexible model)
• Loss functions (guiding the training)
• How to train neural networks (gradient descent and variants)
• How to measure performance of neural networks (generalization)

Machine Learning Basics
No ratings yet
Machine Learning Basics
151 pages
AI Facilitators Handbook Xprint
No ratings yet
AI Facilitators Handbook Xprint
197 pages
Lesson 02 Introduction To Deep Learning
No ratings yet
Lesson 02 Introduction To Deep Learning
74 pages
DL Unit 1
No ratings yet
DL Unit 1
200 pages
Deep Learning 15 May 2014
No ratings yet
Deep Learning 15 May 2014
70 pages
Unit 1a - Fundamentals of Deep Learning
No ratings yet
Unit 1a - Fundamentals of Deep Learning
54 pages
Deep Learning UNIT 5
No ratings yet
Deep Learning UNIT 5
182 pages
DL Unit - 1 Notes
No ratings yet
DL Unit - 1 Notes
45 pages
465-Lecture 1 (Deep Learning)
No ratings yet
465-Lecture 1 (Deep Learning)
47 pages
Week1 UDL CM20315 01 Intro
No ratings yet
Week1 UDL CM20315 01 Intro
49 pages
Deep Learning
No ratings yet
Deep Learning
95 pages
Deep Learning in Neural Networks An Overview
No ratings yet
Deep Learning in Neural Networks An Overview
89 pages
Deep Learning With Keras and Tensorflow
No ratings yet
Deep Learning With Keras and Tensorflow
88 pages
Lecture 3 - Introduction To Deep Learning
No ratings yet
Lecture 3 - Introduction To Deep Learning
27 pages
DNN Merged Sugata
No ratings yet
DNN Merged Sugata
243 pages
AI Unit 2
No ratings yet
AI Unit 2
38 pages
CM20315 01 Intro
No ratings yet
CM20315 01 Intro
62 pages
Deep Learning Midsem Merged Previous Batch
No ratings yet
Deep Learning Midsem Merged Previous Batch
423 pages
UNIT I Part 1 Notes
No ratings yet
UNIT I Part 1 Notes
28 pages
Computer Vision and Deep Learning 1708702317
No ratings yet
Computer Vision and Deep Learning 1708702317
93 pages
Deep Learning Introduction Class
No ratings yet
Deep Learning Introduction Class
46 pages
AA12 Deep Learning 2024
No ratings yet
AA12 Deep Learning 2024
30 pages
ML - MODULE7 - Advanced Topics in ML
No ratings yet
ML - MODULE7 - Advanced Topics in ML
22 pages
Lecture 2 - Introduction To ML
No ratings yet
Lecture 2 - Introduction To ML
20 pages
Deep Learning Trial Lecture
No ratings yet
Deep Learning Trial Lecture
12 pages
Dl-Unit 1
No ratings yet
Dl-Unit 1
12 pages
Deep Learning 2 July 2014
No ratings yet
Deep Learning 2 July 2014
75 pages
DL Notes 1 5 Deep Learning
100% (1)
DL Notes 1 5 Deep Learning
189 pages
Lecture 1 Ai
No ratings yet
Lecture 1 Ai
38 pages
Lec 1
No ratings yet
Lec 1
30 pages
Module 3
No ratings yet
Module 3
97 pages
R21 - A7709 - Deep Learning: Dr. Bhawani Sankar Panigrahi
No ratings yet
R21 - A7709 - Deep Learning: Dr. Bhawani Sankar Panigrahi
92 pages
Machine Learning Deep Learning Overview AIST
No ratings yet
Machine Learning Deep Learning Overview AIST
86 pages
Deep Learning
No ratings yet
Deep Learning
20 pages
JNTUK R20 B.Tech CSE 4-1 Deep Learning Techniques Unit 1 Notes
No ratings yet
JNTUK R20 B.Tech CSE 4-1 Deep Learning Techniques Unit 1 Notes
15 pages
Group I
No ratings yet
Group I
20 pages
Unit 3
No ratings yet
Unit 3
16 pages
DL Module 1 - CS-1 Fundamentals of Neural Network
No ratings yet
DL Module 1 - CS-1 Fundamentals of Neural Network
81 pages
Deep Learning Module 1 Chapter 1
No ratings yet
Deep Learning Module 1 Chapter 1
18 pages
CP4252 ML Unit - V
No ratings yet
CP4252 ML Unit - V
17 pages
Unit 4
No ratings yet
Unit 4
27 pages
Deep Learning in Neural Networks: An Overview
No ratings yet
Deep Learning in Neural Networks: An Overview
31 pages
Introduction To Deep Learning: Technical Seminar by Md. Abul Fazl (14261A05A0) CSE Dept
No ratings yet
Introduction To Deep Learning: Technical Seminar by Md. Abul Fazl (14261A05A0) CSE Dept
21 pages
Sequence Generation With RNNs - Post Quiz - Attempt Review
100% (2)
Sequence Generation With RNNs - Post Quiz - Attempt Review
5 pages
DNN - 1 - M1 - Fundamentals of Neural Network
No ratings yet
DNN - 1 - M1 - Fundamentals of Neural Network
95 pages
22 Selected Top Papers On Deep Learning
No ratings yet
22 Selected Top Papers On Deep Learning
393 pages
Deep Learning
No ratings yet
Deep Learning
22 pages
AI and ML
No ratings yet
AI and ML
28 pages
Deep Learning: A Visual Introduction
No ratings yet
Deep Learning: A Visual Introduction
53 pages
Deep Learning Unit-II
No ratings yet
Deep Learning Unit-II
19 pages
Salman Technical Seminar
No ratings yet
Salman Technical Seminar
24 pages
The Deep Learning Revolution: Introductory Overview Lecture
No ratings yet
The Deep Learning Revolution: Introductory Overview Lecture
35 pages
Deep Learning (DL) - Comprehensive Summary
No ratings yet
Deep Learning (DL) - Comprehensive Summary
9 pages
Deep Learning
100% (3)
Deep Learning
32 pages
Deep Learnig
No ratings yet
Deep Learnig
16 pages
Reviewer
No ratings yet
Reviewer
7 pages
Unit-1 MLT
No ratings yet
Unit-1 MLT
51 pages
Deep
No ratings yet
Deep
15 pages
Deep Neural Network
No ratings yet
Deep Neural Network
12 pages
MVDAFT Final
No ratings yet
MVDAFT Final
30 pages
Deep Learning UNIT 1&2
No ratings yet
Deep Learning UNIT 1&2
69 pages
Lecture 1a - Introduction
No ratings yet
Lecture 1a - Introduction
38 pages
Attention Is All You Need
No ratings yet
Attention Is All You Need
18 pages
YOLO V3 ML Project
No ratings yet
YOLO V3 ML Project
15 pages
A Recurrent Neural Network
No ratings yet
A Recurrent Neural Network
22 pages
Unit 1
No ratings yet
Unit 1
109 pages
Unit Iv
No ratings yet
Unit Iv
41 pages
机器学习读书会嘉宾分享计算机视觉目标检测
No ratings yet
机器学习读书会嘉宾分享计算机视觉目标检测
52 pages
DL Unit3 Autoencoder
No ratings yet
DL Unit3 Autoencoder
91 pages
Unit II Supervised II
No ratings yet
Unit II Supervised II
16 pages
Artificial Neural Network Part-2
No ratings yet
Artificial Neural Network Part-2
15 pages
Stock Prediction FINAL YEAR PROJECT
No ratings yet
Stock Prediction FINAL YEAR PROJECT
23 pages
Assignment Mtech
No ratings yet
Assignment Mtech
5 pages
DEEP LEARNING-Syllabus
No ratings yet
DEEP LEARNING-Syllabus
1 page
Supervised Learning
No ratings yet
Supervised Learning
14 pages
Ai Worksheet Answer
No ratings yet
Ai Worksheet Answer
10 pages
Unit - 1 DLNN
No ratings yet
Unit - 1 DLNN
36 pages
FDB Brochure - Deep Learning For Computer Vision From 05.02.2024 To 10.02.2024
No ratings yet
FDB Brochure - Deep Learning For Computer Vision From 05.02.2024 To 10.02.2024
2 pages
MLT Syllabus
No ratings yet
MLT Syllabus
1 page
Deeplearning - Ai Deeplearning - Ai
No ratings yet
Deeplearning - Ai Deeplearning - Ai
43 pages
Lab2 Solution PDF
No ratings yet
Lab2 Solution PDF
2 pages
Ann-Unit Ii
No ratings yet
Ann-Unit Ii
21 pages
Classification of Multimodal Spam Using Deep Learning
No ratings yet
Classification of Multimodal Spam Using Deep Learning
45 pages
ML Mentorship Prahitha Movva V1
No ratings yet
ML Mentorship Prahitha Movva V1
5 pages
ML Lab3 PGM
No ratings yet
ML Lab3 PGM
3 pages
Lecture 4: Perceptrons and Multilayer Perceptrons: Cognitive Systems II - Machine Learning SS 2005
No ratings yet
Lecture 4: Perceptrons and Multilayer Perceptrons: Cognitive Systems II - Machine Learning SS 2005
25 pages
Be - Information Technology Engineering - Semester 7 - 2023 - May - Deep Learning DL Pattern 2019
No ratings yet
Be - Information Technology Engineering - Semester 7 - 2023 - May - Deep Learning DL Pattern 2019
2 pages
Diabetes Detection Using Deep Learning Algorithms: ICT Express November 2018
No ratings yet
Diabetes Detection Using Deep Learning Algorithms: ICT Express November 2018
5 pages
Programme Details
No ratings yet
Programme Details
2 pages
Deep Learning
From Everand
Deep Learning
Manish Soni
No ratings yet