0% found this document useful (0 votes)

30 views30 pages

Lec 1

Uploaded by

Vinod Krishna

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

30 views30 pages

Lec 1

Uploaded by

Vinod Krishna

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 30

CS60010: Deep Learning

Sudeshna Sarkar
Spring 2018

8 Jan 2018
INTRODUCTION
Milestones: Digit Recognition
LeNet 1989: recognize zip codes, Yann Lecun, Bernhard Boser
and others, ran live in US postal service
Milestones: Image Classification
Convolutional NNs: AlexNet (2012): trained on 200 GB of
ImageNet Data

Human performance
5.1% error
Milestones: Speech Recognition
Recurrent Nets: LSTMs (1997):
Milestones: Language Translation
Sequence-to-sequence models with LSTMs and attention:

Source Luong, Cho, Manning ACL Tutorial 2016.

Milestones: Deep Reinforcement Learning
In 2013, Deep Mind’s arcade player bests human expert on six
Atari Games. Acquired by Google in 2014,.

In 2016, Deep Mind’s

alphaGo defeats former
world champion Lee Sedol

7
Learning about Deep Neural Networks

Yann Lecun: DNNs require: “an interplay between intuitive

insights, theoretical modeling, practical implementations,
empirical studies, and scientific analyses”

i.e. there isn’t a framework or core set of principles to explain

everything (c.f. graphical models for machine learning).

8
This Course

Goals:
• Introduce deep learning.
• Review principles and techniques for understanding deep
networks.
• Develop skill at designing networks for applications

9
This Course

• Times: Mon 12-1, Tue 10-12, Thu 8-9

• Assignments (pre-midterm): 20%

• Post-midterm assignments / Project: 20%
• Midterm: 30%
• Endterm: 30%

• TAs: Ayan Das, Alapan Kuila, Aishik Chakraborty, Ravi Bansal,

Jeenu Grover

• Moodle: DL Deep Learning

• Course Home Page: cse.iitkgp.ac.in - TBD

10
Prerequisites

• Knowledge of calculus and linear algebra

• Probability and Statistics
• Machine Learning

• Programming in Python.

11
Logistics

• 3 hours of lecture
• 1 hour of programming / tutorial

• Attendance is compulsory

12
Phases of Neural Network Research

• 1940s-1960s: Cybernetics: Brain like electronic systems, morphed

into modern control theory and signal processing.
• 1960s-1980s: Digital computers, automata theory, computational
complexity theory: simple shallow circuits are very limited…
• 1980s-1990s: Connectionism: complex, non-linear networks, back-
propagation.
• 1990s-2010s: Computational learning theory, graphical models:
Learning is computationally hard, simple shallow circuits are very
limited…
• 2006: Deep learning: End-to-end training, large datasets,
explosion in applications.
Citations of the “LeNet” paper
• Recall the LeNet was a modern visual classification network that
recognized digits for zip codes. Its citations look like this:

Second phase Deep Learning “Winter” Third phase

• The 2000s were a golden age for machine learning, and marked
the ascent of graphical models. But not so for neural networks.
Why the success of DNNs is surprising
• From both complexity and learning theory perspectives, simple
networks are very limited.
• Can’t compute parity with a small network.
• NP-Hard to learn “simple” functions like 3SAT formulae, and i.e.
training a DNN is NP-hard.
Why the success of DNNs is surprising
• The most successful DNN training algorithm is a version of gradient
descent which will only find local optima. In other words, it’s a
greedy algorithm. Backprop:
loss = f(g(h(y)))
d loss/dy = f’(g) x g’(h) x h’(y)

• Greedy algorithms are even more limited in what they can

represent and how well they learn.

• If a problem has a greedy solution, its regarded as an “easy”

problem.
Why the success of DNNs is surprising
• In graphical models, values in a network represent random
variables, and have a clear meaning. The network structure
encodes dependency information, i.e. you can represent rich
models.

• In a DNN, node activations encode nothing in particular, and the

network structure only encodes (trivially) how they derive from
each other.
Why the success of DNNs is surprising obvious
• Hierarchical representations are ubiquitous in AI. Computer vision:
Why the success of DNNs is surprising obvious
• Natural language:
Why the success of DNNs is surprising obvious
• Human Learning: is deeply layered.
Why the success of DNNs is surprising obvious
• What about greedy optimization?
• Less obvious, but it looks like many learning problems (e.g. image
classification) are actually “easy” i.e. have reliable steepest descent
paths to a good model.

Ian Goodfellow – ICLR 2015 Tutorial

Representations Matter
Cartesian coordinates Polar coordinates

θ
y

x r
Representation Learning
• Use machine learning to discover not only the mapping from
representation to output but also the representation itself.
• Representation Learning
• Learned representations often result in much better
performance than can be obtained with hand-designed
representations.
• They also enable AI systems to rapidly adapt to new tasks, with
minimal human intervention.
Depth
CAR PERSON ANIMAL Output
(object identity)

3rd hidden layer

(object parts)

2nd hidden layer

(corners and
contours)

1st hidden layer

(edges)

Visible layer
(input pixels)
Output

Mapping from
Output Output
features

Additional layers of
Mapping from Mapping from more abstract
Output features
features features

Hand- Hand- Simple

designed Features
designed features
program
features

Input Input Input Input

Deep
Rule-based Classic machine learning
systems learning Representation
learning
ML BASICS
Definition
• Mitchell (1997) “A computer program is said to learn from
experience E with respect to some class of tasks T and
performance measure P, if its performance at tasks in T, as
measured by P, improves with experience E.”
Linear Regression
• In the case of linear regression, the output is a linear
function of the input. Let 𝑦� be the value that our model
predicts 𝑦 should take on. We deﬁne the output to be
𝑦� = 𝑤 𝑇 𝑥
1 (𝑡𝑡𝑡𝑡) (𝑡𝑡𝑡𝑡) 2
𝑀𝑀𝑀𝑡𝑡𝑡𝑡 = 𝑦� −𝑦
𝑚 2
Normal Equations

22 Selected Top Papers On Deep Learning
No ratings yet
22 Selected Top Papers On Deep Learning
393 pages
Artificial Neural Network Course Slides
No ratings yet
Artificial Neural Network Course Slides
61 pages
Deep Learning in Neural Networks An Overview
No ratings yet
Deep Learning in Neural Networks An Overview
89 pages
DL Unit 1
No ratings yet
DL Unit 1
200 pages
Deep Learning (Nirali)
No ratings yet
Deep Learning (Nirali)
32 pages
DL Notes 1 5 Deep Learning
100% (1)
DL Notes 1 5 Deep Learning
189 pages
Deep Learning Module 1 Chapter 1
No ratings yet
Deep Learning Module 1 Chapter 1
18 pages
Lecture 3 - Introduction To Deep Learning
No ratings yet
Lecture 3 - Introduction To Deep Learning
27 pages
Deep Learning - Intro, Methods & Applications
100% (1)
Deep Learning - Intro, Methods & Applications
37 pages
DL IT324a 1
No ratings yet
DL IT324a 1
38 pages
Deep Learning 15 May 2014
No ratings yet
Deep Learning 15 May 2014
70 pages
AA12 Deep Learning 2024
No ratings yet
AA12 Deep Learning 2024
30 pages
Deep Learning
100% (3)
Deep Learning
32 pages
Cheatsheets For Deep Learning 1650192034
No ratings yet
Cheatsheets For Deep Learning 1650192034
95 pages
Deep Learning Midsem Merged Previous Batch
No ratings yet
Deep Learning Midsem Merged Previous Batch
423 pages
Lecture1 ANN - Full
No ratings yet
Lecture1 ANN - Full
66 pages
AI and ML Workshop PPTX - 250131 - 193538
No ratings yet
AI and ML Workshop PPTX - 250131 - 193538
44 pages
Machine Learning Tutorial
No ratings yet
Machine Learning Tutorial
149 pages
DNN Merged Sugata
No ratings yet
DNN Merged Sugata
243 pages
Unit II
No ratings yet
Unit II
27 pages
Deep Learning UNIT 5
No ratings yet
Deep Learning UNIT 5
182 pages
Deep Learning 2 July 2014
No ratings yet
Deep Learning 2 July 2014
75 pages
Data Mining: Practical Machine Learning Tools and Techniques
No ratings yet
Data Mining: Practical Machine Learning Tools and Techniques
123 pages
UNIT-1 Foundations of Deep Learning
100% (1)
UNIT-1 Foundations of Deep Learning
51 pages
Tud DL Lecture01 Intro
No ratings yet
Tud DL Lecture01 Intro
46 pages
Lec 01
No ratings yet
Lec 01
31 pages
Short Course On Deep Learning: Welcome!!
No ratings yet
Short Course On Deep Learning: Welcome!!
57 pages
Unit 3
No ratings yet
Unit 3
16 pages
DL Intro
No ratings yet
DL Intro
64 pages
Unit I - Fundamentals of DL
No ratings yet
Unit I - Fundamentals of DL
41 pages
Deep Learning Introduction Class
No ratings yet
Deep Learning Introduction Class
46 pages
BMM 2018 - Deep Learning Tutorial
No ratings yet
BMM 2018 - Deep Learning Tutorial
47 pages
DL Module 1 - CS-1 Fundamentals of Neural Network
No ratings yet
DL Module 1 - CS-1 Fundamentals of Neural Network
81 pages
Module 3
No ratings yet
Module 3
97 pages
Introduction To Deep Learning: Technical Seminar by Md. Abul Fazl (14261A05A0) CSE Dept
No ratings yet
Introduction To Deep Learning: Technical Seminar by Md. Abul Fazl (14261A05A0) CSE Dept
21 pages
DNN - 1 - M1 - Fundamentals of Neural Network
No ratings yet
DNN - 1 - M1 - Fundamentals of Neural Network
95 pages
Introduction To Deep Learning: Poo Kuan Hoong 19 July 2016
No ratings yet
Introduction To Deep Learning: Poo Kuan Hoong 19 July 2016
53 pages
Dl-Unit 1
No ratings yet
Dl-Unit 1
12 pages
The Deep Learning Revolution: Introductory Overview Lecture
No ratings yet
The Deep Learning Revolution: Introductory Overview Lecture
35 pages
Deep Learning in Neural Networks: An Overview
No ratings yet
Deep Learning in Neural Networks: An Overview
31 pages
Unit 4
No ratings yet
Unit 4
27 pages
Unit - 1 Deep Learning 3-2
No ratings yet
Unit - 1 Deep Learning 3-2
15 pages
Unit-3 D.L
No ratings yet
Unit-3 D.L
16 pages
Deep
No ratings yet
Deep
15 pages
Ai 4 All
No ratings yet
Ai 4 All
18 pages
NN DL Unit - III
No ratings yet
NN DL Unit - III
19 pages
Lecture 1a - Introduction
No ratings yet
Lecture 1a - Introduction
38 pages
Group I
No ratings yet
Group I
20 pages
Unit 1
No ratings yet
Unit 1
20 pages
Module 1 DL Snotes
No ratings yet
Module 1 DL Snotes
11 pages
Chapter1. Introduction To Deep Learning
No ratings yet
Chapter1. Introduction To Deep Learning
21 pages
Deep Neural Network
No ratings yet
Deep Neural Network
12 pages
A Selective Overview of Deep Learning: Jianqing Fan Cong Ma Yiqiao Zhong April 16, 2019
No ratings yet
A Selective Overview of Deep Learning: Jianqing Fan Cong Ma Yiqiao Zhong April 16, 2019
37 pages
Deep Learning Algorithms and Architectures
No ratings yet
Deep Learning Algorithms and Architectures
26 pages
MVDAFT Final
No ratings yet
MVDAFT Final
30 pages
Deep Learnig
No ratings yet
Deep Learnig
16 pages
A Guide To Deep Learning and Neural Networks
No ratings yet
A Guide To Deep Learning and Neural Networks
15 pages
Deep Unsupervised Learning
No ratings yet
Deep Unsupervised Learning
90 pages
Paper ID: Title
No ratings yet
Paper ID: Title
14 pages
DL MID2 Bit Bank 2024-25
No ratings yet
DL MID2 Bit Bank 2024-25
25 pages
NNDL Notes Unit 3
No ratings yet
NNDL Notes Unit 3
38 pages
Advanced Deep Learning Ghosal
No ratings yet
Advanced Deep Learning Ghosal
9 pages
Deep Learning: Yann Lecun
No ratings yet
Deep Learning: Yann Lecun
58 pages
Download textbook Computational Linguistics And Intelligent Text Processing 18Th International Conference Cicling 2017 Budapest Hungary April 17 23 2017 Revised Selected Papers Part I Alexander Gelbukh ebook all chapter pdf
100% (24)
Download textbook Computational Linguistics And Intelligent Text Processing 18Th International Conference Cicling 2017 Budapest Hungary April 17 23 2017 Revised Selected Papers Part I Alexander Gelbukh ebook all chapter pdf
54 pages
2021 - Energy-Efficient VM Scheduling Based On Deep Reinforcement Learning
No ratings yet
2021 - Energy-Efficient VM Scheduling Based On Deep Reinforcement Learning
13 pages
Deep Trip Generation With Graph Neural Networks For Bike Sharing System Expansion
No ratings yet
Deep Trip Generation With Graph Neural Networks For Bike Sharing System Expansion
20 pages
Controllable Data Generation by Deep Learning: A Review: Shiyu Wang Yuanqi Du Xiaojie Guo Bo Pan Zhaohui Qin Liang Zhao
No ratings yet
Controllable Data Generation by Deep Learning: A Review: Shiyu Wang Yuanqi Du Xiaojie Guo Bo Pan Zhaohui Qin Liang Zhao
38 pages
Hu 2018 Deep Stock
No ratings yet
Hu 2018 Deep Stock
5 pages
A Comprehensive Survey of Graph Neural Networks For Knowledge Graphs
No ratings yet
A Comprehensive Survey of Graph Neural Networks For Knowledge Graphs
13 pages
Spectral Bellman Method: Unifying Representation and Exploration in RL
No ratings yet
Spectral Bellman Method: Unifying Representation and Exploration in RL
18 pages
Activation Functions Book
No ratings yet
Activation Functions Book
20 pages
Antony 2016
No ratings yet
Antony 2016
6 pages
Generalized Focal Loss Towards Efficient Representation Learning For Dense Object Detection
No ratings yet
Generalized Focal Loss Towards Efficient Representation Learning For Dense Object Detection
15 pages
Unit 01 - Live Session PPT 2
No ratings yet
Unit 01 - Live Session PPT 2
21 pages
Supervised Contrastive Learning
No ratings yet
Supervised Contrastive Learning
23 pages
CircuitGTL An Intelligent Circuit Design Methodology Across Electromagnetic Topologies With Graph Transfer Learning
No ratings yet
CircuitGTL An Intelligent Circuit Design Methodology Across Electromagnetic Topologies With Graph Transfer Learning
14 pages
Deep Learning in Object Detection: A Review: August 2020
No ratings yet
Deep Learning in Object Detection: A Review: August 2020
12 pages
Tripartite Feature Enhanced Pyramid Network For Dense Prediction
No ratings yet
Tripartite Feature Enhanced Pyramid Network For Dense Prediction
15 pages
2022 Acl-Long 524
No ratings yet
2022 Acl-Long 524
18 pages
DL and Feature Learning
No ratings yet
DL and Feature Learning
2 pages
GNN Foundations Frontiers and Applications Chapter1
No ratings yet
GNN Foundations Frontiers and Applications Chapter1
13 pages
Vistruct: Visual Structural Knowledge Extraction Via Curriculum Guided Code-Vision Representation
No ratings yet
Vistruct: Visual Structural Knowledge Extraction Via Curriculum Guided Code-Vision Representation
16 pages
SAR Target Recognition Based On Deep Learning
No ratings yet
SAR Target Recognition Based On Deep Learning
7 pages
Jiao 2018
No ratings yet
Jiao 2018
7 pages
Deep and Broad Learning On Content-Aware POI Recommendation
No ratings yet
Deep and Broad Learning On Content-Aware POI Recommendation
10 pages
Deep Density-Based Image Clustering
No ratings yet
Deep Density-Based Image Clustering
8 pages
2018 - Research On Iris Image Encryption Based On DL PDF
No ratings yet
2018 - Research On Iris Image Encryption Based On DL PDF
10 pages
Report Project
No ratings yet
Report Project
4 pages
A Multi-View Confidence-Calibrated Framework For Fair and Stable Graph Representation Learning
No ratings yet
A Multi-View Confidence-Calibrated Framework For Fair and Stable Graph Representation Learning
6 pages
Question 4
No ratings yet
Question 4
3 pages
Boarding Pass
No ratings yet
Boarding Pass
2 pages
Assignment and Project Tableau
No ratings yet
Assignment and Project Tableau
1 page
Image Compression: Efficient Techniques for Visual Data Optimization
From Everand
Image Compression: Efficient Techniques for Visual Data Optimization
Fouad Sabry
No ratings yet
Hidden Line Removal: Unveiling the Invisible: Secrets of Computer Vision
From Everand
Hidden Line Removal: Unveiling the Invisible: Secrets of Computer Vision
Fouad Sabry
No ratings yet
Digital Image Processing: Fundamentals and Applications
From Everand
Digital Image Processing: Fundamentals and Applications
Fouad Sabry
No ratings yet

Lec 1

Uploaded by

Lec 1

Uploaded by

CS60010: Deep Learning

Source Luong, Cho, Manning ACL Tutorial 2016.

In 2016, Deep Mind’s

Yann Lecun: DNNs require: “an interplay between intuitive

i.e. there isn’t a framework or core set of principles to explain

• Times: Mon 12-1, Tue 10-12, Thu 8-9

• Assignments (pre-midterm): 20%

• TAs: Ayan Das, Alapan Kuila, Aishik Chakraborty, Ravi Bansal,

• Moodle: DL Deep Learning

• Knowledge of calculus and linear algebra

• 1940s-1960s: Cybernetics: Brain like electronic systems, morphed

Second phase Deep Learning “Winter” Third phase

• Greedy algorithms are even more limited in what they can

• If a problem has a greedy solution, its regarded as an “easy”

• In a DNN, node activations encode nothing in particular, and the

Ian Goodfellow – ICLR 2015 Tutorial

3rd hidden layer

2nd hidden layer

1st hidden layer

Hand- Hand- Simple

Input Input Input Input

You might also like