cs229 Notes1

The document discusses several trends in deep learning including: 1. Scale driving progress in deep learning as larger datasets and more computational power allow for more complex models. 2. The rise of end-to-end learning where models directly learn complex mappings from input to output without separate components. 3. Different categories of deep learning models including general neural networks, sequence models, image models, and future techniques like reinforcement learning.

Uploaded by

fatihy73

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

68 views

cs229 Notes1

Uploaded by

fatihy73

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

Trend #1: Scale driving Deep Learning progress

Nuts and bolts of building AI

applications using Deep Learning

Andrew Ng

Andrew Ng Andrew Ng

Trend #2: The rise of end-to-end learning Major categories of DL models
Learning with integer or real-valued outputs:
1. General neural networks
2. Sequence models (1D sequences)
• RNN, GRU, LSTM, CTC, attention models, ….
3. Image models
Learning with complex (e.g., string valued) outputs:
• 2D and 3D convolutional networks
4. Advanced/future tech:
• Unsupervised learning (sparse coding, ICA, SFA,
…), Reinforcement learning, ….

Andrew Ng Andrew Ng

Andrew Ng
End-to-end learning: Speech recognition End-to-end learning: Autonomous driving
Traditional model Traditional model

End-to-end learning End-to-end learning

This works well given enough labeled (audio, transcript) data. Given the safety-critical requirement of autonomous driving and thus the need for extremely
high levels of accuracy, a pure end-to-end approach is still challenging to get to work. End-to-
end works only when you have enough (x,y) data to learn function of needed level of complexity.
Andrew Ng Andrew Ng

Machine Learning Strategy Traditional train/dev/test and bias/variance

Say you want to build a human-level speech recognition system. You split your data
Often you will have a lot of ideas for how to improve an into train/dev/test:
AI system, what do you do?
Training (60%) Dev (20%) Test (20%)

Good strategy will help avoid months of wasted effort.

Human level error ………. 1%
“Avoidable bias”
Training set error ………... 5%
“Variance”
Dev set error …………….. 8%

Compared to earlier eras, we still talk about bias and variance, but somewhat less
about the “tradeoff” between them.
Andrew Ng Andrew Ng

Andrew Ng
Basic recipe for machine learning Automatic data synthesis examples

• OCR
• Text against random backgrounds
Machine Learning
Bigger model
Training error high? Train longer (Bias) • Speech recognition
Yes New model architecture
• Synthesize clean audio against different background noise
No
• NLP: Grammar correction
More data
Dev error high? Regularization (Variance) • Synthesize random grammatical errors
Yes New model architecture
No Sometimes synthesized data that appears great to human eyes is
Done! actually very impoverished in the eyes of ML algorithms, and covers
only a minuscule fraction of the actual distribution of data. E.g.,
images of cars extracted from video games.

Andrew Ng Andrew Ng

Different training and test set distributions Different training and test set distributions
Better way: Make the dev and test sets come from the same distribution.
Say you want to build a speech recognition system for a new in-car
Training-Dev Dev Test
rearview mirror product. You have 50,000 hours of general speech Training (~50,000h) (20h) (5h) (5h)
data, and 10 hours of in-car data. How do you split your data? This General speech data In-car data
is a bad way to do it:
Human level error ………... 1%
Training Dev Test “Avoidable bias”
Training error …….............. 1.1%
General speech data (50,000 hours) In-car data
(10 hours) Overfitting of training set
Training-Dev error ………... 1.5%
Having mismatched dev and test distributions is not a good idea. Data mismatch
Your team may spend months optimizing for dev set performance Dev set error ………………. 8%
only to find it doesn’t work well on the test set. Overfitting of dev set
Test set error ………………. 8.5%
Andrew Ng Andrew Ng

Andrew Ng
New recipe for machine learning General Human/Bias/Variance analysis
Bigger model
Training error high? Train longer (Bias)
Yes New model architecture General In-car
speech data speech data
No (50,000 hours) (10 hours)
More data
Train-Dev error high? Regularization (Variance) Performance of (Carry out human
Yes New model architecture Human-level error evaluation to measure.)
humans
No “Avoidable bias”
Make training data more Performance on
(Insert some in-car data into
Dev error high? similar to test data. (Train-test data examples you’ve trained Training error training set to measure.)
Yes Data synthesis
mismatch) on
(Domain adaptation.)
“Variance”/degree of
No New model architecture
Performance on overfitting
examples you haven’t Training-Dev error Dev/Test error
Test error high? More dev set data (Overfit dev trained on
Yes
set)
No
Data mismatch
Done! Andrew Ng Andrew Ng

Human level performance Quiz: Medical imaging

You’ll often see the fastest performance improvements on a task while the Suppose that on an image labeling task:
ML is performing worse than humans.
• Human-level performance is a proxy for Bayes optimal error, which we Typical human ………………..… 3% error
can never surpass. Typical doctor …………………... 1% error
• Can rely on human intuition: (i) Have humans provide labeled data. Experienced doctor ……………. 0.7% error
(ii) Error analysis to understand how humans got examples right.
(iii) Estimate bias/variance. E.g., On an image recognition task, training Team of experienced doctors …. 0.5% error
error = 8%, dev error = 10%. What do you do? Two cases:

Human level error ………. 1% Human level error ………. 7.5%
What is “human-level error”?
“Avoidable bias” “Avoidable bias”
Training set error ………... 8% Training set error ………... 8%
“Variance” “Variance” Answer: For purpose of driving ML progress, 0.5% is
Dev set error …………… 10%
Dev set error …………… 10%
best answer since it’s closest to Bayes error.
Focus on bias. Focus on variance.
Andrew Ng Andrew Ng

Andrew Ng
AI Product Management AI Product Management
The availability of new supervised DL algorithms means we’re rethinking the workflow How should PMs and AI teams work together? Here’s one default split of
of how to have teams collaborate to build applications using DL. A Product Manager responsibilities:
(PM) can help an AI team prioritize the most fruitful ML tasks. E.g., should you
improve speech performance with car noise, café noise, for low-bandwidth audio, for Product Manager (PM) AI Scientist/Engineer
accented speech, or improve latency, reduce binary size, or something else? responsibility responsibility
What can AI do today? Some heuristics for PMs: • Provide dev/test sets, ideally • Acquire training data
• If a typical person can do a mental task with less than one second of thought, we drawn from same distribution.
can probably automate it using AI either now or in the near future. • Develop system that does well
• For any concrete, repeated event that we observe (e.g., whether user clicks on ad;; • Provide evaluation metric for according to the provided
how long it takes to deliver a package;; ….), we can reasonably try to predict the learning algorithm (accuracy, metric on the dev/test data.
outcome of the next event (whether user clicks on next ad). F1, etc.)

This is a way for the PM to express

what ML task they think will make
the biggest difference to users.
Andrew Ng Andrew Ng

Machine Learning Yearning

Book on AI/ML technical strategy.

Thank you for coming to
Sign up at http://mlyearning.org this tutorial!

Andrew Ng Andrew Ng

Andrew Ng

(SIRI Assessor Training) AM Guide Book - v2
No ratings yet
(SIRI Assessor Training) AM Guide Book - v2
19 pages
AI For Data Science - Artificial Intelligence Frameworks and Functionality For Deep Learning, Optimization, and Beyond
No ratings yet
AI For Data Science - Artificial Intelligence Frameworks and Functionality For Deep Learning, Optimization, and Beyond
231 pages
CPC Syllabus
No ratings yet
CPC Syllabus
48 pages
14.2 Machine Learning and Deep Learning
No ratings yet
14.2 Machine Learning and Deep Learning
7 pages
C3_W2
No ratings yet
C3_W2
35 pages
Learning AI Development With UX
No ratings yet
Learning AI Development With UX
41 pages
Deeplearning - Ai Deeplearning - Ai
No ratings yet
Deeplearning - Ai Deeplearning - Ai
40 pages
How To Manage Machine Learning Products - Towards Data Science
No ratings yet
How To Manage Machine Learning Products - Towards Data Science
8 pages
Machine Learning Strategy
No ratings yet
Machine Learning Strategy
102 pages
Basic_concepts_of_Machine_Learning_for_Beginners_1732109263
No ratings yet
Basic_concepts_of_Machine_Learning_for_Beginners_1732109263
102 pages
Screenshot 2025-03-14 at 10.30.37 PM
No ratings yet
Screenshot 2025-03-14 at 10.30.37 PM
31 pages
Structuring Machine learning projects
No ratings yet
Structuring Machine learning projects
12 pages
From Field Problems To Machine Learning
No ratings yet
From Field Problems To Machine Learning
51 pages
Machine Learning Q and AI 1686653642
67% (3)
Machine Learning Q and AI 1686653642
82 pages
Deep Learning With Tensorflow
100% (1)
Deep Learning With Tensorflow
70 pages
Certified Artificial Intelligence Practitioner 1
No ratings yet
Certified Artificial Intelligence Practitioner 1
43 pages
Artificial intelligence
No ratings yet
Artificial intelligence
14 pages
Train: Dev: Test Sets
No ratings yet
Train: Dev: Test Sets
5 pages
Introduction To Machine Learning (Copy)
100% (1)
Introduction To Machine Learning (Copy)
49 pages
Presentation1.Pptx Tanushka - Copy
No ratings yet
Presentation1.Pptx Tanushka - Copy
13 pages
1. Fundamentals of AI
No ratings yet
1. Fundamentals of AI
114 pages
Uncertainty in Modeling
No ratings yet
Uncertainty in Modeling
25 pages
Machine Learning Batch 8 2021
100% (1)
Machine Learning Batch 8 2021
73 pages
Chapter 1 Introduction
No ratings yet
Chapter 1 Introduction
28 pages
A Developer's Guide To Artificial Intelligence (AI) : Definitions, Insights & Tools For Getting Started in AI
No ratings yet
A Developer's Guide To Artificial Intelligence (AI) : Definitions, Insights & Tools For Getting Started in AI
9 pages
Introduction To AI
No ratings yet
Introduction To AI
52 pages
2021 Machine Learning Intro
No ratings yet
2021 Machine Learning Intro
43 pages
Unit-5 What If AI Succeed
No ratings yet
Unit-5 What If AI Succeed
9 pages
AI Engineer Interview Prep Guide
No ratings yet
AI Engineer Interview Prep Guide
16 pages
Machine Learning Deep Learning Overview AIST
No ratings yet
Machine Learning Deep Learning Overview AIST
86 pages
Most Impactful AI Trends of 2018: The Rise of ML Engineering
No ratings yet
Most Impactful AI Trends of 2018: The Rise of ML Engineering
9 pages
Noteartificial intelligence
No ratings yet
Noteartificial intelligence
23 pages
AI For Everyone Notes
No ratings yet
AI For Everyone Notes
6 pages
Aiml Online Brochure
No ratings yet
Aiml Online Brochure
20 pages
#Ai ML
No ratings yet
#Ai ML
102 pages
Gartner - Go - Beyond - Machine - Learning - and - Leverage - Other - AI - Approaches
No ratings yet
Gartner - Go - Beyond - Machine - Learning - and - Leverage - Other - AI - Approaches
14 pages
Module1 ECO-598 AI & ML Aug 21
No ratings yet
Module1 ECO-598 AI & ML Aug 21
45 pages
Data Management and Data Transformation, Introduction To Machine Learning
No ratings yet
Data Management and Data Transformation, Introduction To Machine Learning
54 pages
Machine_Learning_Yearning
No ratings yet
Machine_Learning_Yearning
40 pages
Gen AI Content
No ratings yet
Gen AI Content
47 pages
ML Midterm Cheatsheet
No ratings yet
ML Midterm Cheatsheet
2 pages
Voulgaris, Bulut - AI For Data Science (AVG) (2018)
No ratings yet
Voulgaris, Bulut - AI For Data Science (AVG) (2018)
202 pages
USAII Reviewer
No ratings yet
USAII Reviewer
100 pages
001 ML Introduction W1L2
No ratings yet
001 ML Introduction W1L2
64 pages
DOC-20250308-WA0007.
No ratings yet
DOC-20250308-WA0007.
25 pages
Beyond The Hype: A Guide To Understanding and Successfully Implementing Artificial Intelligence Within Your Business
100% (2)
Beyond The Hype: A Guide To Understanding and Successfully Implementing Artificial Intelligence Within Your Business
20 pages
Class Xii Model Life Cycle
No ratings yet
Class Xii Model Life Cycle
6 pages
Deeplearning - Ai Deeplearning - Ai
No ratings yet
Deeplearning - Ai Deeplearning - Ai
45 pages
DL Unit 1
No ratings yet
DL Unit 1
27 pages
State of The Art Research Methodology For Machine
No ratings yet
State of The Art Research Methodology For Machine
58 pages
Lecture 1 Ai
No ratings yet
Lecture 1 Ai
38 pages
ai and ml qp1 solved
No ratings yet
ai and ml qp1 solved
20 pages
Answers 111111111111111111111111111
No ratings yet
Answers 111111111111111111111111111
21 pages
Machine Learning Semester Paper
No ratings yet
Machine Learning Semester Paper
31 pages
Ai and ML PDF 1
No ratings yet
Ai and ML PDF 1
11 pages
Introduction To ML and DL
No ratings yet
Introduction To ML and DL
77 pages
Assignment Title: Trends and Issues in Sciences
No ratings yet
Assignment Title: Trends and Issues in Sciences
4 pages
neural network 1
No ratings yet
neural network 1
28 pages
Computer Vision and Deep Learning 1708702317
No ratings yet
Computer Vision and Deep Learning 1708702317
93 pages
MLT unit -1
No ratings yet
MLT unit -1
38 pages
Workbook_Week 8
No ratings yet
Workbook_Week 8
12 pages
50 Breakthrough AI Concepts in 500 Words Each: In 500 words, #17
From Everand
50 Breakthrough AI Concepts in 500 Words Each: In 500 words, #17
Nietsnie Trebla
No ratings yet
The Introductory Guide For Imr PHD Candidates: We Invite You To
No ratings yet
The Introductory Guide For Imr PHD Candidates: We Invite You To
2 pages
An Analysis To Wealth Distribution Based On Sugarscape Model in An Artificial Society
No ratings yet
An Analysis To Wealth Distribution Based On Sugarscape Model in An Artificial Society
15 pages
Greece Prop
0% (1)
Greece Prop
3 pages
Team 6: Enhanced Design of Experiment For Testing in A Joint Environment
No ratings yet
Team 6: Enhanced Design of Experiment For Testing in A Joint Environment
4 pages
Work Smarter, Not Harder: Guidelines For Designing Simulation Experiments
No ratings yet
Work Smarter, Not Harder: Guidelines For Designing Simulation Experiments
14 pages
Simulation Basics PDF
No ratings yet
Simulation Basics PDF
27 pages
Atkinson, Anthony and Donev, Alexander and Tobias, Randall 2007
No ratings yet
Atkinson, Anthony and Donev, Alexander and Tobias, Randall 2007
3 pages
What Leaders Really Do: Key Functions
No ratings yet
What Leaders Really Do: Key Functions
3 pages
Everything You Wanted To Know About ... Yogurt - Monique Van Der Vloed PDF
No ratings yet
Everything You Wanted To Know About ... Yogurt - Monique Van Der Vloed PDF
10 pages
Making Leadership Happen
No ratings yet
Making Leadership Happen
12 pages
Homemade Kombucha - The Simple Guide To Kickass Kombucha PDF
No ratings yet
Homemade Kombucha - The Simple Guide To Kickass Kombucha PDF
1 page
PSD Matrices
No ratings yet
PSD Matrices
2 pages
Practice Problem Set 7: OA4201 Nonlinear Programming
No ratings yet
Practice Problem Set 7: OA4201 Nonlinear Programming
4 pages
German Memory Hooks
No ratings yet
German Memory Hooks
2 pages
Nonlinear Programming: Emcrapar@nps - Edu
No ratings yet
Nonlinear Programming: Emcrapar@nps - Edu
1 page
C - Sc. - Practical File For 2022 - HY
No ratings yet
C - Sc. - Practical File For 2022 - HY
2 pages
Synthesis
No ratings yet
Synthesis
67 pages
DDoS Attack Detection Using ML
No ratings yet
DDoS Attack Detection Using ML
6 pages
Co Unit-4
No ratings yet
Co Unit-4
12 pages
QA_Subham_Resume
No ratings yet
QA_Subham_Resume
3 pages
TP3.2 MongoDB Uk
No ratings yet
TP3.2 MongoDB Uk
5 pages
Computer Based Optimization Method MCA - 305
100% (1)
Computer Based Optimization Method MCA - 305
3 pages
1.3 Computer Hardware
No ratings yet
1.3 Computer Hardware
76 pages
Autodesk Revit2011-2012 Victaulic ReadMe
No ratings yet
Autodesk Revit2011-2012 Victaulic ReadMe
3 pages
CSC2330 Assignment 3 Problem Solving 2 T1 2024
No ratings yet
CSC2330 Assignment 3 Problem Solving 2 T1 2024
7 pages
Aao Module 2, Lesson 1 Lecture
No ratings yet
Aao Module 2, Lesson 1 Lecture
43 pages
AI Minmax Algo
No ratings yet
AI Minmax Algo
13 pages
22bce033 Prac 3 Oops
No ratings yet
22bce033 Prac 3 Oops
11 pages
Build Share: Docker Cheat Sheet
No ratings yet
Build Share: Docker Cheat Sheet
1 page
Difference Between Linux and Windows
No ratings yet
Difference Between Linux and Windows
2 pages
Massive MIMO
No ratings yet
Massive MIMO
18 pages
POWER9 Scale Out Servers Level 2
No ratings yet
POWER9 Scale Out Servers Level 2
7 pages
Solve The Below Product Mix Problem Using LINGO's Modeling Language
No ratings yet
Solve The Below Product Mix Problem Using LINGO's Modeling Language
1 page
Assignment 1: DATE-31-10-2021
No ratings yet
Assignment 1: DATE-31-10-2021
11 pages
AI and Machine Learning Autosaved
No ratings yet
AI and Machine Learning Autosaved
26 pages
MCAC706 Catalog
No ratings yet
MCAC706 Catalog
23 pages
View Available Choices Handler
No ratings yet
View Available Choices Handler
2 pages
4IT0 01 Que 20170517
No ratings yet
4IT0 01 Que 20170517
24 pages
CCNA Training Simple Network Management Protocol SNMP Tutorial
0% (1)
CCNA Training Simple Network Management Protocol SNMP Tutorial
8 pages
NCU NetworlControlUnit DataSheet
No ratings yet
NCU NetworlControlUnit DataSheet
17 pages
Securing Modern Apps Against Layer 7 Dos Attacks: Whitepaper
No ratings yet
Securing Modern Apps Against Layer 7 Dos Attacks: Whitepaper
16 pages
On Portal User Manualand Training Guide Rev 3
No ratings yet
On Portal User Manualand Training Guide Rev 3
95 pages
Chapter 3: Information Theory: Section 3.5
No ratings yet
Chapter 3: Information Theory: Section 3.5
22 pages
Lab 7: Pointer: Objectives
No ratings yet
Lab 7: Pointer: Objectives
9 pages