0% found this document useful (0 votes)

57 views82 pages

Lecture 9 - RL

This document contains the schedule for an introduction to artificial intelligence course. It lists the topics that will be covered each week over 17 weeks. It notes assignments that will be given and due dates. The intended learning outcomes are also listed which are to understand definitions of AI, challenges in the field, modeling techniques, applying models to solve problems, and developing a final project solution.

Uploaded by

鄭博仁

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

57 views82 pages

Lecture 9 - RL

Uploaded by

鄭博仁

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 82

Any Questions?

Introduction to Artificial Intelligence

Week Date Topic Note
1 2/13~2/17 Introduction
2 2/20~2/24 Machine Learning I
3 2/27~3/3 Machine Learning II No Class (2/28)
4 3/6~3/10 Machine Learning III HW1 announce (3/6)
5 3/13~3/17 Problem Solving by Searching
6 3/20~3/24 Adversarial Search HW1 due and HW2 announce (3/21)
7 3/27~3/31 Markov Decision Process
No Class (4/4)
8 4/3~4/7 Spring Break
HW2 due and HW3 announce (4/7)
9 4/10~4/14 Reinforcement Learning
10 4/17~4/21 Constraint Satisfaction Problems HW3 due and HW4 announce (4/21)
11 4/24~4/28 Bayesian Network
12 5/1~5/5 Knowledge, Reasoning, and Planning
13 5/8~5/12 3D Computer Vision HW4 due and HW5 announce (5/12)
14 5/15~5/19 Guest Talk (NVIDIA Research)
15 5/22~5/26 Guest Talk (TBA) HW5 due (5/26)
16 5/29~6/2 Guest Talk (NVIDIA Research)
17 6/5~6/9 *Final Project Demo* *The above dates might change

18 6/13~6/17

Introduction to Artificial Intelligence 2

https://mingyuliu.net/

Introduction to Artificial Intelligence

https://youtu.be/3GPNsPMqY8o

Introduction to Artificial Intelligence

Intended Learning Outcomes
• List existing definitions of artificial intelligence
• First lecture
• Identify challenges in the field of artificial intelligence
• Cover them in this semester
• Explain selected modeling for identified problems
• A* search (state-based model) for route planning (problem)
• Apply the learned modeling to solve problems
• HW1 ~ HW5
• Develop your solution for a problem
• Final Project

Introduction to Artificial Intelligence

Machine Learning
• Minimizing an objective on the given datasets (or called training
dataset)

Introduction to Artificial Intelligence

State-based Models

Search problem Adversarial Search MDPs

Minimax,
UCS and A* search Alpha-Beta pruning Policy evaluation
Expectimax Value iteration

Introduction to Artificial Intelligence

Example: Map Coloring

How to solve this problem?

Introduction to Artificial Intelligence
Variable-based Models
• Variables
• Solutions to problems ==> assignments to variables
• Examples of variable-based models
• Constraint satisfaction problems
• Bayesian Networks
• Hidden Markov models

Introduction to Artificial Intelligence

The Robot System
A hierarchical design with the following five major levels

1. The robot vehicle and its connection to user programs

• Chapter 2 and 3, and Appendix A and G
2. Low-level action (LLA) (e.g., roll)
• Chapter 4
3. Intermediate-level action (ILA) (e.g., push)
• Chapter 5, 6, and Appendix B
4. Planning (i.e., STRIPS)
• Construct sequences of ILAs needed to carry out specific tasks
• Chapter 7
5. The executive, the program that actually invokes and monitors executions of the ILAs
• Chapter 8
http://ai.stanford.edu/~nilsson/OnlinePubs-Nils/shakey-the-robot.pdf

Introduction to Artificial Intelligence

A Task for Shakey
Fetch a box from an adjacent room and take it back to Room1
How to achieve
it?
Intermediate-level Action
1. Robot GOTHRU D1 from R1 to
R2
2. Robot PUSHTHRU Box1
through D1 from R2 to R1
How to obtain this conclusion?

Introduction to Artificial Intelligence

ChatGPT for Robot Action Generation?

https://github.com/microsoft/ChatGPT-Robot-Manipulation-Prompts

Introduction to Artificial Intelligence

Introduction to Artificial Intelligence
Introduction to Artificial Intelligence
Introduction to Artificial Intelligence
Reinforcement Learning
Spring 2023

Yi-Ting Chen
Introduction to Artificial Intelligence https://lilianweng.github.io/posts/2018-02-19-rl-overview/
AlphaGo, AlphaZero
• Deep Mind, 2016+

Introduction to Artificial Intelligence

RL in Intelligent Systems

https://www.vis.xyz/pub/roach/ https://rl-at-scale.github.io/

Introduction to Artificial Intelligence

End-to-end Driving Models
• Pros:
• Jointly optimizing the targeted objective function (i.e., safe,
comfortable, and efficient driving experience)
• Cheap annotations
• Cons:
• Generalization
• Interpretability

https://www.cvlibs.net/publications/Renz2022CORL.pdf

Introduction to Artificial Intelligence 19

End-to-end Driving Models

https://github.com/autonomousvision/transfuser

Introduction to Artificial Intelligence

IL-based End-to-end Driving Models: Challenges
To train a smart end-to-end driving model with Imitation Learning (IL)
approach, we need lots of high quality expert demonstrations.
● Human drivers as experts
○ High quality data
○ Too expensive to collect lots of data😵
● Automated experts (rule-based)
○ Efficiently generate large scale demonstrations on driving
simulators
○ Hand-crafted rule-based experts usually perform suboptimally😵
We need another expert!

Introduction to Artificial Intelligence

RL-based End-to-end Driving Expert
Why is it desirable to introduce Reinforcement Learning (RL) as a
driving expert?
● Pros
○ RL-based expert is automated, so it can be used to generate large scale
demonstrations efficiently in driving simulators
● Cons
○ RL-based end-to-end driving models are limited to driving simulators
○ Training RL-based models from scratch in the real world leads to fatal car
accidents

Introduction to Artificial Intelligence

Introduction to Artificial Intelligence
https://www.unrealengine.com/marketplace/en-US/product/japanese-street/reviews

Introduction to Artificial Intelligence

Roach - the Reinforcement Learning Coach
STEP 1: Train RL expert
Input: bird’s-eye view semantic segmentation image
Output: low-level actions (steering, throttle and brake)

https://arxiv.org/pdf/2108.08265.pdf

Introduction to Artificial Intelligence

https://arxiv.org/pdf/2108.08265.pdf
26
Introduction to Artificial Intelligence
Roach - the Reinforcement Learning Coach (cont.)
STEP 2: Train IL agent
Input: single camera image
Output: low-level actions

Imitation learning agents learn

from supervision signals provided
by RL experts

https://arxiv.org/pdf/2108.08265.pdf

Introduction to Artificial Intelligence

Performance Comparison - IL Trained by Different Experts

https://arxiv.org/pdf/2108.08265.pdf

Introduction to Artificial Intelligence

Demo: IL Agent Supervised by RL Expert

Introduction to Artificial Intelligence

Any Questions?

Introduction to Artificial Intelligence

Wayve

https://youtu.be/Va-F4qtTQ6g

Introduction to Artificial Intelligence 32

Introduction to Artificial Intelligence 33
Deep RL at Scale: Sorting Waste in Office
Buildings with a Fleet of Mobile Manipulators

https://ai.googleblog.com/2023/04/robotic-
deep-rl-at-scale-sorting-waste.html https://rl-at-scale.github.io/

Introduction to Artificial Intelligence

The Hanging Task

reaching grasping moving hanging

Introduction to Artificial Intelligence

6D Pose Estimation: Template Pose Matching

● How to obtain the 6D pose of an object?

○ Template pose matching

capture keypoint
object template 6D pose
RGBD annotation

corresponding
points matching
obtain 6D pose

capture keypoint
unknown object pose RGBD annotation

Introduction to Artificial Intelligence

Grasping: other approaches
● How to obtain the 6D pose of an object?
○ Learning-based methods

https://arxiv.org/pdf/2104.01542v2.pdf

Introduction to Artificial Intelligence

Deep RL at Scale:
How can the various tools in the deep reinforcement learning toolbox be
put together to address such a complex and diverse real-world task?
• the use of simulation to overcome the high
sample complexity
• Hard to characterize every scenes in
advance
• the use of prior data
• computer vision datasets or data from the
Internet
• large-scale collective learning involving fleets
of multiple robots
• training from scratch in the real world
presents a major exploration challenge,
https://rl-at-scale.github.io/
https://rl-at-scale.github.io/assets/rl_at_scale.pdf

Introduction to Artificial Intelligence

Markov Decision Processes (MDPs)
s
• A MDP is defined as:
• A set of states s  S
• A set of actions a  A a
• A transition function T(s, a, s’)
• Probability that a from s leads to s’, i.e., P(s’| s, a) s, a
• Also called the dynamics model
• A start state
• Maybe a terminal state s,a,s’
• A reward function R(s, a, s’)
• reward for the transition (s, a, s’) ’s

RL: Unknown Transition Functions or Rewards

Introduction to Artificial Intelligence
A computational approach to learning
from Interactions
• Reinforcement learning problem:
• Learning what to do – how to map situations to actions – so as to
maximize a numerical reward signal
• Formalize the problem via dynamical systems theory:
• Markov Decision Processes (MDPs)

Introduction to Artificial Intelligence

MDPs vs RL
• MDPs (Offline)
• Known transition model
• Find a policy to maximize expected utility

• RL (Online)
• Unknow transition models
• Perform actions in the world to find out and collect rewards
• The way humans work: we go through life, taking various actions,
getting feedback. We get rewarded for doing well and learn along the
way

Introduction to Artificial Intelligence

Reinforcement Learning Framework

Stanford CS221n

Introduction to Artificial Intelligence

MDP Dynamics
Known?
Yes No

Iterative methods (e.g., value Learn a Model of

iteration and policy evaluation) Dynamics?

Yes No

State Space with

Model-based RL
manageable size?
Yes No

Hand-crafted state
Model free RL
features?
Yes
No
Model free RL with
Deep Reinforcement
function
Learning
approximation

Introduction to Artificial Intelligence

Model-based RL
• A MDP is defined as:
• A set of states s  S
• A set of actions a  A
• A transition function T(s, a, s’)
• Probability that a from s leads to s’, i.e., P(s’| s, a)
• Also called the dynamics model
• A start state
• Maybe a terminal state
• A reward function R(s, a, s’)
• reward for the transition (s, a, s’)

Estimate transition function and reward in MDP!

Introduction to Artificial Intelligence

Model-based Monte Carlo
• Monte Carlo is a standard way to estimate the expectation of a
random variable by taking an average over samples of that
random variable
• Data collected by using policies

• Transition:

• Reward:

Introduction to Artificial Intelligence

Example
Input Policy  Observed Episodes (Training) Learned Model
Episode 1 Episode 2 T(s,a,s’).
B, east, C, -1 B, east, C, -1 T(B, east, C) = 1.00
A T(C, east, D) = 0.75
C, east, D, -1 C, east, D, -1
T(C, east, A) = 0.25
D, exit, x, +10 D, exit, x, +10
B C D …

Episode 3 Episode 4 R(s,a,s’).

E R(B, east, C) = -1
E, north, C, -1 E, north, C, -1
R(C, east, D) = -1
C, east, D, -1 C, east, A, -1 R(D, exit, x) = +10
Assume:  = 1 D, exit, x, +10 A, exit, x, -10 …
Berkeley CS118

Introduction to Artificial Intelligence

• Transition:

• Reward:
Once we know transition and reward, iterative methods
can be applied to find optimal policy
Introduction to Artificial Intelligence
Problem
• If we do not observe a pair (s, a), e.g., (A, ‘south’), we cannot
estimate transition function and the corresponding reward
• In reinforcement learning, we need a policy to explore states
• This is the concept of Exploration!
• Will discuss the idea later
• This distinguishes reinforcement learning from supervised learning

Introduction to Artificial Intelligence

MDP Dynamics
Known?
Yes No

Iterative methods (e.g., value Learn a Model of

iteration and policy evaluation) Dynamics?

Yes No

State Space with

Model-based RL
manageable size?
Yes No

Hand-crafted state
Model free RL
features?
Yes
No
Model free RL with
Deep Reinforcement
function
Learning
approximation

Introduction to Artificial Intelligence

Model-free RL
• Estimate optimal value directly
• we do not explicitly estimate the transitions and rewards
• Instead, we estimate directly

Introduction to Artificial Intelligence

Model-free Montel Carlo
• Data (following a policy )

• is the expected utility of taking action from state , and then

following policy 
• Utility:
𝑈 𝑡 =𝑅𝑡 +1 +𝛾 𝑅𝑡 +2 +𝛾 2 𝑅𝑡 +3 +…
• Estimate:

Introduction to Artificial Intelligence

Output Values
Example
Input Policy  Observed Episodes (Training) -10
A
Episode 1 Episode 2 +8 +4 +10
B, east, C, -1 B, east, C, -1 B C D
A
C, east, D, -1 C, east, D, -1 -2
D, exit, x, +10 D, exit, x, +10 E
B C D
Episode 3 Episode 4 Example C
E E1: (-1 + 10) = 9
E, north, C, -1 E, north, C, -1 E2: (-1 + 10) = 9
E3: (-1 + 10) = 9
C, east, D, -1 C, east, A, -1
Assume:  = 1 E4: (-1 - 10) = -11
D, exit, x, +10 A, exit, x, -10
Avg: (9+9+9-11)/4 = 4

Introduction to Artificial Intelligence

On-policy vs off-policy
• Model-based Monte Carlo is off-policy
• the model does not depend on an exact policy
• as long as it was able to explore all (s, a) pairs

• Model-free Monte Carlo depends strongly on the policy that is

followed
• the value being computed is dependent on the policy used to generate
the data

Introduction to Artificial Intelligence

Problem
• We use as an estimate of

• only corresponds to a single episode

• Could we do better in estimating ?

Introduction to Artificial Intelligence

A Convex Formulation of Q-value
Estimate
• Original Formulation:

• Iterative procedure for building the mean as new numbers are

coming in

Introduction to Artificial Intelligence

Example -10
A
+8 +4 +10
B C D
Iteration 1: -2
E
Iteration 2: Example
C
E1: (-1 + 10) = 9
E2: (-1 + 10) = 9
Iteration 3: E3: (-1 + 10) = 9
E4: (-1 - 10) = -11

Avg: (9+9+9-11)/4 = 4
Iteration 4:
Introduction to Artificial Intelligence
SARSA
• A combination of the data and the estimate of (based on the
data that has seen before)

Introduction to Artificial Intelligence

Problem
• Model-free Monte Carlo and SARSA only give us
• How to obtain to act optimally?

Introduction to Artificial Intelligence

Optimal Q-value

Optimal Q-value if take action in state

How to obtain optimal Q-value in a model-free manner?

Introduction to Artificial Intelligence

Q-Learning
Optimal Q-value:

• The difference between Q-learning and SARSA is that involves a maximum over
actions rather than just taking the action of the policy

Stanford CS221n

Introduction to Artificial Intelligence

Q-learning

Stanford CS221n

Introduction to Artificial Intelligence

RL: Unknown Transition Functions or Rewards

Introduction to Artificial Intelligence
Self-driving Cab
• How to use RL techniques to pick up the passenger at one location and
drop them off in another?
• Drop off the passenger to the right location.
• Save passenger's time by taking minimum time possible to drop off
• Take care of passenger's safety and traffic rules

Introduction to Artificial Intelligence

State Spaces
• Current location of the cab: 5 x 5 grid (5x5)
• Passenger location: R, G, B, Y + inside the cab (5)
• Destination: R, G, B, Y (4)

• In total, we have 5x5x5x4 = 500 states

Introduction to Artificial Intelligence

Action Spaces
• South, north, east, west, pickup, and dropoff

• Note that there are certain actions that the cab

cannot perform due to obstacles (e.g., walls)

Introduction to Artificial Intelligence

Rewards
• The agent should receive a high positive reward for a successful
dropoff because this behavior is highly desired
• The agent should be penalized if it tries to drop off a passenger in
wrong locations
• The agent should get a slight negative reward for not making it to the
destination after every time-step. "Slight" negative because we would
prefer our agent to reach late instead of making wrong moves trying to
reach to the destination as fast as possible

Introduction to Artificial Intelligence

Q-learning

Introduction to Artificial Intelligence

Any Questions?

Introduction to Artificial Intelligence

Is Q-learning on-policy or off-policy?

Model-free Q-learning is off-policy, since it can learn the optimal

policy using data from other policies
Stanford CS221n
Introduction to Artificial Intelligence
Different RL Algorithms

Introduction to Artificial Intelligence

How to determine the exploration policy?

Determine an exploration policy to act to collect data for learning

Introduction to Artificial Intelligence

Exploration Policy
• No Exploration, All Exploitation
• Set
• Once the agent finds an optimal policy, they will not try other actions
• No Exploitation, all exploration
• Set random from Actions (s)
• it doesn't exploit what it learns and ends up with a very low utility

Introduction to Artificial Intelligence

Epsilon-greedy Algorithm
• The natural thing to do when you have two extremes is to
interpolate between the two
• Start with high

Introduction to Artificial Intelligence

MDP Dynamics
Known?
Yes No

Iterative methods (e.g., value Learn a Model of

iteration and policy evaluation) Dynamics?

Yes No

State Space with

Model-based RL
manageable size?
Yes No

Hand-crafted state
Model free RL
features?
Yes
No
Model free RL with
Deep Reinforcement
function
Learning
approximation

Introduction to Artificial Intelligence

In Q-learning
• Basic Q-Learning keeps a table of all q-values
• In realistic situations, we cannot possibly learn about every
single state!
• Too many states to visit them all in training
• Too many states to hold the q-tables in memory

Introduction to Artificial Intelligence

Similar to the evaluation
function discussed in the
Function Approximation Adversarial Search

• Function approximation fixes the issue of large lookup tables by

parameterizing by a weight vector and a feature vector

Do you remember where we learn this?

Introduction to Artificial Intelligence

Q-learning with Function Approximation
• We hope by using some parametric functions
• Loss function: MSE

• Gradient descent (partial derivative with respect to )

Objective Function

Introduction to Artificial Intelligence

Gradient Decent
• Objective function:
1
𝑇𝑟𝑎𝑖𝑛𝐿𝑜𝑠𝑠 ( 𝜃 )=
¿ 𝒟 train ∨¿ ∑ (𝜃 𝜙 (𝑥 )− 𝑦 ) ¿
2

(𝑥, 𝑦)∈𝒟 train

• Partial derivative with respect to :

1
∇ 𝜃 𝑇𝑟𝑎𝑖𝑛𝐿𝑜𝑠𝑠 ( 𝜃 )=
¿ 𝒟 train∨¿ ∑ 2 ( 𝜃 𝜙(𝑥)− 𝑦 ) 𝜙 (𝑥 )¿
(𝑥 , 𝑦)∈𝒟 train
• Update:

Stanford CS231n Note

Introduction to Artificial Intelligence

MDP Dynamics
Known?
Yes No

Iterative methods (e.g., value Learn a Model of

iteration and policy evaluation) Dynamics?

Yes No

State Space with

Model-based RL
manageable size?
Yes No

Hand-crafted state
Model free RL
features?
Yes
No
Model free RL with
Deep Reinforcement
function
Learning
approximation

Introduction to Artificial Intelligence

Deep Reinforcement Learning
• Instead of manually determine the states, let’s take raw data
(i.e., images) as states

• Parameterize images and with neural networks

• Deep Q-Networks (Deepmind, 2015)

• The field of Deep RL is growing very fast!

• http://rail.eecs.berkeley.edu/deeprlcourse/

Introduction to Artificial Intelligence

Breakout Game
• Action:
• move_left
• move_right
• do_not_move
• Rewards
• If ball hit a brick, reward=1
• If not, reward=0
• End
• If ball falls off the screen, game ends

Introduction to Artificial Intelligence

Three Different Learning Paradigms
• Supervised Learning
• the model learns from training data that consists of a labeled pair of
inputs and outputs
• For example, in Chess, you have to all the moves a player can make in
each state, along with labels indicating whether it is a good move or not
• Unsupervised learning
• the model learns the hidden structure in the input data
• Reinforcement Learning
• the model learns by maximizing the cumulative reward
• the data is the Environment that you are interacting with

Introduction to Artificial Intelligence

Lec 01 Introductionv 2024
No ratings yet
Lec 01 Introductionv 2024
127 pages
Lectures and Presentations
No ratings yet
Lectures and Presentations
8 pages
Mod1 Slides
No ratings yet
Mod1 Slides
34 pages
Introduction To Ai
No ratings yet
Introduction To Ai
105 pages
Screenshot 2022-10-12 at 4.10.48 PM
No ratings yet
Screenshot 2022-10-12 at 4.10.48 PM
57 pages
NLP & LLM - 11
No ratings yet
NLP & LLM - 11
16 pages
Chapter 3 Searching and Solving Problems
No ratings yet
Chapter 3 Searching and Solving Problems
87 pages
Ai Unit 1
No ratings yet
Ai Unit 1
19 pages
L02 Introduction
No ratings yet
L02 Introduction
29 pages
Intro To AI
No ratings yet
Intro To AI
45 pages
Chapter 01 Intro
No ratings yet
Chapter 01 Intro
60 pages
Unit-I Artificial Intelligence
No ratings yet
Unit-I Artificial Intelligence
28 pages
Final
No ratings yet
Final
18 pages
Session One
No ratings yet
Session One
26 pages
Artificial Intelligence - Introduction
No ratings yet
Artificial Intelligence - Introduction
28 pages
Artificial Intelligence - Introduction
100% (2)
Artificial Intelligence - Introduction
28 pages
Lecture 1 - Introduction
No ratings yet
Lecture 1 - Introduction
33 pages
UNIT-1 PPT AI - Dum
No ratings yet
UNIT-1 PPT AI - Dum
70 pages
Introduction To Artificial Intelligence
No ratings yet
Introduction To Artificial Intelligence
18 pages
Intro AI - Part A
No ratings yet
Intro AI - Part A
83 pages
BDM3014
No ratings yet
BDM3014
38 pages
Lecture 1
No ratings yet
Lecture 1
32 pages
Unit-1 AI
No ratings yet
Unit-1 AI
103 pages
Week 3
No ratings yet
Week 3
19 pages
Chapter 1
No ratings yet
Chapter 1
13 pages
AI - Lectures (1-5) - 2019
No ratings yet
AI - Lectures (1-5) - 2019
82 pages
000 AI Integrated Slides
No ratings yet
000 AI Integrated Slides
48 pages
Artificial Intelligence
No ratings yet
Artificial Intelligence
187 pages
Unit 4
No ratings yet
Unit 4
23 pages
IntroAI Slide01
No ratings yet
IntroAI Slide01
59 pages
Unit 1 - Ai
No ratings yet
Unit 1 - Ai
59 pages
Reinforcement Learning For IoT - Final
No ratings yet
Reinforcement Learning For IoT - Final
45 pages
CH 1 (A) - Introduction To Artificial Intelligent
No ratings yet
CH 1 (A) - Introduction To Artificial Intelligent
30 pages
CS3491 Artificial Intelligence and Machine Learning Two Mark QuestionBank
No ratings yet
CS3491 Artificial Intelligence and Machine Learning Two Mark QuestionBank
23 pages
1.fam Unit 1
No ratings yet
1.fam Unit 1
9 pages
CS-370-Week 1-1
No ratings yet
CS-370-Week 1-1
59 pages
AI Notes
No ratings yet
AI Notes
38 pages
Chapter One AI
No ratings yet
Chapter One AI
38 pages
Report ML Aat g1 Final
No ratings yet
Report ML Aat g1 Final
8 pages
LPA MOD 1 NOtes
No ratings yet
LPA MOD 1 NOtes
1 page
1 Intro
No ratings yet
1 Intro
62 pages
Artificial Intelligence
No ratings yet
Artificial Intelligence
5 pages
COL333/671: Introduction To AI
No ratings yet
COL333/671: Introduction To AI
37 pages
Intelligent Agents Overview (1) - 1
No ratings yet
Intelligent Agents Overview (1) - 1
7 pages
2020 What Is Artificial Intelligence Learning 20201123
No ratings yet
2020 What Is Artificial Intelligence Learning 20201123
46 pages
Introduction and History of Artificial Intelligence
No ratings yet
Introduction and History of Artificial Intelligence
90 pages
AIExamrevise
No ratings yet
AIExamrevise
9 pages
Artificial Intelligence N Machine Learning Merged PDF
No ratings yet
Artificial Intelligence N Machine Learning Merged PDF
338 pages
AI - Artículo en Ingles
No ratings yet
AI - Artículo en Ingles
23 pages
AI Unit 1 With Assignment
No ratings yet
AI Unit 1 With Assignment
60 pages
Week 1 2
No ratings yet
Week 1 2
26 pages
Ai & ML Digital Notes
No ratings yet
Ai & ML Digital Notes
177 pages
Final Unit No1
No ratings yet
Final Unit No1
40 pages
Unit1 of AI
No ratings yet
Unit1 of AI
214 pages
Mod 1 Ai
No ratings yet
Mod 1 Ai
10 pages
Unidad 3
No ratings yet
Unidad 3
15 pages
Deep Learning Introduction Class
No ratings yet
Deep Learning Introduction Class
46 pages
The FPGA Programming Handbook: An essential guide to FPGA design for transforming ideas into hardware using SystemVerilog and VHDL
From Everand
The FPGA Programming Handbook: An essential guide to FPGA design for transforming ideas into hardware using SystemVerilog and VHDL
Frank Bruno
No ratings yet
Chatgpt | Generative AI - The Step-By-Step Guide For OpenAI & Azure OpenAI In 36 Hrs.
From Everand
Chatgpt | Generative AI - The Step-By-Step Guide For OpenAI & Azure OpenAI In 36 Hrs.
AJIT DASH
No ratings yet
Generative AI From Beginner to Paid Professional, Part 2: Master Prompt Design, Gemini Multimodal in Vertex AI Studio, LangChain, Launching & Deploying Generative AI Projects
From Everand
Generative AI From Beginner to Paid Professional, Part 2: Master Prompt Design, Gemini Multimodal in Vertex AI Studio, LangChain, Launching & Deploying Generative AI Projects
Bolakale Aremu
No ratings yet
HW4 Spec
No ratings yet
HW4 Spec
5 pages
Report PDF
100% (1)
Report PDF
5 pages
Homework Assignments 2 PDF
No ratings yet
Homework Assignments 2 PDF
10 pages
Report PDF
No ratings yet
Report PDF
12 pages
hw4 PDF
No ratings yet
hw4 PDF
6 pages
AI MCQs
No ratings yet
AI MCQs
7 pages
Data Mining and Data Warehousing: Inst Ruct Ions T O Cand Idat Es
No ratings yet
Data Mining and Data Warehousing: Inst Ruct Ions T O Cand Idat Es
2 pages
Nonlinear Observer Design For L-V System
No ratings yet
Nonlinear Observer Design For L-V System
8 pages
Assignment 1
No ratings yet
Assignment 1
5 pages
CHAPTER 4 - Violations of Assumptions
No ratings yet
CHAPTER 4 - Violations of Assumptions
96 pages
Deep Learning ECOMMERCE
No ratings yet
Deep Learning ECOMMERCE
9 pages
Excel Formula
No ratings yet
Excel Formula
9 pages
Speed Control of DC Motor Using Fuzzy PID Controller
No ratings yet
Speed Control of DC Motor Using Fuzzy PID Controller
15 pages
Digital Signal Processing by John G. Pro Part14
No ratings yet
Digital Signal Processing by John G. Pro Part14
57 pages
Introduction To Signal Processing
No ratings yet
Introduction To Signal Processing
12 pages
AI First
No ratings yet
AI First
9 pages
C Self Adjoint Operators
No ratings yet
C Self Adjoint Operators
7 pages
Mid Important Questions-Pps
No ratings yet
Mid Important Questions-Pps
3 pages
Lecture 4 - Ch4 - s2-5
No ratings yet
Lecture 4 - Ch4 - s2-5
84 pages
Daa QB
No ratings yet
Daa QB
2 pages
Government Polytechnic Gandhinagar Gujarat
No ratings yet
Government Polytechnic Gandhinagar Gujarat
3 pages
(ERRATA) An Introduction To Numerical Computation (Wen Shen)
No ratings yet
(ERRATA) An Introduction To Numerical Computation (Wen Shen)
2 pages
L5 Single Layer FeedForward Network NN - Linear
No ratings yet
L5 Single Layer FeedForward Network NN - Linear
21 pages
Sta150 Contoh
No ratings yet
Sta150 Contoh
10 pages
Mcclean 2016
No ratings yet
Mcclean 2016
23 pages
2209 01383v3
No ratings yet
2209 01383v3
5 pages
CSE/MATH 6643: Numerical Linear Algebra: Haesun Park
No ratings yet
CSE/MATH 6643: Numerical Linear Algebra: Haesun Park
13 pages
Trainer - X-Vision
No ratings yet
Trainer - X-Vision
21 pages
Simulation of Simple Random Double Auction
No ratings yet
Simulation of Simple Random Double Auction
1 page
Ell409 Aq
No ratings yet
Ell409 Aq
8 pages
Scott Aaronson P NP Survey
No ratings yet
Scott Aaronson P NP Survey
122 pages
9 Applications of Maxwell's Thermodynamical Relations Part 2
No ratings yet
9 Applications of Maxwell's Thermodynamical Relations Part 2
18 pages
A Library of Local Search Heuristics For The Vehicle Routing Problem
No ratings yet
A Library of Local Search Heuristics For The Vehicle Routing Problem
23 pages
Khosla 2020
No ratings yet
Khosla 2020
7 pages
Muskingum Routing - Example
No ratings yet
Muskingum Routing - Example
12 pages