PPTX

The document discusses a machine learning problem of teaching a program to play checkers by learning an evaluation function that assigns scores to board states. It describes representing the function as a linear combination of board features and using examples of states and training values to adjust the weights through an algorithm like least mean squares.

Uploaded by

Sridarshini Vikkram

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

26 views12 pages

PPTX

Uploaded by

Sridarshini Vikkram

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 12

A CHECKERS LEARNING

PROBLEM
“Machine Learning” By Tom Mitchell
PROBLEM
• Task T: playing checkers
• Performance measure P: percent of games won in the world
tournament
• Training experience E: games played against itself
APPROACH

1. The exact type of knowledge to be learned

2. A representation for this target knowledge
3. A learning mechanism

The type of training experience available can have a significant impact

on success or failure of the learner
TARGET FUNCTION
• to reduce the problem of improving performance P at task T to the
problem of learning some particular target function

Legal Moves Won or Lost Indirect Training Experience

Given Broad Moves required to win Direct Training Experience

TARGET FUNCTION
• Evaluation function that assigns a numerical score to any given board state.
• V : B →Ƞ to denote that V maps any legal board state from the set B to some real
value (we use Ƞ to denote the set of real numbers).

1. if b is a final board state that is won, then V(b) = 100

2. if b is a final board state that is lost, then V(b) = -100
3. if b is a final board state that is drawn, then V(b) = 0
4. if b is a not a final state in the game, then V(b) = V(b' ), where b' is the best
final board state that can be achieved starting from b and playing optimally until
the end of the game (assuming the opponent plays optimally, as well).
TARGET FUNCTION
• operational description of the ideal target function V is required.
• Learning algorithms is expected to acquire only some approximation
to the target function, and for this reason the process of learning the
target function is often called function approximation

On one hand, we wish to pick a very expressive representation to allow representing

as close an approximation as possible to the ideal target function V. On the other
hand, the more expressive the representation, the more training data the program
will require in order to choose among the alternative hypotheses it can represent.
Problem Representation
•A simple representation: for any given board state, the function will be
calculated as a linear combination of the following board features.
• xl: the number of black pieces on the board
• x2: the number of red pieces on the board
• x3: the number of black kings on the board
• x4: the number of red kings on the board
• x5: the number of black pieces threatened by red (i.e., which can be
captured on red's next turn)
• X6: the number of red pieces threatened by black
TARGET FUNCTION
•Thus, our learning program will represent (b) as a linear function of the
form

where through are numerical coefficients, or weights, to be chosen by

the learning algorithm. Learned values for the weights through will
determine the relative importance of the various board features in
determining the value of the board, whereas the weight will provide an
additive constant to the board value.
ESTIMATING TRAINING VALUES
• In order to learn the target function we require a set of training
examples, each describing a specific board state b and the training
value Vtrain(b) for b. In other words, each training example is an
ordered pair of the form .
• Rule for estimating training values.
← (Successor(b))
ADJUSTING THE WEIGHTS
• One common approach is to define the best hypothesis, or set of
weights, as that which minimizes the square error E between the
training values and the values predicted by the hypothesis .

Thus, we seek the weights, or equivalently the , that minimize E for the
observed training examples.
LMS Training
•Least mean squares or LMS training rule is one of several algorithms to
incrementally refine the weights.
LMS weight update rule.
• For each training example
• Use the current weights to calculate
• For each weight , update it as

)
The Final Design

Unit 1 1
No ratings yet
Unit 1 1
64 pages
ML Unit-1
No ratings yet
ML Unit-1
61 pages
BCS602 - ML - MOD-2 - NOTES @vtunetwork
No ratings yet
BCS602 - ML - MOD-2 - NOTES @vtunetwork
22 pages
Module 1 (3) - Pages
No ratings yet
Module 1 (3) - Pages
77 pages
Ai&ml Unit 4
No ratings yet
Ai&ml Unit 4
21 pages
Unit 1 ML
No ratings yet
Unit 1 ML
60 pages
ML Unit-I
No ratings yet
ML Unit-I
121 pages
Unit 1
No ratings yet
Unit 1
14 pages
Effective Applications of Learning: Speech Recognition
No ratings yet
Effective Applications of Learning: Speech Recognition
52 pages
Machine Learning UNIT-I Notes
No ratings yet
Machine Learning UNIT-I Notes
38 pages
Learning
No ratings yet
Learning
35 pages
Machine Learning Notes-1 (ML Design)
No ratings yet
Machine Learning Notes-1 (ML Design)
7 pages
Lecture 1 & 2
No ratings yet
Lecture 1 & 2
3 pages
Designing A Learning System
No ratings yet
Designing A Learning System
23 pages
UNIT 1 - Introduction
No ratings yet
UNIT 1 - Introduction
26 pages
Module 1
No ratings yet
Module 1
97 pages
Mitchell Machine Learning
No ratings yet
Mitchell Machine Learning
37 pages
Unit 1: Some Successful Applications of Machine Learning
No ratings yet
Unit 1: Some Successful Applications of Machine Learning
28 pages
Designing A Learning System: DR - Chandrika.J Professor CSE Course Faculty
No ratings yet
Designing A Learning System: DR - Chandrika.J Professor CSE Course Faculty
22 pages
Machine Learning (Unit-1)
No ratings yet
Machine Learning (Unit-1)
24 pages
Ecs 403 ML Module I
No ratings yet
Ecs 403 ML Module I
33 pages
VTU Exam Question Paper With Solution of 18MCA53 Machine Learning Feb-2022-Dr - Gnaneswari
No ratings yet
VTU Exam Question Paper With Solution of 18MCA53 Machine Learning Feb-2022-Dr - Gnaneswari
27 pages
Eid 403 ML Module I Lecture Notes
No ratings yet
Eid 403 ML Module I Lecture Notes
26 pages
MACHINE LEARNING TECHNIQUES - PPSX
No ratings yet
MACHINE LEARNING TECHNIQUES - PPSX
26 pages
Unit 1 1
No ratings yet
Unit 1 1
26 pages
ML - Unit 1 - Part I
No ratings yet
ML - Unit 1 - Part I
24 pages
Module 1
No ratings yet
Module 1
28 pages
Unit 1 ML
No ratings yet
Unit 1 ML
14 pages
Unti 1 ML
No ratings yet
Unti 1 ML
26 pages
Module 1
No ratings yet
Module 1
27 pages
ADALINE:Machine Learning Application
No ratings yet
ADALINE:Machine Learning Application
16 pages
ML Unit - 1
No ratings yet
ML Unit - 1
85 pages
ML Module Notes
No ratings yet
ML Module Notes
139 pages
ML Notes
No ratings yet
ML Notes
47 pages
What Is Learning?: CS 391L: Machine Learning
No ratings yet
What Is Learning?: CS 391L: Machine Learning
6 pages
ML 5 Units
No ratings yet
ML 5 Units
466 pages
Learningintro Notes
No ratings yet
Learningintro Notes
12 pages
Module 1 Notes PDF
No ratings yet
Module 1 Notes PDF
26 pages
ML First Unit
No ratings yet
ML First Unit
70 pages
Machine Learning 2D5362
No ratings yet
Machine Learning 2D5362
60 pages
Unit-1 Notes
No ratings yet
Unit-1 Notes
26 pages
ML Unit-I Chapter-I Introduction
No ratings yet
ML Unit-I Chapter-I Introduction
36 pages
Introduction To ML,: Module-I
No ratings yet
Introduction To ML,: Module-I
48 pages
ML 1
No ratings yet
ML 1
86 pages
1 Introduction To Machine Learning
No ratings yet
1 Introduction To Machine Learning
20 pages
Chapter 1
No ratings yet
Chapter 1
3 pages
Machine Learning (UNIT-1 - PART ONE)
No ratings yet
Machine Learning (UNIT-1 - PART ONE)
24 pages
Unit 1.2 Desigining A Learning System
No ratings yet
Unit 1.2 Desigining A Learning System
15 pages
ML Unit 1
No ratings yet
ML Unit 1
156 pages
M01 Machine Learning
No ratings yet
M01 Machine Learning
25 pages
Module 1 Concept Learning Notes
No ratings yet
Module 1 Concept Learning Notes
26 pages
Video Tutorial: Machine Learning 17CS73
100% (2)
Video Tutorial: Machine Learning 17CS73
27 pages
ML Design Learning
No ratings yet
ML Design Learning
7 pages
Unit 1
No ratings yet
Unit 1
45 pages
ML Module1 Chapter1
No ratings yet
ML Module1 Chapter1
38 pages
ML-UNIT-1 - Introduction PART-1
No ratings yet
ML-UNIT-1 - Introduction PART-1
60 pages
ML Unit 1
No ratings yet
ML Unit 1
35 pages
Module-4 ML Landscape
No ratings yet
Module-4 ML Landscape
105 pages
Machine Learning
No ratings yet
Machine Learning
111 pages
Opamp
No ratings yet
Opamp
11 pages
We Are Intechopen, The World'S Leading Publisher of Open Access Books Built by Scientists, For Scientists
No ratings yet
We Are Intechopen, The World'S Leading Publisher of Open Access Books Built by Scientists, For Scientists
21 pages
Unit 3-Generative Models
No ratings yet
Unit 3-Generative Models
23 pages
Semiconductor Physics and Devices
No ratings yet
Semiconductor Physics and Devices
2 pages
Unit 3-Discriminative Models
No ratings yet
Unit 3-Discriminative Models
29 pages
Unit 3-Bayesian Logistic
No ratings yet
Unit 3-Bayesian Logistic
11 pages