0% found this document useful (0 votes)

361 views

8 - Knowledge in Learning

The document discusses various techniques for artificial intelligence, including: 1. Explanation-based learning which constructs general rules from individual examples by creating proofs. 2. Inductive logic programming which induces general first-order theories from examples by representing hypotheses as logic programs. 3. Reinforcement learning where an agent learns optimal actions through trial-and-error interactions in an environment to achieve its goals, either passively by observing or actively by acting.

Uploaded by

Elsa Mutiara

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

361 views

8 - Knowledge in Learning

Uploaded by

Elsa Mutiara

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 35

Artificial Intelligence

Week 8
Knowledge in Learning
LEARNING OUTCOMES

At the end of this session, students will be able to:

LO2 Explain how to use knowledge representation in reasoning purpose
LO3 Apply various techniques to an agent when acting under certainty
OUTLINE

1. A Logical Formulation of Learning

2. Knowledge in Learning
3. Explanation Based Learning
4. Inductive Logic Programming
5. Passive and Active Reinforcement Learning
6. Generalization in Reinforcement Learning
7. Application of Reinforcement Learning
8. Summary
A LOGICAL FORMULATION OF LEARNING
o Study learning methods that can take advantage of prior knowledge
about the world. In most cases, the prior knowledge is represented
as general first-order logical theories; thus, for the first time we
bring together the work on knowledge representation and learning.
o The logical formulation of learning may seem like a lot of extra work
at first, but it turns out to clarify many of the issues in learning.
A LOGICAL FORMULATION OF LEARNING
• The hypothesis is represented by a set of logical sentences
• Example descriptions and classifications will also be logical
sentences.
• A new example can be classified by inferring a classification sentence
from the hypothesis and the example description
A LOGICAL FORMULATION OF LEARNING
o Goal and Hypotheses:
Goal predicate Q: WillWait
o Learning: to find an equivalent logical expression we can classify
examples
o Each hypotheses proposes such an expression
a candidate definition of Q:
A LOGICAL FORMULATION OF LEARNING
 An example: an object of some logical description to which the goal
concept may or may not apply

 The classification of the examples

 Each hypothesis hj have the form

where Cj (x) is a candidate definition

A LOGICAL FORMULATION OF LEARNING
 The relation between f and h are: ++, --, +- (false negative), -+(false
positive)
 An example can be a false negative for the hypothesis, if the hypothesis
says it should be negative but in fact it is positive.

would be a false negative for the hypothesis hr

A LOGICAL FORMULATION OF LEARNING
Current-best-hypothesis search
(extensions of predictor Hr)

Initial False False

hypothesis negative a generalization positive a specialization

Generalization e.g. via dropping conditions

Alternate(x)Patrons(x, Some)  Patrons(x, Some)
Specialization e.g. via adding conditions or via removing disjuncts
Alternate(x)Patrons(x, Some)  Patrons(x, Some)
A LOGICAL FORMULATION OF LEARNING
Current-best-hypothesis search

But
1. Checking all previous instances over again is expensive.
2. Difficult to find good heuristics, and backtracking is slow in the
hypothesis space (which is doubly exponential)
A LOGICAL FORMULATION OF LEARNING
Current-best-hypothesis search
Least commitment:
Instead of keeping around one hypothesis and using backtracking, keep
all consistent hypotheses (and only those).

Incremental: old instances do not have to be rechecked

KNOWLEDGE IN LEARNING

o The preceding section described the simplest setting for inductive

learning. To understand the role of prior knowledge, we need to
talk about the logical relationships among hypotheses, example
descriptions, and classifications.
o Let Descriptions denote the conjunction of all the example
descriptions in the training set, and let Classifications denote the
conjunction of all the example Classifications. Then a Hypothesis
that "explains the observations" must satisfy the following property
(recall that |= means "logically entails"):

Hypothesis ۸ Descriptions |= Classifications

EXPLANATION BASED LEARNING
o Explanation-based learning is a method for extracting general rules from
individual observations.
o The technique of memoization has long been used in computer science
to speed up programs by saving the results of computation. The basic
idea of memo functions is to accumulate a database of input—output
pairs; when the function is called, it first checks the database to see
whether it can avoid solving the problem from scratch.
o Explanation –based learning takes this a good deal further, by creating
general rules that cover an entire class of cases.
EXPLANATION BASED LEARNING

Basic EBL process works as follows

o Given an example, construct a proof that the goal predicate applies to
the example using the available background knowledge.
o In parallel, construct a generalized proof tree for the variabilized goal
using the same inference steps as in the original proof.
o Construct a new rule whose left-hand side consists of the leaves of the
proof tree and whose right-hand side is the variabilized goal (after
applying the necessary bindings from the generalized proof).
o Drop any conditions from the left-hand side that are true regardless of
the values of the variables in the goal.
LEARNING AND USING RELEVANCE
INFORMATION
o The learning algorithm we now present is based on a straightforward
attempt to find the simplest determination consistent with the
observations.
o A determination is therefore consistent with a set of examples if every
pair that matches on the predicates on the left-hand side also matches
on the goal predicate.
LEARNING AND USING RELEVANCE
INFORMATION
INDUCTIVE LOGIC PROGRAMMING
o Inductive logic programming (ILP) combines inductive methods with
the power of first-order representations, concentrating in particular on
the representation of hypotheses as logic programs.

o It has gained popularity for three reasons :

1. ILP offers a rigorous approach to the general knowledge-based
inductive learning problem.
2. ILP offers complete algorithms for inducing general, first-order
theories from examples, which can therefore learn successfully in
domains where attribute-based algorithms are hard to apply.
3. Inductive logic programming produces hypotheses that are
(relatively) easy for humans to read.
INDUCTIVE LOGIC PROGRAMMING
o the general knowledge-based
induction problem is to “solve”
the entailment constraint for
the unknown Hypothesis, given
the Background knowledge and
examples described by
Descriptions and
Classifications .
o The descriptions will consist of
an extended family tree,
described in terms of Mother ,
Father , and Married relations
and Male and Female
properties.
INDUCTIVE LOGIC PROGRAMMING
o The sentences in Classifications depend on the target concept being
learned.
o For example: Grandparent, BrotherInLaw, or Ancestor
o The complete set of Grandparent classifications contains 20 × 20 = 400
conjuncts of the form
INDUCTIVE LOGIC PROGRAMMING
Hypothesis
INDUCTIVE LOGIC PROGRAMMING

Decision-Tree-Learning
o Grandparent (⟨Mum , Charles ⟩) . . .
o FirstElementIsMotherOfElizabeth(⟨Mum,Charles⟩) .

The reader will certainly have noticed that a little bit of background
knowledge would help in the representation of the Grandparent
definition. For example, if Background included the sentence
Parent(x,y) ⇔ [Mother(x,y)∨Father(x,y)],
then the definition of Grandparent would be reduced to
Grandparent(x,y) ⇔ [∃z Parent(x,z)∧Parent(z,y)]
INDUCTIVE LOGIC PROGRAMMING
INDUCTIVE LOGIC PROGRAMMING

Two principal approaches to ILP:

o Top-down inductive learning method: using a generalization of decision
tree methods
o Inductive learning with inverse deduction: using techniques based on
inverting a resolution proof
INDUCTIVE LOGIC PROGRAMMING

Top-down inductive learning method

Suppose we are trying to learn a definition of the
Grandfather (x, y) predicate
Here are three potential additions:
INDUCTIVE LOGIC PROGRAMMING

Inductive learning with inverse deduction

Inverse resolution is based on the observation that if the example Classifications
follow from Background ∧ Hypothesis ∧ Descriptions, then one must be able to
prove this fact by resolution (because resolution is complete). A family tree
example
PASSIVE REINFORCEMENT LEARNING

 An autonomous agent should learn to choose optimal actions in each

state to achieve its goals
 The agent learns how to achieve that goal by trial-and-error
interactions with its environment
 Passive learning the agent imply watches the world going by and tries
to learn the utilities of being in various states
 Active learning the agent not simply watches, but also acts.
PASSIVE REINFORCEMENT LEARNING

The agent’s policy π is fixed: in state s, it always executes the action π(s).
Its goal is simply to learn how good the policy is—to learn the utility
function Uπ(s).
PASSIVE REINFORCEMENT LEARNING

transition model P’(s|s, a), which specifies the probability of reaching

state s from state s after doing action a;
R(s) it the reward function,
The agent executes a set of trials in the environment using its policy π. In
each trial, the agent starts in state (1,1) and experiences a sequence of
state transitions until it reaches one of the terminal states, (4,2) or (4,3).
Its percepts supply both the current state and the reward received in that
state. Typical trials might look like this:
ACTIVE REINFORCEMENT LEARNING

An active agent must consider

 What action to take?
 What their outcomes maybe?

Update utility equation

APPLICATION OF REINFORCEMENT
LEARNING
Game Playing

1. Checkers program written by Arthur Samuel (1959, 1967)

Samuel first used a weighted linear function for the evaluation of
positions, using up to 16 terms at any one time
2. Backgammon program TD-GAMMON (1992)
The TD-GAMMON project was an attempt to learn from self-play
alone. The only reward signal was given at the end of each game.
TD-GAMMON learned to play considerably better than
NEUROGAMMON, even though the input representation contained
just the raw board position with no computed features. This took
about 200,000 training games and two weeks of computer time.
APPLICATION OF REINFORCEMENT
LEARNING
Robot Control

1. BOXES algorithm (Michie and Chambers 1968)

BOXES was implemented with real cart and pole. The algorithm first
discretized the four-dimensional state space into boxes. Negative
reinforcement was associated with the final action in the final box
and then propagated back through the sequence.
2. PEGASUS algorithm (Bagnell and Schneider, 2001)
Application of reinforcement learning to helicopter flight
SUMMARY
o The use of prior knowledge in learning leads to a picture of
cumulative learning, in which learning agents improve their
learning ability as they acquire more knowledge.
o Explanation-based learning (EBL) extracts general rules from single
examples by explaining the examples and generalizing the
explanation. It provides a deductive method for knowledge into
useful, efficient, special -purpose expertise.
o Relevance-based learning (RBL) uses prior knowledge in the form of
determinations to identify the relevant attributes, thereby
generating a reduced hypothesis space and speeding up learning.
RBL also allows deductive generalizations from single examples.
SUMMARY
o Inductive logic programming (ILP) techniques perform on
knowledge that is expressed in first-order logic. ILP methods can
learn relational knowledge that is not expressible in attribute-based
systems
o The overall agent design dictates the kind of information that must
be learned. The three main designs we covered were the model-
based design, using a model P and a utility function U ; the model-
free design, using an action-utility function Q; and the reflex
design, using a policy π.
o When the learning agent is responsible for selecting actions while it
learns, it must trade off the estimated value of those actions
against the potential for learning useful new information. An exact
solution of the exploration problem is infeasible, but some simple
heuristics do a reasonable job
REFERENCES

Stuart Russell, Peter Norvig,. 2010. Artificial intelligence : a modern

approach. PE. New Jersey. ISBN:9780132071482, Chapter 19
Knowledge in Learning and Human Learning:
http://l3d.cs.colorado.edu/courses/AI-96/learning-2.pdf
Scaling Learning Algorithms towards AI:
http://yann.lecun.com/exdb/publis/pdf/bengio-lecun-07.pdf
https://slideplayer.com/slide/15478257/
https://www.slideshare.net/ersaranya/reinforcement-learning-7313
ThankYOU...

Artificial Intelligence Module 5
No ratings yet
Artificial Intelligence Module 5
23 pages
@vtucode - in Module 4 AI 2021 Scheme 5th Sem
No ratings yet
@vtucode - in Module 4 AI 2021 Scheme 5th Sem
11 pages
L1 24 07 2019 Introduction
100% (1)
L1 24 07 2019 Introduction
28 pages
Unit 1 QB
No ratings yet
Unit 1 QB
20 pages
Unit V - AI
No ratings yet
Unit V - AI
41 pages
ML Spectrum
No ratings yet
ML Spectrum
144 pages
Slide 2 ARM Architecture and Instruction Set
No ratings yet
Slide 2 ARM Architecture and Instruction Set
234 pages
NN DL
No ratings yet
NN DL
1 page
Module-02 AIML NOTES
No ratings yet
Module-02 AIML NOTES
29 pages
ADL Unit-3
No ratings yet
ADL Unit-3
21 pages
Unit 3 AI Srs 13-14
No ratings yet
Unit 3 AI Srs 13-14
45 pages
Ai Unit 4
No ratings yet
Ai Unit 4
23 pages
ARTIFICIAl iNTELLIGENCE Unit III &iv
No ratings yet
ARTIFICIAl iNTELLIGENCE Unit III &iv
39 pages
AI UNIT-3-PPT
No ratings yet
AI UNIT-3-PPT
103 pages
Thyroid Disease Classification Using Machine Learning Project
No ratings yet
Thyroid Disease Classification Using Machine Learning Project
34 pages
Artificial Intelligence: Chapter 6: Representing Knowledge Using Rules
No ratings yet
Artificial Intelligence: Chapter 6: Representing Knowledge Using Rules
54 pages
Module 2 Principle of AI
No ratings yet
Module 2 Principle of AI
15 pages
CH 9: Connectionist Models
No ratings yet
CH 9: Connectionist Models
35 pages
18CSC305J - Artificial Intelligence Unit IV Question Bank Part A
No ratings yet
18CSC305J - Artificial Intelligence Unit IV Question Bank Part A
7 pages
Apollo Institute of Engineering and Technology: Question Bank Branch: IT Subject: Artificial Intelligence (3161608)
No ratings yet
Apollo Institute of Engineering and Technology: Question Bank Branch: IT Subject: Artificial Intelligence (3161608)
2 pages
Mca AI 2 Unit
No ratings yet
Mca AI 2 Unit
46 pages
NLP SEM QUESTIONS AND ANSWERS
No ratings yet
NLP SEM QUESTIONS AND ANSWERS
72 pages
Question Bank Module-1: Department of Computer Applications 18mca53 - Machine Learning
No ratings yet
Question Bank Module-1: Department of Computer Applications 18mca53 - Machine Learning
7 pages
Ai-Unit-Iii Notes
No ratings yet
Ai-Unit-Iii Notes
46 pages
Question Bank For CAT1 - 2mks
No ratings yet
Question Bank For CAT1 - 2mks
36 pages
Probabilistic Reasoning
No ratings yet
Probabilistic Reasoning
14 pages
Unit 1
No ratings yet
Unit 1
152 pages
Unification and Lifting
No ratings yet
Unification and Lifting
8 pages
Unit1 of AI
No ratings yet
Unit1 of AI
214 pages
Unit-3-Second Chapter
No ratings yet
Unit-3-Second Chapter
9 pages
FDP Day1
No ratings yet
FDP Day1
35 pages
AI 2marks Questions
100% (1)
AI 2marks Questions
121 pages
AI Lab MAnual Final
No ratings yet
AI Lab MAnual Final
44 pages
AIML Module 3
No ratings yet
AIML Module 3
25 pages
AL3391-AI Unit IV
No ratings yet
AL3391-AI Unit IV
65 pages
Machine Learning-Unit-V-Notes
No ratings yet
Machine Learning-Unit-V-Notes
23 pages
Unit 4 Knowledge Representation
No ratings yet
Unit 4 Knowledge Representation
13 pages
Sample Questions Answers
No ratings yet
Sample Questions Answers
8 pages
AIML Unit 2 Notes
No ratings yet
AIML Unit 2 Notes
49 pages
Traffic Sign Recognition
No ratings yet
Traffic Sign Recognition
17 pages
Constraint Satisfaction Problem
No ratings yet
Constraint Satisfaction Problem
10 pages
Instance Based Learning
100% (1)
Instance Based Learning
27 pages
Unit-3 AI
No ratings yet
Unit-3 AI
98 pages
APMC Prachi Synopsis
No ratings yet
APMC Prachi Synopsis
6 pages
Unit 2 AI
No ratings yet
Unit 2 AI
22 pages
Module - 1: Introduction To AI
No ratings yet
Module - 1: Introduction To AI
128 pages
Cognitive Computing (Course Code: 18CS3272) : CO1 - Session4 Session Topic: The Elements of A Cognitive System
No ratings yet
Cognitive Computing (Course Code: 18CS3272) : CO1 - Session4 Session Topic: The Elements of A Cognitive System
9 pages
21CS54 Aiml Module3 PPT
No ratings yet
21CS54 Aiml Module3 PPT
102 pages
R22B Tech CSE (AIML) IandIIYearSyllabus PDF
No ratings yet
R22B Tech CSE (AIML) IandIIYearSyllabus PDF
65 pages
21CS54 TIE SIMPdocx (1) (1) (1) (1) PDF
No ratings yet
21CS54 TIE SIMPdocx (1) (1) (1) (1) PDF
4 pages
Cs2351 Artificial Intelligence 16 Marks
100% (1)
Cs2351 Artificial Intelligence 16 Marks
1 page
Experiment-6: AIM-Write A Program To Implement XOR Gate Using Mcculloch-Pitts Neuron. Program
No ratings yet
Experiment-6: AIM-Write A Program To Implement XOR Gate Using Mcculloch-Pitts Neuron. Program
3 pages
Course File Compiler Design
No ratings yet
Course File Compiler Design
41 pages
Unit 2
No ratings yet
Unit 2
29 pages
Ai Model Question Paper-4
No ratings yet
Ai Model Question Paper-4
23 pages
CIUnit 3
No ratings yet
CIUnit 3
19 pages
Lab Program
100% (1)
Lab Program
15 pages
FIND-S Algorithm: Machine Learning 15CSL76
No ratings yet
FIND-S Algorithm: Machine Learning 15CSL76
3 pages
Textbook of Engineering Chemistry
From Everand
Textbook of Engineering Chemistry
C. Parameswara Murthy
No ratings yet
The Datadog Handbook: A Guide to Monitoring, Metrics, and Tracing
From Everand
The Datadog Handbook: A Guide to Monitoring, Metrics, and Tracing
Robert Johnson
No ratings yet
Definite Article The Names
50% (2)
Definite Article The Names
1 page
Q3 DNA Extraction of Yeast Experiment
No ratings yet
Q3 DNA Extraction of Yeast Experiment
2 pages
Mina Petrila-Spatiu Exterior 2 - 14 03 2024-Model
No ratings yet
Mina Petrila-Spatiu Exterior 2 - 14 03 2024-Model
1 page
Configuring Sequences With SFC: Simatic Pcs 7
No ratings yet
Configuring Sequences With SFC: Simatic Pcs 7
20 pages
The CTF Toolbox - CTF Tools of The Trade PDF
No ratings yet
The CTF Toolbox - CTF Tools of The Trade PDF
55 pages
STS Chapter 5 8
No ratings yet
STS Chapter 5 8
10 pages
Federal Investigation Agency (FIA) : Recruitment Test
No ratings yet
Federal Investigation Agency (FIA) : Recruitment Test
4 pages
Complete Download Algebraic Topology VIASM 2012 2015 1st Edition H.V. Hưng Nguyễn PDF All Chapters
100% (6)
Complete Download Algebraic Topology VIASM 2012 2015 1st Edition H.V. Hưng Nguyễn PDF All Chapters
62 pages
Jadual Tariff Miceca Part2
No ratings yet
Jadual Tariff Miceca Part2
50 pages
Ecommerce Manager Resume
67% (3)
Ecommerce Manager Resume
8 pages
Netter's Internal Medicine 2nd Ed 3
No ratings yet
Netter's Internal Medicine 2nd Ed 3
4 pages
AQM 65 Brochure
No ratings yet
AQM 65 Brochure
30 pages
Infection Prevention and Control (IPC) For COVID-19 Virus
No ratings yet
Infection Prevention and Control (IPC) For COVID-19 Virus
21 pages
LEC-7230M Manual v0.1
No ratings yet
LEC-7230M Manual v0.1
23 pages
Lecture 02 Maritime Transportation and Logistics As A Trade Facilitator
No ratings yet
Lecture 02 Maritime Transportation and Logistics As A Trade Facilitator
17 pages
MXM316 Im
No ratings yet
MXM316 Im
69 pages
HDCP 2.3 On DisplayPort Comppliace Test Specification Mar 19
No ratings yet
HDCP 2.3 On DisplayPort Comppliace Test Specification Mar 19
129 pages
Digital Business Innovation
No ratings yet
Digital Business Innovation
7 pages
ICEMA Annual Data Report FY'22-23
100% (1)
ICEMA Annual Data Report FY'22-23
172 pages
Summary Pir
No ratings yet
Summary Pir
16 pages
Aquilla User
No ratings yet
Aquilla User
35 pages
Module 2 Principles of Leadership
No ratings yet
Module 2 Principles of Leadership
10 pages
Chapter 8 Solution Manual Accounting Information Systems
No ratings yet
Chapter 8 Solution Manual Accounting Information Systems
19 pages
Vdocuments - MX Agri Chandlers 2011
No ratings yet
Vdocuments - MX Agri Chandlers 2011
14 pages
11-09-2022 - SR - Super60 - Jee-Adv (2020-P2) - RPTA-01 - Key & Sol's
No ratings yet
11-09-2022 - SR - Super60 - Jee-Adv (2020-P2) - RPTA-01 - Key & Sol's
10 pages
Marine Pumps: Grundfos Industrial Solutions Marine
100% (1)
Marine Pumps: Grundfos Industrial Solutions Marine
7 pages
Oil Whirl in Floating Ring Seals
100% (1)
Oil Whirl in Floating Ring Seals
11 pages
WEAR The Wear Behaviour of Hi Chromium
No ratings yet
WEAR The Wear Behaviour of Hi Chromium
19 pages
Brianmann Resume Capstone
No ratings yet
Brianmann Resume Capstone
3 pages
Ce 313 Final Exam Problem Solving
No ratings yet
Ce 313 Final Exam Problem Solving
4 pages