0% found this document useful (0 votes)

162 views10 pages

AI Bayes Theorem

Bayes' theorem allows determining the probability of an event given uncertain knowledge by relating conditional probabilities and marginal probabilities. It calculates the posterior probability P(B|A) given the prior probability P(A) and likelihood P(B|A). Bayesian networks are probabilistic graphical models that represent conditional dependencies between random variables using a directed acyclic graph. Each node corresponds to a variable and arcs represent causal relationships. Bayesian networks use Bayes' theorem and joint probability distributions to answer probabilistic queries about events.

Uploaded by

Kollipara Kanakavardhini

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

162 views10 pages

AI Bayes Theorem

Uploaded by

Kollipara Kanakavardhini

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 10

Bayes' theorem:

Bayes' theorem is also known as Bayes' rule, Bayes' law, or Bayesian

reasoning, which determines the probability of an event with uncertain
knowledge.

In probability theory, it relates the conditional probability and marginal

probabilities of two random events.

Bayes' theorem was named after the British mathematician Thomas Bayes.

The Bayesian inference is an application of Bayes' theorem, which is
fundamental to Bayesian statistics.

It is a way to calculate the value of P(B|A) with the knowledge of P(A|B).

Bayes' theorem:
Bayes' theorem is also known as Bayes' rule, Bayes' law, or Bayesian
reasoning, which determines the probability of an event with uncertain
knowledge.

In probability theory, it relates the conditional probability and marginal

probabilities of two random events.

Bayes' theorem was named after the British mathematician Thomas Bayes.

The Bayesian inference is an application of Bayes' theorem, which is
fundamental to Bayesian statistics.

It is a way to calculate the value of P(B|A) with the knowledge of P(A|B).

Bayes' theorem allows updating the probability prediction of an event by

observing new information of the real world.

Example: If cancer corresponds to one's age then by using Bayes' theorem, we

can determine the probability of cancer more accurately with the help of age.
Bayes' theorem can be derived using product rule and conditional probability
of event A with known event B:

As from product rule we can write:

1. P(A ⋀ B)= P(A|B) P(B) or

Similarly, the probability of event B with known event A:

1. P(A ⋀ B)= P(B|A) P(A)

Equating right hand side of both the equations, we will get:

The above equation (a) is called as Bayes' rule or Bayes' theorem. This

equation is basic of most modern AI systems for probabilistic inference.

It shows the simple relationship between joint and conditional probabilities.

Here,

P(A|B) is known as posterior, which we need to calculate, and it will be read as

Probability of hypothesis A when we have occurred an evidence B.

P(B|A) is called the likelihood, in which we consider that hypothesis is true,

then we calculate the probability of evidence.

P(A) is called the prior probability, probability of hypothesis before

considering the evidence

P(B) is called marginal probability, pure probability of an evidence.

Example-1:
Question: what is the probability that a patient has diseases meningitis
with a stiff neck?

Given Data:

A doctor is aware that disease meningitis causes a patient to have a stiff neck,
and it occurs 80% of the time. He is also aware of some more facts, which are
given as follows:

o The Known probability that a patient has meningitis disease is 1/30,000.

o The Known probability that a patient has a stiff neck is 2%.

Let a be the proposition that patient has stiff neck and b be the proposition
that patient has meningitis. , so we can calculate the following as:

Bayesian Belief Network in artificial

intelligence
Bayesian belief network is key computer technology for dealing with
probabilistic events and to solve a problem which has uncertainty. We can
define a Bayesian network as:
"A Bayesian network is a probabilistic graphical model which represents a set
of variables and their conditional dependencies using a directed acyclic
graph."

It is also called a Bayes network, belief network, decision network,

or Bayesian model.

Bayesian networks are probabilistic, because these networks are built from
a probability distribution, and also use probability theory for prediction and
anomaly detection.

Real world applications are probabilistic in nature, and to represent the

relationship between multiple events, we need a Bayesian network. It can also
be used in various tasks including prediction, anomaly detection,
diagnostics, automated insight, reasoning, time series prediction,
and decision making under uncertainty.

Bayesian Network can be used for building models from data and experts
opinions, and it consists of two parts:

o Directed Acyclic Graph

o Table of conditional probabilities.

The generalized form of Bayesian network that represents and solve decision
problems under uncertain knowledge is known as an Influence diagram.

A Bayesian network graph is made up of nodes and Arcs (directed links),

where:
o Each node corresponds to the random variables, and a variable can
be continuous or discrete.
o Arc or directed arrows represent the causal relationship or conditional
probabilities between random variables. These directed links or arrows
connect the pair of nodes in the graph.
These links represent that one node directly influence the other node,
and if there is no directed link that means that nodes are independent
with each other
o In the above diagram, A, B, C, and D are random variables
represented by the nodes of the network graph.
o If we are considering node B, which is connected with node A
by a directed arrow, then node A is called the parent of Node
B.
o Node C is independent of node A.
The Bayesian network has mainly two components:

o Causal Component
o Actual numbers

Each node in the Bayesian network has condition probability distribution P(Xi |

Parent(Xi) ), which determines the effect of the parent on that node.

Bayesian network is based on Joint probability distribution and conditional

probability. So let's first understand the joint probability distribution:

Joint probability distribution:

If we have variables x1, x2, x3,....., xn, then the probabilities of a different
combination of x1, x2, x3.. xn, are known as Joint probability distribution.

P[x1, x2, x3,....., xn], it can be written as the following way in terms of the joint
probability distribution.

= P[x1| x2, x3,....., xn]P[x2, x3,....., xn]

= P[x1| x2, x3,....., xn]P[x2|x3,....., xn]....P[xn-1|xn]P[xn].

In general for each variable Xi, we can write the equation as:
P(Xi|Xi-1,........., X1) = P(Xi |Parents(Xi ))

Example: Harry installed a new burglar alarm at his home to detect burglary.

The alarm reliably responds at detecting a burglary but also responds for
minor earthquakes. Harry has two neighbors David and Sophia, who have
taken a responsibility to inform Harry at work when they hear the alarm. David
always calls Harry when he hears the alarm, but sometimes he got confused
with the phone ringing and calls at that time too. On the other hand, Sophia
likes to listen to high music, so sometimes she misses to hear the alarm. Here
we would like to cSolution:
o The Bayesian network for the above problem is given below. The
network structure is showing that burglary and earthquake is the parent
node of the alarm and directly affecting the probability of alarm's going
off, but David and Sophia's calls depend on alarm probability.
o The network is representing that our assumptions do not directly
perceive the burglary and also do not notice the minor earthquake, and
they also not confer before calling.
o The conditional distributions for each node are given as conditional
probabilities table or CPT.
o Each row in the CPT must be sum to 1 because all the entries in the
table represent an exhaustive set of cases for the variable.
o In CPT, a boolean variable with k boolean parents contains
2K probabilities. Hence, if there are two parents, then CPT will contain 4
probability values

List of all events occurring in this network:

o Burglary (B)
o Earthquake(E)
o Alarm(A)
o David Calls(D)
o Sophia calls(S)

We can write the events of problem statement in the form of probability: P[D,

S, A, B, E], can rewrite the above probability statement using joint probability
distribution:

P[D, S, A, B, E]= P[D | S, A, B, E]. P[S, A, B, E]

=P[D | S, A, B, E]. P[S | A, B, E]. P[A, B, E]

= P [D| A]. P [ S| A, B, E]. P[ A, B, E]

= P[D | A]. P[ S | A]. P[A| B, E]. P[B, E]

= P[D | A ]. P[S | A]. P[A| B, E]. P[B |E]. P[E]

ompute the probability of Burglary Alarm.

Problem:

Calculate the probability that alarm has sounded, but there is neither a
burglary, nor an earthquake occurred, and David and Sophia both called
the Harry.
From the formula of joint distribution, we can write the problem statement in
the form of probability distribution:

P(S, D, A, ¬B, ¬E) = P (S|A) P (D|A)P (A|¬B ^ ¬E) P (¬B) P (¬E).

= 0.75* 0.91* 0.001* 0.998*0.999

= 0.00068045.
Hence, a Bayesian network can answer any query about the domain by
using Joint distribution.

The semantics of Bayesian Network:

There are two ways to understand the semantics of the Bayesian network,
which is given below:

1. To understand the network as the representation of the Joint

probability distribution.

It is helpful to understand how to construct the network.

2. To understand the network as an encoding of a collection of

conditional independence statements.

It is helpful in designing inference procedure.

Course Notes For Unit 2 of The Udacity Course ST101 Introduction To Statistics
100% (3)
Course Notes For Unit 2 of The Udacity Course ST101 Introduction To Statistics
29 pages
Bayesian Analysis
100% (1)
Bayesian Analysis
96 pages
Lec - 7 Decision Analysis
100% (1)
Lec - 7 Decision Analysis
63 pages
L 5:probability
No ratings yet
L 5:probability
71 pages
Pgm5 With Output
No ratings yet
Pgm5 With Output
13 pages
Module 2 - PROBABILITY
No ratings yet
Module 2 - PROBABILITY
15 pages
2 Conditional Probability Student
No ratings yet
2 Conditional Probability Student
40 pages
Probability: An Introduction To Modeling Uncertainty: Business Analytics
No ratings yet
Probability: An Introduction To Modeling Uncertainty: Business Analytics
58 pages
Developing Neural Network Applications Using Labview
No ratings yet
Developing Neural Network Applications Using Labview
105 pages
Business Statistics, 4e: by Ken Black
No ratings yet
Business Statistics, 4e: by Ken Black
58 pages
Lecture 02
No ratings yet
Lecture 02
33 pages
Basic Concepts of Probability
No ratings yet
Basic Concepts of Probability
36 pages
Aiml Module 04
No ratings yet
Aiml Module 04
62 pages
Statistical Inference III: Mohammad Samsul Alam
No ratings yet
Statistical Inference III: Mohammad Samsul Alam
25 pages
Conditional Probability (Example and Exercises, Walpole, 8 Edition)
No ratings yet
Conditional Probability (Example and Exercises, Walpole, 8 Edition)
8 pages
Brute Force Bayes Algorithm Example
No ratings yet
Brute Force Bayes Algorithm Example
6 pages
CS 4 - Knowledge Representation - First Order Logic
No ratings yet
CS 4 - Knowledge Representation - First Order Logic
86 pages
Explanation - Unit 4 Counting Techniques
No ratings yet
Explanation - Unit 4 Counting Techniques
14 pages
CH 04
No ratings yet
CH 04
51 pages
MANSCI Problem Solving - Question and Answers With Solution
No ratings yet
MANSCI Problem Solving - Question and Answers With Solution
3 pages
AI-Lecture 6 (Adversarial Search)
No ratings yet
AI-Lecture 6 (Adversarial Search)
68 pages
Lab Program
100% (1)
Lab Program
15 pages
ML - CSA 301 - ML Perspective and Issues
No ratings yet
ML - CSA 301 - ML Perspective and Issues
34 pages
Neuro Fuzzy Systems
100% (1)
Neuro Fuzzy Systems
27 pages
L - 2.3, L-2.4 RSA - Diffie Hellman Algorithm
No ratings yet
L - 2.3, L-2.4 RSA - Diffie Hellman Algorithm
21 pages
Ai R16 - Unit-6
No ratings yet
Ai R16 - Unit-6
36 pages
One-hour-Ahead Wind Speed Prediction Using A Bayesian Methodology
No ratings yet
One-hour-Ahead Wind Speed Prediction Using A Bayesian Methodology
6 pages
Informed Search
No ratings yet
Informed Search
36 pages
ML Unit-5
No ratings yet
ML Unit-5
83 pages
AI Module-2
No ratings yet
AI Module-2
123 pages
25-27 Statistical Reasoning-Probablistic Model-Naive Bayes Classifier
No ratings yet
25-27 Statistical Reasoning-Probablistic Model-Naive Bayes Classifier
35 pages
Computer Organization & Architecture
No ratings yet
Computer Organization & Architecture
49 pages
Ai-Unit-I Notes
No ratings yet
Ai-Unit-I Notes
74 pages
UNIT 3 KR Predicate Logic
No ratings yet
UNIT 3 KR Predicate Logic
53 pages
Xpectation Aximization: Grading An Exam Without An Answer Key
No ratings yet
Xpectation Aximization: Grading An Exam Without An Answer Key
9 pages
Representing Knowledge Using
No ratings yet
Representing Knowledge Using
22 pages
IOT Mod-4
No ratings yet
IOT Mod-4
42 pages
Predicate Logic
No ratings yet
Predicate Logic
64 pages
Risk Analysis - Midterm Exam-1 - B
No ratings yet
Risk Analysis - Midterm Exam-1 - B
4 pages
Constraint Satisfaction Problems: AIMA: Chapter 6
No ratings yet
Constraint Satisfaction Problems: AIMA: Chapter 6
64 pages
Circle Generation Algorithm
No ratings yet
Circle Generation Algorithm
10 pages
Unit 2
No ratings yet
Unit 2
29 pages
Concept Learning
No ratings yet
Concept Learning
85 pages
Chapter 1 - Data Representation 1.1 - Data Types
No ratings yet
Chapter 1 - Data Representation 1.1 - Data Types
12 pages
Artificial Intelligence Module 5
No ratings yet
Artificial Intelligence Module 5
23 pages
AI Digital Notes Complete
100% (1)
AI Digital Notes Complete
202 pages
Chapter 3 Fundamentals of Bayesian Inference - Bayesian Hierarchical Models in Ecology
No ratings yet
Chapter 3 Fundamentals of Bayesian Inference - Bayesian Hierarchical Models in Ecology
14 pages
SRM'24 AI Unit 2
No ratings yet
SRM'24 AI Unit 2
105 pages
Department of Computer Science and Engineering
No ratings yet
Department of Computer Science and Engineering
23 pages
ML Lesson Plan (21AI63)
No ratings yet
ML Lesson Plan (21AI63)
8 pages
Ainotes Module4 Parta
No ratings yet
Ainotes Module4 Parta
11 pages
AL3391-AI Unit IV
No ratings yet
AL3391-AI Unit IV
65 pages
Calendar Functions in Python
No ratings yet
Calendar Functions in Python
3 pages
Line, Circle, Ellipse
No ratings yet
Line, Circle, Ellipse
8 pages
Planning and Search: Classical Planning: Planning Graphs, Graphplan
No ratings yet
Planning and Search: Classical Planning: Planning Graphs, Graphplan
22 pages
Artificial Intelligence Unit IV
No ratings yet
Artificial Intelligence Unit IV
105 pages
Unit II Probabilistic Reasoning
No ratings yet
Unit II Probabilistic Reasoning
28 pages
Bresenhams Line Generation Algorithm: Function
No ratings yet
Bresenhams Line Generation Algorithm: Function
8 pages
Ellipse Generating Algorithm
No ratings yet
Ellipse Generating Algorithm
28 pages
Ch-5 Uncertain Knowledge and Reasoning
No ratings yet
Ch-5 Uncertain Knowledge and Reasoning
25 pages
Unit 4 Ai
100% (2)
Unit 4 Ai
16 pages
Ai PPT 3
No ratings yet
Ai PPT 3
29 pages
All Pairs Shortest Path
No ratings yet
All Pairs Shortest Path
28 pages
Pyramid and Pyramid Blending
100% (1)
Pyramid and Pyramid Blending
8 pages
ARTIFICIAl iNTELLIGENCE Unit III &iv
No ratings yet
ARTIFICIAl iNTELLIGENCE Unit III &iv
39 pages
Unit 5 1
No ratings yet
Unit 5 1
18 pages
AI Unit4 LogicAgents
No ratings yet
AI Unit4 LogicAgents
17 pages
Expert System in AI
No ratings yet
Expert System in AI
11 pages
Bayes Rule
No ratings yet
Bayes Rule
29 pages
Artificial Intelligence: (Unit 3: Problem Solving)
100% (1)
Artificial Intelligence: (Unit 3: Problem Solving)
10 pages
IoT Unit 5-1
No ratings yet
IoT Unit 5-1
30 pages
Seminar Presentation Prestige
No ratings yet
Seminar Presentation Prestige
10 pages
AI Spectrum U5
No ratings yet
AI Spectrum U5
30 pages
1 Dempster Shafer Theory
No ratings yet
1 Dempster Shafer Theory
9 pages
07 Game Playing
No ratings yet
07 Game Playing
30 pages
Ai Unit 3 Ai Unit 3
No ratings yet
Ai Unit 3 Ai Unit 3
55 pages
Statistical Inference For Data Science Compress
No ratings yet
Statistical Inference For Data Science Compress
78 pages
Chapter 5 - Uncertain Knowledge and Reasoning
No ratings yet
Chapter 5 - Uncertain Knowledge and Reasoning
29 pages
Ai Unit 4 Unit 4
No ratings yet
Ai Unit 4 Unit 4
12 pages
Bayesian Networks in AI
No ratings yet
Bayesian Networks in AI
8 pages
Ch-4 Ensemble Learning
No ratings yet
Ch-4 Ensemble Learning
18 pages
Unit-5 Bayes' Rule and Bayesian Network
No ratings yet
Unit-5 Bayes' Rule and Bayesian Network
9 pages
Constraint Satisfaction Problem
No ratings yet
Constraint Satisfaction Problem
10 pages
Full Download Bayesian Optimization: Theory and Practice Using Python Peng Liu PDF
100% (4)
Full Download Bayesian Optimization: Theory and Practice Using Python Peng Liu PDF
66 pages
Sas#9-Acc 117
No ratings yet
Sas#9-Acc 117
7 pages
Important Topics in Computer Organization and Architecture
No ratings yet
Important Topics in Computer Organization and Architecture
86 pages
Unit-4 Knowledge Representation
No ratings yet
Unit-4 Knowledge Representation
31 pages
Computer Predictions With Quantified Uncertainty - Tinsley - Oden
No ratings yet
Computer Predictions With Quantified Uncertainty - Tinsley - Oden
9 pages
PROGRAMMING in C UNIT-4 Notes
No ratings yet
PROGRAMMING in C UNIT-4 Notes
26 pages
Unit1 ML
No ratings yet
Unit1 ML
23 pages