Chapter 5
Chapter 5
For example, an airline passenger who is offered “the pasta dish or the chicken”
does not know what lurks beneath the tinfoil cover.
• The pasta could be delicious or congealed, the chicken juicy or overcooked
beyond recognition.
• We can think of the set of outcomes for each action as a lottery—think of each
action as a ticket.
• A lottery L with possible outcomes S1,...,Sn that occur with probabilities p1,...,pn
is written
L = [p1, S1; p2, S2; ... pn, Sn]
• Each outcome Si of a lottery can be either an atomic state or another lottery.
• The primary issue for utility theory is to understand how preferences between
complex lotteries are related to preferences between the underlying states in
those lotteries.
• To address this issue we list six constraints that we require any reasonable
preference relation to obey
1.Orderability: Given any two lotteries, a rational agent must either prefer one to the
other or else rate the two as equally preferable. That is, the agent cannot avoid deciding.
6. Decomposability: Compound lotteries can be reduced to simpler ones using the laws
of probability. This has been called the “no fun in gambling” rule because it says that
two consecutive lotteries can be compressed into a single equivalent lottery, as shown
in Figure 5.1 (b)
• Notice that the axioms of utility theory are really axioms about preferences—
they say nothing about a utility function.
• But in fact from the axioms of utility we can derive the following consequences
(for the proof, see von Neumann and Morgenstern, 1944):
1. Existence of Utility Function: If an agent’s preferences obey the axioms of
utility, then there exists a function U such that U(A) > U(B) if and only if A is
preferred to B, and U(A) = U(B) if and only if the agent is indifferent between A
and B.
2. Expected Utility of a Lottery: The utility of a lottery is the sum of the probability
of each outcome times the utility of that outcome.
• In other words, once the probabilities and utilities of the possible outcome states
are specified, the utility of a compound lottery involving those states is
completely determined.
• Because the outcome of a nondeterministic action is a lottery, it follows that an
agent can act rationally— that is, consistently with its preferences—only by
choosing an action that maximizes expected utility.
Decision Theory :
Types of Decision Making:
• Under Certainty: have complete knowledge
• Under Risk: State of nature is unknown and have only probability of occurrence.
• Under Uncertainty: Hardly any knowledge of state of nature and have no
probability of occurrence.
• Under Conflicts: Neither states of nature are completely known nor are
completely uncertain. E.g. Branding Advertisement.
Definition: Decision theory deals with methods for determining the optimal course of
action when a number of alternatives are available and their consequences cannot be
forecast with certainty.
Components:
• Decision Maker: Individual / Group
• Course of Action: All possible actions
• State of Nature: more possible future events, called states of nature, that might
occur
• Payoff: What we get in return, profit/loss.
Decision Tree :
• A decision tree is a map of the possible outcomes of a series of related choices.
• It allows an individual or organization to weigh possible actions against one
another based on their costs, probabilities, and benefits.
• They can can be used either to drive informal discussion or to map out an
algorithm that predicts the best choice mathematically.
• Decision trees are commonly used in operations research, specifically in decision
analysis, to help identify a strategy most likely to reach a goal, but are also a
popular tool in machine learning.
• A decision tree is a flowchart-like structure in which each internal node
represents a "test" on an attribute, each branch represents the outcome of the test,
and each leaf node represents a class label (decision taken after computing all
attributes).
• The paths from root to leaf represent classification rules.
A decision tree consists of three types of nodes:
1. Decision nodes – typically represented by squares, shows a decision to be made.
2. Chance nodes – typically represented by circles, shows the probabilities of
certain results.
3. End nodes – typically represented by triangles, shows the final outcome of a
decision path.
Advantages
• Easy to understand
• They can be useful with or without hard data, and any data requires
minimal preparation.
• New options can be added to existing trees.
• Their value in picking out the best of several options.
• How easily they combine with other decision making tools.
Disadvantages
• No Convergence Path.
• Tree is very large become excessively complex.
Sequential Decision Problem:
• For example, the lower-right corner shows that when player O chooses action
two and E also chooses two, the payoff is +4 for E and −4 for O.
• Each player in a game must adopt and then execute a strategy (which is the
name used in game theory for a policy).
• A pure strategy is a deterministic policy; for a single-move game, a pure
strategy is just a single action.
• For many games an agent can do better with a mixed strategy, which is a
randomized policy that selects actions according to a probability distribution.
• The mixed strategy that chooses action a with probability p and action b
otherwise is written [p: a; (1 − p): b].
• For example, a mixed strategy for two-finger Morra might be [0.5: one;
0.5:two].
• A strategy profile is an assignment of a strategy to each player; given the
strategy profile, the game’s outcome is a numeric value for each player.
• A solution to a game is a strategy profile in which each player adopts a rational
strategy.