Or II Unit 1 Theory Notes
Or II Unit 1 Theory Notes
SYLLABUS
Game theory, terminologies of Game Theory, Two person zero sum Games, The maximin-
minimax principle, Game without saddle point - mixed strategies, graphical solution of 2 x n
and m x 2 Games, Dominance property, Decision theory introduction, Decision under
certainty, Decision under risk, Decision under uncertainty and decision tree.
Game theory
Game theory is a type of decision theory in which one's choice of action is determined after
taking into account all possible alternatives available to an opponent playing the same game
rather than just by the possibilities of several outcome results.
Game
Game is defined as an activity between two or more person involving activities by each
person according to a set of rules, at the end of which each person receives some benefit or
satisfaction or suffer loss (negative benefit).
The set of rules defines the Game going through the set of rules by the participants defines
the play
Types of game
A) On the basis of Characteristics of game theory.
There can be various types of games, They can be classified on the basis of the following
characteristics
1. Chance of strategy
If in a game, activities are determined by skills, it is said to be a game of strategy, If they are
determined by the chance, it is a chance game of chance.
In general a game may involve game of strategy as well as game of chance.
2. Number of persons
A game is called as n - person game if the number of person playing is in the person is ‘n’
individual or a group aiming at a particular objective.
3. Number of activities
This maybe finite or in finite.
4. Number of alternative (choices) available to each person in a particular activity may also
be finite or infinite. A finite game has a finite number of activities, each involving a finite
number of alternatives otherwise a game is said to be infinite.
5. Information to the players about the past activities of other players is completely
available, partly available or not available at all.
6. Payoff - A quantitative measure of satisfaction a person gets at the end of each play is
called a payoff. It is a real valued function of variables in the game.
B) Competitive Game
A competitive situation is called a competitive game if it has a following for properties
i) There are finite number (n) of competitors (Called Players).
ii) Each player has a list of finite number of possible activities.
iii) A play is it said to occur when each player chooses one of his activities.
iv) Every combination of activities determines outcome.
Strategy
A strategy for a given player is a set of rules that satisfy which of the available course of
action we should make at each play.
This strategy may of two kinds,
1) Pure Strategy
If the player select the same strategy each time, then it is referred to as pure strategy. In this
case each player knows exactly what the other player is going to do. The objective of the
players is to maximize games or minimize losses.
2) Mixed Strategy
When the player use a combination of strategies and each player always kept guessing as to
which course of action is to be selected by other player at a particular occasion then this is
known as mixed strategy. Thus there is a probabilistic situation and objective of the player
player is to maximize expected gains or minimize expected losses.
Payoff Matrix
suppose the player A has m strategies and the player B has n strategies then a payoff matrix
can be formed by adopting the following rules
1. Row designation for each matrix are strategies available to player A
2. Column designation for each matrix are strategies available to player B
3. Cell entries 𝑉𝑖𝑗 is the payment to player A in A’s payoff matrix when A chooses the activity
i and B chooses the activity j
4. With a ‘Zero – Sum’, Two person Game the cell entry in the player B’s payoff matrix will be
negative of the corresponding cell entry 𝑉𝑖𝑗 in the player A’s payoff matrix so that sum of
payoff matrices for player A and Player B is ultimately zaro.
Player A’s Payoff Matrix
Player B
1 2 . . . n
1 𝑉11 𝑉12 . . . 𝑉1𝑛
2 𝑉21 𝑉22 . . . 𝑉2𝑛
Player A . . . . . . .
. . . . . . .
. . . . . .
m 𝑉𝑚1 𝑉𝑚2 . . . 𝑉𝑚𝑛
Example -
Consider two person coin tossing game. Each player tossed an unbiased coin simultaneously.
Player B pays Rs. 9 to A if {H, H} occur and Rs. 5, if {T, T} occurs; otherwise player A pays Rs. 4
to B. This two person game is zero-sum game, since the winnings of one player are the
looses of the other. Each player has his choices from amongst two pure strategies H and T.
Clearly, the entries in B’s payoff matrix will be just the negative of the corresponding entries
in A’s payoff matrix so that the sum of payoff matrices for player A and B is ultimately a null
matrix.
We generally display the payoff matrix of that player who is indicated on the left side of the
matrix. For example, A's payoff matrix may be displayed as below.
Player B
H T
Player A H 9 -4
T -4 5
If the maximin value equals the minimax value, then the game is said to have a saddle
point, which is also called as equilibrium point, and the corresponding strategies are called
optimum strategies. The amount of payoff at an equilibrium point is known as the value of
the game.
To illustrate maximin-minimax principle, let us consider two-person zero-sum game with the
following 3 × 2 payoff metrix for player A
Player B
𝑩𝟏 𝑩𝟐 Row Minima
𝑨𝟏 9 2 2
Player A 𝑨𝟐 8 6 6 Maximin Value
𝑨𝟑 6 4 4
Column 9 6
Maxima
Minimax
value
Saddle Point
A Saddle point of a payoff matrix is a position of such an element in the payoff matrix which
is minimum in its row and maximum in its column.
Mathematically in payoff matrix {𝑉𝑖𝑗 } the position of element such that
Optimal Strategy
The course of action which maximizes the profit of a player or minimizes his loss is called an
Optimal Strategy
If the payoff matrix {𝑉𝑖𝑗 } has the saddle point (r , s) then the player (A and B) are said to have
𝑟 𝑡ℎ and 𝑠 𝑡ℎ optimal strategy respectively.
Co-ordinates are
𝑎22 − 𝑎21
𝑥1 = (𝑎
22 + 𝑎11 ) − (𝑎21 + 𝑎12 )
𝑎11 − 𝑎12
𝑥2 = (𝑎
22 + 𝑎11 ) − (𝑎21 + 𝑎12 )
𝑎22 − 𝑎12
𝑦1 = (𝑎
22 + 𝑎11 ) − (𝑎21 + 𝑎12 )
𝑎11 − 𝑎21
𝑦2 = (𝑎
22 + 𝑎11 ) − (𝑎21 + 𝑎12 )
2. Graphical method
The graphical method is used to solve the Games whose payoff matrix has
i) Two rows and n columns (2 x n)
ii) m rows and two columns (m x 2)
3. Dominance Property
Sometimes it is observed that one of the pure strategies of either player is always inferior to
at least one of the remaining ones. The superior strategies are said to dominate the inferior
ones. Clearly, a player would have no incentive to use inferior strategies which are
dominated by the superior ones. In such cases of dominance, we can reduce the size of the
payoff matrix by deleting those strategies which are dominated by the others. thus if each
element in one row, say kth of the Payoff matrix 𝑎𝑖𝑗 is greater than or equal to the
corresponding elements in some other row, Say rth, then the player A will never choose rth
strategy.
2. If all the elements of a column, say kth are less than or equal to the corresponding
element of any other column, Say rth then rth column is dominated by the kth column.
3. Dominated rows or column maybe deleted to reduce the size of the payoff matrix, as the
optimal strategies will remain unaffected
The modified dominance property
The dominance property is not always based on the superiority of pure strategies only. A
given strategy can also be said to be dominated if it is inferior to an average of two or more
other pure strategies.
Decision analysis
Decision theory may be defined as approach which results in the selection from a set of
alternative course of action, The course of action which is considered to be more satisfying
than others or best to meet the objective of the decision problem, as judged by the decision
maker.
Decision-making becomes essential for the management of an organisation or by an
individual when he felt dissatisfied with the existing decision or when alternative course of
actions are available. Decision theory plays an important role in helping the decision maker
to make better decision under uncertain future conditions.
In the present advancing computer age the decision makers of a business organization
cannot afford to make a decision by merely guessing as the wrong decision may lead a
disastrous consequences.
The overall technique that requires the basic knowledge of mathematics, statistics and
computer helps the decision maker to make effective decision in the best interest of the
organization.
Decision-making environment
Decision analysis is used to determine optimum strategies where a decision Maker is faced
with several decision alternatives. We may come across several decision making situations.
1. Decision under certainty
Whenever there exists only one outcome for a decision. We are dealing with this category.
Examples of this are linear programming, transportation, assignment and sequencing, etc.
2. Decision under uncertainty
This refer to situation where more than one outcome can result from any single decision.
Under conditions of uncertainty, only payoff are known and nothing is known about the
likelihood of each state of nature. There are several different criterion.
A. Criterion of Pessimism
Maximin Criterion
The maximin criteria is based upon the conservative approach to assume that the worst
possibility is going to happen. The decision Maker considers each strategy and locates the
minimum payoff for each and then selects that alternative which maximize the minimum
payoff.
This criterion consists of two steps
Step 1. Determine the minimum assured payoff for each alternative
Step 2. To choose that alternative, which corresponds to the maximum of above minimum
payoffs.
Minimax Criterion
When dealing with the costs, the maximum cost associated with each alternative is
considered and the alternative that minimizes this maximum cost is chosen. In this context
the criterion is the minimax criterion and maybe carried out in two steps.
Step 1 Determine the maximum possible costs for each alternative.
Step 2. Choose that alternative which corresponds to the minimum of the above costs.
B. Criterion of Optimism
Maximax criterion
The maximax criterion is based upon ‘extreme optimism’. The Decision maker selects that
particular strategy which corresponds to the maximum of the maximum payoff for each
strategy.
The maximax criteria thus consist of following steps.
D. Hurwicz Criterion
The Hurwicz criterion stipulates that decision maker’s view may fall somewhere between the
extreme pessimism of the maximum criterion and the extreme optimism of the maximum
criterion. The criterion provides the mechanism by which a balance between extreme
pessimism and extreme optimism is made by weighing them with certain degrees of
optimism and pessimism.
The basic steps of Hurwicz criterion may be summarised as below.
Step 1. Choose an appropriate degree of optimism (or pessimism) of a decision Maker. Let α
be his degree of optimism so that 1 - α be his degree of pessimism.
Step 2. Determine the maximum as well as minimum payoff for each alternative and obtain
the quantities
h = α x Maximum + (1 – α) x Minimum for each alternative.
Step 3. Select the strategy with minimum value of h.
E) Laplace Criterion
The Laplace criterion uses all the information by assigning equal probabilities to the possible
payoffs for each action and then selecting that alternative which corresponds to the
maximum expected payoff.
The basic steps of this criterion may be summarized as:
Step 1. Assign equal probabilities (1/n) to each payoff of a strategy (having n possible
payoffs).
Step 2. Determine the expected payoff value for each alternative.
Step 3. Select that alternative which corresponds to the maximum of the above expected
payoffs.