Classification - Data Mining
Classification - Data Mining
Let H be SOme
hypoheei) such tht Hhe dota uple
belong to a cpecike clas C.
4Pos
fo casifeation poblems,
Ce wan t to delemine PCu)
probabibtyy hat dce hypotheis HH holdr qiren tthe
hypotheit
-
obsesved data tuple X.
-POH/1) postesex(e0 posteori prbabi lity,of
H Condiioned on X.
Example'acig lncome
Custoymer aage
X $40,000
Suppose H i the hypolhesi1 that ouy Cutomer
hypothesi Ahat
wtll by3 a computer ,hen
P(HI) he po babi l: by that cwtomer X loill by
Co mputer gien by bng that oe ktoco Customer
age s fncome.
Con tra_ts, P) i pior probahi lity or pioni prob,
af H
Example:pls
Hhat is pro babil;ty that any given cuit
Co mputes, reqardles of aqe i income.
+Similary enplain,
postenio psor
estimated Foon
* PCH), P) i p IH) can be
He give n data.
Boyes' theoveon i wsehul tor calalating posteiov
pobabilthy
P(#/) PA)P4)
PCx)
Naive Bayeiao Cantieaton
Sollos
)Let D be tvoining set of tuplei a anociated clai
(abel.
O-dmesional atile
-each tupk vepeiented 3
an
to be maimi zed.
pob.is not Kao con,then
-T! clas piox
-all clases are cqualy Chely,
Hhe
ie, Pa): PCC) rCcm)
marimige P(x / c ) o h
othentse P*/c)rc)
probalil'he ybe extimated by
-Clan priov
P(C): |c,ol / i p b
Lino. ot taining tuple pt dan Gin D
9)Tt cwould be comre xpenive to Compute Pt)
toheA foY the with many
datosets many attibute,6
das
-to reduce this he nafve asumpHon
Condt ianal tndependente i made.Cie, there are nO
dependence relatonships among
among the attrbute:)
apectuely
lel late' cla labet attibute be buys- CompukeY
asociated clan lalel fox X:yes
Cbuy-com puler =ye)
has noB been diCvekged .& Hereore
eist ay Conhoow -valued attibute
ogfeCop foshut)Shaopespo):0
-a)pny