Iid

Uploaded by

Bipin Attri

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF or read online on Scribd

0% found this document useful (0 votes)

38 views4 pages

Iid

Uploaded by

Bipin Attri

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF or read online on Scribd

You are on page 1/ 4

28/09/2022, 18:01 Independent and identically eistiovied random variables - Wikipedia WikiPeDiA Independent and identically distributed random variables In probability theory and statisties, a colletion of random variables is independent and identically distributed if each random variable has the same probability distribution asthe others and all are mutually independent. This property is usually abbreviated as i... ti, or IID. ITD was fist defined fn statistics and finds application indifferent fields sch as data mining and signal procesing Contents Introduction ‘03 Definition Definition fr two random variables Definition fr more than two random variables Detiition for ndependence Examples Example 4 Example 2 Example 3 Example 4 Generalizations Exchangeable random variables Levy process Inmachine learning ‘Why assume the data in machine learning re independent and identical cstibutod? See also References Furthor reading Introduction {In statistics, we commonly deal with random samples. A random sample canbe thought of as a set of objects that are chosen randomly. Or, more formally, its“a sequence of independent, identically distributed (IID) random variables" In other words, the terms random sample and JID are basically one and the same. In statistics, we usvally say “random sample,” but in probability it’s ore commen to say "IID." lontically Distributed means that the probabiltydstnbuton Independent maans thatthe sample items ar all independent events n other words, thay aren't connactad to each ater in any way. In other words, knowledge ofthe valie of ane variable gives na information about the valie of tre other and vce versa. reno overall rends-the ditbution doesn't fluctuate and all tems in the sample ae taken fram the same Application Independent and identically distributed random variables are often used as an assumption, which tends to simplify the under} practical applications of statistical modeling, however, the assumption may or may nat be realistic I mathematics. In The iid. assumption is also used in central limit theorem, which states thatthe probability distribution ofthe sum (or average) of L..d. variables with finie variance approaches a normal distribution Often the id. assumption arses inthe context of sequences of random variables. Then "independent and identically distributed” implies that an element inthe sequence is independent of the random variables that came before it. In this way, and. sequence i different from a Markov sequence, where the probability distbution for the nth random variable sa function ofthe previous random variable in the sequence (Tora first order Markov sequen). An id. sequence does not imply the probabilities forall elements ofthe sample space or event space must be the same Ll For example, repeated throws of loaded dice will produce a sequence that i ..d, despite the outcomes being biased Definition Definition for two random variables Suppose thatthe random variables X and ¥ are defined to assume values in J CR. Let F(a) = P(X < 2) and Fy(y) = P(Y < y) be the cumulative distribution functions of X and Y, respectively, and denote thr joint cumulative distribution funtion by Frzy(2,y) = P(X <2 AY 0, there is possibilty PQB]A). Generally, the occurrence of A has an effect on the probability of B, which called conditional probability ad only when the oocurrence of Ahas no effet an the ecumence of B, theres BIA) = PCB). Note: I P(A) > 0, PB) > 0 then A, B are mutually independent which cannot be established with mutually incompatible atthe same time, that is, independence must be compatible and mutual exclusion must be related Suppose A, B, Care three events. If PCAB) = PCA)P(B), P(BC) = P(B)P(C), PCAC) events A, B, Care independent ofeach other. PCADP(C), PARC) = PCA)P(R)P(C) are satisfied, then the A more general definition is there aren events, Ayy Azyonsy. Ifthe probabilities of the product events for any 2,3 ofthe probabilities ofeach event, Unem the events Ay Agy sy Ay are independent of each othe. 1m events are equal to the product Examples Example 1 [A soquence of outcomes of spins ofa fair or unfair roulette whcel is .d. One implication ofthis is that if the roulette ball lands on “red for example, 20 times in a roe, the next spin is no more or les likely tobe “black” than on anyother spin eee the Gamblers fallacy), A sequence of fir o loaded dice rolls isi [A sequence of fir or uni coin ips id Tm signal processing and image processing the notion of transformation to. implies two specifications, the ".@"part andthe "par: (Ga) the signa evel must be balanced on the ime axis; (the ig peru must be fastens ansfrmed by rng (su as ecovuto) to 2 white noise ig (ea ial where al equends are equally present Example 2 Toss a coin 0 times and record how many times does the con lands on head 1. Independent —each outcome of landing wil not atfectthe other outcome, which means the 10 results are independent frm each other 2 lntzay Osibued- the con 8a homogeneas nateal each time the probably or heads 8, which means he probably is ena fr Example 3 Rolla dice 10 ties and record how many time the results 1. Independent—each outcome ofthe sice wil ot afectthe next one, which means the 10 results are independent rom each other. 2 ntzaly Ostby the die a homogensvs materl each tne he probably forthe number Tis, whieh means he robb is Example 4 hitpsen wikipedia orgwikindependent_and_derticaly_cisrbuted_random variables 2828/09/2022, 18:01 Independent and identically eistiovied random variables - Wikipedia Choose a card from a standard deck of cards containing 52 cards, then place th card back inthe deck. Repeat it for 52 times. Record appears amber of King 1. Independent — each outcome ofthe card will not affect he next one, which maans the 52 results are indepandent from each other 2 dentially Distributed after drawing one card fom i, each time the probably for Kings 4/82, which means the probably is identical foreach time, Generalizations Many results that were frst proven under the assumption that the random variables are iid. have been shown to be true even under a weaker distribational assumption. Exchangeable random variables The most general notion which shares the main properties of iid. variables are exchangeable random variables, introduced by Bruno de Finett ‘Exchangeabilty means that while variables may not be independent, fture ones behave like past ones ~ formally, any value ofa fnie sequence eas ikely as any permutation of thse values ~ the joint probability distribution is invariant under the syzumetric group, This provides a useful generalization ~for example, sampling without replacement is not independent, but i exchangeable, Lévy process In stochastic ealeulus, Li. variables are thought of asa discrete time Lévy process: exch variable gives how much one changes from one time to another. For example, a sequonce of Bernoulli trials is interpreted asthe Bernoall process. One may gencralize this to inlude continuous time Lévy processes, and sy Ley processes can be seen as limits of ..d.varables—for instance, the Wiener process is the limit of the Bernoulli process. In machine learning Why assume the data in machine learning are independent and identically distributed? “Machine learning uses currently aoquired massive quantities of data to deliver faster, more accurate results!l Therefore, we neod to use historieal data ‘with overall representativenest, If the data obtained isnot representative ofthe overall situation, then the rule wl be sunmmaried badly or wrongly Through ied. hypothesis, the number of individual cases in the training sample can be greatly reduced. This assumption makes maximization very easy to cileulate mathematically, Observing the assumption of independent and identical distribution in mathematics simplifies the calculation ofthe likelihood function in optimization problems. Because ofthe assumption of independence, the likelihood function can be written like this, 1) = Pleas 2a,23,--.2l6) = PCsl0)PCoal) les) In onder to mas P(=n|9) 1 the probability ofthe observed event, ake the log function and maximize the parameter 0. Thats to say to compute: ssemaxlog(t(0)) where log(t()) = log( P(e) + log P(22|6)) + log(Pl2s|6)) +... +log(P(=nI6)) ‘The computer is very efficient to calculate multiple addltions, butts not efficent to eleulate the multiplication. Tis simplification is the core reason for the increase in computational efficiency. And this Log transformation i eso inthe process of maximizing, turning many exponential functions into linear functions, or two reasons this hypothesis is easy to use the central limit theorem in practical applications. 1. Even ithe samg lit theorem to Ge rormal istration” 2. The second reason is thatthe accuracy ofthe model depends on the simplicty and representative power of the model unit, as well as the data quali. Because the spicy of the unit makes I easy to interpret and scale, and the representative power + scale out ofthe unit improves the madel accuracy Like in'a deep neural network, each neuron is very simple but has song representative power ayer by layer fo represent more complex features to improve model accuracy.) fed fom n approximately -omes from a mere complex non-Gaussian distribution, it can algo approximate wel. Bacau jsiancistibuion. For large number of observable samples "ine sum of many random var See also + De Finat's theorem + Paiwise independent variables + Geniza imi theorem References 4. Clauset, Aaron (2011). A brief primer on probably dsinbutions” (hp: /Muvalusantafe.edu/~aaronclcourses/7000/¢s:7000-001_2011_LO.pd (POF) Santa Fe stitute, hitpsen wikipedia orgwiki/ndependent_and_denticaly_cistrbuted_random variables sia2809/2022, 18:01 Independent and identically eistiovted random variables - Wikipedia 2, Stephanie (2078-05-1") "ID Statistics: Independent and dentcally Distributed Defriton and Examples” (hips ivmw stats icshowto.comi-statstic si) Statistics How To. Reiriaved 2021-12-09, 8. Hampel, Frank (1998), "Is statistics too diffcul!”(Nos.Isomantiescholar or/paper02SacS74 0Sec47abSBeSecb28od8absbedbSBN), Conaclan sJoumal of Siatstes, 26 (3). 487-813, dol 10.280718318772 (hips /idoorg/10.2807%2F8315772), hd 20.500.11850/145503 (Nps. handle.ne¥20, '500, 1185052145603), JSTOR 3915772 (hip: jslor.orgslabe/3315772), S2CID 53717661 (Mipsi/ap, somaniicscholr.07/Corpusi0:53°° 765 1168) 4. lum, J. Rs Cheroff, H; Rosenblat, Teicher, H. (1958).

ML Cheat Sheet
50% (2)
ML Cheat Sheet
74 pages
Complete Data Science Cheatsheet
50% (2)
Complete Data Science Cheatsheet
66 pages
22ECE52 - Module 2 Random Process & Variables
No ratings yet
22ECE52 - Module 2 Random Process & Variables
29 pages
Information Theory, Coding and Cryptography Unit-2 by Arun Pratap Singh
100% (4)
Information Theory, Coding and Cryptography Unit-2 by Arun Pratap Singh
36 pages
All in One CheatSheet
100% (1)
All in One CheatSheet
52 pages
Cs229 Probability Review
No ratings yet
Cs229 Probability Review
36 pages
ML DL AI Cheatsheet
No ratings yet
ML DL AI Cheatsheet
52 pages
MA2013E Slides
No ratings yet
MA2013E Slides
237 pages
Independence of Events, Sequence of Events, Classes of Events, and Random Variables
No ratings yet
Independence of Events, Sequence of Events, Classes of Events, and Random Variables
11 pages
Probability Review Stochastic
No ratings yet
Probability Review Stochastic
23 pages
Probability Formula Sheet
No ratings yet
Probability Formula Sheet
11 pages
AI ML Cheatsheet
No ratings yet
AI ML Cheatsheet
51 pages
Chapter-4 Combined
No ratings yet
Chapter-4 Combined
79 pages
Probability II
No ratings yet
Probability II
45 pages
PML Class 0 2025
No ratings yet
PML Class 0 2025
55 pages
Stochbasics Handout
No ratings yet
Stochbasics Handout
36 pages
MATHF113Lec 5
No ratings yet
MATHF113Lec 5
55 pages
R300 MT Class 1 Slides
No ratings yet
R300 MT Class 1 Slides
68 pages
Probability and Random Processes 2023
No ratings yet
Probability and Random Processes 2023
43 pages
All in One CheatSheet PDF
No ratings yet
All in One CheatSheet PDF
52 pages
Probability: Totalfavourable Events Total Number of Experiments
No ratings yet
Probability: Totalfavourable Events Total Number of Experiments
39 pages
Main
No ratings yet
Main
24 pages
Probability and Stochastic Models
No ratings yet
Probability and Stochastic Models
78 pages
Probability Lecture Notes
No ratings yet
Probability Lecture Notes
19 pages
Lecture2 Math ML Review
No ratings yet
Lecture2 Math ML Review
87 pages
What Is A Data Set?
No ratings yet
What Is A Data Set?
19 pages
Ugc Net Economics English Book 2
No ratings yet
Ugc Net Economics English Book 2
17 pages
Slide 2 - 20191
No ratings yet
Slide 2 - 20191
44 pages
Information & Communication
No ratings yet
Information & Communication
13 pages
STAT 516 Course Notes Part 0: Review of STAT 515: 1 Probability
No ratings yet
STAT 516 Course Notes Part 0: Review of STAT 515: 1 Probability
21 pages
Probability and Statistics
No ratings yet
Probability and Statistics
10 pages
Basic Statistics and Probability Theory
No ratings yet
Basic Statistics and Probability Theory
45 pages
Information Theory and Coding
No ratings yet
Information Theory and Coding
79 pages
Statistics and Applications
No ratings yet
Statistics and Applications
65 pages
Probability Theory
No ratings yet
Probability Theory
8 pages
OptimalLinearFilters PDF
No ratings yet
OptimalLinearFilters PDF
107 pages
Lecture 2: Review of Probability Theory: Books Articles/probability Book/book - HTML
No ratings yet
Lecture 2: Review of Probability Theory: Books Articles/probability Book/book - HTML
21 pages
Independent and Identically Distributed Random Variables - Wikipedia
No ratings yet
Independent and Identically Distributed Random Variables - Wikipedia
10 pages
A B P (B) P (A: Multiplication Law. Let and Be Events and Assume - Then
No ratings yet
A B P (B) P (A: Multiplication Law. Let and Be Events and Assume - Then
20 pages
Learning Material - ITC
No ratings yet
Learning Material - ITC
96 pages
MIT6 042JS10 Chap20
No ratings yet
MIT6 042JS10 Chap20
31 pages
Probability Basics
No ratings yet
Probability Basics
19 pages
Probability Review
No ratings yet
Probability Review
5 pages
Introductory Probability and The Central Limit Theorem
No ratings yet
Introductory Probability and The Central Limit Theorem
11 pages
T2 QA 1 Probability v3.3 Sample
No ratings yet
T2 QA 1 Probability v3.3 Sample
6 pages
All Cheat Shests 1749903425
No ratings yet
All Cheat Shests 1749903425
3 pages
MIT14 381F13 Lec1 PDF
No ratings yet
MIT14 381F13 Lec1 PDF
8 pages
Distributions and Normal Random Variables
No ratings yet
Distributions and Normal Random Variables
8 pages
Probability Summary
No ratings yet
Probability Summary
3 pages
Probability-The Science of Uncertainty and Data
No ratings yet
Probability-The Science of Uncertainty and Data
4 pages
Probab Refresh
No ratings yet
Probab Refresh
7 pages
John Moriarty: C C C 1 2 3 K 1 K
No ratings yet
John Moriarty: C C C 1 2 3 K 1 K
6 pages
Questions Jam Linear+Ode
No ratings yet
Questions Jam Linear+Ode
243 pages
1 Introduction To Information Theory
No ratings yet
1 Introduction To Information Theory
9 pages
Random Variables: COS 341 Fall 2002, Lecture 21
No ratings yet
Random Variables: COS 341 Fall 2002, Lecture 21
6 pages
Lect1 Merged
No ratings yet
Lect1 Merged
175 pages
Matrix Bhu Class Notes
No ratings yet
Matrix Bhu Class Notes
80 pages
Math556 11 ModesOfConvergence
No ratings yet
Math556 11 ModesOfConvergence
9 pages
Limit of A Sequence
No ratings yet
Limit of A Sequence
4 pages

Iid

Uploaded by

Iid

Uploaded by

You might also like