Random Variables

Random variables are crucial in statistics, representing values that cannot be predicted with certainty and can be either discrete or continuous. Discrete random variables have countable values and use a probability mass function (PMF) for their distribution, while continuous random variables can take any real value and utilize a probability density function (PDF). Understanding these concepts is essential for data scientists, as they are foundational to modeling uncertainty in data.

Uploaded by

Vishu Chahal

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

3 views5 pages

Random Variables

Uploaded by

Vishu Chahal

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

Random variables are one of the most important concepts in statistics.

In this
blog post, we will discuss what they are, their different types, and how they are
related to the probability distribution. We will also provide examples so that
you can better understand this concept. As a data scientist, it is of utmost
importance that you have a strong understanding of random variables and how
to work with them.
What is a random variable and what are some examples?
A random variable is a variable that can take on random values. The key
difference between a variable and a random variable is that the value of the
random variable cannot be predicted with certainty. Random variables can be
both scaler and vector-valued. A vector-valued random variable can take on
different sets of values at a different point in time. Some examples of random
variables include:
• X: No. of times 6 occurs in the dice rolled for 10 times: X can take value
of 1 to 10 with 1 and 10 having least probability.
• X: No. of heads occurring the coin flipped for 10 times: X can take value
of 1 to 10 with 1 and 10 having least probability.
• X: Number of students scoring more than 80 marks in a test
• X: The number of people coming to the shop between 11 AM and 12:00
PM: X can take value from 1 to N with varying probabilities.
• X: Number of cars passing a building between 9 and 10 AM
• X: Number of people traveling by flight on any particular day from an
airport
• X: Number of sales on any particular day
• X: Number of people visiting a website on any particular day
Random variables are denoted using uppercase letters. For example, X can be
used to denote a random variable. If we have two random variables, say X and
Y, then we can use the notation (X, Y) to represent them.
P(X = x) represents the probability of the random variable X taking on a
particular value, x. For example, When X represents no. of heads in coin flipped
for 10 times, P(X=5) represents the probability that the number of heads takes
the value of 5. In other words, X = 5.
How are random variables related to the probability distribution?
As mentioned above, the random variables are variables whose values can’t be
predicted with certainty. Thus, the random variables must be associated with
the probability distribution which specifies the probability of the random
variable taking different values. For example, when X represents the count that
dice occurs as 6 when rolled for 8 times, probability distribution of this random
variable will include values of P(X=1), P(X=2), P(X=3), P(X=4), P(X=5), P(X=6),
P(X=7), P(X=8)
The probability distribution of a random variable is a function that maps values
that the random variable can take to their corresponding probabilities. In other
words, it tells us how likely it is for the random variable to take on each value.
The most common type of probability distribution is the normal distribution or
Gaussian distribution. Other types include uniform distributions, binomial
distributions, and Poisson distributions.
Types of random variables & probability distributions
Random variables can be of two different types:
• Discrete random variables
• Continuous random variables
Discrete random variables: These types of random variables can take on only a
countable number of values be it a small or large number. Some examples of
discrete random variables are:
• The number of students in a classroom
• The number of cars sold by a company in a day
• The number of people visiting a website in an hour
The probability distribution of discrete random variables can be calculated
using what is called the probability mass function (PMF). PMF is a function
that assigns probabilities to discrete outcomes. Probability mass functions can
be applied to many discrete random variables at the same time to determine
the probability distribution which is called a joint probability distribution.
P(X=x, Y=y) denotes the probability that X is equal to x and Y is equal to y
simultaneously.
Continuous random variables: These types of random variables can take on
any real value. Some examples of continuous random variables are:
• The time taken to complete a task
• The height of a person
• The weight of a person
• The income of a person
The probability distribution of a continuous random variable can be calculated
using what is called the probability density function (PDF). PDF is a function
that assigns probabilities to continuous outcomes. The probability of a
continuous random variable taking on any particular value is 0 but the
probability of it taking values in a range is non-zero. To calculate the probability,
we need to integrate the PDF over that range.
How to work with random variables in Python
In Python, the random variable having integer values can be generated using
the randint() function in the random module. This function takes two
parameters: the lower limit & upper limit.
For example, if we want to generate a random variable that can take on integer
values between 0 and 100, we will use the following code:
1import numpy as np
2X = np.random.randint(0, 100)
3print(X)
We can also generate an array of floats using the rand() function in the NumPy
library. The rand() function takes a shape parameter that specifies the size of
the array. For example, if we want to generate an array of floats with shape (10,
), we will use the following code:
1import numpy as np
2X = np.random.rand(10)
3X
To calculate the probability distribution of a discrete random variable, we can
use the scipy.stats library in Python. The scipy.stats library contains a number
of probability distributions as well as functions to calculate different statistical
properties such as mean, variance, etc.
For example, let’s say we have a random variable X that can take five different
values having different probabilities. We can calculate the probability
distribution using the following code:
1 '''
2 Print the discrete random variable
3 '''
4 from scipy.stats import rv_discrete
5 import numpy as np
6
7 x = np.arange(5)
8 P_x = [0.1, 0.4, 0.3, 0.1, 0.1]
9 X = rv_discrete(name='X', values=(x, P_x))
10 print(X.pmf(x))
The above probability distribution can be plotted using the following code:
1'''
2Plot the probability distribution of discrete random variable
3'''
4import matplotlib.pyplot as plt
5fig, ax = plt.subplots(1, 1)
6ax.plot(x, X.pmf(x), 'ro', ms=12, mec='r')
7ax.vlines(x, 0, X.pmf(x), colors='r', lw=4)
8plt.show()
The following is how the probability distribution plot will be created based on
the above code:
Conclusion
In conclusion, random variables are an important concept in statistics that
allows us to model events that have uncertainty. There are two main types of
random variables: discrete and continuous. Discrete random variables can take
on only a countable number of values while continuous random variables can
take on any real value. The probability distribution of a random variable
specifies the probabilities of the possible outcomes representing different
values of the random variable. For the discrete random variables, the function
mapping the probability value to individual outcomes is called probability mass
function (PMF). For the continuous random variables, the function mapping
the probability value to individual outcomes is called probability density
function (PDF).

Climatronic Sharan
0% (1)
Climatronic Sharan
11 pages
Hybrid Kettlebell Strength and Conditioning Main Manual
No ratings yet
Hybrid Kettlebell Strength and Conditioning Main Manual
28 pages
Chapter 6 (Non-Math)
No ratings yet
Chapter 6 (Non-Math)
14 pages
Random Variable
No ratings yet
Random Variable
6 pages
Chapter 4 Discrete Probability Distribution
No ratings yet
Chapter 4 Discrete Probability Distribution
8 pages
Lesson 3-Discreet Probability Distributions
No ratings yet
Lesson 3-Discreet Probability Distributions
13 pages
Session 2 PDF
No ratings yet
Session 2 PDF
25 pages
Chapter 3
No ratings yet
Chapter 3
40 pages
Random Variables and Probability Distribution
No ratings yet
Random Variables and Probability Distribution
54 pages
Cone Pre Calculus
No ratings yet
Cone Pre Calculus
30 pages
Probability
No ratings yet
Probability
10 pages
Basic Statistics For Lms
0% (1)
Basic Statistics For Lms
23 pages
Lecture
No ratings yet
Lecture
15 pages
Week 1 Random Variables
No ratings yet
Week 1 Random Variables
30 pages
Chapter I. Random Variables and Probability Distributions
No ratings yet
Chapter I. Random Variables and Probability Distributions
3 pages
Cosm Unit-1 Part-2
No ratings yet
Cosm Unit-1 Part-2
8 pages
Atg-Stat&prob-Met 1
No ratings yet
Atg-Stat&prob-Met 1
6 pages
Random Variables
No ratings yet
Random Variables
28 pages
RandomVariables ProbDistributions Complete
0% (1)
RandomVariables ProbDistributions Complete
86 pages
Chapter-6-Random Variables & Probability Distributions
No ratings yet
Chapter-6-Random Variables & Probability Distributions
19 pages
DAily LEC. Sep 18 Notes
No ratings yet
DAily LEC. Sep 18 Notes
8 pages
Lecture 6-7 Random Variable
No ratings yet
Lecture 6-7 Random Variable
29 pages
Chapter I. Random Variables and Probability Distributions
No ratings yet
Chapter I. Random Variables and Probability Distributions
2 pages
Statistics and Probability
No ratings yet
Statistics and Probability
15 pages
Chapter III Random Variables
No ratings yet
Chapter III Random Variables
99 pages
Random Variables
No ratings yet
Random Variables
31 pages
Week 1 StatProb Module
No ratings yet
Week 1 StatProb Module
11 pages
4 Random Variables
No ratings yet
4 Random Variables
68 pages
Chap6 Some Probability Distributions - Lecture
No ratings yet
Chap6 Some Probability Distributions - Lecture
21 pages
Quantitative Methods in Management
No ratings yet
Quantitative Methods in Management
100 pages
Chapter 6
No ratings yet
Chapter 6
12 pages
Random Variables and Probability Distributions
No ratings yet
Random Variables and Probability Distributions
3 pages
0 Probability Distribution Supplimental Reading From Data Science Center
No ratings yet
0 Probability Distribution Supplimental Reading From Data Science Center
34 pages
Random Variable and Discrete Distribution (Lecture 04)
No ratings yet
Random Variable and Discrete Distribution (Lecture 04)
46 pages
Chapter 5 - Discrete Random Variables and Their Probability Distrubutions
No ratings yet
Chapter 5 - Discrete Random Variables and Their Probability Distrubutions
9 pages
Unit II - ML
No ratings yet
Unit II - ML
29 pages
Lecture One
No ratings yet
Lecture One
12 pages
Random Variables and General Discrete Distributions
No ratings yet
Random Variables and General Discrete Distributions
22 pages
Idencitifierccc 01
No ratings yet
Idencitifierccc 01
87 pages
SSLM in Statistics and Probability For G11 Q3 Module 1
No ratings yet
SSLM in Statistics and Probability For G11 Q3 Module 1
5 pages
DISCRETE AND CONTINUOUS (Statistics and Probability)
No ratings yet
DISCRETE AND CONTINUOUS (Statistics and Probability)
23 pages
Discrete Random Variables Biostatistics College of Public Health and Health Professions University of Florida
No ratings yet
Discrete Random Variables Biostatistics College of Public Health and Health Professions University of Florida
19 pages
Lesson 4 - Probability Distribution
No ratings yet
Lesson 4 - Probability Distribution
48 pages
CH 04
No ratings yet
CH 04
106 pages
Co-Ordinate System
No ratings yet
Co-Ordinate System
9 pages
CH 7
No ratings yet
CH 7
44 pages
Stat&Prob Module 1 Edited
No ratings yet
Stat&Prob Module 1 Edited
14 pages
Stat - G. Assignment
No ratings yet
Stat - G. Assignment
21 pages
Session 04 MSST 2018-20
No ratings yet
Session 04 MSST 2018-20
39 pages
Random Variable 1 PDF
No ratings yet
Random Variable 1 PDF
4 pages
Chapter (2) (1) CCCCCCCCCCCC
No ratings yet
Chapter (2) (1) CCCCCCCCCCCC
28 pages
Chapter-5 - Probability Distributions
No ratings yet
Chapter-5 - Probability Distributions
111 pages
Big Ideas Learning Objectives: St. Mary's Dominican School San Manuel, Pangasinan
No ratings yet
Big Ideas Learning Objectives: St. Mary's Dominican School San Manuel, Pangasinan
4 pages
Slide 2023.2 MI2036 Chap2
No ratings yet
Slide 2023.2 MI2036 Chap2
208 pages
Prelim Module Stat
No ratings yet
Prelim Module Stat
7 pages
Statistics and Probability Second SEMESTER S.Y. 2020 - 2021: Quest
No ratings yet
Statistics and Probability Second SEMESTER S.Y. 2020 - 2021: Quest
6 pages
Random Variable and Mathematical Expectation
No ratings yet
Random Variable and Mathematical Expectation
9 pages
Inbound 4421484962866478386
No ratings yet
Inbound 4421484962866478386
68 pages
Unit1 - Read-Only
No ratings yet
Unit1 - Read-Only
191 pages
Slides Day 4 PDF
No ratings yet
Slides Day 4 PDF
10 pages
Top Numerical Methods With Matlab For Beginners!
From Everand
Top Numerical Methods With Matlab For Beginners!
Andrei Besedin
No ratings yet
Cross Correlation: Unlocking Patterns in Computer Vision
From Everand
Cross Correlation: Unlocking Patterns in Computer Vision
Fouad Sabry
No ratings yet
2023 2024 Class Catch Up Friday Program
100% (1)
2023 2024 Class Catch Up Friday Program
6 pages
‎⁨أوراق عمل انجليزي 1 2 1ث ف2 موقع مادتي⁩
No ratings yet
‎⁨أوراق عمل انجليزي 1 2 1ث ف2 موقع مادتي⁩
23 pages
KEI HW List Price - 15th Feb 2025
No ratings yet
KEI HW List Price - 15th Feb 2025
1 page
SunlightV6 On Top6872265039
No ratings yet
SunlightV6 On Top6872265039
5 pages
The Problem and Its Background
100% (1)
The Problem and Its Background
12 pages
Optimization of Transportation Costs in Supply Cha PDF
No ratings yet
Optimization of Transportation Costs in Supply Cha PDF
83 pages
PD Integration of Thought Feelings and Behavior
No ratings yet
PD Integration of Thought Feelings and Behavior
15 pages
314327-Consumer Electronic Systems 281124
No ratings yet
314327-Consumer Electronic Systems 281124
10 pages
104174us Minimax Operatinginstructions 100518a
No ratings yet
104174us Minimax Operatinginstructions 100518a
12 pages
Energy Efficient Pumping Technology Innovations and Recent Trends
No ratings yet
Energy Efficient Pumping Technology Innovations and Recent Trends
15 pages
OECD - Why DeFi Matters
No ratings yet
OECD - Why DeFi Matters
70 pages
BÀI TẬP HOÀN THÀNH CÂU
No ratings yet
BÀI TẬP HOÀN THÀNH CÂU
18 pages
Troubleshooting
No ratings yet
Troubleshooting
6 pages
Chapter 5 Gastrointestinal Agents Reviewer PDF
No ratings yet
Chapter 5 Gastrointestinal Agents Reviewer PDF
6 pages
Trends
No ratings yet
Trends
13 pages
All or Nothing SR Clare Resource Pack
No ratings yet
All or Nothing SR Clare Resource Pack
36 pages
Expt. No. 8. ECE of Copper
No ratings yet
Expt. No. 8. ECE of Copper
5 pages
Practice Chapter 3 Conformations of Alkanes and Cycloalkanes
No ratings yet
Practice Chapter 3 Conformations of Alkanes and Cycloalkanes
22 pages
BSD Standard Height and Width
100% (2)
BSD Standard Height and Width
2 pages
Artifact #9: Integrated Math and Science Lesson Plan
No ratings yet
Artifact #9: Integrated Math and Science Lesson Plan
12 pages
Lecture 8 BEC
No ratings yet
Lecture 8 BEC
14 pages
Uji Normalitas Data SPSS - Puspita Utari D042202010
No ratings yet
Uji Normalitas Data SPSS - Puspita Utari D042202010
7 pages
MSG 3 Final Report
No ratings yet
MSG 3 Final Report
60 pages
Bloom's Revised Taxonomy of Educational Objectives
No ratings yet
Bloom's Revised Taxonomy of Educational Objectives
36 pages
Console Log ZC026
No ratings yet
Console Log ZC026
7 pages
Algebra Balance Scales
No ratings yet
Algebra Balance Scales
1 page
The American College
No ratings yet
The American College
2 pages
Huawei CloudAIR Solution - Deep Insight - GSM, UMTS and LTE Spectrum Concurrency Share Mechanism
No ratings yet
Huawei CloudAIR Solution - Deep Insight - GSM, UMTS and LTE Spectrum Concurrency Share Mechanism
34 pages

Random Variables

Uploaded by

Random Variables

Uploaded by

Random variables are one of the most important concepts in statistics.

You might also like