Probability in Data Science

The document discusses various statistical distributions including binomial, Poisson, and hypergeometric distributions. It defines key terms like probability distribution function, probability mass function, cumulative distribution function, mean, variance, and covariance. It explains that the binomial distribution models experiments with a fixed number of trials with two outcomes, the Poisson distribution models rare, independent events over an interval, and the hypergeometric distribution is used when sampling without replacement from a finite population. Examples are given of calculating probabilities and values using the binomial and Poisson distributions in Python.

Uploaded by

gopal_svsemails8998

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as XLSX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

196 views25 pages

Probability in Data Science

Uploaded by

gopal_svsemails8998

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as XLSX, PDF, TXT or read online on Scribd

You are on page 1/ 25

Consultancies

Bi reporting tools hr.scandinaviantech.se

Power BI
Data Studio
Google Big Query
Empirical
Discrete
continus

Distribution Describe the shape of the data

PDF(continous) Probability Distribution Function /probability Density function

PMF(discrete) Values taken by X random variable and their associateed probabilities

CDF add the PDF to get CDF and denoted as F(X)

E(X)=Mean Expected value sum of X*P(X)

V(X)

If two random variables move in the same direction, then the covariance will be positive, if they move in the opposite dire
The covariance tells the sign but not the magnitude about how strongly the variables are positively or negatively related.
correlation coefficient provides such measure of how strongly the variables are related to each other.

for Two Random variables X,Y

covariance

correlation coeffiencent

m-Solope of a regresion equation

Binomial Distribution- Assumtions

•Experiment involves n identical trials
•Each trial has exactly two possible outcomes: success and failure
•Each trial is independent of the previous trials
• p is the probability of a success on any one trial
q = (1-p) is the probability of a failure on any one trial
•p and q are constant throughout the experiment
•X is the number of successes in the n trials
Poisson Distribution
•Describes discrete occurrences over a continuum or interval
•A discrete distribution
•Describes rare events
•Each occurrence is independent any other occurrences.
•The number of occurrences in each interval can vary from zero to infinity.
•The expected number of occurrences must hold constant throughout the experiment.

The Hypergeometric Distribution

•The binomial distribution is applicable when selecting from a finite population with replacement or from an infinite popu
•The hypergeometric distribution is applicable when selecting from a finite population without replacement.
if Z table is one sided ie st
ad
move in the opposite direction the covariance will be negative.
tively or negatively related. The
ated to each other.

ion equation
nt or from an infinite population without replacement.
replacement.
if Z table is one sided ie starting form 0 instead of -infinite then we need to
add 0.5 to value in the table
Import scipy 19 T
import numpy as np 25

from import scipy.stats import binom

from scipy.stats import poission

binom(2,20,0.06) 2 represent <=2

20 represents no.of samples

possion
to get zvalue give the area
sf
binom(k,n,p)=(19,25,0.65)

k<=2,20,0.06
p(x<=2) cummuilative binom
cdf(k,n,p)

lambda =3.2
X=5
survival function

ExamBase NDDC Postgraduate Scholarship Aptitude Test Questions Bank For Engineering
No ratings yet
ExamBase NDDC Postgraduate Scholarship Aptitude Test Questions Bank For Engineering
217 pages
Enterprise Supply Planning (ESP) : by Venkata Gopala
100% (1)
Enterprise Supply Planning (ESP) : by Venkata Gopala
33 pages
Artificial Intelligence/ Machine Learning in Nuclear Medicine and Hybrid Imaging
No ratings yet
Artificial Intelligence/ Machine Learning in Nuclear Medicine and Hybrid Imaging
216 pages
A2. Revision 2
No ratings yet
A2. Revision 2
2 pages
Thesis 3
100% (1)
Thesis 3
15 pages
Random Variables: Petter Mostad 2005.09.19
No ratings yet
Random Variables: Petter Mostad 2005.09.19
24 pages
Chapter 6
No ratings yet
Chapter 6
5 pages
Lab-2: Probability Distributions Name: Objective:To Compute Probability Density Function (PDF) and Cumulative Distribution Function (CDF) Outcomes
No ratings yet
Lab-2: Probability Distributions Name: Objective:To Compute Probability Density Function (PDF) and Cumulative Distribution Function (CDF) Outcomes
15 pages
1732725913353_STT201
No ratings yet
1732725913353_STT201
19 pages
Priyanshu Majumder - 34900321060-1
No ratings yet
Priyanshu Majumder - 34900321060-1
37 pages
Study Guide
No ratings yet
Study Guide
9 pages
STAT 342 Statistical Methods For Engineers
No ratings yet
STAT 342 Statistical Methods For Engineers
26 pages
Stats
No ratings yet
Stats
24 pages
Probability Notes
No ratings yet
Probability Notes
39 pages
MTH618 WK 3 Lect 1: Random Variables. Probability Distributions
No ratings yet
MTH618 WK 3 Lect 1: Random Variables. Probability Distributions
17 pages
Ex No-4
No ratings yet
Ex No-4
4 pages
1853_Random Variable & Distribution
No ratings yet
1853_Random Variable & Distribution
43 pages
Chapter 7 Eng
No ratings yet
Chapter 7 Eng
59 pages
R-6 Theory
No ratings yet
R-6 Theory
4 pages
Chapter 2
No ratings yet
Chapter 2
8 pages
Study Guide
No ratings yet
Study Guide
8 pages
TOPIC TWO. RANDOM VARIABLE AND PROBABILITY DISTRIBUTION pptx
No ratings yet
TOPIC TWO. RANDOM VARIABLE AND PROBABILITY DISTRIBUTION pptx
43 pages
Exam P Review Sheet
No ratings yet
Exam P Review Sheet
12 pages
(Lecture 4) Discrete Probability Distributions
No ratings yet
(Lecture 4) Discrete Probability Distributions
57 pages
Prob. Distri.
No ratings yet
Prob. Distri.
36 pages
Discrete Distribution
No ratings yet
Discrete Distribution
19 pages
4 - Probability Theory II
No ratings yet
4 - Probability Theory II
85 pages
Probability Distribution: Shreya Kanwar (16eemme023)
No ratings yet
Probability Distribution: Shreya Kanwar (16eemme023)
51 pages
Unit 4
No ratings yet
Unit 4
22 pages
5221 Basic Probability Distributions in R MCA MMS 20MCA2CC9
No ratings yet
5221 Basic Probability Distributions in R MCA MMS 20MCA2CC9
30 pages
jml5 1
No ratings yet
jml5 1
63 pages
Student_Notes_2.2
No ratings yet
Student_Notes_2.2
6 pages
Probability and Statistic Chapter3
No ratings yet
Probability and Statistic Chapter3
55 pages
Business Inferential Statistics Lessons
No ratings yet
Business Inferential Statistics Lessons
7 pages
Basic Statistics in Fluid Mechanics
No ratings yet
Basic Statistics in Fluid Mechanics
34 pages
Probability MCQ's
No ratings yet
Probability MCQ's
24 pages
statatics and probability chapter 3 and 4
No ratings yet
statatics and probability chapter 3 and 4
10 pages
Sta Statistical Theory
No ratings yet
Sta Statistical Theory
104 pages
Week 1
No ratings yet
Week 1
10 pages
Applied Statics - Merged
No ratings yet
Applied Statics - Merged
58 pages
lecture_note_3
No ratings yet
lecture_note_3
11 pages
FOW9 - SB - Note Chapter 6&7
No ratings yet
FOW9 - SB - Note Chapter 6&7
13 pages
Probability Distributions (2)
No ratings yet
Probability Distributions (2)
23 pages
STAE Lecture Notes - LU5
No ratings yet
STAE Lecture Notes - LU5
22 pages
Tài liệu 5
No ratings yet
Tài liệu 5
19 pages
Unit 3 - DISCRETE AND CONTINOUS PROBABILITY DISTRIBUTIONS PDF
No ratings yet
Unit 3 - DISCRETE AND CONTINOUS PROBABILITY DISTRIBUTIONS PDF
37 pages
Review 44
No ratings yet
Review 44
28 pages
Stats 1 - IITM BS Notes - Part 5
No ratings yet
Stats 1 - IITM BS Notes - Part 5
17 pages
Probability Distributions 2
No ratings yet
Probability Distributions 2
36 pages
Discrete RV and Binomial, Poisson, And Hypergeometric Distributions - Lecture Notes [Dr AKM Azad] (1)
No ratings yet
Discrete RV and Binomial, Poisson, And Hypergeometric Distributions - Lecture Notes [Dr AKM Azad] (1)
3 pages
Mstat Note7 Random Variable f23
No ratings yet
Mstat Note7 Random Variable f23
76 pages
Assignment 1: MSC Statistics
No ratings yet
Assignment 1: MSC Statistics
5 pages
Joint Distribution
No ratings yet
Joint Distribution
11 pages
Appendix A Probability and Statistics
No ratings yet
Appendix A Probability and Statistics
12 pages
Chapter3 220928 093636
No ratings yet
Chapter3 220928 093636
70 pages
Unit II (Part B) Random Variable
No ratings yet
Unit II (Part B) Random Variable
8 pages
Chapter 2
No ratings yet
Chapter 2
25 pages
Lec 01
No ratings yet
Lec 01
44 pages
Probability Distributions
No ratings yet
Probability Distributions
17 pages
Probdist Ref
No ratings yet
Probdist Ref
256 pages
Chapter 3
No ratings yet
Chapter 3
62 pages
Statistical Tools
No ratings yet
Statistical Tools
79 pages
Probability
No ratings yet
Probability
36 pages
Lesson
No ratings yet
Lesson
24 pages
A-level Maths Revision: Cheeky Revision Shortcuts
From Everand
A-level Maths Revision: Cheeky Revision Shortcuts
Scool Revision
3.5/5 (8)
Jagananna Ammavodi 2020-2021 MPPSPAPAYAPALLI (28183902103) Category of School: Primary Final Ineligible Students (List-1)
No ratings yet
Jagananna Ammavodi 2020-2021 MPPSPAPAYAPALLI (28183902103) Category of School: Primary Final Ineligible Students (List-1)
2 pages
Bdy Card More
No ratings yet
Bdy Card More
1 page
Data Science Introduction
100% (1)
Data Science Introduction
54 pages
02RCM February 2005
No ratings yet
02RCM February 2005
79 pages
IR Source Destination Time of Execution
No ratings yet
IR Source Destination Time of Execution
2 pages
Interpolated Filters Small
No ratings yet
Interpolated Filters Small
6 pages
Platform Installadmin en
No ratings yet
Platform Installadmin en
299 pages
26-Mar-11 26-Mar-11 Cash Deposit-Cash Deposit Self - 26-Mar-11 26-Mar-11 INTER CITY CHARGES - 38976288
No ratings yet
26-Mar-11 26-Mar-11 Cash Deposit-Cash Deposit Self - 26-Mar-11 26-Mar-11 INTER CITY CHARGES - 38976288
4 pages
Shri Saibaba Sansthan Trust, Shirdi: Acknowledgement Id:1500654406
No ratings yet
Shri Saibaba Sansthan Trust, Shirdi: Acknowledgement Id:1500654406
2 pages
Digital Circuits Microprocessor Bits Analog C Programing Datastructers Cmos Basics
No ratings yet
Digital Circuits Microprocessor Bits Analog C Programing Datastructers Cmos Basics
1 page
Chapter 3 in Stallings
No ratings yet
Chapter 3 in Stallings
65 pages
Huff Man
No ratings yet
Huff Man
4 pages
UNIT I 1. Explain NAND Gate Primitive
No ratings yet
UNIT I 1. Explain NAND Gate Primitive
2 pages
Class XII Date Sheet 2011
No ratings yet
Class XII Date Sheet 2011
5 pages
Arabia, 2018
No ratings yet
Arabia, 2018
10 pages
Grade 11 Fal Task 5 Qp-1
No ratings yet
Grade 11 Fal Task 5 Qp-1
16 pages
A Hybrid CNN-LSTM: A Deep Learning Approach For Consumer Sentiment Analysis Using Qualitative User-Generated Contents
No ratings yet
A Hybrid CNN-LSTM: A Deep Learning Approach For Consumer Sentiment Analysis Using Qualitative User-Generated Contents
15 pages
FTKF B
No ratings yet
FTKF B
2 pages
Table of Contents
No ratings yet
Table of Contents
14 pages
Caroline Mbindiyo of Amref Presentation On Mhealth at Tandaa Symposium On Technology For Social Good
No ratings yet
Caroline Mbindiyo of Amref Presentation On Mhealth at Tandaa Symposium On Technology For Social Good
16 pages
Solar Cell Fundamentals Lab Lecture 5 4 Point Probing Using Keithley Pro 4 6-2-11
No ratings yet
Solar Cell Fundamentals Lab Lecture 5 4 Point Probing Using Keithley Pro 4 6-2-11
11 pages
Gpower Tutorial - Unlocked
No ratings yet
Gpower Tutorial - Unlocked
43 pages
Productividad Economica.
No ratings yet
Productividad Economica.
8 pages
PALM Flat Scanner: For General Weld Inspection
No ratings yet
PALM Flat Scanner: For General Weld Inspection
4 pages
2022 年下二笔三笔实务真题
No ratings yet
2022 年下二笔三笔实务真题
5 pages
Latest Transcript 2024
No ratings yet
Latest Transcript 2024
2 pages
Installation - g404 - en - Ecomax Hobart Glass Washer (L Club Bar and D Club Bar)
No ratings yet
Installation - g404 - en - Ecomax Hobart Glass Washer (L Club Bar and D Club Bar)
1 page
Gabriela Nouzeilles - Violence and Photography
No ratings yet
Gabriela Nouzeilles - Violence and Photography
11 pages
Website_Design_Trust_and_Culture_An_Eigh
No ratings yet
Website_Design_Trust_and_Culture_An_Eigh
5 pages
Student's Handbook
100% (1)
Student's Handbook
15 pages
Solutions Short Notes - Learning Tales ND 7
No ratings yet
Solutions Short Notes - Learning Tales ND 7
2 pages
Factor of Safety
100% (1)
Factor of Safety
4 pages
Amo Grade 9
100% (1)
Amo Grade 9
17 pages
DrZakirNaik-FORMS OF DAWAH
100% (2)
DrZakirNaik-FORMS OF DAWAH
4 pages
Defense OSINT Strategy 2024-2028
No ratings yet
Defense OSINT Strategy 2024-2028
16 pages
E-Tech Lesson Guide - Unit 3-5
No ratings yet
E-Tech Lesson Guide - Unit 3-5
16 pages
PHD Openings in Data Science & Machine Learning & Advanced Analytics Spring 2020
No ratings yet
PHD Openings in Data Science & Machine Learning & Advanced Analytics Spring 2020
1 page
Complex Analysis: The Extended Complex Plane Proofs of Theorems
No ratings yet
Complex Analysis: The Extended Complex Plane Proofs of Theorems
15 pages
124
No ratings yet
124
20 pages