0% found this document useful (0 votes)

74 views15 pages

Basic - Statsitics - Level 1

The document contains a table with 12 entries listing various data types as either discrete or continuous. It also poses three questions: 1) Identify the data types in a provided list 2) Identify the level of measurement for various variables as nominal, ordinal, interval, or ratio 3) Calculate probabilities for events involving dice rolls, coin tosses, and balls drawn from a bag.

Uploaded by

Aqibansari Shaik

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

74 views15 pages

Basic - Statsitics - Level 1

Uploaded by

Aqibansari Shaik

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 15

Activity Data Type

Number of beatings from Wife Discrete

Results of rolling a dice Discrete
Weight of a person Continuous
Weight of Gold Continuous
Distance between two places Continuous
Length of a leaf Continuous
Dog's weight Continuous
Blue Color Discrete
Number of kids Discrete
Number of tickets in Indian railways Discrete
Number of times married Discrete
Gender (Male or Female) Discrete
Q1) Identify the Data type for the Following:

Q2) Identify the Data types, which were among the following
Nominal, Ordinal, Interval, Ratio.
Data Data Type
Gender Nominal
High School Class Ranking Ordinal
Celsius Temperature Interval
Weight Ratio
Hair Color Nominal
Socioeconomic Status Ordinal
Fahrenheit Temperature Interval
Height Ratio
Type of living accommodation Ordinal
Level of Agreement Ordinal
IQ (Intelligence Scale) Ratio
Sales Figures Ratio
Blood Group Nominal
Time Of Day Ordinal
Time on a Clock with Hands Interval
Number of Children Nominal
Religious Preference Nominal
Barometer Pressure Interval
SAT Scores Interval
Years of Education Ordinal
Q3) Three Coins are tossed, find the probability that two heads and one tail are
obtained?
Ans:
P (Two heads and one tail) = N (Event (Two heads and one tail)) / N (Event (Three
coins tossed))
= 3/8 = 0.375 = 37.5%

Q4) Two Dice are rolled, find the probability where sum is
a) Equal to 1
b) Less than or equal to 4
c) Divisible by 2 and 3
Ans:
Number of possible outcomes for the above event is
N (Event (Two dice rolled)) = 6^2 = 36
a.) P (sum is Equal to 1) = ‘0’ zero null nada none.
b.) P (Sum is less than or equal to 4) = N (Event (Sum is less than or equal to
4)) / N (Event (Two dice rolled))
= 6 / 36 = 1/6 = 0.166 = 16.66%
c.) P (Sum is divisible by 2 and 3) = N (Event (Sum is divisible by 2 and 3)) / N
(Event (Two dice rolled))
= 6 / 36 = 1/6 = 0.16 = 16.66%
Q5) A bag contains 2 red, 3 green and 2 blue balls. Two balls are drawn at random.
What is the probability that none of the balls drawn is blue?
Ans: Total number of balls =7 balls
N (Event (2 balls are drawn randomly from bag) = 7! / 2! * 5!
= (7*6*5*4*3*2*1) /
(2*1) * (5*4*3*2*1)
N (Event (2 balls are drawn randomly from bag) = (7*6)/ (2*1) = 21
If none of them drawn 2 balls are blue = 7 – 2 = 5
N (Event (None of the balls drawn is blue) = 5! / 2! * 3! = (5*4) / (2*1)
= 10
P (None of the balls drawn is blue) = N (Event (None of the balls drawn is blue) /
N (Event (2 balls are drawn randomly from
bag)
= 10 / 21
Q6) Calculate the Expected number of candies for a randomly selected child
Below are the probabilities of count of candies for children (ignoring the nature of
the child-Generalized view)
CHILD Candies count Probability
A 1 0.015
B 4 0.20
C 3 0.65
D 5 0.005
E 6 0.01
F 2 0.120
Child A – probability of having 1 candy = 0.015.
Child B – probability of having 4 candies = 0.20
Ans:
0.015+0.8+1.95+0.025+0.06+0.24 = 3.09
Q7) Calculate Mean, Median, Mode, Variance, Standard Deviation, Range &
comment about the values / draw inferences, for the given dataset
- For Points, Score, Weigh>
Find Mean, Median, Mode, Variance, Standard Deviation, and Range and
also Comment about the values/ Draw some inferences.
Use Q7.csv file
Ans:
Mean for Points = 3.59, Score = 3.21 and Weigh = 17.84
Median for Points = 3.69, Score = 3.32 and Weigh = 17.71
Mode for Points = 3.07, Score = 3.44 and Weigh = 17.02
Variance for Points = 0.28, Score = 0.95, Weigh = 3.19
Standard Deviation for Points = 0.53, Score = 0.97, Weigh = 1.78
Range [Min-Max] for Points [3.59 – 4.93], Score [3.21 – 5.42] and Weigh [17.84 –
22.9]
Draw Inferences
Q8) Calculate Expected Value for the problem below
a) The weights (X) of patients at a clinic (in pounds), are
108, 110, 123, 134, 135, 145, 167, 187, 199
Assume one of the patients is chosen at random. What is the Expected Value
of the Weight of that patient?
Ans: Expected value = Sum (X * Probability of X)
= (1/9)(108)+ (1/9)(110)+ (1/9)(123)+ (1/9)(134)+ (1/9)(145)+ (1/9)(167)+ (1/9)
(187)+ (1/9)(199)
= 145.33
Q9) Calculate Skewness, Kurtosis & draw inferences on the following data
Cars speed and distance
Use Q9_a.csv
Ans:
q9a = pd.read_csv("C:/Users/Moin Dalvi/Documents/EXcelR Study and
Assignment Material/Data Science Assignments/Basic Statistics 1/Q9_a.csv",
index_col = 'Index')
print('For Cars Speed', "Skewness value=", np.round(q9a.speed.skew(),2), 'and' ,
'Kurtosis value=', np.round(q9a.dist.skew(),2))
For Cars Speed Skewness value= -0.12 and Kurtosis value= 0.81

print('Skewness value =', np.round(q9a.dist.skew(),2),'and', 'Kurtosis value =',

np.round(q9a.dist.kurt(),2), 'for Cars Distance')
Skewness value = 0.81 and Kurtosis value = 0.41 for Cars Distance

SP and Weight (WT)

Use Q9_b.csv
Ans:
q9b =pd.read_csv("C:/Users/Moin Dalvi/Documents/EXcelR Study and
Assignment Material/Data Science Assignments/Basic Statistics 1/Q9_b.csv")

print('For SP Skewness =', np.round(q9b.SP.skew(),2), 'kurtosis =',

np.round(q9b.WT.kurt(),2))
For SP Skewness = 1.61 kurtosis = 0.95

print('For WT Skewness =', np.round(q9b.SP.skew(),2), 'Kurtosis =',

np.round(q9b.WT.kurt(),2))
For WT Skewness = 1.61 Kurtosis = 0.95

Q10) Draw inferences about the following boxplot & histogram

Ans: The histograms peak has right skew and tail is on right. Mean > Median. We
have outliers on the higher side.

Ans: The boxplot has outliers on the maximum side.

Q11) Suppose we want to estimate the average weight of an adult male in

Mexico. We draw a random sample of 2,000 men from a population of 3,000,000
men and weigh them. We find that the average person in our sample weighs 200
pounds, and the standard deviation of the sample is 30 pounds. Calculate
94%,98%,96% confidence interval?
Ans:
conf_94 =stats.t.interval(alpha = 0.94, df=1999, loc=200, scale=30/np.sqrt(2000))
print(np.round(conf_94,0))
print(conf_94)
For 94% confidence interval Range is [ 198.73 – 201.26]
For 98% confidence interval range is [198.43 – 201.56]
For 96% confidence interval range is [198.62 – 201.37]

Q12) Below are the scores obtained by a student in tests

34,36,36,38,38,39,39,40,40,41,41,41,41,42,42,45,49,56
1) Find mean, median, variance, standard deviation.
Ans: Mean =41, Median =40.5, Variance =25.52 and Standard Deviation =5.05
2) What can we say about the student marks?
Ans: we don’t have outliers and the data is slightly skewed towards right because
mean is greater than median.
Q13) What is the nature of skewness when mean and median of data are equal?
Ans: No skewness is present we have a perfect symmetrical distribution
Q14) What is the nature of skewness when mean > median?
Ans: Skewness and tail is towards Right
Q15) What is the nature of skewness when median > mean?
Ans: Skewness and tail is towards left
Q16) What does positive kurtosis value indicates for a data?
Ans: Positive kurtosis means the curve is more peaked and it is Leptokurtic
Q17) What does negative kurtosis value indicates for a data?
Ans: Negative Kurtosis means the curve will be flatter and broader
Q18) Answer the below questions using the below boxplot visualization.
What can we say about the distribution of the data?
Ans: The above Boxplot is not normally distributed the median is towards the
higher value
What is nature of skewness of the data?
Ans: The data is a skewed towards left. The whisker range of minimum value is
greater than maximum
What will be the IQR of the data (approximately)?
Ans: The Inter Quantile Range = Q3 Upper quartile – Q1 Lower Quartile = 18 – 10
=8

Q19) Comment on the below Boxplot visualizations?

Draw an Inference from the distribution of data for Boxplot 1 with respect Boxplot
2.
Ans: First there are no outliers. Second both the box plot shares the same median
that is approximately in a range between 275 to 250 and they are normally
distributed with zero to no skewness neither at the minimum or maximum whisker
range.
Q 20) Calculate probability from the given dataset for the below cases

Data _set: Cars.csv

Calculate the probability of MPG of Cars for the below cases.
MPG <- Cars $ MPG
a. P(MPG>38)
Ans: Prob_MPG_greater_than_38 = np.round(1 - stats.norm.cdf(38, loc=
q20.MPG.mean(), scale= q20.MPG.std()),3)
print('P(MPG>38)=',Prob_MPG_greater_than_38)

P(MPG>38)= 0.348

b. P(MPG<40)
Ans: prob_MPG_less_than_40 = np.round(stats.norm.cdf(40, loc =
q20.MPG.mean(), scale = q20.MPG.std()),3)
print('P(MPG<40)=',prob_MPG_less_than_40)

P(MPG<40)= 0.729

c. P (20<MPG<50)
Ans: prob_MPG_greater_than_20 = np.round(1-stats.norm.cdf(20, loc =
q20.MPG.mean(), scale = q20.MPG.std()),3)
print('p(MPG>20)=',(prob_MPG_greater_than_20))
p(MPG>20)= 0.943

prob_MPG_less_than_50 = np.round(stats.norm.cdf(50, loc = q20.MPG.mean(),

scale = q20.MPG.std()),3)
print('P(MPG<50)=',(prob_MPG_less_than_50))
P(MPG<50)= 0.956

prob_MPG_greaterthan20_and_lessthan50= (prob_MPG_less_than_50) -
(prob_MPG_greater_than_20)
print('P(20<MPG<50)=',(prob_MPG_greaterthan20_and_lessthan50))
P(20<MPG<50)= 0.013000000000000012

Q 21) Check whether the data follows normal distribution

a) Check whether the MPG of Cars follows Normal Distribution
Dataset: Cars.csv
Ans:
a.) MPG of cars follows normal distribution

b) Check Whether the Adipose Tissue (AT) and Waist Circumference

(Waist) from wc-at data set follows Normal Distribution
Dataset: wc-at.csv
Ans: Adipose Tissue (AT) and Waist does not follow Normal Distribution
Q 22) Calculate the Z scores of 90% confidence interval,94% confidence
interval, 60% confidence interval
Ans:
# z value for 90% confidence interval
print('Z score for 60% Conifidence Intervla =',np.round(stats.norm.ppf(.05),4))
Z score for 60% Conifidence Intervla = -1.6449

# z value for 94% confidence interval

print('Z score for 60% Conifidence Intervla =',np.round(stats.norm.ppf(.03),4))
Z score for 60% Conifidence Intervla = -1.8808

# z value for 60% confidence interval

print('Z score for 60% Conifidence Intervla =',np.round(stats.norm.ppf(.2),4))
Z score for 60% Conifidence Intervla = -0.8416

Q 23) Calculate the t scores of 95% confidence interval, 96% confidence

interval, 99% confidence interval for sample size of 25
Ans:
# t score for 95% confidence interval
print('T score for 95% Confidence Interval =',np.round(stats.t.ppf(0.025,df=24),4))
T score for 95% Confidence Interval = -2.0639

# t value for 94% confidence interval

print('T score for 94% Confidence Inteval =',np.round(stats.t.ppf(0.03,df=24),4))
T score for 94% Confidence Inteval = -1.974

# t value for 99% Confidence Interval

print('T score for 95% Confidence Interval =',np.round(stats.t.ppf(0.005,df=24),4))
T score for 95% Confidence Interval = -2.7969

Q 24) A Government company claims that an average light bulb lasts 270 days.
A researcher randomly selects 18 bulbs for testing. The sampled bulbs last an
average of 260 days, with a standard deviation of 90 days. If the CEO's claim were
true, what is the probability that 18 randomly selected bulbs would have an
average life of no more than 260 days
Hint:
rcode  pt(tscore,df)
df  degrees of freedom
Ans: import numpy as np
Import scipy as stats
t_score = (x - pop mean) / (sample standard daviation / square root of sample size)
(260-270)/90/np.sqrt(18))
t_score = -0.471
stats.t.cdf(t_score, df = 17)
0.32 = 32%

Assignment Excelr
0% (1)
Assignment Excelr
9 pages
Brown y Forsythe 1974 The ANOVA and Multiple Comparisons For Data With Heterogeneous Variances
No ratings yet
Brown y Forsythe 1974 The ANOVA and Multiple Comparisons For Data With Heterogeneous Variances
7 pages
Probability Advanced Question Sheets
100% (1)
Probability Advanced Question Sheets
46 pages
STATS - DOANE - Chapter 15 Chi-Square Tests
100% (1)
STATS - DOANE - Chapter 15 Chi-Square Tests
189 pages
Assignment
83% (6)
Assignment
16 pages
Republic of The Philippines Department of Education: M8Ge-Iva - 1
No ratings yet
Republic of The Philippines Department of Education: M8Ge-Iva - 1
5 pages
Assignment
43% (7)
Assignment
9 pages
Basuc Statshi
100% (3)
Basuc Statshi
20 pages
I Year QT Notes
No ratings yet
I Year QT Notes
283 pages
Sample Size Calculator
No ratings yet
Sample Size Calculator
3 pages
Arjun S Assignment 1 Basic Stat1
88% (8)
Arjun S Assignment 1 Basic Stat1
21 pages
Basic Statistics 1
100% (2)
Basic Statistics 1
12 pages
Assignment
75% (4)
Assignment
13 pages
Basic Statisticks 1 - Assignment - Vivek T
100% (7)
Basic Statisticks 1 - Assignment - Vivek T
18 pages
Assignment - Basics Statics Level 1
100% (2)
Assignment - Basics Statics Level 1
15 pages
Basic Stat-1, Descriptive Statistics and Probability
100% (1)
Basic Stat-1, Descriptive Statistics and Probability
13 pages
Assignment (Key) 1
100% (1)
Assignment (Key) 1
16 pages
Slides Module 4 Lesson 2
No ratings yet
Slides Module 4 Lesson 2
34 pages
Assignment 1
100% (1)
Assignment 1
16 pages
Assignment
50% (2)
Assignment
10 pages
Statistics CBT Questions
No ratings yet
Statistics CBT Questions
19 pages
Metode de Cercetare in Studiul Organizatiilor
100% (3)
Metode de Cercetare in Studiul Organizatiilor
354 pages
Probability Distribution
No ratings yet
Probability Distribution
16 pages
Assignment
No ratings yet
Assignment
12 pages
Herrypramono,+##Default - Groups.name - ProductionEditor##,+Layout 2+Artikel+2+JMan+Vol+19+No.+1+ (CC)
No ratings yet
Herrypramono,+##Default - Groups.name - ProductionEditor##,+Layout 2+Artikel+2+JMan+Vol+19+No.+1+ (CC)
21 pages
Basic Stats Assignment 1
No ratings yet
Basic Stats Assignment 1
22 pages
Assignment 1
100% (1)
Assignment 1
15 pages
Module 2
100% (1)
Module 2
7 pages
Deva 1
No ratings yet
Deva 1
12 pages
2a EDA
50% (2)
2a EDA
11 pages
Adetya Wulandari UTS Prof. Akhyar
No ratings yet
Adetya Wulandari UTS Prof. Akhyar
12 pages
Constructing Probability Distribution
No ratings yet
Constructing Probability Distribution
14 pages
Assignment 1
0% (1)
Assignment 1
12 pages
Assignment
85% (33)
Assignment
13 pages
Assignment
100% (1)
Assignment
10 pages
Assignment 3
No ratings yet
Assignment 3
4 pages
Basic Statistics - 1
No ratings yet
Basic Statistics - 1
21 pages
Business Statistics 1st Model Test 2 Chapter
No ratings yet
Business Statistics 1st Model Test 2 Chapter
2 pages
Assignment
No ratings yet
Assignment
14 pages
Assignment: - Basic Statistics 1: Q1) Identify The Data Type For The Following
No ratings yet
Assignment: - Basic Statistics 1: Q1) Identify The Data Type For The Following
17 pages
Answers To Additional Business Exercises Chapter 19 Two Way Between Groups ANOVA
No ratings yet
Answers To Additional Business Exercises Chapter 19 Two Way Between Groups ANOVA
3 pages
Sample Size Justification: This Is An Unsubmitted Preprint Shared For Feedback
No ratings yet
Sample Size Justification: This Is An Unsubmitted Preprint Shared For Feedback
31 pages
Week 7 Problem Class: Statistics
No ratings yet
Week 7 Problem Class: Statistics
2 pages
Assignment 1
No ratings yet
Assignment 1
10 pages
Kategori Nafsu Makan Perubahan Nafsu Makan Intervensi Crosstabulation
No ratings yet
Kategori Nafsu Makan Perubahan Nafsu Makan Intervensi Crosstabulation
1 page
Laboratory Report 5
No ratings yet
Laboratory Report 5
30 pages
Assignment
No ratings yet
Assignment
7 pages
Assignment
No ratings yet
Assignment
10 pages
Assignment
No ratings yet
Assignment
19 pages
Testing The Difference Between Two Means: Dependent Samples: Bluman, Chapter 9
No ratings yet
Testing The Difference Between Two Means: Dependent Samples: Bluman, Chapter 9
17 pages
Assignment
No ratings yet
Assignment
11 pages
Comparing Groups For Statistical Differences - How To Choose The Right Statistical Test - Biochemia Medica
No ratings yet
Comparing Groups For Statistical Differences - How To Choose The Right Statistical Test - Biochemia Medica
8 pages
Assignment
No ratings yet
Assignment
11 pages
Continuous Continuous Continuous Continuous Continuous: Discrete Discrete
No ratings yet
Continuous Continuous Continuous Continuous Continuous: Discrete Discrete
15 pages
Name: Siti Mursyida Abdul Karim (Data Science Program) Topic: Assignment - EDA
100% (1)
Name: Siti Mursyida Abdul Karim (Data Science Program) Topic: Assignment - EDA
13 pages
Exercise Sheet 1 Mathematics and Statistics
No ratings yet
Exercise Sheet 1 Mathematics and Statistics
9 pages
Assignment - Answer Excel Descriptive Analytics
No ratings yet
Assignment - Answer Excel Descriptive Analytics
11 pages
Assignment1 of Data Science
No ratings yet
Assignment1 of Data Science
11 pages
Assignment
No ratings yet
Assignment
10 pages
Assignment
No ratings yet
Assignment
7 pages
Assignment (1) SOlution
No ratings yet
Assignment (1) SOlution
15 pages
Assignment
No ratings yet
Assignment
11 pages
Assignment
No ratings yet
Assignment
7 pages
Assignmeant-1 Sharan S
No ratings yet
Assignmeant-1 Sharan S
20 pages
Assignent-1 SDS
No ratings yet
Assignent-1 SDS
17 pages
Assignment
No ratings yet
Assignment
18 pages
Business Analytics
No ratings yet
Business Analytics
47 pages
Assignment 1
No ratings yet
Assignment 1
15 pages
Assignment
No ratings yet
Assignment
8 pages
Activity Data Type
No ratings yet
Activity Data Type
11 pages
Tutorial - Minitab - Part II - 2011
No ratings yet
Tutorial - Minitab - Part II - 2011
18 pages
Assignment
No ratings yet
Assignment
7 pages
Assignment (Answers)
100% (1)
Assignment (Answers)
9 pages
Activity
No ratings yet
Activity
11 pages
File 1704710882 0003256 MTH145Assignment1UNIT-1-1
No ratings yet
File 1704710882 0003256 MTH145Assignment1UNIT-1-1
2 pages
Basic Statistics Level 1
No ratings yet
Basic Statistics Level 1
7 pages
Random Variables
No ratings yet
Random Variables
26 pages
Hypothesis Testing
No ratings yet
Hypothesis Testing
51 pages
Grade 12 Revision Probability Paper 2 Type Questions Name: Date
No ratings yet
Grade 12 Revision Probability Paper 2 Type Questions Name: Date
10 pages
Continuous Continuous Continuous Continuous Continuous: Discrete Discrete
No ratings yet
Continuous Continuous Continuous Continuous Continuous: Discrete Discrete
13 pages
Assignment
No ratings yet
Assignment
5 pages
TR 2501 Cal Labs-Scales and Balances-13976-3
No ratings yet
TR 2501 Cal Labs-Scales and Balances-13976-3
11 pages
Assignment 1 1
No ratings yet
Assignment 1 1
13 pages
SHS Statistics and Probability Q3 Mod1 Random Variables and v4
100% (2)
SHS Statistics and Probability Q3 Mod1 Random Variables and v4
41 pages
Assignment 1 Ans (Reference)
No ratings yet
Assignment 1 Ans (Reference)
18 pages
MUAC 2.7 NACS Users Guide Apr2016
No ratings yet
MUAC 2.7 NACS Users Guide Apr2016
2 pages
Difference Between (Median, Mean, Mode, Range, Midrange) (Descriptive Statistics)
No ratings yet
Difference Between (Median, Mean, Mode, Range, Midrange) (Descriptive Statistics)
11 pages
Assignment 2
No ratings yet
Assignment 2
7 pages

Basic - Statsitics - Level 1

Uploaded by

Basic - Statsitics - Level 1

Uploaded by

Activity Data Type

Number of beatings from Wife Discrete

print('Skewness value =', np.round(q9a.dist.skew(),2),'and', 'Kurtosis value =',

SP and Weight (WT)

print('For SP Skewness =', np.round(q9b.SP.skew(),2), 'kurtosis =',

print('For WT Skewness =', np.round(q9b.SP.skew(),2), 'Kurtosis =',

Q10) Draw inferences about the following boxplot & histogram

Ans: The boxplot has outliers on the maximum side.

Q11) Suppose we want to estimate the average weight of an adult male in

Q12) Below are the scores obtained by a student in tests

Q19) Comment on the below Boxplot visualizations?

Data _set: Cars.csv

prob_MPG_less_than_50 = np.round(stats.norm.cdf(50, loc = q20.MPG.mean(),

Q 21) Check whether the data follows normal distribution

b) Check Whether the Adipose Tissue (AT) and Waist Circumference

# z value for 94% confidence interval

# z value for 60% confidence interval

Q 23) Calculate the t scores of 95% confidence interval, 96% confidence

# t value for 94% confidence interval

# t value for 99% Confidence Interval

You might also like