100% found this document useful (7 votes)

2K views18 pages

Basic Statisticks 1 - Assignment - Vivek T

This document contains questions related to data types, probability, statistics, and data analysis. It includes questions about identifying data types, calculating probabilities, finding measures of central tendency and dispersion for datasets, checking for normal distributions, and interpreting boxplots and other visualizations. The questions cover concepts like discrete vs continuous data, nominal vs ordinal vs ratio variables, mean, median, mode, variance, standard deviation, skewness, kurtosis, and confidence intervals.

Uploaded by

Sunil kumar Kurella

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

100% found this document useful (7 votes)

2K views18 pages

Basic Statisticks 1 - Assignment - Vivek T

Uploaded by

Sunil kumar Kurella

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 18

Activity Data Type

Number of beatings from Wife Discrete

Results of rolling a dice Discrete
Weight of a person Continuous
Weight of Gold Continuous
Distance between two places Continuous
Length of a leaf Continuous
Dog's weight Continuous
Blue Color Categorical
Number of kids Discrete
Number of tickets in Indian railways Discrete
Number of times married Discrete
Gender (Male or Female) Nominal or categorical

Q1) Identify the Data type for the Following:

Q2) Identify the Data types, which were among the following
Nominal, Ordinal, Interval, Ratio.
Data Data Type
Gender Nominal
High School Class Ranking Ordinal
Celsius Temperature Ratio
Weight Ordinal
Hair Color Nominal
Socioeconomic Status Ordinal
Fahrenheit Temperature Ratio
Height Ordinal
Type of living accommodation Nominal
Level of Agreement Ordinal
IQ(Intelligence Scale)
Sales Figures Ratio
Blood Group Nominal
Time Of Day Interval
Time on a Clock with Hands Ratio
Number of Children Nominal
Religious Preference
Barometer Pressure Ratio
SAT Scores
Years of Education Ordinal

Q3) Three Coins are tossed, find the probability that two heads and one tail are
obtained?
3/8
Q4) Two Dice are rolled, find the probability that sum is
a) Equal to 1 Ans: 0
b) Less than or equal to 4 Ans: 6/36=1/6
c) Sum is divisible by 2and 3 Ans: 18/36+12/36=30/36=5/6

Q5) A bag contains 2 red, 3 green and 2 blue balls. Two balls are drawn at
random. What is the probability that none of the balls drawn is blue?
Ans: probability that none of the balls drawn is blue is 5C2/7c2=10/21

Q6) Calculate the Expected number of candies for a randomly selected child
Below are the probabilities of count of candies for children(ignoring the nature of
the child-Generalized view)
CHILD Candies count(x) Probability p(x)
A 1 0.015
B 4 0.20
C 3 0.65
D 5 0.005
E 6 0.01
F 2 0.120
Child A – probability of having 1 candy = 0.015.
Child B – probability of having 4 candies = 0.20
Ans: the Expected number of candies for a randomly selected child is
Summation(x*p(x)) =1*0.015+4*0.20+3*0.65+5*0.005+6*0.01+2*0.120=3.09
Q7) Calculate Mean, Median, Mode, Variance, Standard Deviation, Range &
comment about the values / draw inferences, for the given dataset
- For Points,Score,Weigh>
Find Mean, Median, Mode, Variance, Standard Deviation, and Range
and also Comment about the values/ Draw some inferences.
Ans:, Using R
mean(Q7$Weigh)
[1] 17.84875
> mode(Q7$Weigh)
[1] "numeric"
median(Q7$Weigh)
[1] 17.71
> range(Q7$Weigh)
[1] 14.5 22.9
> var(Q7$Weigh)
[1] 3.193166
> sd(Q7$Weigh)
[1] 1.786943

> mean(Q7$Points)
[1] 3.596563
> mode(Q7$Points)
[1] "numeric"
> median(Q7$Points)
[1] 3.695
> range(Q7$Points)
[1] 2.76 4.93
> var(Q7$Points)
[1] 0.2858814
> sd(Q7$Points)
[1] 0.5346787

mean(Q7$Score)
[1] 3.21725
> mode(Q7$Score)
[1] "numeric"
> median(Q7$Score)
[1] 3.325
> range(Q7$Score)
[1] 1.513 5.424

> var(Q7$Score)
[1] 0.957379

> sd(Q7$Score)
[1] 0.9784574

Q8) Calculate Expected Value for the problem below

a) The weights (X) of patients at a clinic (in pounds), are
108, 110, 123, 134, 135, 145, 167, 187, 199
Assume one of the patients is chosen at random. What is the Expected
Value of the Weight of that patient?
Ans: The expected value of the weight of a patient chosen at random is
mean of all the patients’ i.e
145.34 pounds
Q9)
a.Calculate Skewness, Kurtosis & draw inferences on the following data
Cars speed and distance
Ans: Using Moments packages in R , found the skewness and Kurtosis
> Skewness (Q9$speed)
[1] -0.1139548 negative skewness means left skew i.e. data distributed on
right side
>histo(Q9$speed)
> kurtosis(Q9$speed)
[1] 2.422853 positive kurtosis # data distribution is wide not peak

> skewness(Q9$dist)
[1] 0.7824835 postive skewnewss means right skew i.e data distributed on left

> kurtosis(Q9$dist)
[1] 3.248019 positive kurtosis# data distribution is wide not peak
b.SP and Weight(WT)

Ans:
> skewness(Q9_b$SP)
[1] 1.581454 postive so SP is Right skewness
> kurtosis(Q9_b$SP)
[1] 5.723521 positive# data is high peak
> skewness(Q9_b$WT)
[1] -0.6033099 negative so WT is left skewness

> kurtosis(Q9_b$WT)
[1] 3.819466 positive data is high peak

Q10) Draw inferences about the following boxplot & histogram

Ans:

50-100 weight having more frequency 180

350-400 weight having very less frequency 5

Postive skewness

Data is right skewed

Data is not a normal distribution

0-50 weight having 80 freuency

100-150 weight having 120 freuency

Ans:

 7 Outliers are present in above box plot

 Positive skewness .i.e. data is right skewed
 DATA is not normally distributed
 Q1 is smaller than the Q3

Q11) Suppose we want to estimate the average weight of an adult male in

Mexico. We draw a random sample of 2,000 men from a population of
3,000,000 men and weigh them. We find that the average person in our
sample weighs 200 pounds, and the standard deviation of the sample is 30
pounds. Calculate 94%,98%,96% confidence interval ?

Ans: we don’t have the standard deviation for population .So we have to
use the T-distribution to determine the CI of the given data
𝑋̅ = 200 𝑝𝑜𝑢𝑛𝑑𝑠, S = 30 pounds, n = 2000
𝑆
𝑋̅ ± 𝑡1−𝛼,𝑛−1
√𝑛
Confidence interval for 94%: using R getting the
R code : qt(0.97,1999) = 1.88
Substituting values in the equation
30
200 ± 1.88
√2000
Hence the confidence interval for 94% is [198, 201]

Confidence interval for 96%: using R getting the

R code : qt(0.98,1999) = 2.05
Substituting values in the equation
30
200 ± 2.05
√2000
Hence the confidence interval for 96% is [198.6, 201.3]
Confidence interval for 98%: using R getting the
R code : qt(0.99,1999) = 2.328
Substituting values in the equation
30
200 ± 2.328
√2000
Hence the confidence interval for 98% is [198.4, 201.4]

Q12)Below are the scores obtained by a student in tests

34,36,36,38,38,39,39,40,40,41,41,41,41,42,42,45,49,56
1) Find mean,median,variance,standard deviation?
Ans: Mean=41, Median=40.5, Variance=25.52941, SD=5.052664
2)What can we say about the student marks?

Ans: Avg of student marks 41

The students markes range from 34 to 56
Mode is 41
Most of students score is bw 35 to 42
Q13) What is the nature of skewness when mean, median of data are equal?
Ans: When the values of mean, median and mode are equal, there is no skewness
also you can say the data is in normal distribution.
Q14) What is the nature of skewness when mean >median?
Ans: f the mean is greater than the median, the distribution is positively skewed
Q15) What is the nature of skewness when median > mean?
ANs: If the mean is less than the median, the distribution is negatively skewed
Q16) What does positive kurtosis value indicates for adata ?
Ans: A distribution with a positive kurtosis value indicates that the distribution has heavier tails than the
normal distribution.

Q17) What does negative kurtosis value indicates for a data?

Ans: A negative kurtosis means that your distribution is flatter than a normal curve with the
same mean and standard deviation.

Q18) Answer the below questions using the below boxplot visualization.

What can we say about the distribution of the data?

 No outliers
 Q1 greater than Q3
 Median between 15 to 16
 Most of data present in range of 10 to 18
 Not following normal distribution
 Left skewness of data
What is nature of skewness of the data? Ans: Left skewness
What will be the IQR of the data (approximately)? ANs: IQR=18-10=8

Q19) Comment on the below Boxplot visualizations?

Draw an Inference from the distribution of data for Boxplot 1 with respect
Boxplot 2.
 Both the plots infer that their data is normally distributed.
 We can say that box plot 1 is for sample distribution and box plot 2 is for
population or a sample with larger size.
 No outliers
 Q1 is 25%,Q3=75%.IQR is 50% for both the box plots . so we can say both
the distributions follow normal distribution i.e mean=median=mode.
Q 20) Calculate probability from the given dataset for the below cases

Data _set: Cars.csv

Calculate the probability of MPG ofCars for the below cases.
MPG<- Cars$MPG
a. P(MPG>38)
b. P(MPG<40)
c. P (20<MPG<50)
provide me explanation

Q 21) Check whether the data follows normal distribution

a) Check whether the MPG of Cars follows Normal Distribution
Dataset: Cars.csv
Ans: MPG of cars not following the normal distribution
b) Check Whether the Adipose Tissue (AT) and Waist Circumference(Waist)
fromwc-at data set follows Normal Distribution
Dataset: wc-at.csv
Ans: Variable Waist circumference(waist) does not follow normal
distribution.

Variable 'AT' adipose tissue follow normal distribution.

Q 22) Calculate the Z scoresof 90% confidence interval,94% confidence
interval, 60% confidence interval
Ans:
qnorm(0.95) #Z score for 90% confidence interval is 1.64485
qnorm(0.97) #Z score for 94% confidence interval is 1.8807
qnorm(0.80) #Z score for 60% confidence interval is 0.8416

Q 23) Calculate the t scores of 95% confidence interval, 96% confidence

interval, 99% confidence interval for sample size of 25
Ans:
qt(0.975,24) #t score fro 95% confidence interval is 2.0638
qt(0.98,24) #t score fro 96% confidence interval is 2.171
qt(0.995,24) #t score fro 99% confidence interval is 2.2.796
Q 24)A Government companyclaims that an average light bulb lasts 270
days. A researcher randomly selects 18 bulbs for testing. The sampled bulbs
last an average of 260 days, with a standard deviation of 90 days. If the
CEO's claim were true, what is the probability that 18 randomly selected
bulbs would have an average life of no more than 260 days

Hint:

rcodept(tscore,df)

df degrees of freedom

Set+1 Descriptive+statistics+Probability+
18% (11)
Set+1 Descriptive+statistics+Probability+
4 pages
Set 4
25% (8)
Set 4
2 pages
Set 4
67% (12)
Set 4
2 pages
Basic Statistic2 Assignment
43% (7)
Basic Statistic2 Assignment
8 pages
Assignment Module04 Part1
33% (6)
Assignment Module04 Part1
3 pages
Assignment 2 - Set+1 - Descriptive+Statistics+Probability+ (2) A
80% (5)
Assignment 2 - Set+1 - Descriptive+Statistics+Probability+ (2) A
7 pages
Hypothesis Testing Assignment
25% (4)
Hypothesis Testing Assignment
4 pages
Set 1 - Descriptive Statistics+probability
67% (3)
Set 1 - Descriptive Statistics+probability
3 pages
Set 3
80% (10)
Set 3
3 pages
CBA: Practice Problem Set 2 Topics: Sampling Distributions and Central Limit Theorem
0% (1)
CBA: Practice Problem Set 2 Topics: Sampling Distributions and Central Limit Theorem
3 pages
Set+2 Normal+Distribution+Functions+of+Random+Variables+ (1) (1) ASSIGNMENT
100% (2)
Set+2 Normal+Distribution+Functions+of+Random+Variables+ (1) (1) ASSIGNMENT
2 pages
Costomer Order Form Solution
67% (3)
Costomer Order Form Solution
1 page
It Is Recommended Sample Size Is Greater or Equal Than 30. Lower The Sample Size Higher Chance of Wrong and Also Value of Confidence
100% (3)
It Is Recommended Sample Size Is Greater or Equal Than 30. Lower The Sample Size Higher Chance of Wrong and Also Value of Confidence
3 pages
Topics: Normal Distribution, Functions of Random Variables
100% (1)
Topics: Normal Distribution, Functions of Random Variables
4 pages
Topics: Confidence Intervals
92% (13)
Topics: Confidence Intervals
4 pages
Hypothesis Testing Assignment
100% (2)
Hypothesis Testing Assignment
8 pages
Set+1 Descriptive+statistics+Probability+
100% (2)
Set+1 Descriptive+statistics+Probability+
4 pages
Sai Charan's Assignment 2 (Basic Statistics Level-2) Set 2
100% (1)
Sai Charan's Assignment 2 (Basic Statistics Level-2) Set 2
3 pages
Set+2 Normal+Distribution+Functions+of+random+variables+
92% (13)
Set+2 Normal+Distribution+Functions+of+random+variables+
3 pages
Assignment
50% (2)
Assignment
10 pages
Assignment 4 Simple Linear Regression
100% (1)
Assignment 4 Simple Linear Regression
3 pages
Module 04 - Part1 Assignment
75% (4)
Module 04 - Part1 Assignment
10 pages
Assignment
75% (4)
Assignment
13 pages
Assignment 2 - Set 3 - Solution
100% (1)
Assignment 2 - Set 3 - Solution
4 pages
Set 4
100% (1)
Set 4
1 page
Set+1 Descriptive+statistics+Probability+
100% (1)
Set+1 Descriptive+statistics+Probability+
4 pages
Confidence Interval Assignment
100% (1)
Confidence Interval Assignment
3 pages
Set+1 Descriptive+statistics+Probability SOLUTIONS NAVIN
100% (4)
Set+1 Descriptive+statistics+Probability SOLUTIONS NAVIN
5 pages
Basuc Statshi
100% (3)
Basuc Statshi
20 pages
Topics: Descriptive Statistics and Probability: Name of Company Measure X
100% (1)
Topics: Descriptive Statistics and Probability: Name of Company Measure X
5 pages
Name: Suresh Basic Statistics (Module - 4 ( - 2) )
No ratings yet
Name: Suresh Basic Statistics (Module - 4 ( - 2) )
8 pages
Assignment
83% (6)
Assignment
16 pages
Assignment 1
100% (1)
Assignment 1
15 pages
Assignment (Key) 1
100% (1)
Assignment (Key) 1
16 pages
Assignment 1
100% (1)
Assignment 1
16 pages
Assignment
No ratings yet
Assignment
11 pages
Basic Statistics (Module - 3)
100% (2)
Basic Statistics (Module - 3)
12 pages
Activity
No ratings yet
Activity
11 pages
Arjun S Assignment 1 Basic Stat1
88% (8)
Arjun S Assignment 1 Basic Stat1
21 pages
Assignmeant-1 Sharan S
No ratings yet
Assignmeant-1 Sharan S
20 pages
Assignment
No ratings yet
Assignment
11 pages
Set+1 Descriptive+statistics+Probability+
No ratings yet
Set+1 Descriptive+statistics+Probability+
5 pages
Set 3
No ratings yet
Set 3
3 pages
Name:Silpa Batch Id: Analysis: WDEO 171220 Topic: Principal Component
100% (1)
Name:Silpa Batch Id: Analysis: WDEO 171220 Topic: Principal Component
7 pages
Problem Statement
100% (3)
Problem Statement
8 pages
Assignment
85% (33)
Assignment
13 pages
Assignment
100% (1)
Assignment
10 pages
Assignment (Answers)
100% (1)
Assignment (Answers)
9 pages
Basic Statistics 1
100% (2)
Basic Statistics 1
12 pages
Assignment
No ratings yet
Assignment
12 pages
Assignment
No ratings yet
Assignment
18 pages
Assignment
No ratings yet
Assignment
19 pages
Assignment
No ratings yet
Assignment
11 pages
Assignment 2
No ratings yet
Assignment 2
7 pages
Assignment
No ratings yet
Assignment
11 pages
Continuous Continuous Continuous Continuous Continuous: Discrete Discrete
No ratings yet
Continuous Continuous Continuous Continuous Continuous: Discrete Discrete
15 pages
Assignment (1) SOlution
No ratings yet
Assignment (1) SOlution
15 pages
Activity Data Type
No ratings yet
Activity Data Type
11 pages
Assignment: - Basic Statistics 1: Q1) Identify The Data Type For The Following
No ratings yet
Assignment: - Basic Statistics 1: Q1) Identify The Data Type For The Following
17 pages
Priyanka Pavithra Fundoodata
0% (1)
Priyanka Pavithra Fundoodata
170 pages
Study of Tig Welding
100% (1)
Study of Tig Welding
11 pages
International Project Management Guide 2.0 (IAPM)
100% (1)
International Project Management Guide 2.0 (IAPM)
44 pages
Assignment 1 1
No ratings yet
Assignment 1 1
13 pages
Deterioration of Concrete
No ratings yet
Deterioration of Concrete
34 pages
2017.09.13 - MY18 GLE-Coupe
No ratings yet
2017.09.13 - MY18 GLE-Coupe
29 pages
Nissan - Resilience Strategy
0% (1)
Nissan - Resilience Strategy
2 pages
Portable Radios: Operating Instructions
100% (1)
Portable Radios: Operating Instructions
47 pages
Sartorius PR5510 X4
No ratings yet
Sartorius PR5510 X4
4 pages
Analysis and Design of (Concentric, Edge, Corner) Footing: Sample Structural Manila
100% (1)
Analysis and Design of (Concentric, Edge, Corner) Footing: Sample Structural Manila
3 pages
Quidos Technical Bulletin - 15th September 2019
100% (1)
Quidos Technical Bulletin - 15th September 2019
7 pages
Global Skills of Drawing
No ratings yet
Global Skills of Drawing
2 pages
98 - Improving Rutting Resistance Using Geosynthetics
No ratings yet
98 - Improving Rutting Resistance Using Geosynthetics
5 pages
Reflow Soldering
No ratings yet
Reflow Soldering
6 pages
Science: Quarter 2 - 3 Where D O I C O Mef Rom ?
No ratings yet
Science: Quarter 2 - 3 Where D O I C O Mef Rom ?
23 pages
Very Low Drop 5V Regulator With Reset: Description
No ratings yet
Very Low Drop 5V Regulator With Reset: Description
79 pages
기존 시설물 (기초및지반) 내진성능 평가요령 (안)
No ratings yet
기존 시설물 (기초및지반) 내진성능 평가요령 (안)
216 pages
A CR CCP 702 PF 001 Red Star IG
No ratings yet
A CR CCP 702 PF 001 Red Star IG
730 pages
Nba Lab Details May 2014
No ratings yet
Nba Lab Details May 2014
38 pages
Dijkstra's Algorithm: 1 N Ij I J 1
No ratings yet
Dijkstra's Algorithm: 1 N Ij I J 1
5 pages
CFE Final Course Output 2024 2025 1
No ratings yet
CFE Final Course Output 2024 2025 1
8 pages
Csit 301 Lesson Plan 1
No ratings yet
Csit 301 Lesson Plan 1
5 pages
Nursing Care Assignment
No ratings yet
Nursing Care Assignment
8 pages
Lesson Plan
No ratings yet
Lesson Plan
9 pages
Solar-Powered Lawnmower Design and Development
No ratings yet
Solar-Powered Lawnmower Design and Development
8 pages
CNP Bill
No ratings yet
CNP Bill
1 page
EMR System UI Design
No ratings yet
EMR System UI Design
3 pages
Contourline / Pureline Warming Drawer: 8 Shown Above: Esw 6114
No ratings yet
Contourline / Pureline Warming Drawer: 8 Shown Above: Esw 6114
5 pages
MH 7
No ratings yet
MH 7
1 page
Calculators List Allowed
No ratings yet
Calculators List Allowed
1 page
Interfacing of LED 8051
No ratings yet
Interfacing of LED 8051
16 pages

Basic Statisticks 1 - Assignment - Vivek T

Uploaded by

Basic Statisticks 1 - Assignment - Vivek T

Uploaded by

Activity Data Type

Number of beatings from Wife Discrete

Q1) Identify the Data type for the Following:

Q8) Calculate Expected Value for the problem below

Q10) Draw inferences about the following boxplot & histogram

50-100 weight having more frequency 180

350-400 weight having very less frequency 5

Data is right skewed

Data is not a normal distribution

0-50 weight having 80 freuency

100-150 weight having 120 freuency

 7 Outliers are present in above box plot

Q11) Suppose we want to estimate the average weight of an adult male in

Confidence interval for 96%: using R getting the

Q12)Below are the scores obtained by a student in tests

Ans: Avg of student marks 41

Q17) What does negative kurtosis value indicates for a data?

What can we say about the distribution of the data?

Q19) Comment on the below Boxplot visualizations?

Data _set: Cars.csv

Q 21) Check whether the data follows normal distribution

Variable 'AT' adipose tissue follow normal distribution.

Q 23) Calculate the t scores of 95% confidence interval, 96% confidence

df degrees of freedom

You might also like