0% found this document useful (0 votes)
238 views17 pages

Assignment: - Basic Statistics 1: Q1) Identify The Data Type For The Following

The document contains 20 questions related to basic statistics concepts. It covers identifying data types, calculating probabilities for different scenarios involving dice rolls and coin tosses, finding mean, median, mode, variance, standard deviation and range for given data sets, calculating confidence intervals and skewness and kurtosis. The questions aim to test understanding of key descriptive statistics concepts and calculations.

Uploaded by

CHANDAN
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
238 views17 pages

Assignment: - Basic Statistics 1: Q1) Identify The Data Type For The Following

The document contains 20 questions related to basic statistics concepts. It covers identifying data types, calculating probabilities for different scenarios involving dice rolls and coin tosses, finding mean, median, mode, variance, standard deviation and range for given data sets, calculating confidence intervals and skewness and kurtosis. The questions aim to test understanding of key descriptive statistics concepts and calculations.

Uploaded by

CHANDAN
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 17

ASSIGNMENT : – BASIC STATISTICS 1

Q1) Identify the Data type for the Following:


Activity Data Type
Number of beatings from Wife Discrete(count)
Results of rolling a dice Discrete(count)
Weight of a person Continuous
Weight of Gold Continuous
Distance between two places Continuous
Length of a leaf Continuous
Dog's weight Continuous
Blue Color Discrete(categorial)
Number of kids Discrete(count)
Number of tickets in Indian railways Discrete(count)
Number of times married Discrete(count)
Gender (Male or Female) Discrete(binary)

Q2) Identify the Data types, which were among the following:

Data Data Type


Gender Nominal
High School Class Ranking Nominal
Celsius Temperature Nominal
Weight Ratio
Hair Color Nominal
Socioeconomic Status Ordinal
Fahrenheit Temperature Ratio
Height Ratio
Type of living accommodation Nominal
Level of Agreement Ordinal
IQ(Intelligence Scale) Ratio
Sales Figures Ratio
Blood Group Nominal
Time Of Day Ordinal
Time on a Clock with Hands Interval
Number of Children Ratio
Religious Preference Nominal
Barometer Pressure Ratio
SAT Scores Interval
Years of Education Ratio

Q3) Three Coins are tossed, find the probability that two heads and one tail are
obtained?

Ans. Possible results after tossing 3 coins-

HHH, HHT, HTH, THH, HTT, THT, TTH, TTT i.e. 8 possible results.

Possible results with 2 heads-

HHH, HHT, HTH, THH i.e. 4 possible results.

P(2 Heads)= N(possible results with 2 Heads) / N(possible results after


tossing 3 coins)

P(2 Heads)= 4/8 = 0.5

Q4) Two Dice are rolled, find the probability that sum is

a) Equal to 1
b) Less than or equal to 4
c) Sum is divisible by 2and 3
Ans. Possible outcomes when 2 Dice are rolled-

(1, 1), (1, 2), (1, 3), (1, 4), (1, 5), (1, 6), (2, 1), (2, 2), (2, 3), (2, 4), (2, 5)

(2, 6), (3, 1), (3, 2), (3, 3), (3, 4), (3, 5), (3, 6), (4, 1), (4, 2), (4, 3), (4, 4)

(4, 5), (4, 6), (5, 1), (5, 2), (5, 3), (5, 4), (5, 5), (5, 6), (6, 1), (6, 2), (6, 3),

(6, 4), (6, 5), (6, 6) i.e. 36 possible outcomes.


a) Equal to 1-
Probability of getting sum 1 is 0 because minimum possible sum of
outcomes is 2.

b) Less than or equal to 4-


(1, 1), (1, 2), (1, 3), (2, 1), (2, 2), (3, 1) i.e. 6 possible outcomes.
P(sum less than or equal to 4) = 6/36 = 1/6 = 0.1666

c) Sum is divisible by 2 and 3-


(1, 1), (1, 2), (1, 3), (1, 5), (2, 1), (2, 2), (2, 4), (2, 6), (3, 1), (3, 3), (3, 5),

(3, 6), (4, 2), (4, 4), (4, 5), (4, 6), (5, 1), (5, 3), (5, 4), (5, 5), (6, 2), (6, 3),

(6, 4), (6, 6) i.e. 23 possible outcomes.

P(sum is divisible by 2 and 3) = 23/36

Q5) A bag contains 2 red, 3 green and 2 blue balls. Two balls are drawn at random.
What is the probability that none of the balls drawn is blue?

Ans. Total number of balls= (2 + 3 + 2)= 7

N(Possible events) = 7C2 = 21

Number of 2 balls, none of which is blue i.e. N(Possible events with no blue
balls)= 5C2

i.e. N(Possible events with no blue balls)= 10

P(Possible events with no blue balls) = N(Possible events with no blue balls)/

N(Possible events)

=10/21 = 0.476
Q6) Calculate the Expected number of candies for a randomly selected child

Below are the probabilities of count of candies for children(ignoring the nature of
the child-Generalized view)

CHILD Candies count Probability


A 1 0.015
B 4 0.20
C 3 0.65
D 5 0.005
E 6 0.01
F 2 0.120
Child A – probability of having 1 candy = 0.015.

Child B – probability of having 4 candies = 0.20

Ans. Expected number of candies for a randomly selected child


= 1 * 0.015 + 4*0.20 + 3 *0.65 + 5*0.005 + 6 *0.01 + 2 * 0.12
= 0.015 + 0.8 + 1.95 + 0.025 + 0.06 + 0.24
= 3.090
= 3.09

Q7) Calculate Mean, Median, Mode, Variance, Standard Deviation, Range &
comment about the values / draw inferences, for the given dataset

- For Points,Score,Weigh>
Find Mean, Median, Mode, Variance, Standard Deviation, and Range and
also Comment about the values/ Draw some inferences.
Ans.

Mean-
Points = 115.09/32= 3.59
Scores = 102.952/32= 3.22
Weigh = 27.16/32= 17.85
Median-
Points = (3.69+3.7)/2= 7.39//2= 3.695
Scores = (3.215+3.435)/2= 6.65/2= 3.325
Weigh = (17.6+17.82)/2= 35.42/2= 17.71

Mode-
Points = 3.92
Scores = 3.44
Weigh = 17.02

Variance-
Points =8.862/32= 0.2769
Score = 29.678748/32= 0.9275
Weigh = 98.98815/32= 3.0093379688

Standard Deviation-
Points = 0.526
Scores = 0.9630
Weigh = 1.734744

From Rstudio-
X1 Points Score Weigh
Length:32 Min. :2.760 Min. :1.513 Min. :14.50
Class :character 1st Qu.:3.080 1st Qu.:2.581 1st Qu.:16.89
Mode :character Median :3.695 Median :3.325 Median :17.71
Mean :3.597 Mean :3.217 Mean :17.85
3rd Qu.:3.920 3rd Qu.:3.610 3rd Qu.:18.90
Max. :4.930 Max. :5.424 Max. :22.90

> sd(Points)
[1] 0.5346787
> sd(Score)
[1] 0.9784574
> sd(Weigh)
[1] 1.786943
Q8) Calculate Expected Value for the problem below

a) The weights (X) of patients at a clinic (in pounds), are


108, 110, 123, 134, 135, 145, 167, 187, 199

Assume one of the patients is chosen at random. What is the Expected Value
of the Weight of that patient?

Ans.

The Expected value of random = 145.34

Q9) Calculate Skewness, Kurtosis & draw inferences on the following data

Cars speed and distance


Ans.

Right Skewed (+Ve Skewed)

Negative kurtosis

SP and Weight(WT)
Ans.

a) Left Skewed for SP and positive kurtosis


b)Left Skewed for WT and Negative kurtosis
Calculation of Skewness:
>skewness(Speed_and_weight$SP) #
[1] -0.4076957
> skewness(Speed_and_weight$WT) #
[1] -1.28736Calculation of Kurtosis:
> kurtosis(Speed_and_weight$SP) #
[1] 2.086738
> kurtosis(Speed_and_weight$WT) #
[1] 3.818813

Q10) Draw inferences about the following boxplot & histogram


Ans.

By seeing histogram graph we can it is Right skewed, because histogram tells the
shape of plot.8
The main purpose of box plot is finding the outliers, by seeing the above boxplot,
we can see that there are outliers beyond the upper extreme.

Q11)Suppose we want to estimate the average weight of an adult male in


Mexico. We draw a random sample of 2,000 men from a population of 3,000,000
men and weigh them. We find that the average person in our sample weighs 200
pounds, and the standard deviation of the sample is 30 pounds. Calculate
94%,98%,96% confidence interval ?

Ans.

X+/-(Z1-α. σ/sqrt(n)
Degrees of freedom= 2000-1= 1999
Confidence interval= 94%(1-σ/2)= 1-0.03) =0.97 for confidene interval for 94% is 1.882
Confidence interval for 98%= 2.33
Confidence interval for 96% = 2.05

Q12)Below are the scores obtained by a student in tests


34,36,36,38,38,39,39,40,40,41,41,41,41,42,42,45,49,56

1) Find mean,median,variance,standard deviation.


2) What can we say about the student marks?
Ans.

Mean= 41
Median= 40
Variance= 24.111
Standard deviation= 4.910

Marks are normally distributed. There are no visible outliers.

Q13) What is the nature of skewness when mean, median of data are equal?

Ans.
Skewness is symmetrical when mean, median of data are equal.

Q14) What is the nature of skewness when mean >median ?

Ans.
Skewness is right skewed when mean>median.

Q15) What is the nature of skewness when median > mean?

Ans.
Skewness is left skewed when median>mean.

Q16) What does positive kurtosis value indicates for adata ?

Ans.
Positive kurtosis value indicates normal distribution and kurtosis value is 0.

Q17) What does negative kurtosis value indicates for a data?

Ans.
The distribution of the data has lighter tails and a flatter peaks than
the normaldistribution.

Q18) Answer the below questions using the below boxplot visualization.

1) What can we say about the distribution of the data?


Ans.
If the boxplot is age of students in a school, 50% are above 10 years old and
approximately 40% are above 15 years old.

2) What can we say about the distribution of the data?


Ans.
It is left skewed, median>mean.

3) What will be the IQR of the data (approximately)?


Ans.
Approximately -8

Q19) Comment on the below Boxplot visualizations?

Draw an Inference from the distribution of data for Boxplot 1 with respect Boxplot
2
Ans.
By observing both the plots whisker’s level is high in boxplot 2, mean and
median are equal hence distribution is symetrical.

Q 20) Calculate probability from the given dataset for the below cases

Data _set: Cars.csv

Calculate the probability of MPG ofCars for the below cases.

MPG<- Cars$MPG

a. P(MPG>38)
b. P(MPG<40)
c. P (20<MPG<50)

Ans.
By using filter command
and installing the dplyr package into the ‘R’.
a) There are 33 observations in MPG which are greater than 38.
b) 61 observations in MPG which are lesser than 40.
c) P (20<MPG<50) = 69/81
Rcode:
MPG <-c(Cars$MPG)
MPGsample(MPG)
a=subset(MPG,MPG>38)
b=subset(MPG,MPG<40)
c=subset(MPG,MPG>20 & MPG <50)
21) Check whether the data follows normal distribution
a) Check whether the MPG of Cars follows Normal Distribution
Dataset: Cars.csv

Ans.

We can interpret that the data of MPG of Cars follows the normal distribution by:

1) Conducting shapiro test (w=0,97797; p value =0,1764)


2) Evaluating kurtosis value which is -0,7054604
3) Finding of mean value (34,42208) which is not so far difference from
median value (35, 15273)

b) Check Whether the Adipose Tissue (AT) and Waist Circumference(Waist)


fromwc-at data set follows Normal Distribution
Dataset: wc-at.csv
Ans.

We can interpret that the data of Weight of WC_AT follows the normal distribution
by:
1) Conducting shapiro test (w=0,95586; p value =0,00117).
2) Evaluating kurtosis value which is -1,141846 .
3) Finding of mean value (91.902) which is not so far difference from median
value (90.8),

We can interpret that the data of AT of WC_AT follows the non-normal distribution
by:
1) Conducting shapiro test (w=0,95234; p value =0,000654) which is significant
lower than 0,05
2) Evaluating kurtosis value which is -0,37600593.Finding of mean value
(101,894) which is quite far difference from median value (96.54)
Q 22) Calculate the Z scoresof 90% confidence interval,94% confidence interval,
60% confidence interval

Ans.
Z score of 90% confidence interval is 1.65
Z score of 94% confidence interval is 1.55
Z score of 60% confidence interval is 0.85

Q 23) Calculate the t scores of 95% confidence interval, 96% confidence interval,
99% confidence interval for sample size of 25

Ans.
For 95%= 1.96
For 96%= 2.5
For 99% = 2.47

Q 24)A Government companyclaims that an average light bulb lasts 270 days. A
researcher randomly selects 18 bulbs for testing. The sampled bulbs last an average
of 260 days, with a standard deviation of 90 days. If the CEO's claim were true,
what is the probability that 18 randomly selected bulbs would have an average life
of no more than 260 days

Hint:

rcodept(tscore,df)

df degrees of freedom

Ans.
Mean = 270 days
Sample size = 18
Sample mean = 260
Deviation sample = 90 days

You might also like