0% found this document useful (0 votes)

16 views12 pages

DA Answer-Key

The document consists of multiple-choice questions (MCQs) and descriptive questions (DES) related to data analysis, statistics, and hypothesis testing. It covers topics such as derived variables, variability in histograms, median calculations, kurtosis, standard deviation implications, hypothesis testing, chi-square tests, outlier detection using IQR, exploratory data analysis, skewness effects, and statistical tests for evaluating the effectiveness of interventions. The questions require calculations, explanations, and interpretations of statistical concepts and methods.

Uploaded by

khushpatel1222

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

16 views12 pages

DA Answer-Key

Uploaded by

khushpatel1222

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 12

Type: MCQ

Q1. In reviewing metadata that describes a dataset's variables, you notice several variables are
marked as "derived." What does this indicate about those variables? (0.5)

1. **They are calculated from other variables in the dataset

2. They are collected directly from external sources
3. They are manually entered by the data analyst
4. They are irrelevant to the data analysis

Q2. You compare two histograms: Histogram X is wider with more spread in data, and
Histogram Y is narrower. What can you infer about the variability of data in each histogram?
(0.5)
1. ** Histogram X shows higher variability compared to Histogram Y
2. Histogram Y shows higher variability compared to Histogram X
3. Both histograms have identical variability
4. Histogram X and Histogram Y have no data variability

Q3. Analyze how the presence of multiple duplicate entries of numerical values in a dataset
impacts the calculation of the median (0.5)
1. The median is always one of the duplicate values
2. The median is the average of the duplicate values
3. ** The median is unaffected by the duplicates
4. The median cannot be determined if duplicates are present.

Q4. Given the following data set: 4, 8, 8, 10, 12, 14, 14, 14, 18, 20, determine which of the
following statements accurately describes the data. (0.5)
1. The mean, mode and median are equal.
2. The mean is greater than the median, but mode is less than the median.
3. ** The mode is greater than both the median and the mean.
4. The mean is less than the mode, but greater than the median.
Q5. Evaluate the following situation: You have a dataset with a kurtosis value of 0. What does
this indicate about the spread of data points in comparison to a normal distribution? (0.5)
1. ** The data is more spread out and has a flatter peak
2. The data is less widespread and has a sharper peak
3. The data is normally distributed
4. The data has the same spread as a normal distribution but with more extreme
values
Q6. You are working with a dataset that has a large standard deviation. What does this imply
about the values in the dataset relative to the mean? (0.5)
1. Most of the values are clustered closely around the mean
2. ** The values are widely spread out around the mean
3. The dataset is likely to have no outliers
4. The mean and standard deviation are equal
Q7. A researcher wants to test whether a new teaching method improves student test scores. What
would be the null hypothesis in this case? (0.5)

1. The new teaching method has less effective on test scores.

2. **The new teaching method improves test scores.
3. The old teaching method is better than the new one.
4. The test scores are not related to the teaching method.

Q8. When conducting a hypothesis test, if the p-value is less than the chosen level of significance
(alpha), what should you do? (0.5)

1. Fail to accept the null hypothesis.

2. **Accept the null hypothesis.
3. Reject the null hypothesis.
4. Modify the null hypothesis.
Q9. A researcher is conducting a chi-square test to determine whether there is a significant
association between gender (male, female) and preference for a new product (prefer, do not
prefer). The data is summarized in a contingency table and the chi-square test is performed.
Which of the following steps should the researcher take to apply the chi-square test correctly?
(0.5)

1. Calculate the mean and standard deviation of the data.

2. **Compare the observed frequencies to the expected frequencies under the
assumption of no association.
3. Ensure that the sample size is greater than 30 before applying for the chi-square
test.
4. Use the chi-square test only if the data is normally distributed.

Q10. A scientist wants to test whether a new drug has a different effect on blood pressure
compared to a placebo. The null hypothesis states that there is no difference in blood pressure
between the drug and placebo groups. A hypothesis test is performed at a significance level
of 0.05. Which of the following steps is appropriate when conducting the hypothesis test?
(0.5)

1. **Increases
2. Decreases
3. Remains same
4. None of the above
Type: DES

Q11. A company tests the efficiency of three different advertising strategies (Ad A, Ad B, Ad C) by
measuring the number of sales (in thousands) generated by each strategy over 4 days:. (4)

Day Ad A Ad B Ad C

1 30 22 25

2 35 28 30

3 40 32 35

4 42 30 40

Use one-way ANOVA to determine if there is a significant difference in the mean number of sales
across the three advertising strategies at a 5% significance level.
Solution: Calculations of Group Mean and overall mean = 1M, Sum of Squares = 1M, Mean of
Squares = 1M, F- Statistic & Conclusion = 1M
Q12. Given the dataset with following values
Values=[5,15,25,35,45,55,65,75,85,95]

a. Explain the need for data transformation in data analytics and how Min-Max
Normalization and Decimal Scaling help in preparing data for analysis.
b. Apply Min-Max Normalization to the dataset to transform the values to a range of [0, 1].
Show your calculations and results.
c. Apply Decimal Scaling to the dataset using a scaling factor of 100. Show your
calculations and results. (3)
Solution: Explanation on Normalization, min-max and decimal scaling (1M), b) Min max-
normalization (1M) and c) Decimal Scaling (1M).

(a) Normalization uses a mathematical function to transform numeric columns to a new range.
Normalization is important in preventing certain data analysis methods from giving some
variables undue influence over others because of differences in the range of their values.

The min–max transformation maps the values of a variable to a new range, such as from 0 to
1. The decimal scaling transformation moves the decimal point to ensure the range is
between 1 and −1.

Original Value Transformed Value

5 0

15 0.11

25 0.22

35 0.33

45 0.44

55 0.55

65 0.66

75 0.77

85 0.88

95 1

c)
Original Value Transformed Value

5 0.05

15 0.15

25 0.25

35 0.35

45 0.45

55 0.55

65 0.65

75 0.75

85 0.85

95 0.95

Q13. You are analyzing a dataset of customer transaction amounts for a retail company. The
dataset contains the following transaction values:

Values=[10,12,14,18,22,24,25,28,30,50]

a. As part of your analysis, you need to evaluate the role of the Interquartile Range (IQR) in
identifying outliers. Calculate the IQR for this dataset and determine if there are any
outliers.
b. Analyze how identifying and addressing outliers using the IQR can impact on the overall
quality of your analysis and the insights you can derive about customer spending
behavior. (3)

Solution

Calculation of Q1 and Q2 is (0.5 M each) 1M. Upper Bound (0.5M) Lower Bound (0.5M) Analysis
(1M)

a) Arrange the data in ascending order

Values=10,12,14,18,22,24,25,28,30,50
Q1(First Quartile)=lower half=[10,12,14,18,22]
The 25th percentile
Calculate median= 14

Q3(Third Quartile)
The 75th percentile
Upper half: [24, 25, 28, 30, 50]
Median=28

IQR=Q3-Q1

IQR =28-14=14
Determine any outliers
Lower bound= Q1-1.5 X IQR
=14-1.5 X14
=-7

Upper bound=Q3+1.5 X IQR

28+1.5 X 14
=28+21
=49

Values outside the range [-7,49] are considered as outliers.

In the dataset 50 is the outlier

B ) Inclusion of outliers inflate or deflate the mean transaction value, may contains variability in
customer spending patterns. Identifying and addressing outliers using the IQR improves the
quality of analysis by preventing distortion in key metrics like the mean and variance, leading to
more accurate insights about typical customer behavior

Q14. Consider a dataset containing the following information about a set of customers:

• Age
• Annual Income
• Spending Score (a measure of customer behaviour)
Using this dataset, perform an Exploratory Data Analysis to answer the following:
a. Identify the basic summary statistics (mean, median, and mode) for the Age and Annual
Income columns.
b. Identify any outliers in the Annual Income column using the Interquartile Range (IQR)
method.
c. Interpret the relationship between Age and Spending Score using a scatter plot or
correlation analysis. . (3)

Solution: Basic summary = 1M, Outliers =1M, Relationship analysis =1M

1. Basic Summary Statistics (Age and Annual Income) for the considered sample dataset
For each column, calculate:
• Mean: Average value.
• Median: Middle value.
• Mode: Most frequent value.

2. Outliers in Annual Income (IQR Method)

The IQR method to detect outliers involves:
1. Calculate the first (Q1) and third (Q3) quartiles of the Annual Income data.
o Q1 = Calculated value will be as per considered data
o Q3 = Calculated value will be as per considered data
2. IQR = Q3 - Q1
3. Identify the lower and upper bounds for outliers using:
o Lower bound = Q1 - 1.5 * IQR
o Upper bound = Q3 + 1.5 * IQR
o Identification of outlier based on considered dataset.
3. Relationship Between Age and Spending Score
To assess the relationship:
• Scatter Plot: Plot Age on the x-axis and Spending Score on the y-axis. If the points (as per
considered data), based on the trend (upward or downward), it can be concluded the
positive or negative correlation
OR
• Correlation Analysis: Calculate the correlation coefficient., derive the conclusion (conclusion
will be based on considered dataset)

Q15. A company is analyzing the distribution of delivery times for their products. After collecting
data, they notice that most deliveries happen around 2-3 days, but a few deliveries take much longer
due to unexpected delays.

a. Explain how skewness affects the distribution of delivery times and its impact on the mean,
median, and mode of the dataset . (3)

Solution:
• 1 mark for explaining skewness and identifying it as positive skewness.
• 1 mark for describing the impact on the mean (being higher due to outliers).
• 1 mark for explaining the impact on the median and mode, with the median being less
affected and the mode representing the most frequent value.
• Skewness: The distribution of delivery times is positively skewed since most deliveries
happen within a short time (2-3 days), but a few take much longer, pulling the tail to the
right.
• Impact on Mean, Median, and Mode:
o Mean: Since the data is positively skewed, the mean will be higher than the median because
the longer delivery times (outliers) increase the average.
o Median: The median, being the middle value, is less affected by the outliers and will be
closer to the bulk of the data (around 2-3 days).
o Mode: The mode will represent the most frequent delivery time, likely around 2-3 days,
unaffected by the skew.

Q16. A research study aimed to assess the effectiveness of a six-month high-intensity interval
training (HIIT) program in lowering heart rates. For adults in China, the average heart rate is
typically 72 beats per minute. After participating in HIIT, a sample of 25 individuals recorded
an average heart rate of 69 beats per minute, with a standard deviation of 6.5 beats per
minute. Using a statistical test, determine if there is significant evidence to suggest that the
HIIT successfully reduced heart rates
H0:μ=72

HA:μ<72

Where

Xˉ=69 (sample mean)

μ0=72 (population mean under the null hypothesis)

s=6.5 (sample standard deviation)

n=25n = 25n=25 (sample size)

The calculated t-statistic is approximately −2.31.

. (3)
Q17. A wildlife biologist is studying the alertness levels (arousal) of a population of "chill penguins"
living in a tropical zoo. The arousal levels in this population are normally distributed, with a known
standard deviation of 6. The biologist collects a sample of 49 "chill penguins" and measures their
arousal, finding a sample mean arousal level of 46.44 and a sample standard deviation of 5.6968.
Under normal conditions, the expected arousal level of these penguins is 47. Using a significance
level of α = 0.01, test whether the observed sample mean of 46.44 is significantly less than the
expected population mean of 47.
State the Hypotheses
• Null Hypothesis (H0): H0:μ=47
• Alternative Hypothesis (HA): HA:μ<47
This is a one-tailed test.
z-statistic: −0.653
The critical value for a one-tailed z-test at α=0.01 approximately −2.33
we fail to reject the null hypothesis.
Scheme:

State Hypothesis: 0.5 Mark

Calculation z stat: 0.5 Mark

Find critical value: 0.5 Mark

Decision about accept and Reject Hypothesis: 0.5 Mark

. (2)
Q18. The following data represents hemoglobin values in gm/dl for 10 patients:

10.5 9 6.5 8 11 7 7.5 8.5 9.5 12

Is the mean value for patients significantly differ from the mean value of general population (12
gm/dl)? Evaluate the role of chance. ( = 0.05)

Scheme

Calculation of mean and SD: 0.5 Mark

Calculation of Standerd Error: 0.5 Mark

Calculation of T Value: 0.5 Mark

Decision about accept and Reject Hypothesis: 0.5 Mark

. (2)

Q19. A company is evaluating the impact of two distinct advertising strategies (Strategy A and
Strategy B) across three regions (Region 1, Region 2, and Region 3) to understand how they influence
sales. The marketing team gathers sales performance data after applying both strategies in all three
regions.
1. Based on the given scenario, what is the appropriate statistical technique the company
should use to determine if there is a significant effect of the advertising strategy, region, or
their on sales?
2. After selecting the appropriate statistical technique, what kinds of conclusions could the
company expect from the analysis of the data?

1. Appropriate Statistical Technique: Two-Way ANOVA

In this scenario, the company is evaluating the impact of two factors (advertising strategy and
region) on sales (a continuous dependent variable). Since there are two independent variables
(advertising strategy and region) and the goal is to see if either of these factors, or their interaction,
significantly influences sales, the appropriate statistical technique to use is a Two-Way ANOVA
(Analysis of Variance).
• Factor 1: Advertising strategy (Strategy A vs. Strategy B)
• Factor 2: Region (Region 1, Region 2, Region 3)
This technique allows the company to assess:
1. The main effect of advertising strategy (A vs. B) on sales.
2. The main effect of region (Region 1, 2, 3) on sales.
3. The interaction effect between advertising strategy and region (i.e., whether the effect of
the advertising strategy depends on the region).
2. Expected Conclusions from the Analysis
After performing a Two-Way ANOVA, the company could draw the following types of conclusions:
• Main effect of advertising strategy: The company will determine whether there is a
significant difference in sales between Strategy A and Strategy B, irrespective of the region.
For example, they may conclude that one strategy consistently leads to higher sales across
all regions.
• Main effect of region: The company will assess whether there are significant differences in
sales between the regions, regardless of the advertising strategy used. This could help them
understand if certain regions generally perform better in terms of sales.
• Interaction effect: The analysis will show whether the effectiveness of an advertising
strategy varies by region. This would suggest that the best advertising strategy might depend
on the region. For example, Strategy A might perform better in Region 1 but worse in Region
3, indicating a need for a tailored approach.
Possible conclusions from the Two-Way ANOVA:
• If there’s no significant interaction but significant main effects: The company might
conclude that one strategy is generally better and that certain regions perform better
regardless of the strategy used.
• If there’s a significant interaction effect: The company would likely conclude that the best
advertising strategy depends on the specific region, and a "one size fits all" approach may
not work.
• If neither main effects nor interaction effects are significant: The company might conclude
that neither the advertising strategy nor the region has a significant impact on sales, and
other factors should be investigated.
Scheme
Appropriate statistical technique: 1 Mark
Expected conclusions: 1 Mark
. (2)

Business Statistics Final Exam Solutions
100% (4)
Business Statistics Final Exam Solutions
10 pages
Machine Learning Interview Questions
From Everand
Machine Learning Interview Questions
Tech Interviews
4.5/5 (2)
Multivariate Analysis – The Simplest Guide in the Universe: Bite-Size Stats, #6
From Everand
Multivariate Analysis – The Simplest Guide in the Universe: Bite-Size Stats, #6
Lee Baker
No ratings yet
Formulation of The Research Chapter 1
No ratings yet
Formulation of The Research Chapter 1
4 pages
FS2-Episode 3-AGT
72% (18)
FS2-Episode 3-AGT
11 pages
DA Script
No ratings yet
DA Script
52 pages
Reviewer For 4th Quarter
No ratings yet
Reviewer For 4th Quarter
41 pages
Sheet 2
No ratings yet
Sheet 2
2 pages
Allama Iqbal Open University, Islamabad Warning: Department of Statistics
No ratings yet
Allama Iqbal Open University, Islamabad Warning: Department of Statistics
3 pages
Soc 212
No ratings yet
Soc 212
14 pages
FINAL EXAMINATION (Math in Modern World) Questionnaire
No ratings yet
FINAL EXAMINATION (Math in Modern World) Questionnaire
4 pages
Mock Test Official
No ratings yet
Mock Test Official
15 pages
BUSINESS STATS - Consolidated
No ratings yet
BUSINESS STATS - Consolidated
10 pages
Stats Mcqs Calculations
No ratings yet
Stats Mcqs Calculations
21 pages
Quantitative Analysis: Dr. Basheer Ahmad Samim
No ratings yet
Quantitative Analysis: Dr. Basheer Ahmad Samim
71 pages
PDF MCQs Creation Request
No ratings yet
PDF MCQs Creation Request
9 pages
MB0040 MQP Answer Keys
No ratings yet
MB0040 MQP Answer Keys
19 pages
DS 5-Marks Semeseter Suggestion
No ratings yet
DS 5-Marks Semeseter Suggestion
56 pages
CS001-B03 - Exploratory Data Analysis 20
No ratings yet
CS001-B03 - Exploratory Data Analysis 20
7 pages
đề trí
No ratings yet
đề trí
7 pages
Q2 Quiz 1 and 2 - 2023-2024
No ratings yet
Q2 Quiz 1 and 2 - 2023-2024
2 pages
DA Practice Questions - Unit - 2
No ratings yet
DA Practice Questions - Unit - 2
6 pages
Chapter 3 Solutions
No ratings yet
Chapter 3 Solutions
17 pages
MBA Integrated WINTER 2020
No ratings yet
MBA Integrated WINTER 2020
3 pages
Nmims Decision Science Applicable For June 2020 Exams
No ratings yet
Nmims Decision Science Applicable For June 2020 Exams
10 pages
Statistic Homework (Assignment) 9.0
No ratings yet
Statistic Homework (Assignment) 9.0
12 pages
Part A: Mcqs (10 Questions) : Gen-Z Iitian Stats 1 Most Important Qs
No ratings yet
Part A: Mcqs (10 Questions) : Gen-Z Iitian Stats 1 Most Important Qs
4 pages
BSA - PUT - SEM I - 21-22 Solution
No ratings yet
BSA - PUT - SEM I - 21-22 Solution
16 pages
Business Statistics Final Exam Solutions
No ratings yet
Business Statistics Final Exam Solutions
10 pages
MNUR 108 Statistics
100% (1)
MNUR 108 Statistics
8 pages
Bam 212
No ratings yet
Bam 212
7 pages
DS Assignment COMPLETED
No ratings yet
DS Assignment COMPLETED
11 pages
Tutorial 2 - Asnwer Key
No ratings yet
Tutorial 2 - Asnwer Key
14 pages
Exam Question
100% (1)
Exam Question
8 pages
Stats - The Theory 2
No ratings yet
Stats - The Theory 2
25 pages
Chapter 3. Descriptive Statistics - Numerical Measures - Aug 2023
No ratings yet
Chapter 3. Descriptive Statistics - Numerical Measures - Aug 2023
4 pages
This Study Resource Was: Assignment Number - 5
No ratings yet
This Study Resource Was: Assignment Number - 5
9 pages
QTPR Quiz 1 19score of 30
No ratings yet
QTPR Quiz 1 19score of 30
11 pages
NM
No ratings yet
NM
18 pages
Business Statistics - Final Exam (70 Questions)
No ratings yet
Business Statistics - Final Exam (70 Questions)
5 pages
Worksheets-Importance of Mathematics
No ratings yet
Worksheets-Importance of Mathematics
38 pages
Review Mid Term Exam 2 Answer Keys
No ratings yet
Review Mid Term Exam 2 Answer Keys
11 pages
CCW331 Set4
No ratings yet
CCW331 Set4
5 pages
Complete Pastpaper
No ratings yet
Complete Pastpaper
82 pages
2018 Medical Statistics Paper A
No ratings yet
2018 Medical Statistics Paper A
8 pages
Ch7 L2 CharacteristicsOfProduct&Types
No ratings yet
Ch7 L2 CharacteristicsOfProduct&Types
5 pages
Copy-Sta 131&132 Study Questions by Premier (MR - Humble PDF Archive)
No ratings yet
Copy-Sta 131&132 Study Questions by Premier (MR - Humble PDF Archive)
9 pages
Ds 5 Marks Final
No ratings yet
Ds 5 Marks Final
11 pages
PracticeTest 1
No ratings yet
PracticeTest 1
5 pages
BCOM 209 Business Statistics
No ratings yet
BCOM 209 Business Statistics
12 pages
Statistics - Sample Qualifier Soution (Mock Test)
No ratings yet
Statistics - Sample Qualifier Soution (Mock Test)
25 pages
Phython Assignment
No ratings yet
Phython Assignment
30 pages
Unit 1 Review 2017
No ratings yet
Unit 1 Review 2017
4 pages
Maths Apt
No ratings yet
Maths Apt
11 pages
20 MASTER Exercise Answers MBA
No ratings yet
20 MASTER Exercise Answers MBA
38 pages
Week 5 Worksheet Answers
No ratings yet
Week 5 Worksheet Answers
6 pages
QNT 351 Final Exam Correct Answers 100%
100% (1)
QNT 351 Final Exam Correct Answers 100%
4 pages
Exercises
100% (1)
Exercises
37 pages
Mock Exam Midterm Statistics I
No ratings yet
Mock Exam Midterm Statistics I
24 pages
BS - Internal Exam - 2023
No ratings yet
BS - Internal Exam - 2023
4 pages
Certified Lean Six Sigma Green Belt (ICGB) Practice Questions And Exam Tests ICGB Exam Guidebook And Updated Questions
From Everand
Certified Lean Six Sigma Green Belt (ICGB) Practice Questions And Exam Tests ICGB Exam Guidebook And Updated Questions
Idea Link
No ratings yet
Statistical Analysis and Visualization
From Everand
Statistical Analysis and Visualization
Mohit Chatterjee
No ratings yet
2.7 Eigenvalues and Eigenvectors
No ratings yet
2.7 Eigenvalues and Eigenvectors
9 pages
2.4 Inverse Using Elementary Operations
No ratings yet
2.4 Inverse Using Elementary Operations
3 pages
1.13 Higher Order Linear Differential Equations
No ratings yet
1.13 Higher Order Linear Differential Equations
13 pages
1.16 Legendre's Linear Equation
No ratings yet
1.16 Legendre's Linear Equation
2 pages
Digital Communication
No ratings yet
Digital Communication
9 pages
1.15 Euler-Cauchy Linear Equation
No ratings yet
1.15 Euler-Cauchy Linear Equation
2 pages
1.9 First Order Linear Equations
No ratings yet
1.9 First Order Linear Equations
2 pages
Lesson Plan - July 2023 - BET - ELE 1071
No ratings yet
Lesson Plan - July 2023 - BET - ELE 1071
1 page
1.3 Families of Curves
No ratings yet
1.3 Families of Curves
2 pages
1.5 Variable Separable Equations
No ratings yet
1.5 Variable Separable Equations
2 pages
1.14 Variation of Parameters
No ratings yet
1.14 Variation of Parameters
3 pages
Tutorial-5 1
No ratings yet
Tutorial-5 1
1 page
Tutorial-6 1
No ratings yet
Tutorial-6 1
6 pages
End Sem QP Format and Sample QP - Communication Skills in English
No ratings yet
End Sem QP Format and Sample QP - Communication Skills in English
3 pages
1.2 Formulation of Differential Equations by Eliminating Arbitrary Constants
No ratings yet
1.2 Formulation of Differential Equations by Eliminating Arbitrary Constants
3 pages
1.7 Differential Equations With Linear Coefficients
No ratings yet
1.7 Differential Equations With Linear Coefficients
2 pages
Unit IV-Communications
No ratings yet
Unit IV-Communications
3 pages
DSE 2224 21 Mar 2024
No ratings yet
DSE 2224 21 Mar 2024
7 pages
Case Study 1
No ratings yet
Case Study 1
9 pages
Example Problem-3
No ratings yet
Example Problem-3
7 pages
Tutorial-3 2
No ratings yet
Tutorial-3 2
10 pages
MIE 1071 Bme
No ratings yet
MIE 1071 Bme
7 pages
Example Problem-1
No ratings yet
Example Problem-1
10 pages
DPS PYQs
No ratings yet
DPS PYQs
5 pages
Lecture-4-Carbohydrates & ATP
No ratings yet
Lecture-4-Carbohydrates & ATP
5 pages
Student Answer Script View: MIT MPL - 2nd-4th and 6th Semester - Midterm Examination - Mar 2024 Answer Sheet
No ratings yet
Student Answer Script View: MIT MPL - 2nd-4th and 6th Semester - Midterm Examination - Mar 2024 Answer Sheet
49 pages
Sessional - 2, April 2023 Machine Learning (DSE 2254), IV Sem, DSE Date: 19/04/2023 Max. Marks: 15
No ratings yet
Sessional - 2, April 2023 Machine Learning (DSE 2254), IV Sem, DSE Date: 19/04/2023 Max. Marks: 15
7 pages
ENGLISH Solutions
No ratings yet
ENGLISH Solutions
4 pages
MID SEM QP 2024 MARCH Final
No ratings yet
MID SEM QP 2024 MARCH Final
4 pages
Mid Semester Examination - Scheme of Evaluation: Vi - Semester B.Tech (Data Science and Engineering)
No ratings yet
Mid Semester Examination - Scheme of Evaluation: Vi - Semester B.Tech (Data Science and Engineering)
13 pages
IM-Final Module For THC 102 Risk Management
No ratings yet
IM-Final Module For THC 102 Risk Management
90 pages
Thesis Bar Exam
100% (3)
Thesis Bar Exam
4 pages
Satellite Image Classification Using Dec PDF
No ratings yet
Satellite Image Classification Using Dec PDF
7 pages
GFPP3113 Politik Ekonomi Antarabangsa
No ratings yet
GFPP3113 Politik Ekonomi Antarabangsa
11 pages
1 Kates Etal Sustainability Science 01
No ratings yet
1 Kates Etal Sustainability Science 01
3 pages
Chem PSP Here
No ratings yet
Chem PSP Here
6 pages
Enhancing The Students' Comprehension On DNA Replication and Transcription Through The Use of Strategic Intervention Material
No ratings yet
Enhancing The Students' Comprehension On DNA Replication and Transcription Through The Use of Strategic Intervention Material
4 pages
5 - Reliability
100% (1)
5 - Reliability
14 pages
Public Policy Notes Nigeria Notes
No ratings yet
Public Policy Notes Nigeria Notes
156 pages
2022 Improved Cone Penetration Test Predictions of The State Parameter of Loose Mine Tailings
No ratings yet
2022 Improved Cone Penetration Test Predictions of The State Parameter of Loose Mine Tailings
12 pages
Specialized Movement Skills
No ratings yet
Specialized Movement Skills
7 pages
Evidence-Based Medicine
No ratings yet
Evidence-Based Medicine
69 pages
Me Chapter 4 Demand Estimation Covid 19 Student Version
No ratings yet
Me Chapter 4 Demand Estimation Covid 19 Student Version
18 pages
Article - Designing Out Construction Waste Using BIM Technology
No ratings yet
Article - Designing Out Construction Waste Using BIM Technology
11 pages
Group 2, Jai Ambe Rudra Auto Garage
No ratings yet
Group 2, Jai Ambe Rudra Auto Garage
48 pages
Impact of Employee Motivation On Business Performance
100% (1)
Impact of Employee Motivation On Business Performance
13 pages
Strategic Leadership - Chapter-2
No ratings yet
Strategic Leadership - Chapter-2
29 pages
NWRB Bicol Tor
No ratings yet
NWRB Bicol Tor
9 pages
Organizationa L Culture: Presented by
100% (1)
Organizationa L Culture: Presented by
67 pages
Homework 1 Ishchenko M ХД-21мп
No ratings yet
Homework 1 Ishchenko M ХД-21мп
3 pages
AHS 2012-13 (Final Report)
No ratings yet
AHS 2012-13 (Final Report)
88 pages
A Hybrid Deep Learning Model For Consumer Credit Scoring: Bing Zhu, Wenchuan Yang, Huaxuan Wang, Yuan Yuan
No ratings yet
A Hybrid Deep Learning Model For Consumer Credit Scoring: Bing Zhu, Wenchuan Yang, Huaxuan Wang, Yuan Yuan
4 pages
Abroad News Paper 11 May
No ratings yet
Abroad News Paper 11 May
6 pages
Don Honorio Ventura State University: College of Business Studies
No ratings yet
Don Honorio Ventura State University: College of Business Studies
13 pages
Int. Geodetic Research Projects: - Fully Integrated in Lectures and Thesis of Geomatics (MSC) Study Program
No ratings yet
Int. Geodetic Research Projects: - Fully Integrated in Lectures and Thesis of Geomatics (MSC) Study Program
1 page
Debrah Bibliometric-Qualitative Literature Review
No ratings yet
Debrah Bibliometric-Qualitative Literature Review
66 pages
Assessment Centre Guide
100% (3)
Assessment Centre Guide
35 pages
BAC 107 Week 9 Results and Discussion and Research Writing
No ratings yet
BAC 107 Week 9 Results and Discussion and Research Writing
11 pages

DA Answer-Key

Uploaded by

DA Answer-Key

Uploaded by

Type: MCQ

1. **They are calculated from other variables in the dataset

1. The new teaching method has less effective on test scores.

1. Fail to accept the null hypothesis.

1. Calculate the mean and standard deviation of the data.

Original Value Transformed Value

a) Arrange the data in ascending order

Upper bound=Q3+1.5 X IQR

Values outside the range [-7,49] are considered as outliers.

In the dataset 50 is the outlier

Solution: Basic summary = 1M, Outliers =1M, Relationship analysis =1M

2. Outliers in Annual Income (IQR Method)

Xˉ=69 (sample mean)

μ0=72 (population mean under the null hypothesis)

s=6.5 (sample standard deviation)

n=25n = 25n=25 (sample size)

The calculated t-statistic is approximately −2.31.

State Hypothesis: 0.5 Mark

Calculation z stat: 0.5 Mark

Find critical value: 0.5 Mark

Decision about accept and Reject Hypothesis: 0.5 Mark

10.5 9 6.5 8 11 7 7.5 8.5 9.5 12

Calculation of mean and SD: 0.5 Mark

Calculation of T Value: 0.5 Mark

Decision about accept and Reject Hypothesis: 0.5 Mark

1. Appropriate Statistical Technique: Two-Way ANOVA

You might also like