0% found this document useful (0 votes)

99 views8 pages

Questions

This is questiosn to judiciary

Uploaded by

ashutosh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

99 views8 pages

Questions

This is questiosn to judiciary

Uploaded by

ashutosh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 8

The next 8 questions ask you to analyze a dataset of student performance for 1,000 Freshman (1st

year students) at a specific SDUSD high school, where performance is measured across 11
courses taken by those students in the 2023-2024 school year.

Please use the scoresdat.csv dataset.

Lily is the district-wide Manager of Student Achievement and also your direct boss. She
would like to cluster students into cohorts in order to tailor the academic schedules
separately for each cohort.

However, Lily does not want to simply average the 11 test scores together, as certain
scores are likely highly correlated (eg, Math and Physics, English and Writing) and so a
simple average doesn't feel like the right approach to her. She is also wary that using 11
test scores in a clustering algorithm would mean that the algorithm operates in
11-dimensional space, which to her feels high.

You advise her that one option could be to use Principal Components Analysis on the set of
11 test scores to see if there is a lower-dimensional representation of the data that would
suffice.

Q7) Run PCA on the 11 test scores. How many principal components are required to
represent 90% of the variance in the data?

Q8) Lily thanks you for your excellent work and asks you to use the k-means algorithm to
cluster the students, using the first 3 principal components as the "input data" to the
algorithm.

However, she is unsure of how many clusters might exist in this student population. Using
the "Elbow Method" with a Scree Plot, how many clusters best represent these data?

Q9) For the number of clusters selected in the last question, what is the total within-cluster
sum of squares?

Q10) Lily would like to visualize the result. Add the cluster assignment as a categorical (ie
"factor") variable to the original dataset. Create a scatterplot of mathematics scores on the
horizontal axis and english scores on the vertical axis. Color the points according to their
cluster.
After looking at your plots, Lily remarks that some of the colors overlap, ie, that there is not
a "hard boundary" between clusters. When she was a student and took MGT 100, the
in-class example had "hard boundaries" without overlap. She feels like something is
wrong. You assure her that everything is correct. Select the reason(s) that explain why this
"color overlap" happens in this particular case.
a) K is larger here than the in-class example
b) K-means was fit on 3 variables rather than 2
c) Students generally scored lower in Mathematics than in English
d) The plot uses "original" variables whereas K-means was fit on Principal Components
e) Student test-score data is fundamentally different than smartphone ownership data

Question 11
Satisfied with your work, Lily takes your result to a Senior Councilwoman for
SDUSD, Xiaotong, in order to propose amending next-years academic schedule to
accommodate the K clusters you identified above. Xiaotong is curious about the
methods used.

Lily explains that "PCA and K-Means are both unsupervised algorithms that look
for structure in data." Is Lily's statement correct?

Yes, correct

No, incorrect

Question 12

Lily also mentions that "segmenting students into cohorts is both an art and a
science". Is Lily's statement correct?

Yes, correct

No, incorrect

Question 13

Xiaotong, being quite informed of advanced analytic methods, inquires about the
certainty of the result. Lily assures her that "K-means is guaranteed to final the
global minimum value for the within-cluster sum of squares." Is Lily's statement
correct?

Yes, correct

No, incorrect

Question 14

Xiaotong is persuaded with your results, as presented by Lily, and agrees to

re-work the 2024-2025 academic schedule for the Sophomore, Junior, and Senior
(ie, 2nd year, 3rd year, and 4th year) students at this high school to accommodate
the K cohorts of students that you have presented. However, Xiaotong is
uncertain what to do with the in-coming Freshman students.
Explain how you, Lily, and Xiaotong both (1) can assign Sophomore, Junior, and
Senior students into the identified cohorts, and (2) why you will have difficulty
assigning in-coming Freshman students into the identified cohorts.

While you were working for the SDUSD, your friend Aslan got hired at a Rivian, a relatively
new firm that manufactures high-end electric vehicles in the SUV and Truck categories.
You sign a non-disclosure agreement with Rivian, enabling you to work alongside Aslan.

The next 10 questions ask you to help Aslan analyze conjoint data from consumers making
vehicle choices on a survey.

Please use the conjointdat.RData dataset. Use load() to get the data in R. The data are
ready to be used with the mlogit() command (ie, I already did the dfidx / mlogit.data thing so
you don't need to do that).

Q16

Q17
Q18

Q19

Q20

Q21
Q22
Q23

Q24

Q25
Q26

Inspired by using "customer" analytic techniques in examples including combating

homelessness in Utah, you take a job with San Diego's local government to study adoption
of "clean" energy products such as solar panels.

The next 4 questions ask you to analyze a dataset of first-time adopters of solar panels for
residential homes.

Please use the sundat.csv dataset, which has the following 2 variables:

● month - an integer counter of the first 20 months of solar panel sales in San
Diego
● NFTAs - the number of first-time adopters of solar panels (in 1,000's)

Q28
Q29

Q33

TV Scientific Assessment
No ratings yet
TV Scientific Assessment
9 pages
C 9 It
50% (2)
C 9 It
2 pages
B Practice Tests (1-4) and Final Exams - Introductory Statistics 2e OpenStax
No ratings yet
B Practice Tests (1-4) and Final Exams - Introductory Statistics 2e OpenStax
1 page
CH 04 - Introduction To Probability: Page 1
75% (4)
CH 04 - Introduction To Probability: Page 1
55 pages
2.3.2.7. Quiz 2 (Module 2)
No ratings yet
2.3.2.7. Quiz 2 (Module 2)
13 pages
Week 1-2 Exercises
100% (1)
Week 1-2 Exercises
36 pages
Test Bank For Elementary Statistics 14th Edition by Triola
No ratings yet
Test Bank For Elementary Statistics 14th Edition by Triola
28 pages
Ebooks File (Ebook PDF) Business Statistics: A First Course 8th Edition All Chapters
100% (3)
Ebooks File (Ebook PDF) Business Statistics: A First Course 8th Edition All Chapters
50 pages
Hybrid Feature Selection Student Performance Prediction Paper
No ratings yet
Hybrid Feature Selection Student Performance Prediction Paper
17 pages
Chapter 2 Assignment Describing Data: Frequency Distributions and Graphic Presentation
100% (1)
Chapter 2 Assignment Describing Data: Frequency Distributions and Graphic Presentation
3 pages
Titanic Survival Prediction
No ratings yet
Titanic Survival Prediction
14 pages
统计考试题13页
100% (1)
统计考试题13页
13 pages
Campaign Exploration 1
No ratings yet
Campaign Exploration 1
14 pages
Quiz Unit 3 Exam Remotely Proctored 1 PDF
No ratings yet
Quiz Unit 3 Exam Remotely Proctored 1 PDF
20 pages
Psy 202 Conceptual Assignment 5
No ratings yet
Psy 202 Conceptual Assignment 5
4 pages
Predictors of Student's Performance in TIMSS Mathematics Released Items
No ratings yet
Predictors of Student's Performance in TIMSS Mathematics Released Items
8 pages
Math 221 Week 1 Quiz
No ratings yet
Math 221 Week 1 Quiz
10 pages
韩篙夫 62017010079 Assignment2
No ratings yet
韩篙夫 62017010079 Assignment2
5 pages
Data Mining Exam
No ratings yet
Data Mining Exam
14 pages
Test Bank For Introduction To The Practice of Statistics 9th Edition by David S. Moore
No ratings yet
Test Bank For Introduction To The Practice of Statistics 9th Edition by David S. Moore
19 pages
Data Distributions and Analysis Worksheet
No ratings yet
Data Distributions and Analysis Worksheet
9 pages
The Following Information Relates To Questions 1 and 2
No ratings yet
The Following Information Relates To Questions 1 and 2
3 pages
Stat 151 - Final Review
No ratings yet
Stat 151 - Final Review
15 pages
BSCHAPTER - 11 (Hypothesis Testing)
No ratings yet
BSCHAPTER - 11 (Hypothesis Testing)
67 pages
CHP 1 Study
No ratings yet
CHP 1 Study
16 pages
Introduction To Data Mining
No ratings yet
Introduction To Data Mining
3 pages
Markov Analysis
100% (1)
Markov Analysis
4 pages
Review For Final Exam - Sending
No ratings yet
Review For Final Exam - Sending
5 pages
BAB210 Assignment3
No ratings yet
BAB210 Assignment3
5 pages
Graphical Presentation of Data
No ratings yet
Graphical Presentation of Data
4 pages
Chapter 2
No ratings yet
Chapter 2
19 pages
02 DataCategorization
No ratings yet
02 DataCategorization
41 pages
End Term Practice Pev107 1
No ratings yet
End Term Practice Pev107 1
139 pages
Chapter 4 Descriptive Data Mining
No ratings yet
Chapter 4 Descriptive Data Mining
6 pages
Continuous Random Variables and The Normal Distribution: Prem Mann, Introductory Statistics, 9/E
No ratings yet
Continuous Random Variables and The Normal Distribution: Prem Mann, Introductory Statistics, 9/E
144 pages
Chapter 3 - Displaying and Summarizing Quantitative Data
No ratings yet
Chapter 3 - Displaying and Summarizing Quantitative Data
77 pages
Research Methods 6RM
No ratings yet
Research Methods 6RM
7 pages
Statistics Traning Exam
No ratings yet
Statistics Traning Exam
10 pages
Stats1 Chp3 SupplementaryHistogramExercise
No ratings yet
Stats1 Chp3 SupplementaryHistogramExercise
4 pages
Omgt 333 Review Questions w2017
No ratings yet
Omgt 333 Review Questions w2017
4 pages
Chapter 2: Descriptive Analysis and Presentation of Single-Variable Data
No ratings yet
Chapter 2: Descriptive Analysis and Presentation of Single-Variable Data
71 pages
ANSWER
No ratings yet
ANSWER
29 pages
Business Statistics: A Decision-Making Approach: Graphs, Charts, and Tables - Describing Your Data
No ratings yet
Business Statistics: A Decision-Making Approach: Graphs, Charts, and Tables - Describing Your Data
47 pages
Chapter 2 171
No ratings yet
Chapter 2 171
118 pages
Quiz3 ISDS 361B
No ratings yet
Quiz3 ISDS 361B
5 pages
Graded Quiz - Using Probability Distributions - Coursera
No ratings yet
Graded Quiz - Using Probability Distributions - Coursera
10 pages
Bar Graph-Wps Office
No ratings yet
Bar Graph-Wps Office
16 pages
May 2021 Examination Diet School of Mathematics & Statistics ID5059
No ratings yet
May 2021 Examination Diet School of Mathematics & Statistics ID5059
6 pages
Exercise1 3
No ratings yet
Exercise1 3
13 pages
Applied Statistics: Assessment Tasks
No ratings yet
Applied Statistics: Assessment Tasks
4 pages
Chapter 2 MC Quiz
No ratings yet
Chapter 2 MC Quiz
2 pages
Chapter 7 Test
No ratings yet
Chapter 7 Test
4 pages
Prepari̇ng-İst Fi̇nal Exam
No ratings yet
Prepari̇ng-İst Fi̇nal Exam
12 pages
g8m6 Study Guide Statistics
No ratings yet
g8m6 Study Guide Statistics
8 pages
Written Arguments Consumer
No ratings yet
Written Arguments Consumer
3 pages
Wishart Distribution
No ratings yet
Wishart Distribution
6 pages
Past Exam 1
No ratings yet
Past Exam 1
6 pages
Teaching Excel in Statistics
No ratings yet
Teaching Excel in Statistics
9 pages
Impact Assessment of Stem Initiatives in Improving Educational Outcomes: Research Report from a National Evaluation Conducted to Inform Policy and Practice
From Everand
Impact Assessment of Stem Initiatives in Improving Educational Outcomes: Research Report from a National Evaluation Conducted to Inform Policy and Practice
Pallavi Amitava Banerjee
No ratings yet
There Are Four Basic Types of Satellites
100% (1)
There Are Four Basic Types of Satellites
18 pages
Ma2262 Probability and Queuing Theory Question Bank Download
No ratings yet
Ma2262 Probability and Queuing Theory Question Bank Download
4 pages
Assignment I Data Analytics
No ratings yet
Assignment I Data Analytics
3 pages
ECS Concepts and Features-Participant Guide
No ratings yet
ECS Concepts and Features-Participant Guide
132 pages
Suraj Data
No ratings yet
Suraj Data
100 pages
1-Introduction To Algorithms and C Programming
No ratings yet
1-Introduction To Algorithms and C Programming
50 pages
Ba K 0106 1 en
No ratings yet
Ba K 0106 1 en
20 pages
IPCC Inventory Software Manual
No ratings yet
IPCC Inventory Software Manual
66 pages
Drill Stem Test
No ratings yet
Drill Stem Test
4 pages
Techno 101 - Presentation
No ratings yet
Techno 101 - Presentation
58 pages
How To Download Google Maps For Windows 11 - 10
No ratings yet
How To Download Google Maps For Windows 11 - 10
28 pages
Surveillance Systems
No ratings yet
Surveillance Systems
17 pages
Objective:: Write An Experiment On Zener Diode Clipper
No ratings yet
Objective:: Write An Experiment On Zener Diode Clipper
13 pages
01ALCATEL - Temporis - 500 Pro - User Guide
No ratings yet
01ALCATEL - Temporis - 500 Pro - User Guide
40 pages
Introduction To Text Mining
No ratings yet
Introduction To Text Mining
45 pages
Acpk Brochure 1
No ratings yet
Acpk Brochure 1
20 pages
Integrating PCA With Deep Learning Models For Stock Market Forecasting
No ratings yet
Integrating PCA With Deep Learning Models For Stock Market Forecasting
13 pages
DLL - Mapeh 4 - Q3 - W9
No ratings yet
DLL - Mapeh 4 - Q3 - W9
4 pages
Chs 07 08answers PDF
No ratings yet
Chs 07 08answers PDF
18 pages
Building Internet Brands: Brand Equity and Brand Image Creating A Strong Brand On The Internet
No ratings yet
Building Internet Brands: Brand Equity and Brand Image Creating A Strong Brand On The Internet
22 pages
Computational Fluid Dynamic Analysis of Innovative Design of Solar-Biomass Hybrid Dryer
No ratings yet
Computational Fluid Dynamic Analysis of Innovative Design of Solar-Biomass Hybrid Dryer
12 pages
Combined Voltage and Current Post Insulator Sensors: Ordering Table Part Number Sequence 96AB/CDEFGH Where
No ratings yet
Combined Voltage and Current Post Insulator Sensors: Ordering Table Part Number Sequence 96AB/CDEFGH Where
2 pages
T34 Catlogue - Catalogue - V2 - 2023
No ratings yet
T34 Catlogue - Catalogue - V2 - 2023
8 pages
Individual Accomplishment Report 10
No ratings yet
Individual Accomplishment Report 10
5 pages
Tom's Introduction To The MBT Binaural Beats and How Best To Use Them
No ratings yet
Tom's Introduction To The MBT Binaural Beats and How Best To Use Them
3 pages
Hostel List
No ratings yet
Hostel List
4 pages
2 Abstract (Black and White)
No ratings yet
2 Abstract (Black and White)
5 pages
DEA 5TT2 Quiz
No ratings yet
DEA 5TT2 Quiz
4 pages
Brakes Volvo Trucks
No ratings yet
Brakes Volvo Trucks
2 pages
Dasar Mesin Elektrik G-M Saja
No ratings yet
Dasar Mesin Elektrik G-M Saja
45 pages
Ford Truck f650 f750 Wiring Diagrams 1999
No ratings yet
Ford Truck f650 f750 Wiring Diagrams 1999
16 pages

Questions

Uploaded by

Questions

Uploaded by

The next 8 questions ask you to analyze a dataset of student performance for 1,000 Freshman (1st

Please use the scoresdat.csv dataset.

Xiaotong is persuaded with your results, as presented by Lily, and agrees to

Inspired by using "customer" analytic techniques in examples including combating

You might also like