COM2007 CaseStudy Sample
COM2007 CaseStudy Sample
ON STUDENT
DATA
TABLE OF CONTENTS
01 DATASET INTRO
02 METHODOLOGY
03 ANALYSIS
RESULTS
04 CONCLUSIONS
THE DATASET
DERIVED FROM…
The dataset is published on the website of Center for Machine
Learning and Intelligent Systems of University of California.
THE DATASET IS
ABOUT…
The data was collected from school reports and questionnaires,
the research objects are 395 students in secondary education
of two Portuguese schools.
Access Power BI
To create datatables that To visualize key insights
could highlight the key that we found with Access
insights SQL
WHAT
WE HAVE
FOUND
INSIGHT
#1
Performance difference in two different schools,
whether extra educational support and absences are two attributes that impact the
students’ grades.
Student Grade in different
School
<SELECT StudentInfo.School, COUNT(School) AS NoOfStudents, Avg(StudentInfo.G1) AS AvgGradeIn1stPeriod,
Max(G1) AS HighestGradeIn1stPeriod, Min(G1) AS LowestGradeIn1stPeriod, Avg(StudentInfo.G2) AS
AvgGradeIn2ndPeriod, Max(G2) AS aHighestGradeIn2ndPeriod, Min(G2) AS LowestGradeIn2ndPeriod,
Avg(StudentInfo.G3) AS AvgFinalGrade, Max(G3) AS HighestFinalGrade, Min(G3) AS LowestFinalGrade
FROM StudentInfo GROUP BY StudentInfo.School;>
Student Grade in School Of
“GP” is better
We found that in every column, the grade of GP’s students are better than the
MS Students.
Academic Result of School of GP is better!
Whether absence affects the
grade?
<SELECT absences, COUNT(Absences) AS NoOfStudent, AVG(G1) AS AvgOf1st, AVG(G2) AS AVGof2nd,
AVG(G3) AS AVGoffinalgd
FROM StudentInfo GROUP BY Absences;>
Whether absence affects the
grade?
Maybe Yes.
Using the final grade and
absences data into scatter plot.
Avg Grade
SchScp: 9.43 NoSchSup: 10.67
We estimate that “school support” is for students whose result is not ideal for
making improvement.
Edu_Gardian
133
NonEdu_Gardian
262 It is not always the
parent who has a
higher education level
who takes the role of
the guardian.
SELECT COUNT(S.ID) AS Job_Guardian, COUNT(A.ID) AS All_Guardian, All_Guardian -
Job_Guardian AS NonJob_Guardian
FROM
(SELECT ID, Mjob, Fjob, guardian
FROM StudentInfo
WHERE (Mjob = "at_home" AND guardian = "mother") OR (Fjob = "at_home" AND
guardian = "father")) AS S RIGHT JOIN (SELECT *
FROM StudentInfo
WHERE Mjob = "at_home" OR Fjob = "at_home") AS A ON S.ID = A.ID
;
In a family, if there is
one parent that is “at
home”, then this
parent has higher
chance to be the
guardian of the
student.
SELECT COUNT(T.ID) AS Teacher_Guardian, COUNT(A.ID) AS All_Guardian,
All_Guardian - Teacher_Guardian AS NonTeacher_Guardian
FROM
(SELECT *
FROM (SELECT ID, Mjob, Fjob, guardian FROM StudentInfo WHERE Mjob = "teacher"
OR Fjob = "teacher") AS [%$##@_Alias]
WHERE (Mjob = "teacher" AND guardian = "mother") OR (Fjob = "teacher" AND
guardian = "father")) AS T RIGHT JOIN (SELECT *
FROM StudentInfo
WHERE Mjob = "teacher" OR Fjob = "teacher") AS A ON T.ID = A.ID;
In a family, if there is
one parent that is
“teacher”, then this
parent has higher
chance to be the
guardian of the
student.
SELECT COUNT(A.ID) AS Apart_other, COUNT (B.ID) AS All_other, All_other -
Apart_other AS NonApart_other
FROM
(SELECT ID, Pstatus, guardian
FROM StudentInfo
WHERE Pstatus = "A" AND guardian = "other") AS A RIGHT JOIN
(SELECT ID, Medu, Fedu, Mjob, Fjob, Pstatus, guardian
FROM StudentInfo
WHERE guardian = "other") AS B ON A.ID = B.ID;
If there is no
internet &
extra-curricular
activities, there
can have better
performance by
putting more
time on
studying.
Internet & Activities
If there has
both internet &
extra-curricular
activities, there
can have better
performance
than average.
Internet & Activities
(Mjob=teacher) By <SELECT
COUNT(*)
FROM
Studentinfo
WHERE
Fjob='teacher'
and
internet='yes'
and
activities='yes';
>
There are 12
records and
shows the
representation
of reliability.
INSIGHT
#4
How student behavior affects student’s performance.
Including attributes:
Studytime, goout, dalc, romantic.
How studytime affect the
grade
SELECT studytime, COUNT(studytime) AS numberofstudent, AVG(G1)+AVG(G2)+AVG(G3)
AS TotalAvgGrade FROM StudentInfo GROUP BY studytime;
the grade
AVG(G1)+AVG(G2)+AVG(G3) AS TotalAvgGrade FROM StudentInfo GROUP BY
romantic;