Instructor:: Semester Project Mam. Yella Mehroze

This document describes a student project to predict student engagement levels using a dataset containing student performance data. The dataset has 13 attributes including class label and will be used to classify students as having high or low engagement. Classification techniques in Weka, such as J48 decision trees, will be applied for model training and prediction. Feature selection will also be performed to identify the most important attributes for the model. The trained model will then be tested on unlabeled data to predict engagement levels.

Uploaded by

Bilal Sheikh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

68 views7 pages

Instructor:: Semester Project Mam. Yella Mehroze

Uploaded by

Bilal Sheikh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 7

Data Warehouse Data Mining

Semester Project

Instructor: Mam. Yella Mehroze

Student Details
Name Bilal Ahmad
Course Data mining
Reg No. FA19-BCS-136
Section C
Date 18/12/2021
Student performance
prediction

Dataset details
Description:
Student performance prediction database will be use in this project
to predict the engagement level of student with learning resource.
Predicted class label will show high or low based on the interaction
of student with study material. Dataset contains 13 attributes
including class attribute and almost 500 records.
It includes:
 Login: How many times a student login to portal.
 Content Read: How many times a student read the study
material content.
 Forum Reads: How many times a student read the
problem/issue on community forum.
 Forum post: How many times a student posts the
problem/issue on community forum.
 Review: How many times a student reviews the quiz before
submission.
 Lateness indicator: Is student submit the late assignment.
There are multiple lateness indicator attributes.
 Average assignment submission time: How many hours
student takes to submit the assignment. There are multiple
lateness indicator attributes.
 Engagement level: This is a class label that will show the level
of student engagement to study materials based on previous
attributes.

Source:
Source of the dataset is from github (open source platform) and you
can get this dataset from here.

Problem type:
I want to predict class labels, so I will use classification because
classification is used to predict the nominal value of class attribute.

Techniques:
I will use weka tool to classify to train model and predict the class
labels. I will also use feature selection to select the relevant
attributes for model training.
Process
Preprocessing:
Dataset is already preprocessed. There is no negative, null, or empty
value. Class attribute label is nominal and contains H (high) or L (low)
labels.

Classification:
I will use classification because classification is used to predict the
nominal value of class attribute. I will train the model using different
algorithm and test options.
Here are some steps to perform classification:
 Load training data into weka.
 In classify tab, choose tree and select J48 algorithm.
 Select cross validation (10 folds) in test option.
 Select class attribute
 Start the process.
Result is showing that dataset is 99% correctly classified. There are
other details like total instances, root mean, absolute error,
confusion matrix, accuracy details etc.

Right click on the result item and select “save the model” for test
data label prediction.
Now, create the copy of training data, shuffle the order, and remove
all the labels of class attribute to check either trained model can
predict class labels or not.
Here are some steps to perform prediction:
 In classify tab, choose supplied test data in test options.
 Load the test data file (without labels).
 Choose class attribute.
 Click on more options, choose Plain text in output prediction.
 Reevaluate the model on current dataset.
 Result will show the predicted class labels.

Result is showing the predicted class labels as high or low based on

trained model.
Association rules:
Association rules are not applicable on this type of dataset and this
feature is out of scope. Association rules are only applicable if
dataset contain itemset.
Feature selection:
The attribute selection task essentially consists in selecting a subset
of originally available attributes to be subsequently used for model
creation. For this purpose, I will use selectAttribute tab to select top
10 attribute in dataset that will be use in training model.
Here are some steps to perform feature selection:
 Load training data into weka.
 In classify tab, choose info gain in attribute evaluator.
 Set number option to 10 in Ranker.
 Select cross validation (10 folds) in test option.
 Select class attribute
 Start the process.

Result is showing relevant attributes which are necessary for model

training. Attributes are short on the base of average rank. Select top
rank attributes for training model. Class attribute will be mandatory.

HCIA-AI V3.0 Training Material
100% (2)
HCIA-AI V3.0 Training Material
474 pages
Detection of Parkinson's Disease Using Machine Learning
75% (4)
Detection of Parkinson's Disease Using Machine Learning
91 pages
CB Insights Tech Market Map AI Lifecycle Management in Enterprise IT
No ratings yet
CB Insights Tech Market Map AI Lifecycle Management in Enterprise IT
22 pages
CE802 Pilot
No ratings yet
CE802 Pilot
2 pages
Capstone 2 Corizo
No ratings yet
Capstone 2 Corizo
2 pages
Assignment 1-Preprocessing Handon
No ratings yet
Assignment 1-Preprocessing Handon
6 pages
Predicting Employees Performance Using Data Mining Techniques
No ratings yet
Predicting Employees Performance Using Data Mining Techniques
12 pages
Perform Prediction Using Regression Algorithm: Ex No: 1 Date
No ratings yet
Perform Prediction Using Regression Algorithm: Ex No: 1 Date
13 pages
DWM Lab Manual
No ratings yet
DWM Lab Manual
92 pages
Machine Learning Project Checklist
No ratings yet
Machine Learning Project Checklist
30 pages
ML Checklist PDF
No ratings yet
ML Checklist PDF
4 pages
Employee Performance Analysis
No ratings yet
Employee Performance Analysis
3 pages
CE802 Report
No ratings yet
CE802 Report
7 pages
19-Introduction Classification Algorithm-18-09-2024
No ratings yet
19-Introduction Classification Algorithm-18-09-2024
102 pages
Credit Risk Project
No ratings yet
Credit Risk Project
11 pages
Mini Project Report
No ratings yet
Mini Project Report
10 pages
DM Unit - 3
No ratings yet
DM Unit - 3
21 pages
2023 Its665 - Isp565 - Group Project
No ratings yet
2023 Its665 - Isp565 - Group Project
6 pages
Credit Card Approval Prediction Report-Final
No ratings yet
Credit Card Approval Prediction Report-Final
27 pages
DM Lab Record PDF
No ratings yet
DM Lab Record PDF
32 pages
S 11
No ratings yet
S 11
7 pages
07 ML Classificaion Advanced Kappa
No ratings yet
07 ML Classificaion Advanced Kappa
18 pages
Draft Xai
No ratings yet
Draft Xai
16 pages
Case Study - Churn Mdel Prediction
No ratings yet
Case Study - Churn Mdel Prediction
77 pages
Its665 Isp565 Group Project Mac2024
No ratings yet
Its665 Isp565 Group Project Mac2024
9 pages
17 Ensemble Techniques Problem Statement
No ratings yet
17 Ensemble Techniques Problem Statement
28 pages
German Dataset Tasks
No ratings yet
German Dataset Tasks
6 pages
Machine Learning Lecture1 - 26-27 Aug
No ratings yet
Machine Learning Lecture1 - 26-27 Aug
30 pages
EC9560 Data Mining: Lab 02: Classification and Prediction Using WEKA
No ratings yet
EC9560 Data Mining: Lab 02: Classification and Prediction Using WEKA
5 pages
Introduction To Data Mining & Classification
No ratings yet
Introduction To Data Mining & Classification
58 pages
6.034 Design Assignment 2: 1 Data Sets
No ratings yet
6.034 Design Assignment 2: 1 Data Sets
6 pages
2022ucd2164 1 2
No ratings yet
2022ucd2164 1 2
35 pages
Research Paper
No ratings yet
Research Paper
5 pages
TE ML LAB Mannual
No ratings yet
TE ML LAB Mannual
21 pages
01 Apply Data Preprocessing On Heart Dataset and Evaluate Performance Using Confusion Matrix
No ratings yet
01 Apply Data Preprocessing On Heart Dataset and Evaluate Performance Using Confusion Matrix
19 pages
DM Assignment 2
No ratings yet
DM Assignment 2
2 pages
Data Science Checklist
No ratings yet
Data Science Checklist
22 pages
Minor Project
No ratings yet
Minor Project
21 pages
ClassificationandPrediction Module3
No ratings yet
ClassificationandPrediction Module3
88 pages
Its665 Isp565 Group Project March 2023
No ratings yet
Its665 Isp565 Group Project March 2023
10 pages
MLT 1 - 7 Kanish
No ratings yet
MLT 1 - 7 Kanish
24 pages
Each Stage of A Data Mining Project
No ratings yet
Each Stage of A Data Mining Project
5 pages
CSC4316 9
No ratings yet
CSC4316 9
40 pages
DWDM Lab Tasks
No ratings yet
DWDM Lab Tasks
13 pages
Assignment 1-Preprocessing Handon
No ratings yet
Assignment 1-Preprocessing Handon
13 pages
Features Selection and Featurs Generation
No ratings yet
Features Selection and Featurs Generation
5 pages
7 Classification
100% (3)
7 Classification
63 pages
Semester 2, 2020 Week 8: Data Mining in WEKA Tutorial/Lab Session - 7
No ratings yet
Semester 2, 2020 Week 8: Data Mining in WEKA Tutorial/Lab Session - 7
13 pages
Course Project Report: Indian Institute of Technology, Kanpur
No ratings yet
Course Project Report: Indian Institute of Technology, Kanpur
15 pages
Da Lab Mannual
No ratings yet
Da Lab Mannual
25 pages
حل المشروع
No ratings yet
حل المشروع
13 pages
DMBI
No ratings yet
DMBI
15 pages
ICAICT 2016 Paper 26
No ratings yet
ICAICT 2016 Paper 26
8 pages
Machine Learning Team Coursework
No ratings yet
Machine Learning Team Coursework
7 pages
35 Cse DWM
No ratings yet
35 Cse DWM
41 pages
DM Manual-Min
No ratings yet
DM Manual-Min
100 pages
Project Synopsis
33% (3)
Project Synopsis
4 pages
Hasnain Saeed Lab Task # 11
No ratings yet
Hasnain Saeed Lab Task # 11
11 pages
Project Report-Micro Credit Loan
No ratings yet
Project Report-Micro Credit Loan
8 pages
WEKA
No ratings yet
WEKA
81 pages
Review 3
No ratings yet
Review 3
25 pages
DATA MINING AND MACHINE LEARNING. PREDICTIVE TECHNIQUES: REGRESSION, GENERALIZED LINEAR MODELS, SUPPORT VECTOR MACHINE AND NEURAL NETWORKS
From Everand
DATA MINING AND MACHINE LEARNING. PREDICTIVE TECHNIQUES: REGRESSION, GENERALIZED LINEAR MODELS, SUPPORT VECTOR MACHINE AND NEURAL NETWORKS
César Pérez López
No ratings yet
IGNOU PGDCA MCS 206 Object Oriented Programming using Java Previous Years solved Papers
From Everand
IGNOU PGDCA MCS 206 Object Oriented Programming using Java Previous Years solved Papers
Manish Soni
No ratings yet
Chemistry: Short Questions
No ratings yet
Chemistry: Short Questions
1 page
Lab Task No. 02 Instructor: Allah Bux Sargano: Student Details
No ratings yet
Lab Task No. 02 Instructor: Allah Bux Sargano: Student Details
8 pages
COMSATS University Islamabad, Lahore Campus: Legend For Seat #, Room Location
No ratings yet
COMSATS University Islamabad, Lahore Campus: Legend For Seat #, Room Location
1 page
Assignment No. 01 Instructor: Samiya Shahzad: Student Details
No ratings yet
Assignment No. 01 Instructor: Samiya Shahzad: Student Details
3 pages
Assignment No. 01 Instructor: Dr. Abdul Sattar: Student Details
No ratings yet
Assignment No. 01 Instructor: Dr. Abdul Sattar: Student Details
3 pages
Quaid
No ratings yet
Quaid
2 pages
Write About War of Independence 1857
No ratings yet
Write About War of Independence 1857
2 pages
Assignment No. 02 Instructor: Iqra Obaid: Student Details
No ratings yet
Assignment No. 02 Instructor: Iqra Obaid: Student Details
3 pages
Admin Panel Bypass
No ratings yet
Admin Panel Bypass
3 pages
Individualized Indicator For All: Stock-Wise Technical Indicator Optimization With Stock Embedding
No ratings yet
Individualized Indicator For All: Stock-Wise Technical Indicator Optimization With Stock Embedding
9 pages
Dengue Fever Prediction A Data Mining Problem 2153 0602 1000181 PDF
No ratings yet
Dengue Fever Prediction A Data Mining Problem 2153 0602 1000181 PDF
5 pages
Deep Speech - Scaling Up End-To-End Speech Recognition
No ratings yet
Deep Speech - Scaling Up End-To-End Speech Recognition
12 pages
TSA Business Report
0% (1)
TSA Business Report
27 pages
Introduction To Machine Learning Top-Down Approach - Towards Data Science
No ratings yet
Introduction To Machine Learning Top-Down Approach - Towards Data Science
6 pages
Credit Risk Modeling in R - ch1 - PDF
No ratings yet
Credit Risk Modeling in R - ch1 - PDF
45 pages
An Enlightenment To Machine Learning - Resp
No ratings yet
An Enlightenment To Machine Learning - Resp
22 pages
LP I ML Viva Questions
100% (1)
LP I ML Viva Questions
9 pages
55-Julia-Large Dimension Parametrization With Convolutional Variational Autoencoder
No ratings yet
55-Julia-Large Dimension Parametrization With Convolutional Variational Autoencoder
20 pages
P M S W P M U D L D T R
No ratings yet
P M S W P M U D L D T R
26 pages
Constantinou 2018 ML PDF
No ratings yet
Constantinou 2018 ML PDF
27 pages
Internship Report
No ratings yet
Internship Report
38 pages
EAST: An Efficient and Accurate Scene Text Detector: April 2017
No ratings yet
EAST: An Efficient and Accurate Scene Text Detector: April 2017
11 pages
Pointnet: A 3D Convolutional Neural Network For Real-Time Object Class Recognition
No ratings yet
Pointnet: A 3D Convolutional Neural Network For Real-Time Object Class Recognition
8 pages
User Guide of GARCH-MIDAS and DCC-MIDAS MATLAB Programs
No ratings yet
User Guide of GARCH-MIDAS and DCC-MIDAS MATLAB Programs
12 pages
10.1007@978 981 13 7123 3
No ratings yet
10.1007@978 981 13 7123 3
628 pages
Real Estate Project PDF
No ratings yet
Real Estate Project PDF
8 pages
Lecture13 Ngrams With SRILM
No ratings yet
Lecture13 Ngrams With SRILM
6 pages
Seminar On Artificial Neural Network
No ratings yet
Seminar On Artificial Neural Network
17 pages
Estimate Furnace Temp
No ratings yet
Estimate Furnace Temp
10 pages
Plant Disease Detection
No ratings yet
Plant Disease Detection
3 pages
Water Fraud REPORT
0% (2)
Water Fraud REPORT
63 pages
Detect Depression From Communication How Computer Vision Signal Processing and Sentiment Analysis Join Forces
No ratings yet
Detect Depression From Communication How Computer Vision Signal Processing and Sentiment Analysis Join Forces
14 pages
A Major Project Report ON "Mnist (Digit Recognisation) " Submitted To (M.P.)
No ratings yet
A Major Project Report ON "Mnist (Digit Recognisation) " Submitted To (M.P.)
21 pages
6.036 Notes
No ratings yet
6.036 Notes
99 pages
Forecasting Selling Price of Petrol in India Post Covid-19 Using Radial Basis Function Technique
No ratings yet
Forecasting Selling Price of Petrol in India Post Covid-19 Using Radial Basis Function Technique
11 pages
Green AI
No ratings yet
Green AI
12 pages

Instructor:: Semester Project Mam. Yella Mehroze

Uploaded by

Instructor:: Semester Project Mam. Yella Mehroze

Uploaded by

Data Warehouse Data Mining

Instructor: Mam. Yella Mehroze

Result is showing the predicted class labels as high or low based on

Result is showing relevant attributes which are necessary for model

You might also like