0% found this document useful (0 votes)

81 views14 pages

Machine Learning Project Presentation

This document summarizes Samuel Odulaja's machine learning project to predict house prices using a Kaggle dataset containing information on over 1400 homes. Various regression techniques were applied to transform, engineer, and select features from the original 79 variables. Models like Ridge, Lasso, SVR, LightGBM, and Gradient Boosting were trained and evaluated on the task. The best performing model was Gradient Boosting, achieving an RMSE of 0.3848232 on the test set.

Uploaded by

shivaybhargava33

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

81 views14 pages

Machine Learning Project Presentation

Uploaded by

shivaybhargava33

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 14

Machine Learning

Project
Samuel Odulaja
Background

● Kaggle Dataset
○ Contains around 1400 house prices and associated predictors
● 79 explanatory variables describing aspects of residential homes in Ames, Iowa
● Using advanced regression techniques, predict the final price of each home
Concatenate training and testing features

● Concatenated features so
that we don’t have to
impute missing values,
transform features, etc.
○ Did this for both
training and test sets
● Removed houses with
ground living area greater
than 4,500 sq.ft from the
training sets
SalePrice Distribution
SalePrice Transformation

● Transformed target variable

○ y_train = np.log(train["SalePrice"])
Impute missing values

● The plot shows the

number of missing values
in columns with at least
one missing value
Engineer features

● Creating new features for dataset

○ TotalSF, TotalPorchSF, TotalBath
Categorize MSSubClass and YrSold

● From the MSSubClass

description, the levels don’t seem
to have a natural ordering
○ Represented the MSSubClass
as a categorical feature rather
than a numerical one
● Also represented YrSold as a
categorical feature
○ Allowed for a more flexible
relationship with SalePrice
Transform features

● To better highlight any recurring patterns in SalePrice, MoSold was transformed

● Also transformed highly skewed features using code below

● Used pd.get_dummies to convert all categorical values into dummy variables

Removing outliers from training data

● Fitted a linear model to the training data and removed examples with a studentized residual
greater than 3
Define random search

● Used random search to optimize hyperparameters for each of our models

● Used a 5-fold cross validation to score each iteration
Trained Models

● Overall the models did well with Gradient Boosting performing the best.
○ Ridge: 0.0778
○ Lasso: 0.0796
○ SVR: 0.0712
○ LGBM: 0.0640
○ GBM: 0.0436
Creating Predictions and RSME

● Stored the predictions of the based learners and stacked ensemble in a list
● Averaged the predictions and gave a weight of 0.13 to the based learners and .35 to the stacked
ensemble
● RSME: 0.3848232
Conclusions

● Overall the models seemed to perform well

● However the RSME seemed a little high
○ Most likely an error in the code
● In the future would improve on RSME, by using different methods.

House Price Prediction
No ratings yet
House Price Prediction
14 pages
SAP BDEx Config Guide
100% (2)
SAP BDEx Config Guide
101 pages
Machine Learning Lab Manual
No ratings yet
Machine Learning Lab Manual
22 pages
CP4252 Machine Learning Lab Manual
No ratings yet
CP4252 Machine Learning Lab Manual
26 pages
House
100% (2)
House
19 pages
Coding Question
No ratings yet
Coding Question
6 pages
MAJOR Synopsis
No ratings yet
MAJOR Synopsis
5 pages
Accomplishment Report Ict
100% (2)
Accomplishment Report Ict
2 pages
House Price Prediction Using Machine Learning: © MAY 2021 - IRE Journals - Volume 4 Issue 11 - ISSN: 2456-8880
No ratings yet
House Price Prediction Using Machine Learning: © MAY 2021 - IRE Journals - Volume 4 Issue 11 - ISSN: 2456-8880
5 pages
10982C Supporting and Troubleshooting Windows 10
100% (1)
10982C Supporting and Troubleshooting Windows 10
9 pages
Readme
No ratings yet
Readme
67 pages
Real Estate Price Prediction Model
No ratings yet
Real Estate Price Prediction Model
3 pages
CVTSP1120-M01-An Introduction To Commvault
No ratings yet
CVTSP1120-M01-An Introduction To Commvault
16 pages
Project
No ratings yet
Project
10 pages
Housepriceprediction ML 221104055342 Fb5109ae
No ratings yet
Housepriceprediction ML 221104055342 Fb5109ae
17 pages
ML Practical 04
No ratings yet
ML Practical 04
19 pages
For House Price Prediction Model
No ratings yet
For House Price Prediction Model
9 pages
Phase 5
No ratings yet
Phase 5
5 pages
Synopsis 4
No ratings yet
Synopsis 4
7 pages
House Price Predictor PPT Project
No ratings yet
House Price Predictor PPT Project
13 pages
Predicting House Prices
No ratings yet
Predicting House Prices
9 pages
Predicting House Prices Using Machine Learning
No ratings yet
Predicting House Prices Using Machine Learning
6 pages
Report
No ratings yet
Report
40 pages
Title Predicting House Pricing Using AIML (KASHISH)
No ratings yet
Title Predicting House Pricing Using AIML (KASHISH)
2 pages
Dawit House
No ratings yet
Dawit House
49 pages
CSIC 6132 排版870 878
No ratings yet
CSIC 6132 排版870 878
9 pages
ML Assignment (22BCE8086) 2
No ratings yet
ML Assignment (22BCE8086) 2
19 pages
ES205 Presentation
No ratings yet
ES205 Presentation
13 pages
Oral Presentation
No ratings yet
Oral Presentation
9 pages
DL Lab Prog 2
No ratings yet
DL Lab Prog 2
2 pages
Report 1
No ratings yet
Report 1
11 pages
Meta
No ratings yet
Meta
21 pages
Assignment
No ratings yet
Assignment
3 pages
Comprehensive Project
No ratings yet
Comprehensive Project
10 pages
ML Project CLG
No ratings yet
ML Project CLG
62 pages
House Price Prediction With Analysis
No ratings yet
House Price Prediction With Analysis
9 pages
Solution Methodology
No ratings yet
Solution Methodology
5 pages
Aastha Mahajan Python File
No ratings yet
Aastha Mahajan Python File
17 pages
Detailed Report Regression Models For House Price Prediction
No ratings yet
Detailed Report Regression Models For House Price Prediction
3 pages
Data Science Assignment Chapter 1
No ratings yet
Data Science Assignment Chapter 1
5 pages
House Price Prediction Using Machine Learning Techniques
No ratings yet
House Price Prediction Using Machine Learning Techniques
5 pages
House Price Prediction Using Machine Learning in Python
No ratings yet
House Price Prediction Using Machine Learning in Python
13 pages
Predicting House Prices Using Regression Techniques: Problem Statement: Problems Faced During Buying A House
No ratings yet
Predicting House Prices Using Regression Techniques: Problem Statement: Problems Faced During Buying A House
20 pages
Final Defence
No ratings yet
Final Defence
55 pages
House Price Prediction
No ratings yet
House Price Prediction
27 pages
Qfsglobalpro.com
No ratings yet
Qfsglobalpro.com
22 pages
Ids Case Study
No ratings yet
Ids Case Study
15 pages
Data Science Project Report Long
No ratings yet
Data Science Project Report Long
177 pages
Ames Housing Price Prediction - Complete ML Project With Python
No ratings yet
Ames Housing Price Prediction - Complete ML Project With Python
14 pages
House Price Prediction Using Machine Learning Techniques
No ratings yet
House Price Prediction Using Machine Learning Techniques
5 pages
Bangalore House Price Prediction
No ratings yet
Bangalore House Price Prediction
4 pages
De Assignment 3
No ratings yet
De Assignment 3
2 pages
Rev Ajrcos 101262 Ina A
No ratings yet
Rev Ajrcos 101262 Ina A
11 pages
Utkarsh Gupta - House Price Prediction
No ratings yet
Utkarsh Gupta - House Price Prediction
6 pages
Regression Dataset
No ratings yet
Regression Dataset
3 pages
Real Estate Price Prediction With Regression and Classification
No ratings yet
Real Estate Price Prediction With Regression and Classification
5 pages
DL Assignment 1ms24rai03
No ratings yet
DL Assignment 1ms24rai03
10 pages
Data Analysis Project MAIN
No ratings yet
Data Analysis Project MAIN
6 pages
Data Mining Final Assignment
No ratings yet
Data Mining Final Assignment
4 pages
House Prices Analysis - Final Assessment
No ratings yet
House Prices Analysis - Final Assessment
2 pages
Arvind Report
No ratings yet
Arvind Report
21 pages
Bangalore House Price Prediction Using The Best Machine Learning Model Submitted by Rukzana Vadakkekudy Rassak P2682221
No ratings yet
Bangalore House Price Prediction Using The Best Machine Learning Model Submitted by Rukzana Vadakkekudy Rassak P2682221
9 pages
Numeric
No ratings yet
Numeric
20 pages
Project Report
No ratings yet
Project Report
15 pages
CD FO 32 Experience Verification Form V1.0
No ratings yet
CD FO 32 Experience Verification Form V1.0
1 page
ML Manual
No ratings yet
ML Manual
24 pages
Ansible Rhel 90
No ratings yet
Ansible Rhel 90
72 pages
CC Domain4
No ratings yet
CC Domain4
67 pages
COMSATS Institute of Information Technology, Islamabad: Terminal Examination Fall2014
No ratings yet
COMSATS Institute of Information Technology, Islamabad: Terminal Examination Fall2014
8 pages
SOS Computer Class 5 2024-25
No ratings yet
SOS Computer Class 5 2024-25
3 pages
Understanding OpenID Connect Protocol
No ratings yet
Understanding OpenID Connect Protocol
18 pages
Word Formation Process
No ratings yet
Word Formation Process
24 pages
Data Warehousing & Data Mining
No ratings yet
Data Warehousing & Data Mining
16 pages
Recursive Functions
No ratings yet
Recursive Functions
12 pages
Sk-Final Inventory-Form Output 2023
No ratings yet
Sk-Final Inventory-Form Output 2023
4 pages
Mod 6 Sma
No ratings yet
Mod 6 Sma
7 pages
Seven Direct SD50 PH Ion Meter Manual
No ratings yet
Seven Direct SD50 PH Ion Meter Manual
70 pages
Web Dev Touchstone Sample Submission
No ratings yet
Web Dev Touchstone Sample Submission
23 pages
Flipkart Labels 27 Jun 2025-07-47
No ratings yet
Flipkart Labels 27 Jun 2025-07-47
1 page
2024 - A Survey On Long Video Generation - Li Et Al
No ratings yet
2024 - A Survey On Long Video Generation - Li Et Al
11 pages
2024 March Mrahman What To Expect in Android 15
No ratings yet
2024 March Mrahman What To Expect in Android 15
44 pages
High Online Traffic Can Crash Your Website
No ratings yet
High Online Traffic Can Crash Your Website
16 pages
Unit 1 - Chapter 1 - Worksheet - Answer
No ratings yet
Unit 1 - Chapter 1 - Worksheet - Answer
3 pages
Budget of Work (Bow) in Mathematics
No ratings yet
Budget of Work (Bow) in Mathematics
6 pages
Stacktrace
No ratings yet
Stacktrace
96 pages
Simple-Ostinato: Release 0.0.1
No ratings yet
Simple-Ostinato: Release 0.0.1
41 pages
Automating Speedrun Routing Overview and Vision
No ratings yet
Automating Speedrun Routing Overview and Vision
8 pages
Pagerank Explained Simple
No ratings yet
Pagerank Explained Simple
4 pages
Vice President of Engineering - Shift5
No ratings yet
Vice President of Engineering - Shift5
3 pages
Ctrl+Shift+Enter Mastering Excel Array Formulas: Do the Impossible with Excel Formulas Thanks to Array Formula Magic
From Everand
Ctrl+Shift+Enter Mastering Excel Array Formulas: Do the Impossible with Excel Formulas Thanks to Array Formula Magic
Mike Girvin
4/5 (11)

Machine Learning Project Presentation

Uploaded by

Machine Learning Project Presentation

Uploaded by

Machine Learning

● Transformed target variable

● The plot shows the

● Creating new features for dataset

● From the MSSubClass

● To better highlight any recurring patterns in SalePrice, MoSold was transformed

● Also transformed highly skewed features using code below

● Used pd.get_dummies to convert all categorical values into dummy variables

● Used random search to optimize hyperparameters for each of our models

● Overall the models seemed to perform well

You might also like