0% found this document useful (0 votes)
2 views11 pages

Data Mining Project

Uploaded by

bushemanthony
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
2 views11 pages

Data Mining Project

Uploaded by

bushemanthony
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
You are on page 1/ 11

Data Mining Final Project

By: Reese Martin, Anthony Bushem, and Vince Dipaolo


CarData.csv File Image
Project Description

● The data set we used are the following:


○ CarData.csv (From Week 2)

● Project Questions:
1. Finding the best model to determine
the average MPG with horsepower

2. Finding the best model for


determining the average acceleration
per cylinders

3. Find a model for predicting the weight


of a vehicle based on all the factors
Project Question
#1 – Finding the
best model to
determine the
average MPG with
horsepower
Project Question #1 – Graph Image
Project
Question #2 –
Finding the
best model for
determining the
average
acceleration
per cylinders
Project Question #2 – Graph Image
Project Question
#3 – Find a model
for predicting the
weight of a vehicle
based on all the
factors
Project Question #3 – Graph Image
Code/Explanation for the applied model
Question 1 and 2-
● Makes the model based on significance
● Set a seed to split the data into a training and testing set
● Make models singularly and multivariate for the main variable
● Create predictive models to see numerically what an output would look like
● View the predictive models along with their correlation

Question 3-
● Set up the predictors and response models
● Check for any missing values
● Set a seed to split the data into a training and testing set
● Fit a linear model and a random forest model to get predictions for the variable
● Set up predictive models for outlay
● View the predicted outlay
Conclusion & Reflection

● Choosing the right dataset and formulating research questions required focus
as the foundation of the project.

● Running the code and understanding its application allowed for a challenging
insight into problem solving and execution of code.

● Evaluating the model performance as a critical part of the data mining


process.

You might also like