0% found this document useful (0 votes)

9 views2 pages

A

Uploaded by

Houssam Alrifaii

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as TXT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

9 views2 pages

A

Uploaded by

Houssam Alrifaii

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as TXT, PDF, TXT or read online on Scribd

You are on page 1/ 2

# Import the necessary libraries

import pandas as pd
import numpy as np
import matplotlib.pyplot as plt
import seaborn as sns
from sklearn.model_selection import train_test_split
from sklearn.preprocessing import OneHotEncoder
from sklearn.linear_model import LinearRegression
from sklearn.metrics import mean_absolute_error, mean_squared_error, r2_score

# Load the dataset from CSV file named "house.csv" into a pandas DataFrame
df = pd.read_csv("house.csv")

# Check the DataFrame to see if there are any duplicate records and print the value
duplicates = df.duplicated().sum()
print(f"Number of duplicate records: {duplicates}")

# Drop unnecessary fields (House_Id in this case) and determine the features and
the target fields
df = df.drop(columns=["House_Id"])
features = ['Area', 'Bedrooms', 'Bathrooms', 'Neighborhood']
target = 'Price'

# Check for any missing values

missing_values = df.isnull().sum()
print(f"Missing values in each column:\n{missing_values}")

# Calculate the average area of all houses in the dataset

average_area = df['Area'].mean()
print(f"Average area of all houses: {average_area}")

# Perform one-hot encoding on the 'Neighborhood' feature

encoder = OneHotEncoder(sparse=False)
neighborhood_encoded = pd.DataFrame(encoder.fit_transform(df[['Neighborhood']]))
neighborhood_encoded.columns = encoder.get_feature_names_out(['Neighborhood'])
df = pd.concat([df.drop(columns=['Neighborhood']), neighborhood_encoded], axis=1)

# Split the data into training and testing sets

X = df.drop(columns=[target])
y = df[target]
X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.2,
random_state=42)

# Initialize the ML model and train it using Linear Regression

model = LinearRegression()
model.fit(X_train, y_train)

# Make predictions
y_pred = model.predict(X_test)

# Evaluate the model using 3 metrics

mae = mean_absolute_error(y_test, y_pred)
mse = mean_squared_error(y_test, y_pred)
r2 = r2_score(y_test, y_pred)

print(f"Mean Absolute Error: {mae}")

print(f"Mean Squared Error: {mse}")
print(f"R-Squared Score: {r2}")
# Display evaluation metrics
metrics = pd.DataFrame({
"Metric": ["Mean Absolute Error", "Mean Squared Error", "R-Squared Score"],
"Value": [mae, mse, r2]
})
print(metrics)

# Create a scatter plot to visualize the relationship between the number of

bathrooms and the price
plt.figure(figsize=(8, 6))
plt.scatter(df['Bathrooms'], df['Price'], alpha=0.5, color='blue')
plt.title("Bathrooms vs Price")
plt.xlabel("Number of Bathrooms")
plt.ylabel("Price")
plt.grid(True)
plt.show()

# Find the house with the highest number of bedrooms and print its neighborhood
max_bedrooms_house = df[df['Bedrooms'] == df['Bedrooms'].max()]
print(f"Neighborhood of the house with the most bedrooms:
{max_bedrooms_house['Neighborhood'].values}")

# Plot the performance metrics calculated above on a heatmap graph

metrics_values = np.array([[mae, mse, r2]])
plt.figure(figsize=(8, 4))
sns.heatmap(metrics_values, annot=True, fmt=".2f", cmap="Blues",
xticklabels=["MAE", "MSE", "R2"], yticklabels=["Model"])
plt.title("Model Performance Metrics")
plt.show()

VAR Slides
No ratings yet
VAR Slides
12 pages
Smart Annuity Plus Ready Annual Reckoner - V08
No ratings yet
Smart Annuity Plus Ready Annual Reckoner - V08
5 pages
What Is LASSO Regression Definition, Examples and Techniques
No ratings yet
What Is LASSO Regression Definition, Examples and Techniques
15 pages
The Result of Pre and Post Test
No ratings yet
The Result of Pre and Post Test
2 pages
House Price Prediction: # Importing Necessary Libraries
No ratings yet
House Price Prediction: # Importing Necessary Libraries
18 pages
Deep Learning - House Price Prediction
No ratings yet
Deep Learning - House Price Prediction
17 pages
Housing Prices Notebook
No ratings yet
Housing Prices Notebook
14 pages
MidTerm MGT782 JULY 2023
No ratings yet
MidTerm MGT782 JULY 2023
6 pages
Real Statistics Examples Regression 1
No ratings yet
Real Statistics Examples Regression 1
440 pages
Econ 335 Wooldridge CH 8 Heteroskedasticity
No ratings yet
Econ 335 Wooldridge CH 8 Heteroskedasticity
23 pages
House Price Prediction
No ratings yet
House Price Prediction
14 pages
Capstone Project Report
No ratings yet
Capstone Project Report
187 pages
End To End Machine Learning Project-2
No ratings yet
End To End Machine Learning Project-2
10 pages
0.1 Guilherme Marthe - Boston House Pricing Challenge
100% (1)
0.1 Guilherme Marthe - Boston House Pricing Challenge
15 pages
3 Linear Regression 1
No ratings yet
3 Linear Regression 1
5 pages
Unit 1: Shobana T S Assistant Professor Dept. of ISE, BMSCE
No ratings yet
Unit 1: Shobana T S Assistant Professor Dept. of ISE, BMSCE
127 pages
General Linear Model
No ratings yet
General Linear Model
5 pages
Actuarial Science Dissertation Topics
100% (2)
Actuarial Science Dissertation Topics
4 pages
Kaggle House Prices Advanced Regression Techniques
No ratings yet
Kaggle House Prices Advanced Regression Techniques
87 pages
CH 02 Wooldridge 5e ppt20250307
No ratings yet
CH 02 Wooldridge 5e ppt20250307
51 pages
Report
No ratings yet
Report
40 pages
ML Manual
No ratings yet
ML Manual
30 pages
IAS19-EMPLOYEE BENEFITS - Calubaquib, Shine - Clemente, Ryanne
No ratings yet
IAS19-EMPLOYEE BENEFITS - Calubaquib, Shine - Clemente, Ryanne
36 pages
Faseeh Chap 2 Report
No ratings yet
Faseeh Chap 2 Report
30 pages
النظام واللائحة التنفيذية لتبادل المنافع باللغة الانجليزية
No ratings yet
النظام واللائحة التنفيذية لتبادل المنافع باللغة الانجليزية
41 pages
2 Reinsurance1
No ratings yet
2 Reinsurance1
28 pages
Data Science Record - 05
No ratings yet
Data Science Record - 05
20 pages
Machine Learning Lab
No ratings yet
Machine Learning Lab
20 pages
Housepriceprediction ML 221104055342 Fb5109ae
No ratings yet
Housepriceprediction ML 221104055342 Fb5109ae
17 pages
Story Point Estimation Copy
No ratings yet
Story Point Estimation Copy
16 pages
Machine Learning Project: TITLE: Predicting The Sale Price of A House Using Linear Regression
No ratings yet
Machine Learning Project: TITLE: Predicting The Sale Price of A House Using Linear Regression
20 pages
Exercise3 Solution
No ratings yet
Exercise3 Solution
19 pages
Linear Reg
No ratings yet
Linear Reg
25 pages
Exercise4 Solution
No ratings yet
Exercise4 Solution
20 pages
DL Assignment 1ms24rai03
No ratings yet
DL Assignment 1ms24rai03
10 pages
T2 Summary VHA
No ratings yet
T2 Summary VHA
14 pages
House Price Prediction Models
No ratings yet
House Price Prediction Models
16 pages
Project
No ratings yet
Project
10 pages
ML Manual
No ratings yet
ML Manual
9 pages
Linear Regression Analysis - Polynomial Regression
No ratings yet
Linear Regression Analysis - Polynomial Regression
25 pages
Housing
No ratings yet
Housing
21 pages
STATA Commands For Unobserved Effects Pa
No ratings yet
STATA Commands For Unobserved Effects Pa
23 pages
Predicting House Prices Using Regression Techniques: Problem Statement: Problems Faced During Buying A House
No ratings yet
Predicting House Prices Using Regression Techniques: Problem Statement: Problems Faced During Buying A House
20 pages
(House Price Prediction) Capstone Project For Python
No ratings yet
(House Price Prediction) Capstone Project For Python
10 pages
ML Regression
No ratings yet
ML Regression
9 pages
Oral Presentation
No ratings yet
Oral Presentation
9 pages
Regression Algorithm
No ratings yet
Regression Algorithm
9 pages
House Price Prediction Using Machine Learning in Python
No ratings yet
House Price Prediction Using Machine Learning in Python
13 pages
New Opendocument Text
No ratings yet
New Opendocument Text
7 pages
1684918425867
No ratings yet
1684918425867
14 pages
Unit 3 5
No ratings yet
Unit 3 5
4 pages
USA Real Estate Price Prediction Using Decision Tree Regressor, and AdaBoost Regressor
No ratings yet
USA Real Estate Price Prediction Using Decision Tree Regressor, and AdaBoost Regressor
14 pages
Data Analysis Project MAIN
No ratings yet
Data Analysis Project MAIN
6 pages
AIML
No ratings yet
AIML
5 pages
Jhs Loa Report Per Subject: Division of Cavite Province
No ratings yet
Jhs Loa Report Per Subject: Division of Cavite Province
10 pages
Phase 5
No ratings yet
Phase 5
5 pages
Kaggle Machine Learning
No ratings yet
Kaggle Machine Learning
6 pages
DA Lab2
No ratings yet
DA Lab2
5 pages
Wa0009.
No ratings yet
Wa0009.
4 pages
Project 4 - House Price Prediction - Ipynb - Colab
No ratings yet
Project 4 - House Price Prediction - Ipynb - Colab
5 pages
Covariate Balancing Wooldridge
No ratings yet
Covariate Balancing Wooldridge
12 pages
Insurance & Risk Management April 2024
No ratings yet
Insurance & Risk Management April 2024
8 pages
IoT Task4 21BEC0384
No ratings yet
IoT Task4 21BEC0384
9 pages
Mid Term Test 2021 - 2911
No ratings yet
Mid Term Test 2021 - 2911
5 pages
Emllab
No ratings yet
Emllab
6 pages
Guide - Lines - of - CREDIT - POINT III
No ratings yet
Guide - Lines - of - CREDIT - POINT III
16 pages
18 Financial Statements
No ratings yet
18 Financial Statements
35 pages
Housing Prices Linear Regression
No ratings yet
Housing Prices Linear Regression
3 pages
Price Prediction
No ratings yet
Price Prediction
4 pages
Regression Dataset
No ratings yet
Regression Dataset
3 pages
Expt 7
No ratings yet
Expt 7
3 pages
Regression Analysis On The Boston House Price Dataset For House Price Prediction
No ratings yet
Regression Analysis On The Boston House Price Dataset For House Price Prediction
2 pages
Econometrics Assignment MBA - 2
No ratings yet
Econometrics Assignment MBA - 2
3 pages
Machine Learning Life Cycle Report
No ratings yet
Machine Learning Life Cycle Report
2 pages
House Prices Analysis - Final Assessment
No ratings yet
House Prices Analysis - Final Assessment
2 pages
Tarea - Prediccion de Casas en California
No ratings yet
Tarea - Prediccion de Casas en California
5 pages
Introduction To Machine Learning (ML) With Sklearn
No ratings yet
Introduction To Machine Learning (ML) With Sklearn
10 pages
All IPP
No ratings yet
All IPP
17 pages
Risk Assessment DPPS 210619
No ratings yet
Risk Assessment DPPS 210619
4 pages
Chapter 1 Simple Linear Regression Model
No ratings yet
Chapter 1 Simple Linear Regression Model
2 pages
Machine Learning
No ratings yet
Machine Learning
1 page
Assignment 1 - Regression
No ratings yet
Assignment 1 - Regression
1 page
Import As Import As From Import: "Mean Squared Errors: "
No ratings yet
Import As Import As From Import: "Mean Squared Errors: "
1 page
Course Outline ACS 311 PDF
No ratings yet
Course Outline ACS 311 PDF
3 pages
C Programming
From Everand
C Programming
Netra
No ratings yet
Syed Ahsan Ali Actuarial Sciences - Risk Management
No ratings yet
Syed Ahsan Ali Actuarial Sciences - Risk Management
3 pages
California Housing Price Prediction .
No ratings yet
California Housing Price Prediction .
1 page
World Population M
No ratings yet
World Population M
2 pages
IAS 19 - Presentation
No ratings yet
IAS 19 - Presentation
41 pages
Marginal VaR
No ratings yet
Marginal VaR
1 page

A

Uploaded by

A

Uploaded by

# Import the necessary libraries

# Check for any missing values

# Calculate the average area of all houses in the dataset

# Perform one-hot encoding on the 'Neighborhood' feature

# Split the data into training and testing sets

# Initialize the ML model and train it using Linear Regression

# Evaluate the model using 3 metrics

print(f"Mean Absolute Error: {mae}")

# Create a scatter plot to visualize the relationship between the number of

# Plot the performance metrics calculated above on a heatmap graph

You might also like