0% found this document useful (0 votes)

24 views4 pages

Python

Python code

Uploaded by

Gowtham Yv

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

24 views4 pages

Python

Python code

Uploaded by

Gowtham Yv

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

Predicting House Prices

Step 1: Exploratory Data Analysis (EDA)

# Load the Boston Housing dataset
from sklearn.datasets import load_boston
boston = load_boston()
df = pd.DataFrame(boston.data, columns=boston.feature_names)
df['PRICE'] = boston.target

# Display the first few rows

print("Initial Data:")
print(df.head())

# EDA: Summary statistics

print("\nSummary Statistics:")
print(df.describe())

# Visualize correlations
plt.figure(figsize=(12, 8))
sns.heatmap(df.corr(), annot=True, fmt='.2f', cmap='coolwarm')
plt.title('Correlation Heatmap')
plt.show()

# Distribution of target variable

plt.figure(figsize=(8, 5))
sns.histplot(df['PRICE'], bins=30, kde=True)
plt.title('Price Distribution')
plt.xlabel('Price')
plt.ylabel('Frequency')
plt.show()
Step 2: Data Preprocessing
We'll handle missing values and normalize the features.
python
Copy code
# Check for missing values
print("\nMissing Values:")
print(df.isnull().sum())

# Since there are no missing values in the Boston dataset, we can proceed to normalization
from sklearn.preprocessing import StandardScaler

# Normalize/Standardize features
scaler = StandardScaler()
features = df.drop('PRICE', axis=1)
features_scaled = scaler.fit_transform(features)

# Create a new DataFrame with scaled features

df_scaled = pd.DataFrame(features_scaled, columns=features.columns)
df_scaled['PRICE'] = df['PRICE'].values
Step 3: Data Splitting
We'll split the dataset into training and testing sets.
python
Copy code
from sklearn.model_selection import train_test_split

# Define features (X) and target variable (y)

X = df_scaled.drop('PRICE', axis=1)
y = df_scaled['PRICE']

# Split the dataset into training (80%) and testing (20%) sets
X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.2, random_state=42)

print(f"\nTraining set shape: {X_train.shape}, Testing set shape: {X_test.shape}")

Step 4: Model Implementation
We will implement three regression algorithms: Linear Regression, Decision Trees, and Random
Forests.
python
Copy code
from sklearn.linear_model import LinearRegression
from sklearn.tree import DecisionTreeRegressor
from sklearn.ensemble import RandomForestRegressor

# Initialize models
linear_model = LinearRegression()
tree_model = DecisionTreeRegressor(random_state=42)
forest_model = RandomForestRegressor(random_state=42)

# Train models
linear_model.fit(X_train, y_train)
tree_model.fit(X_train, y_train)
forest_model.fit(X_train, y_train)

# Make predictions
y_pred_linear = linear_model.predict(X_test)
y_pred_tree = tree_model.predict(X_test)
y_pred_forest = forest_model.predict(X_test)
Step 5: Model Evaluation
We'll evaluate the models using Mean Absolute Error (MAE) and Root Mean Squared Error (RMSE).
python
Copy code
from sklearn.metrics import mean_absolute_error, mean_squared_error

# Evaluate models
def evaluate_model(y_true, y_pred, model_name):
mae = mean_absolute_error(y_true, y_pred)
rmse = np.sqrt(mean_squared_error(y_true, y_pred))
print(f"\n{model_name} Performance:")
print(f"Mean Absolute Error (MAE): {mae:.2f}")
print(f"Root Mean Squared Error (RMSE): {rmse:.2f}")

evaluate_model(y_test, y_pred_linear, "Linear Regression")

evaluate_model(y_test, y_pred_tree, "Decision Tree")
evaluate_model(y_test, y_pred_forest, "Random Forest")

Regression Analysis - Cheatsheet
No ratings yet
Regression Analysis - Cheatsheet
9 pages
Document
0% (1)
Document
136 pages
ML Project Part A 1
No ratings yet
ML Project Part A 1
6 pages
ML Record
No ratings yet
ML Record
19 pages
House Pricing
No ratings yet
House Pricing
15 pages
Project
No ratings yet
Project
10 pages
House Price Prediction Using Machine Learning in Python
No ratings yet
House Price Prediction Using Machine Learning in Python
13 pages
Integrated System Lab
No ratings yet
Integrated System Lab
25 pages
P05 The Regression Pipeline - Training and Testing Ans
No ratings yet
P05 The Regression Pipeline - Training and Testing Ans
13 pages
Wa0009.
No ratings yet
Wa0009.
4 pages
Machine Learning Presentaion
No ratings yet
Machine Learning Presentaion
15 pages
Set 2
No ratings yet
Set 2
19 pages
Coding Question
No ratings yet
Coding Question
6 pages
New Opendocument Text
No ratings yet
New Opendocument Text
7 pages
Explain Me Every Code Written in It With Deep Know
No ratings yet
Explain Me Every Code Written in It With Deep Know
7 pages
Data Mining Final Assignment
No ratings yet
Data Mining Final Assignment
4 pages
Machine Learning Lab Manual
No ratings yet
Machine Learning Lab Manual
22 pages
ML Record
No ratings yet
ML Record
21 pages
ML Manual
No ratings yet
ML Manual
24 pages
Lab ML
No ratings yet
Lab ML
26 pages
AIML
No ratings yet
AIML
5 pages
Predicting Housin main project ediglobe
No ratings yet
Predicting Housin main project ediglobe
4 pages
ML Practical 04
No ratings yet
ML Practical 04
19 pages
Lab 1. Boston House
No ratings yet
Lab 1. Boston House
7 pages
Machine Learning Problem-Solving Steps: 1. Look at The Big Picture
No ratings yet
Machine Learning Problem-Solving Steps: 1. Look at The Big Picture
41 pages
Machine Learning Lab
No ratings yet
Machine Learning Lab
20 pages
Pa Da1
No ratings yet
Pa Da1
17 pages
Report
No ratings yet
Report
40 pages
Unit 1: Shobana T S Assistant Professor Dept. of ISE, BMSCE
No ratings yet
Unit 1: Shobana T S Assistant Professor Dept. of ISE, BMSCE
127 pages
Data Analysis Project MAIN
No ratings yet
Data Analysis Project MAIN
6 pages
Ads Lab8
No ratings yet
Ads Lab8
5 pages
ML Manual
No ratings yet
ML Manual
30 pages
ML Shristi File
No ratings yet
ML Shristi File
49 pages
ML
No ratings yet
ML
17 pages
Unit 5
No ratings yet
Unit 5
18 pages
Lab 3 - Linear Regression
No ratings yet
Lab 3 - Linear Regression
15 pages
CP4252 Machine Learning Lab Manual
No ratings yet
CP4252 Machine Learning Lab Manual
26 pages
Ayush File 1
No ratings yet
Ayush File 1
37 pages
Real-Estate Property
No ratings yet
Real-Estate Property
11 pages
Docu 4
No ratings yet
Docu 4
3 pages
Regression Dataset
No ratings yet
Regression Dataset
3 pages
Kaggle Course Notes
No ratings yet
Kaggle Course Notes
87 pages
Machine Learning Project: TITLE: Predicting The Sale Price of A House Using Linear Regression
No ratings yet
Machine Learning Project: TITLE: Predicting The Sale Price of A House Using Linear Regression
20 pages
ML Lab-1
No ratings yet
ML Lab-1
32 pages
End To End Machine Learning Project-2
No ratings yet
End To End Machine Learning Project-2
10 pages
DT As Regressor-Follow
No ratings yet
DT As Regressor-Follow
4 pages
Ese Lab File
No ratings yet
Ese Lab File
30 pages
CP4252 Lab Manual
No ratings yet
CP4252 Lab Manual
13 pages
Document From Jahnavi
No ratings yet
Document From Jahnavi
20 pages
S 10
No ratings yet
S 10
11 pages
Data Analytics I
No ratings yet
Data Analytics I
4 pages
R22 ML Lab Manual
No ratings yet
R22 ML Lab Manual
25 pages
Phase 5
No ratings yet
Phase 5
5 pages
IoT Task4 21BEC0384
No ratings yet
IoT Task4 21BEC0384
9 pages
ML Book Notes
No ratings yet
ML Book Notes
9 pages
Predicting House Prices
No ratings yet
Predicting House Prices
9 pages
20BCP021 Assignment 6
No ratings yet
20BCP021 Assignment 6
15 pages
Housing Prices AI
No ratings yet
Housing Prices AI
10 pages
House Price Prediction Using Machine Learning Techniques
No ratings yet
House Price Prediction Using Machine Learning Techniques
5 pages
House Price Prediction Using Machine Learning Techniques
No ratings yet
House Price Prediction Using Machine Learning Techniques
5 pages
Fundamentals of Computer Programming
No ratings yet
Fundamentals of Computer Programming
12 pages
Audit Answers CA Final May 2024
No ratings yet
Audit Answers CA Final May 2024
4 pages
MY DREAM WORLDB ALICIA Fix
No ratings yet
MY DREAM WORLDB ALICIA Fix
2 pages
Lc5296-At Lc5248e-At User Manual
No ratings yet
Lc5296-At Lc5248e-At User Manual
44 pages
Baker 2 Phase Flow
No ratings yet
Baker 2 Phase Flow
2 pages
Bio-Soft N-Series PDF
No ratings yet
Bio-Soft N-Series PDF
9 pages
Building A Home Freezer 2004
No ratings yet
Building A Home Freezer 2004
3 pages
Countable and Uncountable Nouns
No ratings yet
Countable and Uncountable Nouns
3 pages
Desert Dust in The Global System PDF
100% (1)
Desert Dust in The Global System PDF
287 pages
Car Rental System (Group 6)
No ratings yet
Car Rental System (Group 6)
6 pages
Fun Facts About Christmas
No ratings yet
Fun Facts About Christmas
2 pages
OPCRF 2024 Template For School Heads
No ratings yet
OPCRF 2024 Template For School Heads
9 pages
Q.1: Explain The Difference Between Connectionless Unacknowledged Service and
No ratings yet
Q.1: Explain The Difference Between Connectionless Unacknowledged Service and
17 pages
KS3 Science 2008 Paper 1 Level 5-7
No ratings yet
KS3 Science 2008 Paper 1 Level 5-7
28 pages
Resume Vincent Wang 2021
No ratings yet
Resume Vincent Wang 2021
1 page
Rammer Hydraulic Hammer Spec Sheet
No ratings yet
Rammer Hydraulic Hammer Spec Sheet
1 page
Journal of Physiotherapy
No ratings yet
Journal of Physiotherapy
13 pages
Safari - 1 Nov 2023 at 15:20
No ratings yet
Safari - 1 Nov 2023 at 15:20
1 page
Envi325 HW 8
No ratings yet
Envi325 HW 8
3 pages
Survey
No ratings yet
Survey
20 pages
(Ebook) The Golden Age of Video Games: The Birth of A Multibillion Dollar Industry by Roberto Dillon ISBN 9781439873236, 1439873232
No ratings yet
(Ebook) The Golden Age of Video Games: The Birth of A Multibillion Dollar Industry by Roberto Dillon ISBN 9781439873236, 1439873232
60 pages
Facility Location Models: CTL - SC2x - Supply Chain Design
No ratings yet
Facility Location Models: CTL - SC2x - Supply Chain Design
41 pages
Hcispp Healthcare Information Security and Privacy Practitioner All-In-One Exam Guide Sean P. Murphy
No ratings yet
Hcispp Healthcare Information Security and Privacy Practitioner All-In-One Exam Guide Sean P. Murphy
51 pages
Practical Exam
No ratings yet
Practical Exam
25 pages
Payment Plan Agreement
No ratings yet
Payment Plan Agreement
2 pages
Experiment No. 1 Reducing Aggregate Field Samples To Test Samples
No ratings yet
Experiment No. 1 Reducing Aggregate Field Samples To Test Samples
5 pages
Factors Influencing The Uptake of Cervical Cancer Screening Services Among Women Attending Gynecological OPD at KIUTH, Ishaka-Bushenyi, South Western Uganda
No ratings yet
Factors Influencing The Uptake of Cervical Cancer Screening Services Among Women Attending Gynecological OPD at KIUTH, Ishaka-Bushenyi, South Western Uganda
12 pages
Accounting English Medium: Paper Based Revision Programme Marking Guide - Revision Paper - 34
No ratings yet
Accounting English Medium: Paper Based Revision Programme Marking Guide - Revision Paper - 34
6 pages
(Project) Answers
No ratings yet
(Project) Answers
28 pages

Python

Uploaded by

Python

Uploaded by

Predicting House Prices

Step 1: Exploratory Data Analysis (EDA)

# Display the first few rows

# EDA: Summary statistics

# Distribution of target variable

# Create a new DataFrame with scaled features

# Define features (X) and target variable (y)

print(f"\nTraining set shape: {X_train.shape}, Testing set shape: {X_test.shape}")

evaluate_model(y_test, y_pred_linear, "Linear Regression")

You might also like