0% found this document useful (0 votes)

27 views6 pages

Coding Question

This are some of coding questions which I hope would be helpful for someone learning programing

Uploaded by

zain.bsba90

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

27 views6 pages

Coding Question

This are some of coding questions which I hope would be helpful for someone learning programing

Uploaded by

zain.bsba90

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

### Coding Question: Building a Machine Learning Model to Predict Housing Prices

**Problem Statement:**

You are given a dataset containing various features of houses along with their prices. Your task is to
build a machine learning model to predict the prices of houses based on their features. You will use the
popular Boston Housing dataset for this task.

**Dataset:**

The dataset consists of the following features:

1. CRIM: per capita crime rate by town

2. ZN: proportion of residential land zoned for lots over 25,000 sq. ft.

3. INDUS: proportion of non-retail business acres per town

4. CHAS: Charles River dummy variable (= 1 if tract bounds river; 0 otherwise)

5. NOX: nitric oxides concentration (parts per 10 million)

6. RM: average number of rooms per dwelling

7. AGE: proportion of owner-occupied units built prior to 1940

8. DIS: weighted distances to five Boston employment centres

9. RAD: index of accessibility to radial highways

10. TAX: full-value property tax rate per $10,000

11. PTRATIO: pupil-teacher ratio by town

12. B: 1000(Bk - 0.63)^2 where Bk is the proportion of Black residents by town

13. LSTAT: % lower status of the population

14. MEDV: Median value of owner-occupied homes in $1000s

**Tasks:**

1. Load and explore the dataset.

2. Preprocess the data.

3. Split the data into training and testing sets.

4. Train a machine learning model (e.g., Linear Regression).

5. Evaluate the model.

6. Make predictions using the trained model.

### Step-by-Step Solution

#### 1. Load and Explore the Dataset

```python

import numpy as np

import pandas as pd

import matplotlib.pyplot as plt

import seaborn as sns

from sklearn.datasets import load_boston

from sklearn.model_selection import train_test_split

from sklearn.preprocessing import StandardScaler

from sklearn.linear_model import LinearRegression

from sklearn.metrics import mean_squared_error, r2_score

# Load the dataset

boston = load_boston()

boston_df = pd.DataFrame(boston.data, columns=boston.feature_names)

boston_df['MEDV'] = boston.target

# Display the first few rows of the dataset

print(boston_df.head())

# Summary statistics

print(boston_df.describe())
# Check for missing values

print(boston_df.isnull().sum())

# Correlation matrix

plt.figure(figsize=(12, 10))

sns.heatmap(boston_df.corr(), annot=True, cmap='coolwarm')

plt.show()

```

#### 2. Preprocess the Data

```python

# Features and target variable

X = boston_df.drop('MEDV', axis=1)

y = boston_df['MEDV']

# Standardize the data

scaler = StandardScaler()

X_scaled = scaler.fit_transform(X)

# Split the data into training and testing sets

X_train, X_test, y_train, y_test = train_test_split(X_scaled, y, test_size=0.2, random_state=42)

```

#### 3. Train a Machine Learning Model (Linear Regression)

```python

# Train the model

model = LinearRegression()

model.fit(X_train, y_train)

# Model coefficients

print("Coefficients:", model.coef_)

print("Intercept:", model.intercept_)

```

#### 4. Evaluate the Model

```python

# Make predictions on the testing set

y_pred = model.predict(X_test)

# Evaluate the model

mse = mean_squared_error(y_test, y_pred)

r2 = r2_score(y_test, y_pred)

print("Mean Squared Error:", mse)

print("R-squared:", r2)

# Plot the results

plt.figure(figsize=(10, 6))

plt.scatter(y_test, y_pred, color='blue')

plt.plot([min(y_test), max(y_test)], [min(y_test), max(y_test)], color='red', linewidth=2)

plt.xlabel('Actual')

plt.ylabel('Predicted')

plt.title('Actual vs Predicted')

plt.show()
```

#### 5. Make Predictions Using the Trained Model

```python

# Predicting on new data (example)

new_data = np.array([[0.1, 18.0, 2.31, 0.0, 0.538, 6.575, 65.2, 4.0900, 1, 296.0, 15.3, 396.90, 4.98]])

new_data_scaled = scaler.transform(new_data)

predicted_price = model.predict(new_data_scaled)

print("Predicted price:", predicted_price)

```

### Explanation of the Code

1. Loading and Exploring the Dataset:

- The Boston Housing dataset is loaded using `load_boston()` from `sklearn.datasets`.

- The dataset is converted into a DataFrame for easier exploration and manipulation.

- Summary statistics and correlation matrix are generated to understand the data better.

2. Preprocessing the Data:

- Features (`X`) and target variable (`y`) are separated.

- The features are standardized using `StandardScaler`.

- The dataset is split into training and testing sets using `train_test_split`.

3. Training the Model:

- A Linear Regression model is instantiated and trained on the training data.

- Model coefficients and intercept are printed.

4. **Evaluating the Model**:

- Predictions are made on the testing set.

- Mean Squared Error (MSE) and R-squared (R²) are calculated to evaluate the model's performance.

- A scatter plot is generated to visualize the actual vs predicted values.

5. **Making Predictions**:

- An example of making a prediction on new data is provided. The new data is scaled using the same
scaler used during training, and the model predicts the house price.

This extensive example covers the entire process of building a machine learning model to predict
housing prices, from data loading and preprocessing to model training, evaluation, and prediction.

House Pricing
No ratings yet
House Pricing
15 pages
Experiment 4
No ratings yet
Experiment 4
6 pages
Practice Exercise 4
No ratings yet
Practice Exercise 4
2 pages
DL Lab Prog 2
No ratings yet
DL Lab Prog 2
2 pages
Boston House Price Prediction Guide
No ratings yet
Boston House Price Prediction Guide
7 pages
Explain Me Every Code Written in It With Deep Know
No ratings yet
Explain Me Every Code Written in It With Deep Know
7 pages
House Price Prediction: Project Description
No ratings yet
House Price Prediction: Project Description
11 pages
DL Assignment 1ms24rai03
No ratings yet
DL Assignment 1ms24rai03
10 pages
EXPNO5
No ratings yet
EXPNO5
2 pages
PRJ Housuing Price
No ratings yet
PRJ Housuing Price
14 pages
Predicting Housin Main Project Ediglobe
No ratings yet
Predicting Housin Main Project Ediglobe
4 pages
Boston Housing Price Prediction
No ratings yet
Boston Housing Price Prediction
3 pages
House Pridiction Analysis
No ratings yet
House Pridiction Analysis
3 pages
Machinelearning Project
No ratings yet
Machinelearning Project
3 pages
Machine Learning Lab Manual
No ratings yet
Machine Learning Lab Manual
22 pages
ML Manual
No ratings yet
ML Manual
30 pages
ML Manual
No ratings yet
ML Manual
24 pages
Python
No ratings yet
Python
4 pages
1 - Lab Manual (ML)
No ratings yet
1 - Lab Manual (ML)
42 pages
Project
No ratings yet
Project
10 pages
IoT Task4 21BEC0384
No ratings yet
IoT Task4 21BEC0384
9 pages
Machine Learning Assignment
No ratings yet
Machine Learning Assignment
2 pages
ML Practical 04
No ratings yet
ML Practical 04
19 pages
ML Record
No ratings yet
ML Record
19 pages
CP4252 Machine Learning Lab Manual
No ratings yet
CP4252 Machine Learning Lab Manual
26 pages
wvcg0mt7pkASSI 3 ML 16
No ratings yet
wvcg0mt7pkASSI 3 ML 16
4 pages
House Price Prediction Full Report-2
No ratings yet
House Price Prediction Full Report-2
5 pages
Wa0009.
No ratings yet
Wa0009.
4 pages
T2 Summary VHA
No ratings yet
T2 Summary VHA
14 pages
Predicting House Prices
No ratings yet
Predicting House Prices
9 pages
Housing Price Prediction with Regression
No ratings yet
Housing Price Prediction with Regression
5 pages
Ds 4 Linears Boston
No ratings yet
Ds 4 Linears Boston
2 pages
Regression Analysis On The Boston House Price Dataset For House Price Prediction
No ratings yet
Regression Analysis On The Boston House Price Dataset For House Price Prediction
2 pages
Solution Methodology
No ratings yet
Solution Methodology
5 pages
ML Record
No ratings yet
ML Record
21 pages
DSBDA Practical 4 Tutorial
No ratings yet
DSBDA Practical 4 Tutorial
8 pages
Week 6 LAB
No ratings yet
Week 6 LAB
13 pages
Lab ML
No ratings yet
Lab ML
26 pages
Lab (Work) Experiment File Priyanka Rajak 0901MC221056
No ratings yet
Lab (Work) Experiment File Priyanka Rajak 0901MC221056
19 pages
DNN Tutorial for Data Scientists
No ratings yet
DNN Tutorial for Data Scientists
9 pages
Docu 4
No ratings yet
Docu 4
3 pages
Data Mining Final Assignment
No ratings yet
Data Mining Final Assignment
4 pages
ML PDF
No ratings yet
ML PDF
30 pages
Boston Housing Price Prediction
No ratings yet
Boston Housing Price Prediction
3 pages
CP4252 Lab Manual
No ratings yet
CP4252 Lab Manual
13 pages
New Opendocument Text
No ratings yet
New Opendocument Text
7 pages
Machine Learning for House Price Prediction
No ratings yet
Machine Learning for House Price Prediction
15 pages
7 A
No ratings yet
7 A
2 pages
Import Library Python
No ratings yet
Import Library Python
10 pages
De Assignment 3
No ratings yet
De Assignment 3
2 pages
Deep Learning Practical Exercises
No ratings yet
Deep Learning Practical Exercises
34 pages
Experiment 4 ML
No ratings yet
Experiment 4 ML
9 pages
ML Assignment FDP BIT Mesra
No ratings yet
ML Assignment FDP BIT Mesra
1 page
2a DL
No ratings yet
2a DL
4 pages
Housepriceprediction ML 221104055342 Fb5109ae
No ratings yet
Housepriceprediction ML 221104055342 Fb5109ae
17 pages
ML
No ratings yet
ML
17 pages
ML Recordjp
No ratings yet
ML Recordjp
35 pages
Experiment 1
No ratings yet
Experiment 1
19 pages
Exp 2 (Multiple Linear Regression)
No ratings yet
Exp 2 (Multiple Linear Regression)
6 pages
Residual Analysis and Test - 02
No ratings yet
Residual Analysis and Test - 02
37 pages
Logistic vs Linear Regression Explained
No ratings yet
Logistic vs Linear Regression Explained
16 pages
Understanding Simple Linear Regression
No ratings yet
Understanding Simple Linear Regression
2 pages
Ridge and Lasso Regresssion
No ratings yet
Ridge and Lasso Regresssion
22 pages
Trend Lines Problem Solving Worksheet
No ratings yet
Trend Lines Problem Solving Worksheet
13 pages
Lecture - Regression Analysis - Case Study Demo
No ratings yet
Lecture - Regression Analysis - Case Study Demo
20 pages
1999 - A Statistical Method For Practical Assessment of Sawability With Diamond Wire Cutting Machine of Ankara-Cubuk Andesites
No ratings yet
1999 - A Statistical Method For Practical Assessment of Sawability With Diamond Wire Cutting Machine of Ankara-Cubuk Andesites
4 pages
Applied Linear Regression 4th Edition Sanford Weisberg - Own The Complete Ebook Set Now in PDF and DOCX Formats
100% (3)
Applied Linear Regression 4th Edition Sanford Weisberg - Own The Complete Ebook Set Now in PDF and DOCX Formats
46 pages
Ordinary Least Squares Explained
No ratings yet
Ordinary Least Squares Explained
30 pages
Statistics Exam Practice Questions
No ratings yet
Statistics Exam Practice Questions
19 pages
Trendlines and Regression Analysis
No ratings yet
Trendlines and Regression Analysis
17 pages
Econometrics 005
No ratings yet
Econometrics 005
6 pages
PM278 ch20
No ratings yet
PM278 ch20
15 pages
Analysis of Tax Revenue Factors
No ratings yet
Analysis of Tax Revenue Factors
4 pages
CausalInference w7 Panel
No ratings yet
CausalInference w7 Panel
30 pages
Chapter 09 Assessing Studies Based On Multiple Regression
No ratings yet
Chapter 09 Assessing Studies Based On Multiple Regression
80 pages
Notes 1024 Part1
No ratings yet
Notes 1024 Part1
35 pages
Lecture 08 Dummy Variables
No ratings yet
Lecture 08 Dummy Variables
6 pages
Fidia Oktarisa, 2023
No ratings yet
Fidia Oktarisa, 2023
14 pages
Stat7220001
No ratings yet
Stat7220001
6 pages
Linear Regression
No ratings yet
Linear Regression
7 pages
Creep Curve Fitting in MAPDL: ANSYS Mechanical Nonlinear Materials
No ratings yet
Creep Curve Fitting in MAPDL: ANSYS Mechanical Nonlinear Materials
16 pages
1 4 Multilevel and Longitudinal Mode PDF
No ratings yet
1 4 Multilevel and Longitudinal Mode PDF
1,503 pages
Dummy Variables
No ratings yet
Dummy Variables
8 pages
Unit 4 Regression Analysis
No ratings yet
Unit 4 Regression Analysis
28 pages
Student Database STD ID STD Name Marks Percentage Grade Remark Grade Description Marks
No ratings yet
Student Database STD ID STD Name Marks Percentage Grade Remark Grade Description Marks
18 pages
ODD - Solutions Chapter 5
No ratings yet
ODD - Solutions Chapter 5
9 pages
Linear Regression for Stat Students
No ratings yet
Linear Regression for Stat Students
11 pages
Activity
No ratings yet
Activity
10 pages

Coding Question

Uploaded by

Coding Question

Uploaded by

### Coding Question: Building a Machine Learning Model to Predict Housing Prices

The dataset consists of the following features:

1. CRIM: per capita crime rate by town

3. INDUS: proportion of non-retail business acres per town

4. CHAS: Charles River dummy variable (= 1 if tract bounds river; 0 otherwise)

5. NOX: nitric oxides concentration (parts per 10 million)

6. RM: average number of rooms per dwelling

7. AGE: proportion of owner-occupied units built prior to 1940

8. DIS: weighted distances to five Boston employment centres

9. RAD: index of accessibility to radial highways

10. TAX: full-value property tax rate per $10,000

11. PTRATIO: pupil-teacher ratio by town

12. B: 1000(Bk - 0.63)^2 where Bk is the proportion of Black residents by town

13. LSTAT: % lower status of the population

14. MEDV: Median value of owner-occupied homes in $1000s

1. Load and explore the dataset.

2. Preprocess the data.

3. Split the data into training and testing sets.

5. Evaluate the model.

6. Make predictions using the trained model.

### Step-by-Step Solution

#### 1. Load and Explore the Dataset

import matplotlib.pyplot as plt

import seaborn as sns

from sklearn.datasets import load_boston

from sklearn.model_selection import train_test_split

from sklearn.preprocessing import StandardScaler

from sklearn.linear_model import LinearRegression

from sklearn.metrics import mean_squared_error, r2_score

# Load the dataset

boston_df = pd.DataFrame(boston.data, columns=boston.feature_names)

# Display the first few rows of the dataset

sns.heatmap(boston_df.corr(), annot=True, cmap='coolwarm')

#### 2. Preprocess the Data

# Features and target variable

# Standardize the data

# Split the data into training and testing sets

X_train, X_test, y_train, y_test = train_test_split(X_scaled, y, test_size=0.2, random_state=42)

#### 3. Train a Machine Learning Model (Linear Regression)

# Train the model

#### 4. Evaluate the Model

# Make predictions on the testing set

# Evaluate the model

mse = mean_squared_error(y_test, y_pred)

print("Mean Squared Error:", mse)

# Plot the results

plt.scatter(y_test, y_pred, color='blue')

plt.plot([min(y_test), max(y_test)], [min(y_test), max(y_test)], color='red', linewidth=2)

#### 5. Make Predictions Using the Trained Model

# Predicting on new data (example)

print("Predicted price:", predicted_price)

### Explanation of the Code

1. **Loading and Exploring the Dataset**:

- The Boston Housing dataset is loaded using `load_boston()` from `sklearn.datasets`.

2. **Preprocessing the Data**:

- Features (`X`) and target variable (`y`) are separated.

- The features are standardized using `StandardScaler`.

3. **Training the Model**:

- A Linear Regression model is instantiated and trained on the training data.

- Model coefficients and intercept are printed.

- Predictions are made on the testing set.

- A scatter plot is generated to visualize the actual vs predicted values.

You might also like

1. Loading and Exploring the Dataset:

2. Preprocessing the Data:

3. Training the Model: