0% found this document useful (0 votes)

17 views13 pages

AI Lab7

The document outlines a practical lab exercise for students at Sukkur IBA University focused on performing linear regression using Python. It covers the objectives of understanding linear regression concepts, building predictive models, and includes examples of simple and multiple linear regression. Additionally, it provides tasks for students to apply linear regression on the Boston housing price dataset and a diabetes dataset.

Uploaded by

ayesha mangrio

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

17 views13 pages

AI Lab7

Uploaded by

ayesha mangrio

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 13

Sukkur IBA University

Department of Computer Science

6 th
Semester 3rd Year

Artificial Intelligence - Lab

Practical No. 7
To perform Linear Regession using python
Student’s Roll no: _______________ Points Scored: __________________________

Date of Conduct: Teacher’s Signature: ___

LAB DATA ANALYSIS ABILITY TO

SUBJECT CALCULATION OBSERVATION/
PERFORMANCE KNOWLEDGE
AND CONDUCT PRESENTATION
AND CODING RESULTS
SCORE
INDICATOR INTERPRETATION EXPERIMENT

 OBJECTIVES: Upon successful completion of this practical, the students will be able to:
 Understand linear regression and its basic concepts.
 Understand types of linear regression.
 Build a machine learning model to predict linear regression problem for given dataset.

Regression

What Is Regression?
Regression searches for relationships among variables. For example, you can observe several
employees of some company and try to understand how their salaries depend on their features,
such as experience, education level, role, city of employment, and so on.

This is a regression problem where data related to each employee represents one observation. The
presumption is that the experience, education, role, and city are the independent features, while the
salary depends on them.

Similarly, you can try to establish the mathematical dependence of housing prices on area, number
of bedrooms, distance to the city center, and so on.

Generally, in regression analysis, you consider some phenomenon of interest and have a number of
observations. Each observation has two or more features. Following the assumption that at least
one of the features depends on the others, you try to establish a relation among them.
In other words, you need to find a function that maps some features or variables to others
sufficiently well.

The dependent features are called the dependent variables, outputs, or responses. The independent
features are called the independent variables, inputs, regressors, or predictors.

Regression problems usually have one continuous and unbounded dependent variable. The inputs,
however, can be continuous, discrete, or even categorical data such as gender, nationality, or brand.

It’s a common practice to denote the outputs with 𝑦 and the inputs with 𝑥. If there are two or more
independent variables, then they can be represented as the vector 𝐱 = (𝑥₁, …, 𝑥ᵣ), where 𝑟 is the
number of inputs.

When Do You Need Regression?

Typically, you need regression to answer whether and how some phenomenon influences the other
or how several variables are related. For example, you can use it to determine if and to what extent
experience or gender impacts salaries.

Regression is also useful when you want to forecast a response using a new set of predictors. For
example, you could try to predict electricity consumption of a household for the next hour given the
outdoor temperature, time of day, and number of residents in that household.

Linear Regression
Linear regression is an algorithm that provides a linear relationship between an independent
variable and a dependent variable to predict the outcome of future events. It is a statistical method
used in data science and machine learning for predictive analysis.

In the above figure,

 X-axis = Independent variable
 Y-axis = Output / dependent variable
 Line of regression = Best fit line for a model
Here, a line is plotted for the given data points that suitably fit all the issues. Hence, it is called the
‘best fit line.’ The goal of the linear regression algorithm is to find this best fit line seen in the above
figure.
 Example:

The figure above shows the relationship between the quantity of apple and the cost price. How
much do you need to pay for 7kg of apples? I know it’s easy. If 1kg costs 5$ then 7kg cost
7*5=35$ or you will just draw a perpendicular line from point 7 along the y-axis until it touches the
linear equation and the corresponding value on the y-axis is the answer as shown by the green
dotted line on the graph. But we are going to solve using the formula of a linear equation.

Now, if I have to find the price of 9.5 kg of apple then according to our model mx+b = 5 * 9.5 + 0
= $47.5 is the answer. By now you might have understood that m and b are the main ingredients of
the linear equation or in other words m and b are called parameters.

 Example(Predict housing prices):

A company name ABC provides you a data on the houses’ size and its price. The company requires
providing them a machine learning model that can predict houses’ prices for any given size. Let’s
say what would be the best-estimated price for area 3000 feet square? If you are thinking to fit a
line somewhere between the dataset and draw a verticle line from 3000 on the x-axis until it
touches the line and then the corresponding value on the y-axis i.e 470 would be the answer, then
you are on right track, it is represented by the green dotted line in the figure below.

A company name ABC provides you a data on the houses’ size and its price. The company
requires providing them a machine learning model that can predict houses’ prices for any
given size. Let’s say what would be the best-estimated price for area 3000 feet square? If you
are thinking to fit a line somewhere between the dataset and draw a verticle line from 3000 on the
x-axis until it touches the line and then the corresponding value on the y-axis i.e 470 would be the
answer, then you are on right track, it is represented by the green dotted line in the figure below.

Let’s do it in another way, if we could find the equation of line y = mx+b that we use to fit the data
represented by the blue inclined line then we can easily find the model that can predict the housing
prices for any given area. In machine learning lingo function y = mx+b is also called a hypothesis
function where m and b can be represented by theta0 and theta1 respectively. theta0 is also
called a bias term and theta1,theta2,.. are called weights.
See the blue line in the picture above, By taking any two samples that touch or very close to the line
we can find the theta1 (slope) = 0.132 and theta zero = 80 as shown in the figure. Now we can
use our hypothesis function to predict housing price for size 3000 feet square i.e 80+3000*0.132 =
476. $476,000 could be the best-estimated price for a house of size 3000 feet square and this could
be a reasonable way to prepare a machine learning model when you have just 50 samples and with
only one feature(size).

 Simple linear regression

Simple linear regression reveals the correlation between a dependent variable (input) and an
independent variable (output). Primarily, this regression type describes the following:
Relationship strength between the given variables.

Example: The relationship between pollution levels and rising temperatures. The value of the
dependent variable is based on the value of the independent variable.
Example: The value of pollution level at a specific temperature.
 Multiple linear regression

Multiple linear regression establishes the relationship between independent variables (two or
more) and the corresponding dependent variable. Here, the independent variables can be either
continuous or categorical. This regression type helps foresee trends, determine future values, and
predict the impacts of changes.
Example: Consider the task of calculating blood pressure. In this case, height, weight, and amount
of exercise can be considered independent variables. Here, we can use multiple linear regression to
analyze the relationship between the three independent variables and one dependent variable, as
all the variables considered are quantitative.

 Regression model evaluation metrics

The MSE, MAE, RMSE, and R-Squared metrics are mainly used to evaluate the prediction error rates
and model performance in regression analysis.
 MAE (Mean absolute error) represents the difference between the original and predicted values
extracted by averaged the absolute difference over the data set.
 MSE (Mean Squared Error) represents the difference between the original and predicted values
extracted by squared the average difference over the data set.
 RMSE (Root Mean Squared Error) is the error rate by the square root of MSE.
 R-squared (Coefficient of determination) represents the coefficient of how well the values fit
compared to the original values. The value from 0 to 1 interpreted as percentages. The higher the
value is, the better the model is.
Linear Regression using Python.

 Boston house price prediction

Boston Housing Data: This dataset was taken from the StatLib library and is maintained by Carnegie
Mellon University. This dataset concerns the housing prices in the housing city of Boston. The
dataset provided has 506 instances with 13 features.
The Description of the dataset is taken from the below reference as shown in the table follows:

Steps:
Lab Tasks

1. Perform linear regression on Boston house price prediction. Add all steps screenshots.

2. Perform linear regression on Diabetes Dataset. Add all steps screenshots.

The End

Regression: Unit Iii
No ratings yet
Regression: Unit Iii
54 pages
Linear Regression - Jupyter Notebook
100% (3)
Linear Regression - Jupyter Notebook
56 pages
Linear Regression
No ratings yet
Linear Regression
130 pages
Unit 2
No ratings yet
Unit 2
92 pages
6 - Classification and Regression Tasks
No ratings yet
6 - Classification and Regression Tasks
115 pages
Linear Regression
No ratings yet
Linear Regression
49 pages
Parametric & Non-Parametric Tests
No ratings yet
Parametric & Non-Parametric Tests
34 pages
MachineLearning Unit-II
No ratings yet
MachineLearning Unit-II
45 pages
ML 2 ND Unit
No ratings yet
ML 2 ND Unit
50 pages
Unit-4 DS Student
No ratings yet
Unit-4 DS Student
43 pages
Session Presentation
No ratings yet
Session Presentation
79 pages
Module 4
No ratings yet
Module 4
41 pages
What Are Linear Models in Machine Learning (1) .Docx (Unit3 ML)
No ratings yet
What Are Linear Models in Machine Learning (1) .Docx (Unit3 ML)
60 pages
Regression: UNIT - V Regression Model
100% (1)
Regression: UNIT - V Regression Model
21 pages
AI Lec 3
No ratings yet
AI Lec 3
36 pages
Unit 3
No ratings yet
Unit 3
30 pages
Supervised Learning Algorithms
No ratings yet
Supervised Learning Algorithms
20 pages
ML Unit-2 Final
No ratings yet
ML Unit-2 Final
32 pages
Linear Regression
No ratings yet
Linear Regression
24 pages
OC, OCB & Innovative Work Behavior Proposal-Last
No ratings yet
OC, OCB & Innovative Work Behavior Proposal-Last
20 pages
ML Lecture - 3
No ratings yet
ML Lecture - 3
47 pages
4 ML
No ratings yet
4 ML
41 pages
ML Unit
No ratings yet
ML Unit
23 pages
ML U2 Regression
No ratings yet
ML U2 Regression
20 pages
Machine Learning Notes
No ratings yet
Machine Learning Notes
27 pages
ML Unit Ii
No ratings yet
ML Unit Ii
30 pages
Lecture 9-10
No ratings yet
Lecture 9-10
28 pages
DAV 2201079 Exp 2 2-1
No ratings yet
DAV 2201079 Exp 2 2-1
35 pages
Linear Regression
No ratings yet
Linear Regression
18 pages
ML PR-2
No ratings yet
ML PR-2
11 pages
Linear Regression by Sam
No ratings yet
Linear Regression by Sam
27 pages
Unit 3 Notes
No ratings yet
Unit 3 Notes
33 pages
ML 01 (Pranavv)
No ratings yet
ML 01 (Pranavv)
14 pages
Linear Regression
No ratings yet
Linear Regression
15 pages
Regression
No ratings yet
Regression
64 pages
LP III Lab Manual
100% (1)
LP III Lab Manual
8 pages
DSBDAL - Assignment No 4
No ratings yet
DSBDAL - Assignment No 4
15 pages
SM 38
No ratings yet
SM 38
58 pages
Unit 2
No ratings yet
Unit 2
18 pages
chp6 (10) Fam
No ratings yet
chp6 (10) Fam
24 pages
Nirdosh New SOP
No ratings yet
Nirdosh New SOP
3 pages
Unit Ii
No ratings yet
Unit Ii
48 pages
Machine Learning and Deep Learning Course
No ratings yet
Machine Learning and Deep Learning Course
23 pages
Machine Learning: Bilal Khan
100% (2)
Machine Learning: Bilal Khan
20 pages
LECTURE Regression
No ratings yet
LECTURE Regression
12 pages
Teit ML2
No ratings yet
Teit ML2
11 pages
Linear Regression
No ratings yet
Linear Regression
5 pages
Mathematics Behind Machine Learning:: Linear Regression Model
No ratings yet
Mathematics Behind Machine Learning:: Linear Regression Model
21 pages
Linear Regression
No ratings yet
Linear Regression
11 pages
LR 1751142062
No ratings yet
LR 1751142062
10 pages
Classical Machine Learning: Linear Regression: Ramesh S
No ratings yet
Classical Machine Learning: Linear Regression: Ramesh S
28 pages
Practical # 10
No ratings yet
Practical # 10
5 pages
ML Mod 2
No ratings yet
ML Mod 2
8 pages
Unit 5
No ratings yet
Unit 5
18 pages
Linear Regression
No ratings yet
Linear Regression
7 pages
Assignment 2
No ratings yet
Assignment 2
5 pages
UNIT 3 Regression
No ratings yet
UNIT 3 Regression
5 pages
ML Algorithm
No ratings yet
ML Algorithm
4 pages
Adf
50% (2)
Adf
15 pages
RESF 412 Notas
No ratings yet
RESF 412 Notas
87 pages
How To Be A Data Analyst Booklet Guide
No ratings yet
How To Be A Data Analyst Booklet Guide
29 pages
Senior Business Analyst Resume
33% (3)
Senior Business Analyst Resume
4 pages
Linear Regression
No ratings yet
Linear Regression
20 pages
ML Week 4
No ratings yet
ML Week 4
5 pages
Human Resoure Development Process - Its Effectiveness: Sambhram Academy of Management Studies
No ratings yet
Human Resoure Development Process - Its Effectiveness: Sambhram Academy of Management Studies
85 pages
LinearRegression PDF
No ratings yet
LinearRegression PDF
4 pages
CS429: Data Mining: About Instructor
No ratings yet
CS429: Data Mining: About Instructor
26 pages
The Impact of Digital Transformation On Business Administration and Management Practices in Nigeria M
No ratings yet
The Impact of Digital Transformation On Business Administration and Management Practices in Nigeria M
49 pages
1 Practical Research 1
No ratings yet
1 Practical Research 1
5 pages
The Business Research Process
100% (1)
The Business Research Process
41 pages
Devang SIP
No ratings yet
Devang SIP
12 pages
Enphase Energy Associate Manager Data Analyst
No ratings yet
Enphase Energy Associate Manager Data Analyst
2 pages
Ei Samay Case
No ratings yet
Ei Samay Case
19 pages
Ba Unit 4 - Part1
No ratings yet
Ba Unit 4 - Part1
7 pages
Data Analysis - Statistics-021216 - NH (Compatibility Mode)
No ratings yet
Data Analysis - Statistics-021216 - NH (Compatibility Mode)
8 pages
Chaotic Image Encryption Techniques: A Project Seminar On
No ratings yet
Chaotic Image Encryption Techniques: A Project Seminar On
31 pages
MktRes MARK7362 Lecture6 004
No ratings yet
MktRes MARK7362 Lecture6 004
61 pages
Data Mining Methods: Data Pre-Processing: Prof. Dr. Christina Andersson
No ratings yet
Data Mining Methods: Data Pre-Processing: Prof. Dr. Christina Andersson
33 pages
Review Csmodel
No ratings yet
Review Csmodel
17 pages
MO10 - Conducting Pre-Campign Test
No ratings yet
MO10 - Conducting Pre-Campign Test
51 pages
Tugasan/Assignment 5 (20 Markah/marks) : Serial No
No ratings yet
Tugasan/Assignment 5 (20 Markah/marks) : Serial No
4 pages
ML SP24 Mid Term Exam - Solution
No ratings yet
ML SP24 Mid Term Exam - Solution
8 pages
House
No ratings yet
House
11 pages
Phase 3
No ratings yet
Phase 3
23 pages
LAB01
No ratings yet
LAB01
8 pages
AyeshabiTigdikar Resume
No ratings yet
AyeshabiTigdikar Resume
1 page
MIT PE ADSP - Delivery Schedule - Sep 2022
No ratings yet
MIT PE ADSP - Delivery Schedule - Sep 2022
1 page
The Practically Cheating Calculus Handbook
From Everand
The Practically Cheating Calculus Handbook
S. Deviant
3.5/5 (7)
Machine Learning Interview Questions
From Everand
Machine Learning Interview Questions
Tech Interviews
4.5/5 (2)
Top Numerical Methods With Matlab For Beginners!
From Everand
Top Numerical Methods With Matlab For Beginners!
Andrei Besedin
No ratings yet

AI Lab7

Uploaded by

AI Lab7

Uploaded by

Sukkur IBA University

Department of Computer Science

Artificial Intelligence - Lab

Date of Conduct: ________________ Teacher’s Signature: ___________________

LAB DATA ANALYSIS ABILITY TO

When Do You Need Regression?

In the above figure,

 Example(Predict housing prices):

 Simple linear regression

 Regression model evaluation metrics

 Boston house price prediction

2. Perform linear regression on Diabetes Dataset. Add all steps screenshots.

You might also like

Date of Conduct: Teacher’s Signature: ___