0% found this document useful (0 votes)

14 views

Assignment 4

Uploaded by

arun raghu

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

14 views

Assignment 4

Uploaded by

arun raghu

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

Assignment-4

1. What is Linear Regression, and how is it used in Machine Learning?

Linear Regression is one of the fundamental supervised learning algorithms used in machine
learning for predictive modeling. It establishes a relationship between an independent
variable (input) and a dependent variable (output) by fitting a straight line to the data. This
line, known as the regression line, is represented by the equation:

Y=mX+CY = mX + C

where:

 YY is the predicted output,

 mm is the slope (coefficient),
 XX is the input variable, and
 CC is the intercept.

Linear Regression is mainly used for predicting continuous values, such as stock prices,
house prices, or temperature trends. The model learns by minimizing the difference between
the actual and predicted values using techniques like Ordinary Least Squares (OLS) or
Gradient Descent.

2. How do we implement Linear Regression using a programming language like

Python?

Linear Regression can be implemented in Python using libraries such as scikit-learn.

Below is a simple example:

from sklearn.linear_model import LinearRegression

from sklearn.model_selection import train_test_split
import numpy as np

# Sample data
X = np.array([1, 2, 3, 4, 5]).reshape(-1, 1)
Y = np.array([2, 4, 5, 4, 5])
# Splitting data into training and testing sets
X_train, X_test, Y_train, Y_test = train_test_split(X, Y, test_size=0.2,
random_state=42)

# Creating and training the model

model = LinearRegression()
model.fit(X_train, Y_train)

# Making predictions
predictions = model.predict(X_test)
print("Predictions:", predictions)

This script trains a linear regression model on a small dataset and makes predictions on
unseen data.

3. What are some real-world applications of Linear Regression?

Linear Regression has a wide range of real-world applications, including:

 Finance: Predicting stock prices based on historical trends.

 Healthcare: Estimating patient recovery time based on health indicators.
 Marketing: Understanding the impact of advertising budgets on sales revenue.
 Real Estate: Predicting house prices based on features like location and size.
 Manufacturing: Forecasting product demand based on past sales data.

These applications demonstrate how Linear Regression is an essential tool for making data-
driven decisions.

4. What are the key performance parameters used to evaluate a Linear Regression
model?

Key performance metrics for evaluating a Linear Regression model include:

 Mean Absolute Error (MAE): Measures the average absolute difference between
actual and predicted values.
 Mean Squared Error (MSE): Penalizes larger errors by squaring the differences.
 Root Mean Squared Error (RMSE): The square root of MSE, providing error in
original units.
 R-Squared (R²): Represents the proportion of variance in the dependent variable
explained by the model. A value close to 1 indicates a good fit.

These metrics help in assessing how well the model generalizes to unseen data.

5. What is a Decision Tree Classifier, and how does it work?

A Decision Tree Classifier is a supervised learning algorithm used for classification tasks. It
works by recursively splitting the dataset based on feature values to create a tree-like
structure of decision rules.

At each node, the algorithm selects the best feature to split the data by minimizing impurity
(measured using Gini Index or Entropy). The process continues until all samples in a node
belong to the same class or another stopping criterion is met.

Decision Trees are widely used because they are easy to interpret and handle both numerical
and categorical data effectively.

6. What are the key differences between Classification and Regression Trees?

Aspect Classification Trees Regression Trees

Categorical labels (e.g., spam/non- Continuous values (e.g., house
Output Type
spam) prices)
Splitting
Gini Index, Entropy Mean Squared Error (MSE)
Criterion
Application Used for classification tasks Used for regression tasks

While both trees use recursive partitioning, classification trees focus on predicting categories,
whereas regression trees predict continuous values.

7. How does the Gini Index help in creating a Decision Tree?

The Gini Index is a measure of impurity used to split nodes in a Decision Tree. It is
calculated as:
Gini=1−∑pi2Gini = 1 - \sum p_i^2

where pip_i is the probability of a class appearing in the node.

A lower Gini Index indicates a purer node. The algorithm selects splits that minimize
impurity, leading to better classification performance.

8. What is the ID3 algorithm, and how does it use Information Gain?

The ID3 (Iterative Dichotomiser 3) algorithm is a Decision Tree learning algorithm that
builds trees using the concept of Information Gain. Information Gain measures the reduction
in entropy (randomness) after a dataset split.

Formula for entropy:

Entropy=−∑pilog 2piEntropy = - \sum p_i \log_2 p_i

The feature with the highest Information Gain is chosen for splitting, as it provides the most
informative split.

9. What is a Random Forest Classifier, and how does it improve accuracy?

A Random Forest Classifier is an ensemble learning method that combines multiple Decision
Trees to improve accuracy and reduce overfitting. It works by:

1. Creating multiple Decision Trees using different subsets of data and features.
2. Aggregating predictions from all trees (majority vote for classification, average for
regression).

This method increases robustness, generalization, and reduces sensitivity to individual noisy
features.

10. Can you explain a real-world case study where regression and classification models
are used to solve a problem?

A real-world example of using both regression and classification is loan approval prediction
and risk assessment in banking.
1. Classification Model (Decision Tree/Random Forest):
o Used to classify loan applicants as "Approved" or "Rejected" based on factors
like credit score, income, and employment status.
o Helps automate loan processing, improving efficiency.
2. Regression Model (Linear Regression):
o Used to predict loan default probability based on factors like past loan history,
outstanding debts, and economic trends.
o Helps banks decide interest rates and loan limits.

This combination of regression and classification ensures accurate decision-making,

minimizing financial risk while improving customer experience.

ETC07402 Communication Switching Systems - Question Bank - With Answers Rev 03
100% (4)
ETC07402 Communication Switching Systems - Question Bank - With Answers Rev 03
45 pages
DL DL2 DL3 Merged
No ratings yet
DL DL2 DL3 Merged
11 pages
Week 4 Q&A
No ratings yet
Week 4 Q&A
7 pages
ML 2
No ratings yet
ML 2
6 pages
SemVII_MachineLearning
No ratings yet
SemVII_MachineLearning
22 pages
Machine Learning Viva Questions
No ratings yet
Machine Learning Viva Questions
6 pages
Machine Learning Most Important Question For Mid Term Ipu University
No ratings yet
Machine Learning Most Important Question For Mid Term Ipu University
36 pages
Unit 3
No ratings yet
Unit 3
18 pages
ML Viva Questions
No ratings yet
ML Viva Questions
8 pages
ML Viva
No ratings yet
ML Viva
28 pages
Distinguish between decision trees
No ratings yet
Distinguish between decision trees
2 pages
Regression
No ratings yet
Regression
13 pages
00. April27 Revision LR DT Boosting Student Copy
No ratings yet
00. April27 Revision LR DT Boosting Student Copy
33 pages
AIML-QB- UNIT 3
No ratings yet
AIML-QB- UNIT 3
6 pages
ML Model Paper 2 Solution
No ratings yet
ML Model Paper 2 Solution
15 pages
ML Short
No ratings yet
ML Short
11 pages
ML Unit-2 Final
No ratings yet
ML Unit-2 Final
32 pages
LP I ML Viva Questions
100% (1)
LP I ML Viva Questions
9 pages
Machine Learning Questions
No ratings yet
Machine Learning Questions
21 pages
ML QB WITH ANSWER
No ratings yet
ML QB WITH ANSWER
20 pages
ML 2 marks
No ratings yet
ML 2 marks
7 pages
Types of Regression
No ratings yet
Types of Regression
8 pages
Machine learning
No ratings yet
Machine learning
62 pages
unit iii
No ratings yet
unit iii
57 pages
SEM MLOps
No ratings yet
SEM MLOps
58 pages
Sem Rpa
No ratings yet
Sem Rpa
61 pages
Machine Learning Basics
No ratings yet
Machine Learning Basics
32 pages
UNIT II Machine Learning
No ratings yet
UNIT II Machine Learning
118 pages
Unit 4 - Machine Learning PDF
No ratings yet
Unit 4 - Machine Learning PDF
49 pages
practice_paper_4
No ratings yet
practice_paper_4
9 pages
Supervised Learning
No ratings yet
Supervised Learning
187 pages
Data Science Interview Questions
100% (1)
Data Science Interview Questions
68 pages
ML Cheatsheet Final
No ratings yet
ML Cheatsheet Final
32 pages
Aiml 4
No ratings yet
Aiml 4
107 pages
Supervised and Unsupervised Learning
No ratings yet
Supervised and Unsupervised Learning
92 pages
M2 - Supervised Machine Learning
No ratings yet
M2 - Supervised Machine Learning
79 pages
ML points
No ratings yet
ML points
13 pages
Fam QB Ans
No ratings yet
Fam QB Ans
9 pages
Sarcia - Judd Michael - AS3
No ratings yet
Sarcia - Judd Michael - AS3
5 pages
ML ASSIGNMENT-01
No ratings yet
ML ASSIGNMENT-01
7 pages
ML & DL Notes
No ratings yet
ML & DL Notes
30 pages
Introduction To AI and ML
No ratings yet
Introduction To AI and ML
22 pages
PWC
No ratings yet
PWC
24 pages
Accenture
No ratings yet
Accenture
3 pages
ML 2 nd Unit
No ratings yet
ML 2 nd Unit
50 pages
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
From Everand
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
César Pérez López
No ratings yet
unit 6 questions and answers
No ratings yet
unit 6 questions and answers
4 pages
Machine learning notes
No ratings yet
Machine learning notes
12 pages
ML_Theory
No ratings yet
ML_Theory
10 pages
Linear Regression for ML ass
No ratings yet
Linear Regression for ML ass
99 pages
Here are some possible questions and answers based on the uploaded documents
No ratings yet
Here are some possible questions and answers based on the uploaded documents
8 pages
ML Interview Ques
No ratings yet
ML Interview Ques
12 pages
Solved With ChatGPT
No ratings yet
Solved With ChatGPT
3 pages
UNIT II Machine Learning
No ratings yet
UNIT II Machine Learning
118 pages
Machine Learning Questions and Answers For Interview
No ratings yet
Machine Learning Questions and Answers For Interview
20 pages
Machine Learning Question Bank-Unit 3
No ratings yet
Machine Learning Question Bank-Unit 3
6 pages
MLP Question Bank of AI and ML and NLP
No ratings yet
MLP Question Bank of AI and ML and NLP
7 pages
Final_AIP_Spring_24(Sloution)
No ratings yet
Final_AIP_Spring_24(Sloution)
16 pages
ML final
No ratings yet
ML final
92 pages
Alternating Decision Tree: Fundamentals and Applications
From Everand
Alternating Decision Tree: Fundamentals and Applications
Fouad Sabry
No ratings yet
The Secret Of Machine Learning
From Everand
The Secret Of Machine Learning
Mhd Arjunanta
No ratings yet
Unix and OS Lab Programs
No ratings yet
Unix and OS Lab Programs
25 pages
Twisted Pair Patch Cable, Raw - Category 5e Class D - SF/UTP
No ratings yet
Twisted Pair Patch Cable, Raw - Category 5e Class D - SF/UTP
2 pages
On the Internet 1st Edition Hubert L. Dreyfus pdf download
100% (1)
On the Internet 1st Edition Hubert L. Dreyfus pdf download
52 pages
Assignment 2 IT Infrastructure
No ratings yet
Assignment 2 IT Infrastructure
2 pages
Mod Menu Log - JP - Co.ponos - Battlecatsen
No ratings yet
Mod Menu Log - JP - Co.ponos - Battlecatsen
433 pages
Developing Android
No ratings yet
Developing Android
11 pages
Linux Commands With Examples
No ratings yet
Linux Commands With Examples
11 pages
Tugas Pertemuan 4 PUTRI SALSABILA 18043138 A. Baldzan
No ratings yet
Tugas Pertemuan 4 PUTRI SALSABILA 18043138 A. Baldzan
3 pages
Ad of Civil Judge
No ratings yet
Ad of Civil Judge
20 pages
Waf Logs
No ratings yet
Waf Logs
5 pages
uCOS-II Kernel Structure
No ratings yet
uCOS-II Kernel Structure
43 pages
Turbo HD 4.0 Solution - 1
No ratings yet
Turbo HD 4.0 Solution - 1
24 pages
AP Mode Switching - Overview, Configuration Examples, and Troubleshooting
No ratings yet
AP Mode Switching - Overview, Configuration Examples, and Troubleshooting
43 pages
Technology Report 2021: The 20s Roar
No ratings yet
Technology Report 2021: The 20s Roar
92 pages
Shift Registers Notes
No ratings yet
Shift Registers Notes
146 pages
IEC-IM05 Series: Key Features
No ratings yet
IEC-IM05 Series: Key Features
1 page
Reviews Scam, Legit or Safe Check Scamadviser
No ratings yet
Reviews Scam, Legit or Safe Check Scamadviser
1 page
14:332:231 Digital Logic Design: Ivan Marsic, Rutgers University Electrical & Computer Engineering Fall 2013
No ratings yet
14:332:231 Digital Logic Design: Ivan Marsic, Rutgers University Electrical & Computer Engineering Fall 2013
5 pages
RAB Pengadaan Peralatan Praktik
No ratings yet
RAB Pengadaan Peralatan Praktik
2 pages
Chapter 05 Short Workplace Messages and Digital Media
No ratings yet
Chapter 05 Short Workplace Messages and Digital Media
45 pages
Placement Management System
No ratings yet
Placement Management System
10 pages
ESP Registration Details
No ratings yet
ESP Registration Details
1 page
ZigBee+3.0+module+AT+command+standard+specification EN V1.0
No ratings yet
ZigBee+3.0+module+AT+command+standard+specification EN V1.0
16 pages
Ch03 Step Wise
No ratings yet
Ch03 Step Wise
36 pages
Coca Cola
No ratings yet
Coca Cola
9 pages
WeatherLink Console Guide
No ratings yet
WeatherLink Console Guide
29 pages
English Practice Test Number 6 (Code Vtmp)
No ratings yet
English Practice Test Number 6 (Code Vtmp)
4 pages
M3660idn M3655idn M3145idn M3645idnENRMR20
No ratings yet
M3660idn M3655idn M3145idn M3645idnENRMR20
48 pages
Scalability in Cloud Computing
No ratings yet
Scalability in Cloud Computing
6 pages