0% found this document useful (0 votes)

11 views6 pages

Aih Exp 1

Uploaded by

sanket.pingale2004

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

11 views6 pages

Aih Exp 1

Uploaded by

sanket.pingale2004

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

Sardar Patel Institute of Technology,Mumbai

Department of Electronics and Telecommunication Engineering

B.E. Sem-VII- PE-IV (2024-2025)
IT 24 - AI in Healthcare

Experiment 1: Regression in Healthcare Dataset

Name:Prithvi Singh
Date: 16/08/2024
UID: 2022301014

Objective:
● Write program for regression analysis for healthcare dataset.
● To demonstrate the working principle of regression techniques on medical data set for
building the model to classify/ predict using a new sample.
Outcomes:
● Explore the Medical Dataset suitable for linear/ logistic regression problem
● Explore the pattern from the dataset and apply suitable algorithm

System Requirements:
Linux OS with Python and libraries or R or windows with MATLAB

• Theory:

What is regression with a mathematical approach?

A dependent variable (often represented as yy) and one or more independent variables (xx) can be
modeled and analyzed statistically using regression. Regression analysis is used to determine
which equation best fits the data of the independent variables to predict the dependent variable.

Mathematical Approach to Regression

1. Linear Regression

Linear regression analysis is used to predict the value of a variable based on the value of another
variable. The variable you want to predict is called the dependent variable. The variable you are
using to predict the other variable's value is called the independent variable
The linear regression formula is:
y = mx + b
Where:
• y is the dependent variable (response variable)
• x is the independent variable (predictor variable)
• m is the slope (coefficient) of the regression line
• b is the intercept (constant) of the regression line

2. Logistic Regression

Logistic regression estimates the probability of an event occurring, such as voted or didn’t vote,
based on a given data set of independent variables.

This type of statistical model (also known as logit model) is often used for classification and
predictive analytics. Since the outcome is a probability, the dependent variable is bounded between
0 and 1. In logistic regression, a logit transformation is applied on the odds—that is, the probability
of success divided by the probability of failure. This is also commonly known as the log odds, or
the natural logarithm of odds, and this logistic function is represented by the following formulas:
Logit(pi) = 1/(1+ exp(-pi))

ln(pi/(1-pi)) = Beta_0 + Beta_1X_1 + … + B_kK_k

What are the many forms of regression and what does it mean?
Several categories exist for regression analysis, each serving a specific function depending on the
makeup of the independent and dependent variables. A summary of numerous common types of
regression and their applicability may be found here.

1. Linear Regression

Types:

1. Simple Linear Regression: Uses a straight line to represent the relationship between a
single independent variable and a dependent variable.
2. Multiple Linear Regression: This technique simulates the connection between a
dependent variable and two or more independent variables.

Significance: Helpful in understanding the link between factors and forecasting a continuous
outcome. It can assist in forecasting and trend identification and is based on the assumption of a
linear relationship.

2. Logistic Regression

On the basis of the categories, Logistic Regression can be classified into three types:

Types:

1. Binomial: In a binomial logistic regression, the dependent variables can only be of two
types: either 0 or 1, Pass or Fail, etc.

2. Multinomial: In multinomial logistic regression, the dependent variable, such as "cat,"

"dogs," or "sheep," may be one or more of three potential unordered varieties.

3. Ordinal: Three or more ordered sorts of dependent variables, such as "low," "medium," or
"high," are conceivable in ordinal logistic regression.

Significance: Critical for estimating the probability of a categorical outcome, especially when
dealing with multinomial or binary answer variables. It provides information about the factors
influencing categorical conclusions.
• Dataset for Logistic Regression:

• ALGORITHM:
Load the Original Dataset:

● Read the original dataset from a CSV file.

Separate Features and Target:

● Identify and separate the target variable (Growing_Stress) and the feature variables.
● Split the features into numerical and categorical columns.

Generate Synthetic Data:

● For Numeric Columns:

○ Generate synthetic data using a normal distribution based on the mean and standard
deviation of the original data for each numeric feature.
● For Categorical Columns:
○ Generate synthetic data by randomly sampling from the unique values in the
original dataset. Preserve the original distribution using probabilities.

Model the Target Variable (Growing_Stress):

● Train a logistic regression model using the original dataset to predict the
Growing_Stress variable.
● Use this model to predict Growing_Stress values for the synthetic data.
Combine Original and Synthetic Data:

● Append the synthetic data to the original dataset.

Save the Combined Dataset:

● Write the combined dataset (original + synthetic) to a new CSV file.

Validate Model Accuracy

● Train a logistic regression model on the combined dataset to check the model's accuracy.
Ensure the accuracy is around 95%.

● Logistic Regression Dataset:

https://www.kaggle.com/datasets/bhavikjikadara/mental-health-dataset

Output:
● Testing it again on new dataset

● Conclusion:
We performed logistic regression on a mental health dataset. First, we used an initial dataset to train
the model. Next, we constructed a fresh dataset with comparable properties, and we used my trained
model to predict the values with a comparable level of accuracy. To sum up, we have successfully
created a regression analysis software for a healthcare dataset and illustrated how regression techniques
operate on a diabetes data set to create a model that can be used to forecast or classify using a fresh
sample.

Applied Logistic Regression - 3rd Edition Scribd Download
100% (8)
Applied Logistic Regression - 3rd Edition Scribd Download
17 pages
Module-2 - Logistic Regression in Machine Learning
No ratings yet
Module-2 - Logistic Regression in Machine Learning
28 pages
Ai Tech Agency Infographics
No ratings yet
Ai Tech Agency Infographics
65 pages
Exp2 Milf
No ratings yet
Exp2 Milf
7 pages
Aih Lab1
No ratings yet
Aih Lab1
10 pages
Logistic Regression Playbook
No ratings yet
Logistic Regression Playbook
19 pages
Logistic Regression Monograph
No ratings yet
Logistic Regression Monograph
33 pages
Regression in M.L
No ratings yet
Regression in M.L
13 pages
Regression Analysis Linear Multiple Logistic
No ratings yet
Regression Analysis Linear Multiple Logistic
25 pages
Regression
No ratings yet
Regression
19 pages
Module 4 - Logistic Regression - Afterclass1b
No ratings yet
Module 4 - Logistic Regression - Afterclass1b
54 pages
Unit-Iii-1 1
No ratings yet
Unit-Iii-1 1
31 pages
VO MCA S4 Data Mining Unit 8
No ratings yet
VO MCA S4 Data Mining Unit 8
18 pages
BA3 4 5modules
No ratings yet
BA3 4 5modules
258 pages
Da 2
No ratings yet
Da 2
31 pages
DMML Unit4
No ratings yet
DMML Unit4
77 pages
Logistic Regression
No ratings yet
Logistic Regression
10 pages
MACHINE LEARNING Presentation Logistic Regression
No ratings yet
MACHINE LEARNING Presentation Logistic Regression
18 pages
Linear and Logistic Regression
No ratings yet
Linear and Logistic Regression
21 pages
ML 7th Sem AIML ITE Notes Complete LONG (1) - 34-62
No ratings yet
ML 7th Sem AIML ITE Notes Complete LONG (1) - 34-62
29 pages
Lecture 08
No ratings yet
Lecture 08
42 pages
Regression Analysis
100% (2)
Regression Analysis
11 pages
Logisticregression
No ratings yet
Logisticregression
22 pages
Logistic Regression
No ratings yet
Logistic Regression
14 pages
Unit-2 ML
No ratings yet
Unit-2 ML
39 pages
Logistic Regression Report
No ratings yet
Logistic Regression Report
39 pages
DA Unit-3
No ratings yet
DA Unit-3
13 pages
Regression Analysis
No ratings yet
Regression Analysis
14 pages
Unit 3-2
No ratings yet
Unit 3-2
20 pages
Wa0004.
No ratings yet
Wa0004.
9 pages
Dokumen - Pub - Cambridge Primary Computing Learners Book Stage 2 9781398368576 1398368571 - Compressed
100% (2)
Dokumen - Pub - Cambridge Primary Computing Learners Book Stage 2 9781398368576 1398368571 - Compressed
170 pages
Logistic Regression
No ratings yet
Logistic Regression
42 pages
Fai Module 3
No ratings yet
Fai Module 3
67 pages
Logistic Regression
No ratings yet
Logistic Regression
7 pages
Logistic Regression
No ratings yet
Logistic Regression
16 pages
Logistic Regression
No ratings yet
Logistic Regression
3 pages
Report Logistic Regression
No ratings yet
Report Logistic Regression
21 pages
DSL5
No ratings yet
DSL5
6 pages
Experiment No 3
No ratings yet
Experiment No 3
7 pages
Logistic Regression
No ratings yet
Logistic Regression
9 pages
ML 5th
No ratings yet
ML 5th
8 pages
Task 1
No ratings yet
Task 1
7 pages
Logistic Regression
No ratings yet
Logistic Regression
13 pages
Report Logistic Regression
No ratings yet
Report Logistic Regression
17 pages
Data Analytics Using R
No ratings yet
Data Analytics Using R
23 pages
Logistic Regression Model Study Assignment
100% (1)
Logistic Regression Model Study Assignment
5 pages
Logistic Regression
No ratings yet
Logistic Regression
20 pages
Business Analytics: Advance: Logistic Regression
100% (1)
Business Analytics: Advance: Logistic Regression
26 pages
Logistic Regression
No ratings yet
Logistic Regression
14 pages
Misc 5
No ratings yet
Misc 5
1 page
5.1) Binary Logistic Regression
No ratings yet
5.1) Binary Logistic Regression
32 pages
Logistic Regression
No ratings yet
Logistic Regression
17 pages
Logistic Regression
No ratings yet
Logistic Regression
8 pages
Logistic Regression
No ratings yet
Logistic Regression
18 pages
What Is Logistic Regression
No ratings yet
What Is Logistic Regression
20 pages
Regression Techniques
No ratings yet
Regression Techniques
14 pages
Logistic Regression
No ratings yet
Logistic Regression
25 pages
HRA Chapter 6
No ratings yet
HRA Chapter 6
16 pages
ML Unit 3
No ratings yet
ML Unit 3
40 pages
Logistic Regression Model - A Review
No ratings yet
Logistic Regression Model - A Review
5 pages
Math 6 COT
No ratings yet
Math 6 COT
16 pages
Bootcamp's Step 1 Study Schedule
No ratings yet
Bootcamp's Step 1 Study Schedule
8 pages
1 Action Research Final
No ratings yet
1 Action Research Final
75 pages
Water Supply and Sanitary Engineering
No ratings yet
Water Supply and Sanitary Engineering
3 pages
Singing Dialogue - Music Therapy With Persons - PDF Room
No ratings yet
Singing Dialogue - Music Therapy With Persons - PDF Room
360 pages
2019 School Intervention Plan in Cip Project SUMA (Solving and Understanding Mathematical Analysis)
No ratings yet
2019 School Intervention Plan in Cip Project SUMA (Solving and Understanding Mathematical Analysis)
2 pages
Socio Lesson 2 PDF
No ratings yet
Socio Lesson 2 PDF
53 pages
MATHS P1 Form 4 End Term 1 Exam 2021 Teacher - Co - .Ke
No ratings yet
MATHS P1 Form 4 End Term 1 Exam 2021 Teacher - Co - .Ke
17 pages
Share Zimbabwejobs FRIDAY, ALLWeeklyjobs 10
No ratings yet
Share Zimbabwejobs FRIDAY, ALLWeeklyjobs 10
254 pages
Name of Learner: Rosacena, Andrelle Josh P Date: Grade Level: 12 Section: Sociability
No ratings yet
Name of Learner: Rosacena, Andrelle Josh P Date: Grade Level: 12 Section: Sociability
5 pages
The Armadillo 3-3
100% (3)
The Armadillo 3-3
6 pages
Critical Nodes Identification in Complex Networks - A Survey
No ratings yet
Critical Nodes Identification in Complex Networks - A Survey
46 pages
Untitled
No ratings yet
Untitled
3 pages
School-Lib
No ratings yet
School-Lib
4 pages
What Is A Dance Competition?: Prepared By: Ethel S. Donesa
No ratings yet
What Is A Dance Competition?: Prepared By: Ethel S. Donesa
3 pages
Puerto Rico 2019 2020 Calendar PDF
0% (1)
Puerto Rico 2019 2020 Calendar PDF
1 page
Arvr Course Project Report
No ratings yet
Arvr Course Project Report
24 pages
Manuscript
No ratings yet
Manuscript
47 pages
Dhobi Ghat
No ratings yet
Dhobi Ghat
2 pages
BSSW 3 2 Proposal
No ratings yet
BSSW 3 2 Proposal
5 pages
Module#3: Name: Lorren M.Alindahaw Yr/Section: BSCM 3A Subject: CM108 Instructor: E.Tupas
No ratings yet
Module#3: Name: Lorren M.Alindahaw Yr/Section: BSCM 3A Subject: CM108 Instructor: E.Tupas
3 pages
Writing AND Mathematics: - Raiza Lorraine Calanda
No ratings yet
Writing AND Mathematics: - Raiza Lorraine Calanda
96 pages
Education 4 Adulthood
No ratings yet
Education 4 Adulthood
14 pages
Ce Exp 6
No ratings yet
Ce Exp 6
4 pages
Varshil Shah Exp 1
No ratings yet
Varshil Shah Exp 1
5 pages
ADV Exp 7 2022301014
No ratings yet
ADV Exp 7 2022301014
5 pages
Aih Exp 3
No ratings yet
Aih Exp 3
8 pages
AmericanEngJenny Practice Conversation 1
No ratings yet
AmericanEngJenny Practice Conversation 1
8 pages
Art of Comprehension
No ratings yet
Art of Comprehension
7 pages
Introduction For RRL
No ratings yet
Introduction For RRL
6 pages
A8 Meantime 09 10
No ratings yet
A8 Meantime 09 10
24 pages
Mahindra Finance
No ratings yet
Mahindra Finance
2 pages
JP Word Cards A4 Family
No ratings yet
JP Word Cards A4 Family
6 pages
Allama Iqbal Open University, Islamabad: Warning
No ratings yet
Allama Iqbal Open University, Islamabad: Warning
1 page
Statistical Classification: Fundamentals and Applications
From Everand
Statistical Classification: Fundamentals and Applications
Fouad Sabry
No ratings yet

Aih Exp 1

Uploaded by

Aih Exp 1

Uploaded by

Sardar Patel Institute of Technology,Mumbai

Department of Electronics and Telecommunication Engineering

Experiment 1: Regression in Healthcare Dataset

What is regression with a mathematical approach?

Mathematical Approach to Regression

ln(pi/(1-pi)) = Beta_0 + Beta_1*X_1 + … + B_k*K_k

2. Multinomial: In multinomial logistic regression, the dependent variable, such as "cat,"

● Read the original dataset from a CSV file.

Separate Features and Target:

Generate Synthetic Data:

● For Numeric Columns:

Model the Target Variable (Growing_Stress):

● Append the synthetic data to the original dataset.

Save the Combined Dataset:

● Write the combined dataset (original + synthetic) to a new CSV file.

Validate Model Accuracy

● Logistic Regression Dataset:

You might also like

ln(pi/(1-pi)) = Beta_0 + Beta_1X_1 + … + B_kK_k