Oel 02

The document outlines a data analysis process involving health insurance data, including steps for data preprocessing, clustering, and predictive modeling. It utilizes various Python libraries for data manipulation and analysis, such as pandas, numpy, and sklearn. Key recommendations include targeting interventions for high-risk clusters and suggesting loyalty benefits for low-risk customers to enhance retention.

Uploaded by

shapparhay

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF or read online on Scribd

0% found this document useful (0 votes)

4 views4 pages

Oel 02

Uploaded by

shapparhay

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF or read online on Scribd

You are on page 1/ 4

¥ Minal Fatima 21-CP-55 OEL-02 DWM 1 step 2: Tapert Necessary Librantes Anoort pandas a6 94 Snport nunpy 3 99 ‘ngort seaborn at ane ingot natplotlio.pyplot as plt ‘from sklearn.nedel selection ingort train_test split ‘fron Sklearn-ensenble inport RandonForestRegressor ‘from sklearn.cluster import Means from skleorn metrics Seport nean_squoredercor, aceuracy_score, classification report from sidearm Sapte seport Sinpleinputer ‘rom sklearn prearocessing import stareardscaler 4 step 2: Load the Dataset ‘from google. cola> inport files loaded = Files-uploae() # Upload the CSV Fite here ‘le_nane = List (uploaded. keys())12] macifed:13/12/2024- 100% done Saving health-Snsurance-cata.cev to health insurance data.cev 4 Step 3: Data Preprocessing 4 Inspect the dataset Printtatases Overview") prine(a-neaet)) (ustonerI0 Age Gender Region HealthCondition ClainsCoure ClainAnount \ ° 1 'S6 “ale ease tat ere : 2 6 Male north eth 7 ‘ow 2 246 Female “east cardiac to astp.ee : 4S “Wale north Garotae 2 a 560 Male ease "oN a InsurancePrentun Layaltyvears 1 da03. 5 Ba 2 a 2 4 andle nissing values {noutar ~ Sinoleteputer(strategy="nedsan') af[ Clatnsoount') = inputer. i, transfora(dé{[“Clasntnount“}]) ft = def “Chatnanount”J-quantte(@.25) 1 = dfl“Chatmanount”J-qvaneile(@.75) ign > B= at lower pound = qi - 2.5 * tar oper pound = 43 + 2.5 * Lar {F[‘Clainsoount') = np.wbere(dé[ ‘Clatntrount'} > upper_bound, upper_bound, df{Clatanount'})4 Encode eategorteal varfables [of = pd.get_cmnses(af, colunnse[ Gender’, ‘Region’, ‘Helthcondstion'], drop firstetrue) 1 Standardize nunerical features Scaler = Standardscaler() um_cols = (‘age', ‘ClainsCount', “Clatnanount", “InsurancePseniun’, “Loyaltyvears') [oun cols] = sealen.#it_teanstorn(et(un_cols]) Print "ata after preprocessing:") print af-heoed)) ‘Fe oata after preprocessing: ‘ustoneri9 Age’ Clainscount ClaieAnount InsurancePrentun 3 44 0.996292 1.340602 9.803815 9.452705 4 5 eiscone “elsssm6 “a.sarie 0.320586 loyaityvears Genéer_tale Region North Region_south Region Mest \ oe savesies9 trie False false False a “eleatsse Tre ve False False 4 -b328i6 True false False false eaLthcoraition Cardise HealthConaition psabetes \ e alse else 1 False False 2 Troe False 2 tre False 4 alse False eatthcorat tion Hypertension 1 False 2 False 2 False 4 False 1 step 4: Patter Discovery # clustersng Ineans = Kheans(n_clusterse2, randon_state-s2) af ‘Cluster’] = eneans.f8¢_predict(@F[nur_cols]) sns.painplot(a, huew'Clusser’, varscrom-cols) ple-show() 1 step 5: Predictive Modeling 4 Split aataset X= dF-drop({CustonerZ0", “Claindnount'], anise) y= df{ 'Cainawount"] A.rain, ALeest, y_tain, y_test ~ train_test splittK, ys test_size-e.3, randon_staters2)a 2 *‘feature Seportances = pé.Sertes(nodel feature snportances_, SndexsX. colons) feature_inporeances.nlangest (10) plot{kings2arh") plt.title("reature Ieportances") pit.show() Feature Inportances ‘entnconeonabetes veathconton cardiac felon south conser Mate region North conmscount Levan ae ssurnceeium custer oo cl) eee eecomentatons) Frinetchterey highrise clusters sing ters for sarstedinerventons.”) Print ros on castors with hnish conditions coneriuting to Nghe clta.”) Print". Suggest Loyalty benesits for long-tern low-risk custoners to retain thet.") Print(é. Use predictive nodeling for proactive cost managenent by anticipating Nigh elaine.

Pandas: Reference Sheet
No ratings yet
Pandas: Reference Sheet
9 pages
4-10 Aiml
No ratings yet
4-10 Aiml
25 pages
Health Insurance Lead Prediction
No ratings yet
Health Insurance Lead Prediction
21 pages
Datascience 2 PDF
No ratings yet
Datascience 2 PDF
24 pages
Mall Customer Segmentation Using KMeans Clustering Algorithm and Classification Algorithm
No ratings yet
Mall Customer Segmentation Using KMeans Clustering Algorithm and Classification Algorithm
40 pages
ML Manual Final
No ratings yet
ML Manual Final
35 pages
Module2.1 Feature Selection
No ratings yet
Module2.1 Feature Selection
38 pages
utf-8''C2M1 Assignment
No ratings yet
utf-8''C2M1 Assignment
24 pages
Healthcare Insurance Prediction Main
No ratings yet
Healthcare Insurance Prediction Main
74 pages
Python Sklearn Linear Regression
No ratings yet
Python Sklearn Linear Regression
45 pages
'Name-Piyush Tiwari''/n' 'Section - C'/N' 'Roll - No-2001610100142'
No ratings yet
'Name-Piyush Tiwari''/n' 'Section - C'/N' 'Roll - No-2001610100142'
28 pages
MACHINE LEARNING Manual
No ratings yet
MACHINE LEARNING Manual
36 pages
Certificate
No ratings yet
Certificate
33 pages
Heart Disease Prediction - Jupyter Notebook
100% (1)
Heart Disease Prediction - Jupyter Notebook
9 pages
Aquif Ibrar 1212
No ratings yet
Aquif Ibrar 1212
9 pages
ML All Projectpdf Removed
No ratings yet
ML All Projectpdf Removed
41 pages
DA Assignment
No ratings yet
DA Assignment
18 pages
Chapter 5 - Classification Problems
100% (1)
Chapter 5 - Classification Problems
25 pages
Lab Manual - MachineLearningLaboratory-DR - Vaishnavi
No ratings yet
Lab Manual - MachineLearningLaboratory-DR - Vaishnavi
71 pages
Sla4a 21im30005
No ratings yet
Sla4a 21im30005
11 pages
Komal ML Assg1
No ratings yet
Komal ML Assg1
9 pages
AIML Record Batch 9
No ratings yet
AIML Record Batch 9
88 pages
Bacdeaf 23032025 115708 Split 1
No ratings yet
Bacdeaf 23032025 115708 Split 1
37 pages
Building Logistic Regression Model in Python
No ratings yet
Building Logistic Regression Model in Python
24 pages
PA LAb 3
No ratings yet
PA LAb 3
6 pages
Rapport
No ratings yet
Rapport
21 pages
ML 6 7 8
No ratings yet
ML 6 7 8
10 pages
Machine Learning Laboratory (BTCS619-18) B.Tech Cse 6Th 2024 EVEN
No ratings yet
Machine Learning Laboratory (BTCS619-18) B.Tech Cse 6Th 2024 EVEN
29 pages
Decision Support
No ratings yet
Decision Support
21 pages
Naive Bayes
No ratings yet
Naive Bayes
5 pages
Heart Disease Diagnosis Using Machine Learning
No ratings yet
Heart Disease Diagnosis Using Machine Learning
26 pages
Step by Step Data Processing For ML Project
No ratings yet
Step by Step Data Processing For ML Project
16 pages
Data Mining Lab Manual CSE VII Sem
No ratings yet
Data Mining Lab Manual CSE VII Sem
63 pages
17.11.24 - Jupyter Notebook - Doc
No ratings yet
17.11.24 - Jupyter Notebook - Doc
6 pages
AI and ML Lab Ex3 To 12
No ratings yet
AI and ML Lab Ex3 To 12
27 pages
Healthcare-Project-Simplilearn - Week3
No ratings yet
Healthcare-Project-Simplilearn - Week3
7 pages
Bank Marketing Targets 1724510938
No ratings yet
Bank Marketing Targets 1724510938
13 pages
Advance Python
No ratings yet
Advance Python
5 pages
Aiml Programs
No ratings yet
Aiml Programs
12 pages
Machine Learning With PySpark and MLlib - Solving A Binary Classification Problem - by Susan Li - Towards Data Science
No ratings yet
Machine Learning With PySpark and MLlib - Solving A Binary Classification Problem - by Susan Li - Towards Data Science
10 pages
Da Lab Mannual
No ratings yet
Da Lab Mannual
25 pages
ML Assigmengt Rishu Ranjan 12212221: # Import Necessary Libraries
No ratings yet
ML Assigmengt Rishu Ranjan 12212221: # Import Necessary Libraries
3 pages
Titanic Akshaya
No ratings yet
Titanic Akshaya
12 pages
ML Complete Notes Hridoy
No ratings yet
ML Complete Notes Hridoy
5 pages
Kartik MLP 4-9prg
No ratings yet
Kartik MLP 4-9prg
10 pages
Data Analytics
No ratings yet
Data Analytics
10 pages
Python Cod1
No ratings yet
Python Cod1
3 pages
Logistic Regression
No ratings yet
Logistic Regression
2 pages
Medical
No ratings yet
Medical
4 pages
Mi PR 5
No ratings yet
Mi PR 5
4 pages
Home Work
No ratings yet
Home Work
12 pages
Group Work Assignment Supervised and Unsupervised Learning
No ratings yet
Group Work Assignment Supervised and Unsupervised Learning
10 pages
Step 1
No ratings yet
Step 1
10 pages
Python 1
No ratings yet
Python 1
3 pages
PROJECTS
No ratings yet
PROJECTS
6 pages
Asset-V1 VIT+MBA109+2020+type@asset+block@Introductio To ML Using Python
No ratings yet
Asset-V1 VIT+MBA109+2020+type@asset+block@Introductio To ML Using Python
7 pages
ML Model Report
No ratings yet
ML Model Report
8 pages
DSBDA Practicals
No ratings yet
DSBDA Practicals
16 pages
Iii Aid - ML
No ratings yet
Iii Aid - ML
30 pages
OEL01
No ratings yet
OEL01
8 pages
Presentation Report
No ratings yet
Presentation Report
8 pages
DBMS Lab 06 - 21-CP-055
No ratings yet
DBMS Lab 06 - 21-CP-055
6 pages
Lab 9
No ratings yet
Lab 9
3 pages
Lab 06
No ratings yet
Lab 06
3 pages
CEPM Assignment01 Mobile Brand Comprision
No ratings yet
CEPM Assignment01 Mobile Brand Comprision
6 pages

Oel 02

Uploaded by

Oel 02

Uploaded by

You might also like