0% found this document useful (0 votes)

15 views21 pages

Final Presentation

Uploaded by

Jamila Hamdi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

15 views21 pages

Final Presentation

Uploaded by

Jamila Hamdi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 21

Drug Persistency Project

Virtual Internship: Final Presentation

Group Name: Health+

Group Members:
Mohammad Odeh (United Arab Emirates)
Sakib Mahmud (Qatar)
Date: 11-May-2021
Background – Drug Persistency case study

 One of the challenge for all Pharmaceutical companies is to understand the persistency of drug as per the
physician prescription. To solve this problem ABC pharma company approached an analytics company to
automate this process of identification.
 Objective : Gather insights on the factors that are impacting the persistency, build a classification for the
given dataset.
The analysis has been divided into three parts:
• Data Understanding
• Data insights and visualization
• Recommendations
Data Exploration
• 68 Features, including :
• General features such as (Demographics, Provider Attributes)
• Diseases/Drugs Factors
• Clinical Factors

• Total number of patients : 3424

Assumptions:

 The data follows Normal Distribution.

 Patients’ history data were recorded accurately without any errors in testing or examination.
Demographics
Profit Analysis Analysis

Gender Proportion Gender Proportion vs. Persistency Flag

Demographics
Profit Analysis Analysis

Age Proportion Age Bucket vs. Persistency Flag

Demographics
Profit Analysis Analysis

Ethnicity Proportion Ethnicity vs. Persistency Flag

Demographics
Profit Analysis Analysis

Region Proportion Region vs. Persistency Flag

Demographics
Profit Analysis Analysis

Race Proportion IDN Indicator Ratio

Disease Type and Responsible Physician
Specialty Analysis
Drug Factor Analysis

Concomitancy of Drugs
Diseases Factor Analysis

Comorbidity of Diseases
Risk Factor Analysis

Risk Factors
Risk Factor Analysis

• High number of non persistent

patients has less than 3 count of
risks.
• Patients with more than 3 count
of risks has the highest
percentage of non-persistent
cases compared to total
registered cases.

Risk Counts Vs Persistency Flag

Risk Factor Analysis

• High number of non persistent

patients has less than 3 count of
risks.
• Patients with more than 3 count
of risks has the highest
percentage of non-persistent
cases compared to total
registered cases.

Risk Counts Vs Persistency Flag

Dominance Analysis
• Dominance Analysis show most
influential features in the data
set (Most 15 influential factors).

• It can be noticed that clinical

parameters were the most
influential factors behind
persistency of drugs.
Recommendations
From the Exploratory Data Analysis (EDA) done on the dataset, following recommendations are given to the ABC
company’s technical team:
 Demographic Factors provided in the dataset is not strongly related to the “Persistency Level” of the patients.
 NTM Specialist type or Specialist Flag did not show any correlation to the target variable.
 Some important parameters were determined using Dominance Analysis which can be used to transform the
dataset into a subset and perform quantitative analysis.
 Clinical Factors such as “Concomitancy of Drugs”, “Comorbidity of Various Diseases” and “Risk Factors” do
show some correlations with the target variable “Persistency Level” of the patients which needs to be
investigated further through a Quantitative Analysis such as Machine Learning.
Recommendations
Original Autoencoder Dominance Analysis
ML Algorithms MAE Accuracy Precision Recall f1-Score AUC MAE Accuracy Precision Recall f1-Score AUC MAE Accuracy Precision Recall f1-Score AUC
Logistic Regression 0.19 0.81 0.81 0.79 0.81 0.88 0.20 0.80 0.79 0.78 0.79 0.87 0.27 0.73 0.73 0.68 0.72 0.76
K-Nearest Neighbour (KNN) 0.22 0.78 0.78 0.73 0.76 0.84 0.21 0.79 0.80 0.78 0.79 0.83 0.31 0.69 0.68 0.63 0.67 0.70
Support Vector Machine (SVM) 0.21 0.79 0.78 0.76 0.78 0.86 0.20 0.80 0.80 0.79 0.80 0.82 0.28 0.72 0.72 0.69 0.72 0.71
Stochastic Gradient Descent (SGD) 0.24 0.76 0.76 0.75 0.76 0.81 0.21 0.79 0.80 0.79 0.79 0.86 0.28 0.72 0.71 0.67 0.71 0.74
Decision Tree 0.27 0.73 0.73 0.71 0.73 0.71 0.25 0.75 0.75 0.73 0.75 0.73 0.35 0.65 0.63 0.59 0.63 0.62
Gradient Boosting 0.19 0.81 0.81 0.78 0.81 0.88 0.20 0.80 0.80 0.79 0.80 0.86 0.28 0.72 0.71 0.68 0.71 0.76
Random Forest 0.19 0.81 0.80 0.78 0.80 0.88 0.22 0.78 0.78 0.76 0.78 0.84 0.33 0.67 0.66 0.63 0.66 0.69
Extra Trees 0.21 0.79 0.79 0.77 0.79 0.87 0.23 0.77 0.77 0.75 0.77 0.84 0.34 0.66 0.64 0.60 0.64 0.67
AdaBoost 0.19 0.81 0.81 0.79 0.81 0.87 0.21 0.79 0.79 0.78 0.79 0.86 0.28 0.72 0.72 0.67 0.71 0.75
XgBoost 0.21 0.79 0.80 0.75 0.78 0.87 0.20 0.80 0.80 0.79 0.80 0.86 0.28 0.72 0.71 0.66 0.70 0.76
Multiple Layer Perceptron (MLP) 0.25 0.75 0.75 0.74 0.75 0.82 0.21 0.79 0.79 0.77 0.79 0.86 0.32 0.68 0.68 0.65 0.68 0.70
ANN Developed with KERAS 0.80 0.78 0.79 0.79
Recommendations
Recommendations
Recommendations
Based on the provided data ( mostly categorical ), and the previous analysis we recommend 2 types of model to
build for this problem:
 Neural Networks
 Adaptive Boosting (Ensemble)
 Gradient Boosting (Ensemble)

Both should be complex enough to learn well the data and provide high accuracy

Selected Pipeline: Autoencoder based feature extraction

Thank You

Capstone Project - Credit Risk Analysis
67% (6)
Capstone Project - Credit Risk Analysis
50 pages
2.21 Dynamo For Civil 3D PDF
No ratings yet
2.21 Dynamo For Civil 3D PDF
1 page
Cognitive Computing Model Brief - Hospital Admissions and ED Visits
No ratings yet
Cognitive Computing Model Brief - Hospital Admissions and ED Visits
9 pages
Learning Together Is Fun!: Learning English Through Sharing Picture Books
No ratings yet
Learning Together Is Fun!: Learning English Through Sharing Picture Books
11 pages
Final Report
No ratings yet
Final Report
21 pages
Week 10
No ratings yet
Week 10
27 pages
Week 10
No ratings yet
Week 10
16 pages
Week 7
No ratings yet
Week 7
3 pages
Smart Business Problems and Analytical Hints in Cancer Research
From Everand
Smart Business Problems and Analytical Hints in Cancer Research
Zemelak Goraga
No ratings yet
DS Report 03
No ratings yet
DS Report 03
30 pages
d8 Group Finalllllllllllllllllllllllllllllllllll
No ratings yet
d8 Group Finalllllllllllllllllllllllllllllllllll
74 pages
ABHISHEK Final
No ratings yet
ABHISHEK Final
78 pages
Case Study - Healthcare Industry
No ratings yet
Case Study - Healthcare Industry
2 pages
Data Mining Review - 1
No ratings yet
Data Mining Review - 1
9 pages
Phase 2
No ratings yet
Phase 2
6 pages
Project
No ratings yet
Project
6 pages
Three Dimensional Model For Diagnostic Prediction: A Data Mining Approach
No ratings yet
Three Dimensional Model For Diagnostic Prediction: A Data Mining Approach
5 pages
Overview Of Bayesian Approach To Statistical Methods: Software
From Everand
Overview Of Bayesian Approach To Statistical Methods: Software
Vinaitheerthan Renganathan
No ratings yet
ADAM
No ratings yet
ADAM
12 pages
Healthcare CRM Dataset
No ratings yet
Healthcare CRM Dataset
3 pages
Final-Term BDM Proposal - Docx - 2
No ratings yet
Final-Term BDM Proposal - Docx - 2
19 pages
Capstone - Project - Final Report - Hitesh - Dadhich
No ratings yet
Capstone - Project - Final Report - Hitesh - Dadhich
38 pages
Introduction To Non Parametric Methods Through R Software
From Everand
Introduction To Non Parametric Methods Through R Software
Editor IJSMI
No ratings yet
Healthcare Dataset
No ratings yet
Healthcare Dataset
2,237 pages
Interview Prep
No ratings yet
Interview Prep
31 pages
Template Capstone PPT
No ratings yet
Template Capstone PPT
12 pages
ALY 6020 Final Project
No ratings yet
ALY 6020 Final Project
6 pages
Personality Traits and Drug Consumption A Story Told by Data Complete Book Download
100% (17)
Personality Traits and Drug Consumption A Story Told by Data Complete Book Download
17 pages
Final Mini Project PPT (d8) PDF
No ratings yet
Final Mini Project PPT (d8) PDF
29 pages
Ijccn02322014 1
No ratings yet
Ijccn02322014 1
8 pages
Data Analyst Practical
No ratings yet
Data Analyst Practical
11 pages
Pharma Nursing Process
No ratings yet
Pharma Nursing Process
6 pages
d8 PPT Review 3
No ratings yet
d8 PPT Review 3
45 pages
Hospital Pharmacoepidemiology
No ratings yet
Hospital Pharmacoepidemiology
15 pages
1 s2.0 S1877050920315210 Main
No ratings yet
1 s2.0 S1877050920315210 Main
10 pages
Clinical Data Analytics 59f63d5c33
No ratings yet
Clinical Data Analytics 59f63d5c33
4 pages
Ai ML Exp2
No ratings yet
Ai ML Exp2
7 pages
Clinical Pharmacy Practice: Prepared By: Cedrix Cuaderno RPH, Bs Ind Pharm. Adamson University College of Pharmacy
No ratings yet
Clinical Pharmacy Practice: Prepared By: Cedrix Cuaderno RPH, Bs Ind Pharm. Adamson University College of Pharmacy
30 pages
Final
No ratings yet
Final
13 pages
Personality Traits and Drug Consumption A Story Told by Data ISBN 3030104419, 9783030104412 Fast Download
No ratings yet
Personality Traits and Drug Consumption A Story Told by Data ISBN 3030104419, 9783030104412 Fast Download
15 pages
Biostatistical Methods: The Assessment of Relative Risks
From Everand
Biostatistical Methods: The Assessment of Relative Risks
John M. Lachin
3.5/5 (2)
A Comparative Study of Classification Algorithms For Diseases Prediction in Medical Domain
No ratings yet
A Comparative Study of Classification Algorithms For Diseases Prediction in Medical Domain
5 pages
The Surgical Oncology Review: For the Absite and Boards
From Everand
The Surgical Oncology Review: For the Absite and Boards
Aryan Meknat M.D.
No ratings yet
Characterizing Chronic Disease and Polymedication Prescription Patterns From Electronic Health Records
No ratings yet
Characterizing Chronic Disease and Polymedication Prescription Patterns From Electronic Health Records
24 pages
The Staff Nurse-Final
No ratings yet
The Staff Nurse-Final
31 pages
Hospital No Show
No ratings yet
Hospital No Show
15 pages
E045276 Full
No ratings yet
E045276 Full
9 pages
PPT-Hackathon Tiny Coders
No ratings yet
PPT-Hackathon Tiny Coders
21 pages
MPO Review Preprint
No ratings yet
MPO Review Preprint
31 pages
Exam Pa 06 19 Model Solution
No ratings yet
Exam Pa 06 19 Model Solution
17 pages
Lecture 2 Data Sources For Pharmacoepidemiology
No ratings yet
Lecture 2 Data Sources For Pharmacoepidemiology
41 pages
Makalah KEL 1. BAHASA INGGRIS
No ratings yet
Makalah KEL 1. BAHASA INGGRIS
13 pages
Liver Disease Prediction Using Machine Learning
No ratings yet
Liver Disease Prediction Using Machine Learning
28 pages
Applied Machine Learning and Multi-criteria Decision-making in Healthcare
From Everand
Applied Machine Learning and Multi-criteria Decision-making in Healthcare
Ilker Ozsahin
No ratings yet
Unified Health Record - ZS.2022 10 Submission Template
No ratings yet
Unified Health Record - ZS.2022 10 Submission Template
5 pages
Presentation Forecasting Pharmacy
No ratings yet
Presentation Forecasting Pharmacy
48 pages
Final Mini Project PPT (d8)
No ratings yet
Final Mini Project PPT (d8)
15 pages
CAPESTONE
No ratings yet
CAPESTONE
16 pages
My ML Project
No ratings yet
My ML Project
14 pages
Machine Learning Approaches To Medication Adherence Amongst NCD Patients: A Systematic Literature Review
No ratings yet
Machine Learning Approaches To Medication Adherence Amongst NCD Patients: A Systematic Literature Review
31 pages
Report 2
No ratings yet
Report 2
13 pages
Presentation by Group 6B
No ratings yet
Presentation by Group 6B
38 pages
Introduction To Eclipse
No ratings yet
Introduction To Eclipse
29 pages
CDS Annotations
No ratings yet
CDS Annotations
29 pages
CDS Views - Performance Annotation
No ratings yet
CDS Views - Performance Annotation
6 pages
Requêtes SQL Access Gestion Commerciale
No ratings yet
Requêtes SQL Access Gestion Commerciale
2 pages
Data Connections
No ratings yet
Data Connections
13 pages
CDS View DEMO
No ratings yet
CDS View DEMO
27 pages
Open SQL
No ratings yet
Open SQL
37 pages
Eclipse
No ratings yet
Eclipse
21 pages
Templates For The CDS View
No ratings yet
Templates For The CDS View
33 pages
CDS Views
No ratings yet
CDS Views
37 pages
02 Max Mar Class
No ratings yet
02 Max Mar Class
6 pages
01 CNN
No ratings yet
01 CNN
19 pages
ABAP Development Cycle in Eclipse
No ratings yet
ABAP Development Cycle in Eclipse
29 pages
03 Perceptrons
No ratings yet
03 Perceptrons
25 pages
05 Prune
No ratings yet
05 Prune
4 pages
01 Basics
No ratings yet
01 Basics
5 pages
02 Decision Tree
No ratings yet
02 Decision Tree
5 pages
10 Adv Disadv
No ratings yet
10 Adv Disadv
3 pages
Completion Diagram: Reda Discharge: UT Pump Oring Oring B/u LT Pump
No ratings yet
Completion Diagram: Reda Discharge: UT Pump Oring Oring B/u LT Pump
2 pages
SMBTA43-Siemens Semiconductor Group
No ratings yet
SMBTA43-Siemens Semiconductor Group
4 pages
JIP390
No ratings yet
JIP390
11 pages
MUX74HC4067 - Codebender
No ratings yet
MUX74HC4067 - Codebender
8 pages
Ogunka 3 PDF
No ratings yet
Ogunka 3 PDF
18 pages
Anas Enterprise
No ratings yet
Anas Enterprise
6 pages
Easy Car Hire Naha Okinawa-Ken, Japan
No ratings yet
Easy Car Hire Naha Okinawa-Ken, Japan
3 pages
Math g1 m2 Full Module
No ratings yet
Math g1 m2 Full Module
379 pages
8 More Projects
No ratings yet
8 More Projects
10 pages
7 Key Principles of Apparel Costing - Textile Tutorials
No ratings yet
7 Key Principles of Apparel Costing - Textile Tutorials
2 pages
Maximilian Steinberg
No ratings yet
Maximilian Steinberg
5 pages
The Orthodox Christian Mission
No ratings yet
The Orthodox Christian Mission
3 pages
Introducing Transdisciplinary Design Thinking in Early Undergradu
No ratings yet
Introducing Transdisciplinary Design Thinking in Early Undergradu
272 pages
Writing Research Report
No ratings yet
Writing Research Report
33 pages
Cell Organelle Chart-1
No ratings yet
Cell Organelle Chart-1
4 pages
Latest Battery Datasheet
No ratings yet
Latest Battery Datasheet
4 pages
CHP 5 Communication
100% (1)
CHP 5 Communication
59 pages
Q - Skills For Success - Level 1 - Reading and Writing Split
No ratings yet
Q - Skills For Success - Level 1 - Reading and Writing Split
116 pages
Sharplcd13 15 20s1u2
No ratings yet
Sharplcd13 15 20s1u2
59 pages
Sampling and Data Collection
No ratings yet
Sampling and Data Collection
7 pages
MC 10161751 9999
No ratings yet
MC 10161751 9999
3 pages
Catalogo Pompe 2014
No ratings yet
Catalogo Pompe 2014
2 pages
Rail Gun
100% (1)
Rail Gun
20 pages
5843 HRT
No ratings yet
5843 HRT
38 pages
Project Mg'T-Group Project Sec-A
No ratings yet
Project Mg'T-Group Project Sec-A
13 pages
Andculture Brand Guide
No ratings yet
Andculture Brand Guide
35 pages
Order of The Mass-2
100% (2)
Order of The Mass-2
2 pages
PA 6.0 Amplifier Datasheet
No ratings yet
PA 6.0 Amplifier Datasheet
6 pages

Final Presentation

Uploaded by

Final Presentation

Uploaded by

Drug Persistency Project

Virtual Internship: Final Presentation

Group Name: Health+

• Total number of patients : 3424

 The data follows Normal Distribution.

Gender Proportion Gender Proportion vs. Persistency Flag

Age Proportion Age Bucket vs. Persistency Flag

Ethnicity Proportion Ethnicity vs. Persistency Flag

Region Proportion Region vs. Persistency Flag

Race Proportion IDN Indicator Ratio

• High number of non persistent

Risk Counts Vs Persistency Flag

• High number of non persistent

Risk Counts Vs Persistency Flag

• It can be noticed that clinical

Selected Pipeline: Autoencoder based feature extraction

You might also like