Lecture 1
Lecture 1
HST.956, 6.S897
Dialogue interface
system for identifying bacteria I amready
causing severe infections ** THISIS A 26 YEAR
OLDMALE
PATIENT
Myunderstanding
is:
• Proposed a good therapy in Theageof thepatient
is 26
Thesexof thepatient
is male
infectious
The Contextof the MYCIN
disease experts
Experiments ** JO
Myunderstanding
is:
Thename
of thepatientis Jo
EXPERTSYSTEM Respiratory-tract
is oneof thesymptoms
that thepatienthad
** A COUPLE OFDAYS BEFORE THEADMISSION,
HEHADA MALAISE
Description User Please
give methedateof admission
I
=~ inter-
of newcase qp~ Inference ** MARCH12, 1979
Engine
face
USER Myunderstanding
is:
Advice& t Thepatient
Malaise
wasadmittedat thehospital
is oneof thesymptoms
3 daysago
thatthepatient
had5 daysago
Explanation qp_~ Knowledge
Base [
(inputs)
work a layer at a time. Because MLPs are trained
with an algorithm called error back-propagation,
chart review
in node models and patterns of connectivity,34
3’,‘~’4
FIGURE 2. A multilayer perceptron. This is a two-layer percep-
but the MLP is the network used in nearly all med- tron with four inputs, four hidden units, and one output unit.
[Penny & Frost, Neural Networks in Clinical Medicine. Med Decis Making, 1996]
395
§A single integer in the accuracy column denotes percentage overall classification rate and a single real number between 0 and 1 indicates the
AUROCC value Neural = accuracy of neural net, Other accuracy of best other method
=
Outline for today’s class
1. Brief history of AI and ML in healthcare
2. Why now?
3. Examples of how ML will transform
healthcare
4. What is unique about ML in healthcare?
5. Overview of class syllabus
The Opportunity:
Adoption of Electronic Health Records
(EHR) has increased 9x in US since 2008
00000
85.2%* 96.9%*
96%
Certi ed EHR 94%*
75.5%* 83.8%*
71.9%
Basic EHR
Percentage 59.4%*
of hospitals
44.4%*
in the US
27.6%*
15.6%
12.2%
9.4%
De-identified
health data from
~40K critical care
patients
Demographics,
vital signs,
laboratory tests,
medications,
notes, …
Large datasets
“Data on nearly
230 million
unique patients
since 1995”
$$$
Large datasets
President Obama’s initiative to create a 1 million
person research cohort Core data set:
• Baseline health exam
THE PRECISION MEDICINE INITIATIVE • Clinical data derived
from electronic health
records (EHRs)
• Healthcare claims
• Laboratory data
WHAT IS IT?
[Precision Medicine Initiative (PMI) working Group Report, Sept. 17 2015]
Precision medicine is an emerging approach for disease
prevention and treatment that takes into account people’s
individual variations in genes, environment, and lifestyle.
The Precision Medicine Initiative will generate the scientific
Diversity of digital health data
proteomics
lab tests
imaging
social media
phone
…
… [https://blog.curemd.com/the-most-bizarre-
[https://en.wikipedia.org/wiki/Lis
t_of_ICD-9_codes] icd-10-codes-infographic/]
Standardization
• Diagnosis codes: ICD-9 and
ICD-10 (International
Classification of Diseases)
• Laboratory tests: LOINC
codes
• Pharmacy: National Drug
Codes (NDCs)
• Unified Medical Language
System (UMLS): millions of
medical concepts
[http://oplinc.com/newsletter/index_May08.htm]
Standardization
Standardization
OMOP
Common
Data
Model v5.0
Breakthroughs in machine learning
Why now?
• Big data
• Algorithmic advances
• Open-source software
Breakthroughs in machine learning
• Major advances in ML & AI
– Learning with high-dimensional features (e.g., l1-
regularization)
– Semi-supervised and unsupervised learning
– Modern deep learning techniques (e.g. convnets,
variants of SGD)
• Democratization of machine learning
– High quality open-source software, such as
Python’s scikit-learn, TensorFlow, Torch, Theano
Industry interest in ML & healthcare
Industry interest in ML & healthcare
• Major acquisitions to get big data for ML:
– Merge ($1 billion purchase by IBM, 2015)
medical imaging
– Truven Health Analytics ($2.6 billion purchase by
IBM, 2016)
health insurance claims
– Flatiron Health ($1.9 billion purchase by Roche,
2018)
electronic health records (oncology)
Outline for today’s class
1. Brief history of AI and ML in healthcare
2. Why now?
3. Examples of how ML will transform
healthcare
4. What is unique about ML in healthcare?
5. Overview of class syllabus
ML will transform every aspect of healthcare
The stakeholders:
ect
ex-
go-
nal
ur-
X-
tal-
our
e a Input
Chest X-Ray Image
for-
sts.
adi- CheXNet
121-layer CNN
ion
end
tX- Output
on Pneumonia Positive (85%)
Arrhythmia?
Triage note
Predicted
chief
complaints Contextual
auto-
complete
ML will transform every aspect of healthcare
The stakeholders:
Time
Disease burden
Undiagnosed
condition
Time
Treatment A
Progression on VRd
Response to treatment A
Patient w.
condition X Treatment B
Response to treatment B
Time
What is the future of how we treat
chronic disease?
• Early diagnosis, e.g. of diabetes, Alzheimer's,
cancer
Liquid biopsy