Chapter 7 Healthcare Data Analytics
Chapter 7 Healthcare Data Analytics
7 Healthcare Data
Analytics
Bill Hersh MD
Learning Objectives
After reviewing these slides, the viewer should be able to:
▸ Discuss the difference between descriptive, predictive and prescriptive
analytics
▸ Describe the characteristics of “Big Data”
▸ Enumerate the necessary skills for a worker in the data analytics field
▸ List the limitations of healthcare data analytics
▸ Discuss the critical role electronic health records play in healthcare data
analytics
2
Important
Introduction
▸ One of the promises of the growing clinical data in electronic health record
(EHR) systems is secondary use (or re-use) of the data for other purposes,
such as quality improvement and clinical research
▸ Interest in healthcare data has grown exponentially due to EHR incentives
after the HITECH Act and the addition of genomic information that will
eventually be integrated with EHRs
3
Important
Terminology of Analytics
▸ The term analytics is achieving wide use both in and out of healthcare. A
leader in the field defines analytics as “the extensive use of data,
statistical and quantitative analysis, explanatory and predictive models,
and fact-based management to drive decisions and actions”
▸ IBM defines analytics as “the systematic use of data and related business
insights developed through applied analytical disciplines to drive fact-
based decision making for planning, management, measurement and
learning
4
Different Types of Analytics
Increasing functionality and value very important
• standard types of reporting that describe current
Descriptive situations and problems (how many uninsured patients
should do we have with type 2 diabetes?)
6
Read
Analytics Concepts (2-2)
▸ Business intelligence, which in healthcare refers to the “processes and
technologies used to obtain timely, valuable insights into business and
clinical data”
▸ Learning health system, where data can be used for continuous learning
to allow the healthcare system to better carry out disease surveillance and
response, targeting of healthcare services, improving decision-making,
managing misinformation, reducing harm, avoiding costly errors, and
advancing clinical research
7
Big Data The definition is important
▸ Another related term is big data, which describes large and ever-
increasing volumes of data that adhere to the following attributes:
▹ Volume – ever-increasing amounts
▹ Velocity – quickly generated
▹ Variety – many different types
▹ Veracity – from trustable sources
▸ While big data is considered a buzz word by some, we are having to deal
with terabytes and petabytes of information today. With the addition of
genomics big data will escalate
8
Important
&*
Big Data
▸ Healthcare organizations are generating an ever-increasing amount of
data. In all healthcare organizations, clinical data takes a variety of forms,
from structured (e.g., images, lab results, etc.) to unstructured (e.g.,
textual notes including clinical narratives, reports, and other types of
documents)
▸ For example, it was estimated by Kaiser-Permanente in 2013 that its
current data store for its 9+ million members exceeds 30 petabytes
(petabyte = 1024 terabytes) of data
9
&
Big Data Important
▸ Another example is CancerLinQ that will provide a comprehensive system
for clinicians and researchers consisting of EHR data collection,
application of clinical decision support, data mining and visualization, and
quality feedback
▸ Lastly, IBM’s Watson is now focusing on healthcare, specifically Oncology
so that massive amounts of cancer information/research can be analyzed
and applied to individual patient decision making
10
The Analytics Big
Data Pipeline
According to Kumar et al
▸ One begins with multiple data
sources, that are extracted and
cleansed and normalized
▸ Statistical processing prepares the
data for output
▸ Finally, the data helps generate
descriptive, predictive and
prescriptive analytics
11
Big Data - Big Data will Drive ACOs
▸ Accountable care organizations (ACOs) provide incentives to deliver high-
quality care in cost-efficient ways that will require a robust IT architecture,
health information exchange (HIE) plus analytics. This approach would be
used to predict and quickly act on excess costs
▸ As a well-known informatics blogger put it: ACOs = HIE + Analytics
12
Important
Challenges to Data Analytics (1-2)
▸ Data generated in the routine care of patients may be limited in its use for
analytical purposes. For example, data may be inaccurate or incomplete. It
may be transformed in ways that undermine its meaning (e.g., coding for
billing priorities)
▸ It may exhibit the well-known statistical phenomenon of censoring, i.e.,
the first instance of disease in record may not be when it was first
manifested (left censoring), or the data source may not cover a sufficiently
longtime interval (right censoring)
▸ Data may also incompletely adhere to well-known standards, which makes
combining it from different sources more difficult
13
Important
Challenges to Data Analytics (2-2)
▸ Clinical data mostly allows observational and not experimental studies,
thus raising issues of cause-and-effect of findings discovered
▸ Research questions asked of the data tend to be driven by what can be
answered, as opposed to prospective hypotheses
▸ Data are not always as objective as one might like, and “bigger” is not
necessarily better
▸ There are ethical concerns over how the data of individuals is used, how it
is collected, and the possible divide between those who have access to
data and those who do not
▸ Who owns the data and who can use it? 14
Research and Application of Analytics
▸ There is an emerging base of research that demonstrates how data from
operational clinical systems can be used to identify critical situations or
patients whose costs are outliers
▸ There is less research, however, demonstrating how this data can be used
to improve clinical outcomes or reduce costs. Studies using EHR data for
clinical prediction have been proliferating
15
The Most Important Use Cases for
Application of Data Analytics
▸ High-cost patients – looking for ways to intervene early
▸ Readmissions – preventing
▸ Triage – appropriate level of care
▸ Decompensation – when patient’s condition worsens
▸ Adverse events – awareness
▸ Treatment optimization – especially for diseases affecting multiple organ
systems
16
Important
Research and Application of Analytics
▸ One common area of focus has been the use of data analytics to identify
patients at risk for hospital readmission within 30 days of discharge. The
importance of this factor comes from the US Centers for Medicare and
Medicaid Services (CMS) Readmissions Reduction Program that penalizes
hospitals for excessive numbers of readmissions
▸ This has led to research using EHR data to predict hospital readmissions.
Thus far, the results are mixed, and several examples of trials are included
in the textbook chapter
17
Research and Application of Analytics
Identifying Patients for Research Using EHR Data
18
Research and Application of Analytics
Scenarios for EHR Data Analysis to predict healthcare utilization outcomes
20
Research and Application of Analytics
Using Genomic Information and EHRs
21
Research and Application of Analytics
Using Genomic Information and EHRs
▸ More recent work has “inverted” the paradigm to carry out phenome-wide
association studies (PheWAS) that associated multiple phenotypes with
varying genotypes
▸ Genome-wide and phenome-wide association studies are also discussed in
the chapter on bioinformatics
22
Important
Role of Informaticians in Analytics
▸ There has been little focus on the human experts who will carry out
analytics, to say nothing of those who will support their efforts in building
systems to capture data, put it into usable form, and apply the results of
analysis
▸ Where will these workers come from and what will be the education of
those who work in this emerging area, that some call data science?
▸ We do know that data analytics experts are in high demand
23
Important
Role of Informaticians in Analytics
*
▸ From basic biomedical scientists to clinicians and public health workers,
those who are researchers and practitioners are drowning in data, needing
tools and techniques to allow its use in meaningful and actionable ways
▸ Dr. Hersh believes that a strong background in Health Informatics or
Biomedical Informatics is the best preparation for the healthcare data
analytics field
24
Role of Informaticians in Analytics
very important
▸ Data science is more than statistics or computer science applied in a
-
25
X
The Need for Data Analytics Experts
▸ A report by McKinsey consulting states that there will soon be a need in
the US for 140,000-190,000 individuals who have “deep analytical talent”
and an additional 1.5 million “data-savvy managers needed to take full
advantage of big data”
▸ An analysis by SAS estimated that by 2018, there will be over 6400
organizations that will hire 100 or more analytics staff
▸ Another report found that data scientists currently comprise less than 1%
of all big data positions, with more common job roles consisting of
developers (42% of advertised positions), architects (10%), analysts (8%)
and administrators (6%)
26
Important
The Need for Data Analytics Experts
▸ The technical skills most required for big data positions were NoSQL,
Oracle, Java and SQL
▸ PriceWaterhouseCoopers noted that healthcare organizations need to
acquire talent in systems and data integration, data statistics and analytics,
technology and architecture support, and clinical informatics
▸ Business knowledge is also useful
27
Important !
The Need for Data Analytics Experts
What Skill Sets Should Universities Train For?
28
Conclusions
▸ Healthcare data has proliferated greatly, in large part due to the
30
Reference
▸ Hersh WR, Hoyt RE. Health Informatics: Practical Guide Seventh Edition.
Informatics Education; 2018.
https://books.google.com.kw/books?id=UnxeDwAAQBAJ
Chapter 7: Healthcare Data Analytics (pp 149-160)
31