Anand J. Kulkarn

Uploaded by

eriyanto anto

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

23 views4 pages

Anand J. Kulkarn

Uploaded by

eriyanto anto

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

Studies in Big Data 66

Anand J. Kulkarni · Patrick Siarry ·

Pramod Kumar Singh · Ajith Abraham ·
Mengjie Zhang · Albert Zomaya ·
Fazle Baki Editors

Big Data
Analytics in
Healthcare
A Review of Big Data and Its
Applications in Healthcare and Public
Sector

Apoorva Shastri and Mihir Deshpande

Abstract Big Data has been a buzzword in the IT sector for a few years now. It has
attracted attention from researchers, industry and academia around the world. This
chapter is intended to introduce Big data and its related technologies and further trace
the challenges. In this chapter, we discuss the applications of big data technologies in
the fields of healthcare and public sector. Over the preceding few years, computing
power has increased substantially while the storage costs have reduced significantly,
leading to businesses being able to produce and store huge volumes of data. Also,
increasing penetration of hand-held and internet enabled devices had led to an explo-
sion in data generation. Social media is exemplary regarding this phenomenon. Such
huge volumes of data cannot be handled using existing frameworks and requires new
and innovative techniques to handle it. In this chapter, we will briefly discuss the use
of big data in healthcare and its potential use cases such as preventive healthcare
planning and predictive analytics. We will also discuss the potential use of big data
in public sector and its applications like urban management and inclusive decision
making. We will further highlight the challenges that hinder the potential use of big
data technologies in these areas.

1 Introduction

Everyone is talking about Big Data these days. Leading organisations have started to
recognise it as a strategic asset [1]. Let’s start with a commonly agreed definition of
it. Big Data is any data which is large and complex and therefore becomes difficult to
process with the traditional storage and processing paradigms, loosely approximated
by practitioners as data-sets around 30–50 terabytes and beyond up to petabytes [2].

A. Shastri (B)
Lovely Professional University, Phagwara 144411, Punjab, India
e-mail: [email protected]
Symbiosis Institute of Technology, Symbiosis International University, Pune 412115, India
M. Deshpande
School of Information Studies, Syracuse University, Syracuse, NY 13244-1190, USA
e-mail: [email protected]
© Springer Nature Switzerland AG 2020 55
A. J. Kulkarni et al. (eds.), Big Data Analytics in Healthcare,
Studies in Big Data 66, https://doi.org/10.1007/978-3-030-31672-3_4
56 A. Shastri and M. Deshpande

Fig. 1 The V’s of big data [4]

For example, The Large Hadron Collider’s 150 million sensors generate a data flow
of about 15 petabytes or about 15,000,000 GB per year [3]. Therefore, traditional
tools and techniques are unable to store, process and visualize it within stipulated
amount of time and extract competitive insights. Big data applications can be seen
everywhere from scientific community, marketing, banking, telecom to healthcare,
public services and so on. It has allowed organisations to take informed decisions
based on the insights derived from transactional data created at various points. Big
data has been described as a set of 3V’s, 4V’s and even 5V’s by various big data
researchers as shown in Fig. 1.

1.1 5 V’s of Big Data

(i) Volume
It refers to the humungous scale of data. The amount of data that is being gener-
ated has increased has been increasing exponentially in the past few years and
is expected to continue to do so in the coming future due to reducing storage
costs. By 2020, there will be around 6.1 Billion smart phones and our accumu-
lated digital universe will be around 44 trillion gigabytes [5]. Google processes
around 40,000 search queries every single second [5]. The mammoth scale of
data being generated requires innovative data infrastructure, data management
and processing techniques.
A Review of Big Data and Its Applications … 57

(ii) Velocity
The rate of flow of data is measured by velocity. Facebook handles around 900
million photographs every day [6]. It must absorb it, process it and later be able
to retrieve it. The Data Management infrastructure that follows such high-speed
data flows is a vital part of the Big Data Paradigm. Time sensitive processes
like banking transactions or social media streaming data are some examples
were data is generated, processed and stored in a matter of few seconds.
(iii) Variety
The nature of different data forms that exist. They can vary from tradi-
tional enterprise structured data, semi-structured data or unstructured data like
images, text, audio and video. There are endless heterogeneous data types and
sources which are to be dealt with in a Big data paradigm.
(iv) Veracity
It the quality associated with big data defined as data of inconsistent, incompe-
tent, deceiving and ambiguous nature. It is also concerned with the reliability
and authenticity of the data used for analyses.
(v) Value
It is the intrinsic value that the big data holds with respect to its size. If large
volumes of imprecise data are analyzed it results in low value and if large
volumes of precise data are analyzed it results in high value.

2 Big Data Technologies

“Data, I look at it as the new oil. It’s going to change most industries across the board.”
said Intel CEO Brian Krzanich [7]. There are a plenty of tools and technology for big
data processing and storage available today. Early developments like the Google File
system which allowed for processing large scale distributed data-intensive applica-
tions on inexpensive commodity hardware using a fault tolerant mechanism paved
way for further developments in distributed computing [8]. Later, Google developed
MapReduce, a programming model based on Java language, which is useful for
writing applications to process huge amounts of data, in parallel, on clusters of com-
modity hardware. Hadoop and Spark are the latest buzzwords in the big data universe
these days. The Apache Hadoop project develops open-source software for reliable,
scalable, distributed computing in a fault-tolerant manner. The Hadoop framework
is a collection of software that enables distributed processing for large sets of data
across clusters of computers using simple programming models. It makes use of the
Map Reduce model as one of its module. Several projects related to Hadoop that
work on the top of Hadoop architecture have been developed and are available to
use. Spark is yet another distributed processing framework which has its similarities
with Hadoop, but is generally consider much more efficient and fast as compared
to Hadoop especially when dealing with queries that are iterative in nature. Few of
these technologies will further elaborated.

Big Data: Submitted By-Rajashree Rashmita Reg - No-1825209016 Mca 4 Sem
No ratings yet
Big Data: Submitted By-Rajashree Rashmita Reg - No-1825209016 Mca 4 Sem
27 pages
Big Data Analytics: - by Ayushi Gupta
No ratings yet
Big Data Analytics: - by Ayushi Gupta
94 pages
Big Data Unit 1
No ratings yet
Big Data Unit 1
55 pages
Big Data Analytics
No ratings yet
Big Data Analytics
86 pages
Big Data Analysis by Deshbandhu
No ratings yet
Big Data Analysis by Deshbandhu
368 pages
0 Principles of Big Data
No ratings yet
0 Principles of Big Data
70 pages
Introduction To Big Data Platform
No ratings yet
Introduction To Big Data Platform
20 pages
A Survey On Big Data Applications and Challenges
No ratings yet
A Survey On Big Data Applications and Challenges
4 pages
BIG Data Analytics 21CSH-471: Computer Science & Engineering
No ratings yet
BIG Data Analytics 21CSH-471: Computer Science & Engineering
16 pages
CS 3440 Discussion Assignment Unit 1
No ratings yet
CS 3440 Discussion Assignment Unit 1
4 pages
Big Data and Five V'S Characteristics: Hiba Jasim Hadi, Ammar Hameed Shnain, Sarah Hadishaheed, Azizahbt Haji Ahmad
No ratings yet
Big Data and Five V'S Characteristics: Hiba Jasim Hadi, Ammar Hameed Shnain, Sarah Hadishaheed, Azizahbt Haji Ahmad
8 pages
Big Data
No ratings yet
Big Data
30 pages
Big Data Presentation
No ratings yet
Big Data Presentation
22 pages
BDA 01 - Introduction
No ratings yet
BDA 01 - Introduction
43 pages
Lecture 3-Introduction To Big Data
No ratings yet
Lecture 3-Introduction To Big Data
25 pages
5.innovating Big Data Analytic
No ratings yet
5.innovating Big Data Analytic
27 pages
BDA-1st Unit
No ratings yet
BDA-1st Unit
39 pages
Big Data Presentation Slide
100% (1)
Big Data Presentation Slide
30 pages
Big Data
No ratings yet
Big Data
24 pages
Big Data (1) (Autosaved)
No ratings yet
Big Data (1) (Autosaved)
13 pages
Introduction To Big Data Computing
No ratings yet
Introduction To Big Data Computing
25 pages
Big Data
No ratings yet
Big Data
30 pages
Big Data A Comprehensive Overview
No ratings yet
Big Data A Comprehensive Overview
25 pages
Introduction To Big Data
No ratings yet
Introduction To Big Data
83 pages
Bigdatappt
No ratings yet
Bigdatappt
31 pages
Assignment Stid (Group 18) - Big Data
No ratings yet
Assignment Stid (Group 18) - Big Data
28 pages
Unit 1 - BDS - DS307
No ratings yet
Unit 1 - BDS - DS307
47 pages
Unit-1 Introduction To Big Data Analytics
No ratings yet
Unit-1 Introduction To Big Data Analytics
57 pages
UNIT-1:Overview of Big Data
No ratings yet
UNIT-1:Overview of Big Data
10 pages
05-Big Data
No ratings yet
05-Big Data
29 pages
Big Data Class - Introduction
No ratings yet
Big Data Class - Introduction
60 pages
Bda Unit1
No ratings yet
Bda Unit1
19 pages
Big Data Unit 1 Notes
No ratings yet
Big Data Unit 1 Notes
20 pages
Lecture 3-Introduction To Big Data
No ratings yet
Lecture 3-Introduction To Big Data
25 pages
What Is Big Data & Why Is Big Data Important in Today's Era
100% (1)
What Is Big Data & Why Is Big Data Important in Today's Era
13 pages
Big Data
No ratings yet
Big Data
10 pages
Introductions: What Are The 5 Vs of Big Data/ Characteristics of Big Data or Nature of Data
No ratings yet
Introductions: What Are The 5 Vs of Big Data/ Characteristics of Big Data or Nature of Data
75 pages
Bda PST
No ratings yet
Bda PST
11 pages
Future Revolution On Big Data
No ratings yet
Future Revolution On Big Data
24 pages
DBMS Unit1
No ratings yet
DBMS Unit1
30 pages
BIG DATA For Healthcare A Survey
No ratings yet
BIG DATA For Healthcare A Survey
12 pages
Big Data
No ratings yet
Big Data
63 pages
Big Data
100% (3)
Big Data
22 pages
Introduction To Bda
No ratings yet
Introduction To Bda
67 pages
Title - Concept of Big Data: Presented by - Divyanshu Upadhyay Naman Gupta Adarsh Pandey Pankaj Chaudhary Shivbrat Singh
No ratings yet
Title - Concept of Big Data: Presented by - Divyanshu Upadhyay Naman Gupta Adarsh Pandey Pankaj Chaudhary Shivbrat Singh
17 pages
Unit-3 Software: Need of Computer Software
No ratings yet
Unit-3 Software: Need of Computer Software
10 pages
Big Data
No ratings yet
Big Data
31 pages
Fundamentals of Big Data Engineering: A Guide To The
No ratings yet
Fundamentals of Big Data Engineering: A Guide To The
14 pages
Big Data PPT 55b0fc01e7543
No ratings yet
Big Data PPT 55b0fc01e7543
31 pages
Business Intelligence & Big Data Analytics-CSE3124Y
No ratings yet
Business Intelligence & Big Data Analytics-CSE3124Y
25 pages
Big Data (Analytics) in Power Systems
No ratings yet
Big Data (Analytics) in Power Systems
20 pages
Big Data
No ratings yet
Big Data
31 pages
Big Data Presentation
No ratings yet
Big Data Presentation
24 pages
Unit I-Ch 01-Big Data Introduction
No ratings yet
Unit I-Ch 01-Big Data Introduction
40 pages
Advanced Analytics: What Is Big Data Analytics? Definition, Benefits, and More
No ratings yet
Advanced Analytics: What Is Big Data Analytics? Definition, Benefits, and More
13 pages
Big Data
No ratings yet
Big Data
43 pages
What Is Big Data
No ratings yet
What Is Big Data
8 pages
Data Driven Governance Competency Guide: Resource Person
No ratings yet
Data Driven Governance Competency Guide: Resource Person
42 pages
Deloitte Solutions Network: Introduction To Big Data
No ratings yet
Deloitte Solutions Network: Introduction To Big Data
9 pages
Review of Recent Technologies in Big Data Analysis
No ratings yet
Review of Recent Technologies in Big Data Analysis
3 pages
Asa All
No ratings yet
Asa All
32 pages
Stationery Management System
25% (4)
Stationery Management System
4 pages
Introduction of Software Testing
No ratings yet
Introduction of Software Testing
9 pages
Hands-On Lab 8: JOIN Operations
No ratings yet
Hands-On Lab 8: JOIN Operations
3 pages
Lab Manual - SQLi To Shell - V1.0
No ratings yet
Lab Manual - SQLi To Shell - V1.0
9 pages
How Does A Generic Datasource Communicates
No ratings yet
How Does A Generic Datasource Communicates
16 pages
Ai Agent
No ratings yet
Ai Agent
8 pages
Web Application Development With Yii 2 and PHP Sample Chapter
No ratings yet
Web Application Development With Yii 2 and PHP Sample Chapter
18 pages
Tupmmpc Loan Monitoring and Management System
No ratings yet
Tupmmpc Loan Monitoring and Management System
72 pages
Abstract Data Type ADBMS
No ratings yet
Abstract Data Type ADBMS
6 pages
Data Warehousing - C03 - DM
No ratings yet
Data Warehousing - C03 - DM
42 pages
Ebs 12.2 Cloning and Upgrade - V1
No ratings yet
Ebs 12.2 Cloning and Upgrade - V1
11 pages
Day 1: SAP Technology: Instructor: XXX
No ratings yet
Day 1: SAP Technology: Instructor: XXX
65 pages
Software Process SENG 22572: Lecture 8: Process Modeling Languages-MVP-L
No ratings yet
Software Process SENG 22572: Lecture 8: Process Modeling Languages-MVP-L
19 pages
VL10 Vijay
No ratings yet
VL10 Vijay
5 pages
Prashant DevOps Resume-1
No ratings yet
Prashant DevOps Resume-1
3 pages
From 0 To Hero: Daron Yöndem
No ratings yet
From 0 To Hero: Daron Yöndem
28 pages
Phases of SDLC
No ratings yet
Phases of SDLC
1 page
Digesh Gathani 767297802
No ratings yet
Digesh Gathani 767297802
4 pages
Manual Testing Basics - CCS
No ratings yet
Manual Testing Basics - CCS
17 pages
Analytics of AI in Enviroments of Development
No ratings yet
Analytics of AI in Enviroments of Development
11 pages
SQE Test Design Specification Template
No ratings yet
SQE Test Design Specification Template
2 pages
Conexión de Modem GSM SIMXXXX MQTT Con AWS
No ratings yet
Conexión de Modem GSM SIMXXXX MQTT Con AWS
10 pages
Vaniresume
No ratings yet
Vaniresume
3 pages
Backup and RestoreOnLinux
No ratings yet
Backup and RestoreOnLinux
4 pages
AIF Documentation Guide
No ratings yet
AIF Documentation Guide
4 pages
Hoja de Vida Andres F Cadena Mayo
No ratings yet
Hoja de Vida Andres F Cadena Mayo
1 page
Dcat Linux Commands Quick Reference Card Battle Card v3
No ratings yet
Dcat Linux Commands Quick Reference Card Battle Card v3
2 pages
The Power of Big Data: Transforming Industries and Shaping the Future
From Everand
The Power of Big Data: Transforming Industries and Shaping the Future
Tom Henricksen
No ratings yet
Crash Course Big Data
From Everand
Crash Course Big Data
IntroBooks Team
No ratings yet

Anand J. Kulkarn

Uploaded by

Anand J. Kulkarn

Uploaded by

Studies in Big Data 66

Anand J. Kulkarni · Patrick Siarry ·

Apoorva Shastri and Mihir Deshpande

Fig. 1 The V’s of big data [4]

1.1 5 V’s of Big Data

2 Big Data Technologies

You might also like