Big Data in Cloud Computing

Download as pdf or txt
Download as pdf or txt
You are on page 1of 11

See discussions, stats, and author profiles for this publication at: https://www.researchgate.

net/publication/349786524

Big Data in Cloud Computing

Chapter · January 2021


DOI: 10.4018/978-1-7998-6673-2.ch005

CITATIONS READS
3 548

2 authors:

Jayashree Kanniappan Swaminathan B.


Panimalar Engineering College REC
52 PUBLICATIONS 149 CITATIONS 17 PUBLICATIONS 53 CITATIONS

SEE PROFILE SEE PROFILE

All content following this page was uploaded by Jayashree Kanniappan on 18 April 2021.

The user has requested enhancement of the downloaded file.


Applications of Big Data
in Large- and Small-Scale
Systems

Sam Goundar
British University Vietnam, Vietnam

Praveen Kumar Rayani


National Institute of Technology, Durgapur, India

A volume in the Advances in Data Mining and


Database Management (ADMDM) Book Series
Published in the United States of America by
IGI Global
Engineering Science Reference (an imprint of IGI Global)
701 E. Chocolate Avenue
Hershey PA, USA 17033
Tel: 717-533-8845
Fax: 717-533-8661
E-mail: [email protected]
Web site: http://www.igi-global.com

Copyright © 2021 by IGI Global. All rights reserved. No part of this publication may be reproduced, stored or distributed in
any form or by any means, electronic or mechanical, including photocopying, without written permission from the publisher.
Product or company names used in this set are for identification purposes only. Inclusion of the names of the products or
companies does not indicate a claim of ownership by IGI Global of the trademark or registered trademark.
Library of Congress Cataloging-in-Publication Data

Names: Goundar, Sam, 1967- editor. | Rayani, Praveen Kumar, 1989- editor.
Title: Applications of big data in large and small-scale systems / Sam
Goundar and Praveen Kumar Rayani, editors.
Description: Hershey, PA : Engineering Science Reference, [2021] | Includes
bibliographical references and index. | Summary: “This book addresses
the newest innovative and intelligent applications related to utilizing
the large amounts of big data being generated that is increasingly
driving decision making and changing the landscape of business
intelligence, from governments to private organizations, from
communities to individuals”-- Provided by publisher.
Identifiers: LCCN 2020026763 (print) | LCCN 2020026764 (ebook) | ISBN
9781799866732 (hardcover) | ISBN 9781799866749 (paperback) | ISBN
9781799866756 (ebook)
Subjects: LCSH: Big data--Industrial applications. | Data mining. |
Decision making.
Classification: LCC QA76.9.B45 A668 2021 (print) | LCC QA76.9.B45 (ebook)
| DDC 005.7--dc23
LC record available at https://lccn.loc.gov/2020026763
LC ebook record available at https://lccn.loc.gov/2020026764

This book is published in the IGI Global book series Advances in Data Mining and Database Management (ADMDM)
(ISSN: 2327-1981; eISSN: 2327-199X)

British Cataloguing in Publication Data


A Cataloguing in Publication record for this book is available from the British Library.

All work contributed to this book is new, previously-unpublished material. The views expressed in this book are those of the
authors, but not necessarily of the publisher.

For electronic access to this publication, please contact: [email protected].


77

Chapter 5
Big Data in Cloud Computing
Jayashree K.
Rajalakshmi Engineering College, India

Swaminathan B.
https://orcid.org/0000-0002-0822-3087
Rajalakshmi Engineering College, India

ABSTRACT
The huge size of data that has been produced by applications that spans from social network to scientific
computing is termed big data. Cloud computing as a delivery model for IT services enhances business
productivity by reducing cost. It has the intention of achieving solution for managing big data such as
high dimensional data sets. Thus, this chapter discusses the background of big data and cloud comput-
ing. It also discusses the various application of big data in detail. The various related work, research
challenges of big data in cloud computing, and the future direction are addressed in this chapter.

1. INTRODUCTION

In the past few years cloud computing has been developing rapidly and it is a novel computing archetype
that has the ability to provide various services on request (Alfazi et al, 2017). It supports self-service
through no or slight retailer facilitation and it offers an efficacy archetypal of resources where companies
merely pay for their utilization. Sharing of resources leads to cost of computing much lower (Gupta et al,
2012). The elasticity, low upfront investment, pay per use are few of the foremost facilitating features that
makes the cloud computing the universal platform for installing parsimoniously reasonable enterprise
organization settings (Venkatesh et al, 2015).
Big data is a data exploration approach supported by inventive tools which provision high-velocity
data seizure, storage, and exploration. Data production rate has been growing rapidly during the recent
years. Certain corporate examples of big data are social network content, cell phone particulars, trans-
actional information, fitness archives, commercial official papers, and weather data. (Balachandran &
Prasad, 2017). It can be useful in smart cities to mine and analyse the data from an enormous size of
data (Jayashree et al, 2019). Data are produced from many origins such as medical devices, sensors or

DOI: 10.4018/978-1-7998-6673-2.ch005

Copyright © 2021, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited.

Big Data in Cloud Computing

associated instruments. To store and analyze the data that has been generated big data technologies can
utilize cloud computing (Gholami & Laure, 2016).
Cloud computing has several inherent capabilities such as quantifiability, flexibility, metered pay-
per-use capability, sharing, data dependability, easier preservation that offer real opportunities for big
data (Hanan Elazhary 2014).
The rest of this chapter is structured as: Section 2 delivers a broad-spectrum summary of big data and
its applications and cloud computing, related works are discussed in Section 3. The challenges of big
data in cloud computing is deliberated in Section 4. Future research directions are described in Section
5 and the conclusion of the chapter is briefed in Section 6.

2. BACKGROUND

2.1 Bigdata

Neves describes the five aspects such as Volume, Variety, Velocity, Value and Veracity. Volume defines
the dimensions of datasets that a big data method convention with. Variety deals with that data arises in
all kinds of presentations such as from organized, numeric data in customary databases to unstructured
text documents, electronic mail, video, audio, and business contacts (Wadhwani K & Wang, 2017).
Velocity denotes to the period in which big data can be processed (Hadi et al, 2016). Value deals with
the accurate value of information. Veracity denotes to the reliability of the data, addressing data privacy,
consistency, and accessibility.

2.2 Big Data and its Applications (kiran et al 2015)

Big data are classified such as structured and unstructured.

1. Structured Data

Words and numbers that can be certainly categorized and examined belongs to structured data.
Structured data are produced by things like network sensors, smart phones, trades data, and global
positioning system devices.

2. Unstructured Data

Unstructured data comprise further multifarious data, such as consumer analyses from merchandis-
able websites, photos and other multimedia, and remarks on social networking sites. Separation of these
data and grouping are not easy and numerical analysis are also difficult.
Some areas of big data computing are portrayed in the subsequent texts (Kune et al, 2016).
Scientific surveys: Data obtained from different sensors are studied to extract the suitable informa-
tion for communal profits.
Health care: Medical care groups might figure the localities from where the infections are spreading in
order to avoid more spreads (Mayer & Cukier 2013). Clinical decision support methods, specific analytics
applied for patient summary, custom-made medicine, examine disease patterns, improve public health.

78

Big Data in Cloud Computing

Governance: In transport sectors by means of real-time transportation data to calculate traffic pat-
terns, and modernize communal transport schedules.
Stock: A private stock trade in Asia utilizes indatabase analytics to build up an exhaustive framework
to detect abusive trading patterns to detect fraud in private stock trade.
Web analytics: Several websites are experiencing millions of unique visitors per day, thus creating
a large range of content. Increasingly, companies want to be able to mine this data to understand limi-
tations of their sites, improve response time, offer more targeted ads, and so on. This requires tools to
perform complicated analytics on data that far exceed the memory of a single machine or even in cluster
of machines.

2.3. Cloud Computing

Cloud paradigms include various services such as Software as a Service (SaaS), Platform as a Service
(PaaS), and Infrastructure as a Service (IaaS) (Agrawal et al, 2011).
There is extensive amount of replacements for commerce by means of the cloud for PaaS (Purcell
et al, 2013). PaaS is used to deliver platforms for the improvement and custom of concord applications.
PaaS solutions contain application project and enhancement tools, versioning, incorporation, deploy-
ment and hosting, state running, and other associated enhancement tools (Geczy et al, 2012). Trades
achieve price redeeming by means of PaaS from end to end regularization and great consumption of the
cloud-based phase through an amount of applications. Further benefits of utilizing PaaS are endorsing
shared services, refining software security, and bringing down ability prerequisites required for new
frameworks improvement.
SaaS offers companies by means of applications that are kept and run on simulated servers within
the cloud. Advantages of utilizing SaaS are simpler programming, programmed updates and fix organi-
zation, programming similarity through the trade, simpler cooperation, and worldwide openness. SaaS
offers companies examining big data demonstrated software elucidations meant for statistics analysis.
Services offered to enterprises over the IaaS prototypical incorporate disaster rescue, data center and
storing as a provision, virtual desktop organization, and cloud bursting. Virtualization is generally utilized
in IaaS cloud in order to blend/break down actual resources in an ad-hoc mode to encounter evolving
or diminishing resource request from cloud clients (Santosh Kumar and Goudar, 2012). Advantages
of IaaS comprises expanded monetary adaptability, selection of services, business liveliness, practical
versatility, and expanded security.

2.4 Big Data in Cloud

Big data in clouds is an innovative data-intensive platform for rapidly creating the analytics and installing
in excess of an elastically accessible organization. They are generally categorized as:
Public big data clouds: Wide-ranging data association and handling over the flexibly accessible
clouds substructure. The resources are functioned over Internet as pay-as-go computing prototypes. The
examples are Amazon big data computing in clouds, Windows Azure HDInsight, RackSpace Cloudera
Hadoop, and Google cloud platform of big data computing.
Private big data clouds: Arrangement of big data policy within the venture above a virtualized frame-
work, with a more prominent mechanism and security to the particular organizations.

79

Big Data in Cloud Computing

Hybrid big data clouds: Incorporation of public and private big data clouds for versatility, catastro-
phe rescue, and abundant accessibility. In this arrangement, the private chores can be transferred to the
communal arrangement through uttermost loads.

2.5 ADVANTAGES OF BIG DATA AND CLOUD


COMPUTING (ISLAM & REZA 2019)

Agility

It is likely to offer several arrangements with all the essential resources rapidly.

Elasticity

A cloud platform can dynamically increase to deliver storage for continually growing data.

Reducing Expense with Big Data in the Cloud

With the cloud computing, the accountability moves to the cloud suppliers and the enterprise merely
devise to emolument for the storing space and power consumption.

Compact Intricacy

Several process of big data solution comprises many constituents and incorporations. Cloud computing
delivers the likelihood to establish these constituents, therefore decreasing intricacy and expanding the
profitability of the big data investigation group.

3. RELATED WORK

The combination of big data and cloud computing has long term profits of together acuities and perfor-
mance (Chandrashekar et al, 2015). Cloud services can deal wide extents of data through fast latency and
real time handling of the data that has been collected. There are presently a small number of incorporated
cloud environments for big data investigation.
In digital domain, data are produced from several bases and the quick change as of digital innovations
devises to the evolution of big data (Acharjya & Ahmed, 2016). It delivers progressive innovations in
various arenas through assortment of huge datasets. Talia, 2013 indicated that finding appropriate data
as of from huge amounts of data involves adaptable analysis procedures to produce suitable outcomes.
Effective data investigation tools in addition to skills are necessary to deal with specific data. Every
procedure enactment stops to rise directly through collective computational means. Several challenges
of big data comprise Heterogeneity, Incompleteness, versatility, Timeliness, Privacy and Security have
been addressed in this chapter (Jayashree & Abirami, 2018).
As researchers keep on examining the concerns of big data in cloud computing, different issues in big
data handling emerge through the interim data study methods. The speediness of torrent data received

80

Big Data in Cloud Computing

as of dissimilar data sources need be managed and associated through historic information within a spe-
cific timeframe. Specific data bases might comprise dissimilar strategies that creates the coordination
of numerous bases meant for examination a multifarious stint.
Research exertions formulated toward making a big data management framework for the cloud.
Khan et al, 2015 have suggested a data exemplary as well as offers a schema for big data in the cloud
and endeavors for the facilitation toward requesting data on behalf of the consumer. Ortiz et al, 2015
reconnoitered the utilization of a suggested combined Hadoop and MPI/OpenMP system and in what
way the accompanying can expand speed and performance.
Cohen et al, 2009 provided a parallel database design aimed at analytics that provisions SQL and
MapReduce scripting arranged in the top of a DBMS towards incorporating numerous data bases. Data
handling as well as analytics abilities stay stirring in the direction of Enterprise Data Haylofts, otherwise
organized in data centers to ease reprocess through several data collections (Jensen et al, 2012).

4. CHALLANGES OF BIG DATA IN CLOUD COMPUTING

Hashem et al, 2015 have depicted several significant research challenges, contains versatility, acces-
sibility, convenience, data reliability, data conversion, data value, data diversity, secrecy and authorized
concerns, and regulatory governance.

4.1 Data Staging

Utmost significant exposed research concern about data staging stays associated towards the assorted
nature of data. Statistics collected as of diverse bases will not have an organized arrangement. Chang-
ing and cleaning aforesaid indistinct data beforehand stocking them into the warehouse for analysis are
motivating tasks.

4.2 Distributed Storage Methods

A number of elucidations was suggested towards storing and recovering huge volumes of data. Specific
elucidations were pragmatic in a cloud computing environs. Though, some concerns obstruct the effica-
cious execution of specific elucidations, containing the ability of recent cloud technologies offering needed
size and great enactment to treatise enormous aggregates of data, development of current file structures
in place of the capacities required through data mining solicitations, in addition to, by what means data
can remain kept in such a way that they be able to be simply recovered and migrated amongst servers.

4.3 Data Analysis

The determination of a suitable exemplary used in place of extensive data exploration remains critical.

Data Security

The safety coercions are expanded through the volume, velocity, and variety of big data. Additionally,
some coercions and concerns, such as secrecy, concealment, reliability, and accessibility of data, occur

81

Big Data in Cloud Computing

in big data by means of cloud computing platforms. Hence, data safety need remain stately after data are
subcontracted towards the cloud provision suppliers. The cloud need furthermore evaluation by fixed
interims to secure the aforementioned beside coercions.
Manogaran et al 2016 have portrayed the safety tasks related by means of big data in cloud computing.
Big data safety in the cloud computing is vital owing towards the accompanying concerns as:

1. To safeguard and avert vast mass about trusted commerce, government, or controlling data from
malevolent invaders and progressive coercions
2. Absence of responsiveness and standards almost in what way cloud provision suppliers firmly
preserving the massive disk space plus removal of remaining big data,
3. Absence of guidelines approximately examining and recording of enormous data in public cloud
4. Consumers who does not even graft for the association, yet can take complete control in addition
to perceptibility into past of business data.

5. FUTURE RESEARCH DIRECTIONS

Skourletopoulos et al 2017, have deliberated several open research concerns comprising seizure, storing,
handling, cleaning, investigation, gathering information, examine, distribution, conception, demanding
and secrecy of the precise huge capacities of data. The research future direction could be

• Data storage and management


• Data broadcast and curation
• Data handling and exploration
• Data secrecy and security

6. CONCLUSION

Cloud computing environments remain made for wide-ranging perseverance workloads plus resource
sharing that remains used towards deliver elasticity on request. Thus, the cloud computing environment
appears to stay well appropriate for big data. Data storage with cloud computing stays a reasonable choice
for trivial to moderate sized industries in view of the usage of Big Data analytic techniques (Zanoon et
al, 2016). Therefore, this chapter deliberates the background of big data and cloud computing. It likewise
addresses the challenges associated to big data in cloud computing.

REFERENCES

Acharjya, D.P., & Ahmed, K.P. (2016). A Survey on Big Data Analytics: Challenges, Open Research
Issues and Tools. International Journal of Advanced Computer Science and Applications, 7(2).
Agrawal, D., Das, S., & Abbadi, A.E. (2011). Big Data and Cloud Computing: Current State and Future
Opportunities. EDBT.

82

Big Data in Cloud Computing

Alfazi, Abdullah, Sheng, Quan, Babar, Ali, Ruan, Wenjie, & Qin. (2017). Toward Unified Cloud Service
Discovery for Enhanced Service Identification. The 6th Australasian Symposium on Service Research
and Innovation (ASSRI’17).
Balachandran, B., & Prasad, S. (2017). Challenges and Benefits of Deploying Big Data Analytics in the
Cloud for Business Intelligence International Conference on Knowledge Based and Intelligent Information
and Engineering Systems. Procedia Computer Science, 112, 1112–1122. doi:10.1016/j.procs.2017.08.138
Chandrashekar, R., Kala, M., & Mane, D. (2015). Integration of Big Data in Cloud computing environ-
ments for enhanced data processing capabilities. International Journal of Engineering Research and
General Science.
Cohen, J., Dolan, B., Dunlap, M., Hellerstein, J. M., & Welton, C. (2009). MAD skills: New analysis
practices for big data. Proceedings of the VLDB Endowment International Conference on Very Large
Data Bases, 2(2), 1481–1492. doi:10.14778/1687553.1687576
Elazhary. (2014). Cloud Computing for Big Data MAGNT Research Report. Academic Press.
Geczy, P., Izumi, N., & Hasida, K. (2012). Cloudsourcing: Managing cloud adoption. Global Journal
of Business Research, 6(2), 57–70.
Gholami, A., & Laure, E. (2016). Big data security and privacy issues in the cloud. International Journal
of Network Security & Its Applications, 8(1).
Gupta, R., Gupta, H., & Mohania, M. (2012). Cloud Computing and Big Data Analytics: What Is New
from Databases Perspective? LNCS, 7678, 42–61.
Hadi, H.J., Shnain, A.H., Hadishaheed, S., & Ahmad, A.H. (2015). Big Data and Five V’S Character-
istics. International Journal of Advances in Electronics and Computer Science, 2(1).
Hashem, I. A. T., Yaqoob, I., Anuar, N. B., Mokhtar, S., Gani, A., & Ullah Khan, S. (2015). The rise
of “big data” on cloud computing: Review and open research issues. Information Systems, 47, 98–115.
doi:10.1016/j.is.2014.07.006
Islam, M., & Reza, M. (2019). The Rise of Big Data and Cloud Computing. Internet of Things and Cloud
Computing., 7(2), 45–53. doi:10.11648/j.iotcc.20190702.12
Jayashree, K., & Abirami, R. (2018). Big Data Technologies and Management Innovative in Applications
of Knowledge Discovery and Information Resources Management. IGI Global Publisher.
Jayashree, K., Abirami, R., & Babu, R. (2018). A Collaborative Approach of IoT, Big Data, and Smart
City in Big Data analytics for Smart and Connected Cities. IGI Global Publisher.
Jensen, D., Konkel, K., Mohindra, A., Naccarati, F., & Sam, E. (2012). Business Analytics in the Cloud.
White paper IBW03004-USEN-00, IBM.
Khan, I., Naqvi, S. K., Alam, M., & Rizvi, S. N. A. (2015). Data model for Big Data in cloud environment.
Computing for Sustainable Global Development (INDIACom), 2nd International Conference, 582 – 585.
Kiran, J. S., Sravanthi, M., Preethi, K., & Anusha, M. (2015). Recent Issues and Challenges on Big Data
in Cloud Computing. IJCST, 6(2).

83

Big Data in Cloud Computing

Kumar & Goudar. (2012). Cloud Computing – Research Issues, Challenges, Architecture. Platforms and
Applications: A Survey International Journal of Future Computer and Communication, 1(4), 356-360.
Kune, R., Konugurthi, P. K., Agarwal, A., Chillarige, R. R., & Buyya, R. (2016). The anatomy of big
data computing Journal of Software. Practice and Experience, 46(1), 79–105. doi:10.1002pe.2374
Manogaran, G., Thota, C., & Kumar, M. V. (2016). MetaCloudDataStorage Architecture for Big Data
Security in Cloud Computing. Procedia Computer Science, 87, 128–133. doi:10.1016/j.procs.2016.05.138
Mayer, V. V., & Cukier, K. (2013). Big Data: A Revolution That Will Transform How We Live, Work
and Think. John Murray Press.
Neves, Schmerl, Camara, & Bernardino. (2016). Big Data in Cloud Computing: Features and Is-
sues. Proceedings of the International Conference on Internet of Things and Big Data, 1, 307-314.
10.5220/0005846303070314
Purcell, M.B. (2013). Big data using cloud computing. Journal of Technology Research, 1-7.
Reyes-Ortiz, J., Oneto, L., & Anguita, D. (2015). Big Data Analytics in the Cloud: Spark on Hadoop vs
MPI/OpenMP on Beowulf. Procedia Computer Science, 53(1), 121–130. doi:10.1016/j.procs.2015.07.286
Skourletopoulos, G., Mavromoustakis, C.X., Mastorakis, G., Batalla, J.M., Dobre, C., Panagiotakis,
S., & Pallis, E. (2016). Big Data and Cloud Computing: A Survey of the State-of-the-Art and Research
Challenges. In Advances in Mobile Cloud Computing and Big Data in the 5G Era. Studies in Big Data
(Vol. 22). Springer.
Talia, D. (2013). Clouds for scalable big data analytics. Computer, 46(5), 98–101. doi:10.1109/MC.2013.162
Venkatesh, H., Perur, D.S., & Jalihal, N. (2015). A Study on Use of Big Data in Cloud Computing En-
vironment. International Journal of Computer Science and Information Technologies, 6(3), 2076-2078.
Wadhwani, K., & Wang, Y. (2017). Big Data Challenges and solutions. Technical Report.
Zanoon, N., Al-Haj, N., & Khwaldeh, S. (2017). M Cloud Computing and Big Data is there a Relation
between the Two: A Study. International Journal of Applied Engineering Research, 12(17), 6970–6982.

84

View publication stats

You might also like