0% found this document useful (0 votes)

101 views

Introduction To Smart Systems

Big data and cloud computing are interrelated concepts. Big data refers to vast amounts of structured, semi-structured, or unstructured data from various sources that is analyzed to identify trends or patterns. Cloud computing provides on-demand access to computing resources and services. The cloud enables scalable and cost-effective big data processing by providing vast infrastructure that users can assemble as needed for analytics projects. While the cloud offers benefits like scalability, agility and cost savings, it also presents disadvantages such as network dependence, storage costs, security risks, and lack of standardization that must be considered.

Uploaded by

api-321004552

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

101 views

Introduction To Smart Systems

Uploaded by

api-321004552

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 22

Introduction to smart

systems
Big data and Cloud
services

 What is big data in the cloud?

 Big data and cloud computing are two

distinctly different ideas, but the two
concepts have become so interwoven
that they are almost inseparable. It's
important to define the two ideas and
see how they relate.

 Big data

 Big data refers to vast amounts of data

that can be structured, semistructured or
unstructured. It is all about analytics and
is usually derived from different sources,
such as user input, IoT sensors and sales
data.
Big data and Cloud
services
 Big data also refers to the act of
processing enormous volumes of
data to address some query, as
well as identify a trend or pattern.
 Data is analyzed through a set of
mathematical algorithms, which
vary depending on what the data
means, how many sources are
involved and the business's intent
behind the analysis.
 Distributed computing software
platforms, such as Apache Hadoop,
Databricks and Cloudera, are used
to split up and organize such
complex analytics.
Cloud

 Cloud computing provides

computing resources and
services on demand. A user can
easily assemble the desired
infrastructure of cloud-based
compute instances and storage
resources, connect cloud
services, upload data sets and
perform analyses in the cloud.
Users can engage almost
limitless resources across the
public cloud, use those
resources for as long as needed
and then dismiss the
environment -- paying only for
the resources and services that
were actually used.
The pros of big data in the cloud

 Scalability

 A typical business data center faces limits in physical space, power, cooling and the budget to
purchase and deploy the sheer volume of hardware it needs to build a big data infrastructure. By
comparison, a public cloud manages hundreds of thousands of servers spread across a fleet of
global data centers. The infrastructure and software services are already there, and users can
assemble the infrastructure for a big data project of almost any size.

 Agility

 Not all big data projects are the same. One project may need 100 servers, and another project
might demand 2,000 servers. With cloud, users can employ as many resources as needed to
accomplish a task and then release those resources when the task is complete.

 Cost

 A business data center is an enormous capital expense. Beyond hardware, businesses must also
pay for facilities, power, ongoing maintenance and more. The cloud works all those costs into a
flexible rental model where resources and services are available on demand and follow a pay-per-
use model.
The pros of big data in the
cloud
 Accessibility
 Many clouds provide a global footprint, which enables resources and services
to deploy in most major global regions. This enables data and processing
activity to take place proximally to the region where the big data task is
located. For example, if a bulk of data is stored in a certain region of a cloud
provider, it's relatively simple to implement the resources and services for a
big data project in that specific cloud region -- rather than sustaining the cost
of moving that data to another region.
 Resilience
 Data is the real value of big data projects, and the benefit of cloud resilience
is in data storage reliability. Clouds replicate data as a matter of standard
practice to maintain high availability in storage resources, and even more
durable storage options are available in the cloud.
The cons of big data in the cloud

 Public clouds and many third-party big data services have proven their value in big data use
cases. Despite the benefits, businesses must also consider some of the potential pitfalls. Some
major disadvantages of big data in the cloud can include the following.

 Network dependence

 Cloud use depends on complete network connectivity from the LAN, across the internet, to the
cloud provider's network. Outages along that network path can result in increased latency at best
or complete cloud inaccessibility at worst. While an outage might not impact a big data project in
the same ways that it would affect a mission-critical workload, the effect of outages should still be
considered in any big data use of the cloud.

 Storage costs

 Data storage in the cloud can present a substantial long-term cost for big data projects. The three
principal issues are data storage, data migration and data retention. It takes time to load large
amounts of data into the cloud, and then those storage instances incur a monthly fee. If the data is
moved again, there may be additional fees. Also, big data sets are often time-sensitive, meaning
that some data may have no value to a big data analysis even hours into the future. Retaining
unnecessary data costs money, so businesses must employ comprehensive data retention and
deletion policies to manage cloud storage costs around big data.
The cons of big data in the cloud

 Security
 The data involved in big data projects can involve proprietary or personally
identifiable data that is subject to data protection and other industry- or
government-driven regulations. Cloud users must take the steps needed to
maintain security in cloud storage and computing through adequate
authentication and authorization, encryption for data at rest and in flight, and
copious logging of how they access and use data.
 Lack of standardization
 There is no single way to architect, implement or operate a big data
deployment in the cloud. This can lead to poor performance and expose the
business to possible security risks. Business users should document big data
architecture along with any policies and procedures related to its use. That
documentation can become a foundation for optimizations and improvements
for the future
Choose the right cloud deployment model

 Hybrid cloud
 A hybrid cloud is useful when sharing specific resources. For example, a
hybrid cloud might enable big data storage in the local private cloud --
effectively keeping data sets local and secure -- and use the public cloud for
compute resources and big data analytical services. However, hybrid clouds
can be more complex to build and manage, and users must deal with all of
the issues and concerns of both public and private clouds.
 Multi-cloud
 With multiple clouds, users can maintain availability and use cost benefits.
However, resources and services are rarely identical between clouds, so
multiple clouds are more complex to manage. This cloud model also has
more risks of security oversights and compliance breaches than single public
cloud use. Considering the scope of big data projects, the added complexity
of multi-cloud deployments can add unnecessary challenges to the effort.
Choose the right cloud deployment model

 Private cloud
 Private clouds give businesses control over their cloud environment, often to
accommodate specific regulatory, security or availability requirements.
However, it is more costly because a business must own and operate the
entire infrastructure. Thus, a private cloud might only be used for sensitive
small-scale big data projects.
 Public cloud
 The combination of on-demand resources and scalability makes public cloud
ideal for almost any size of big data deployment. However, public cloud users
must manage the cloud resources and services it uses. In a shared
responsibility model, the public cloud provider handles the security of the
cloud, while users must configure and manage security in the cloud.
Providers
 AWS

• Amazon Elastic MapReduce

• AWS Deep Learning AMIs

• Amazon SageMaker

 Microsoft Azure

• Azure HDInsight

• Azure Analysis Services

• Azure Databricks

 Google Cloud

• Google BigQuery

• Google Cloud Dataproc

• Google Cloud AutoML

What is machine learning?
 Machine learning is a branch of artificial intelligence
(AI) and computer science which focuses on the use
of data and algorithms to imitate the way that
humans learn, gradually improving its accuracy.

 Over the last couple of decades, the technological

advances in storage and processing power have
enabled some innovative products based on machine
learning, such as Netflix’s recommendation engine
and self-driving cars.
• Artificial Intelligence: a
program that can sense,
reason, act and adapt.
• Machine Learning:
algorithms whose
performance improve as
they are exposed to more
data over time.
• Deep Learning: subset of
machine learning in which
multilayered neural
networks learn from vast
amounts of data.
Aplication of
Smart systems
Aplication of Smart systems
 Streaming: Event stream processing correlates and enriches events from multiple
sources to discover meaningful associations. Relationships can be formed through a
combination of both spatial and temporal scales or through data aggregation itself.
Companies drive tremendous value by merely correlating data from multiple
streams enabling event recognition and notification.

 Enrichment: Data enrichment provides the ability to combine data that is in flight
with data at rest from a tertiary source as a means of augmenting the data.

 Archiving: A data lake provides a distributed data store (e.g., Hadoop) that can host
structured (relational dataset), semi-structured (XML, JSON) or unstructured data
(ex. PDF, document). A data archive will provide future enablement of data mining
and machine learning. How data is stored goes a long way in dictating how the data
is used. There are many trade-offs shown by the CAP conjecture published by Eric
Brewer that illustrates that it is impossible to provide any more than two of the
following three guarantees: consistency, availability and partition tolerance.
Application of Smart
systems
 Analyzing: How smart can a system be without analysis? Through
combinations of data mining and machine learning (e.g., Sparx
ML), Smart Systems can become increasingly cognitive and
autonomous. Through iterations, systems can discover new
patterns and new meanings and use them to find new
opportunities and capabilities to automate.

 Automation: Automation is achieved through technology-based

algorithms that computers use to control both physical and
software components that allow Smart Systems to perform
algorithms such as closed-loop control as seen with modern
building management systems.

CV Cristian Pacurariu PDF
No ratings yet
CV Cristian Pacurariu PDF
2 pages
Estados Financieros Mercadona ... Navarro
No ratings yet
Estados Financieros Mercadona ... Navarro
19 pages
Moorthi and Thiagarajan - 2020 - Energy Consumption and Network Connectivity Based
No ratings yet
Moorthi and Thiagarajan - 2020 - Energy Consumption and Network Connectivity Based
9 pages
Case Study Catfish Creek Canoe Company
No ratings yet
Case Study Catfish Creek Canoe Company
2 pages
Agile UP Intro-Larman
100% (1)
Agile UP Intro-Larman
25 pages
Accops DetailedPPT Aug 2019
No ratings yet
Accops DetailedPPT Aug 2019
195 pages
Imteyaz Ali PPT Digital Empowerment
No ratings yet
Imteyaz Ali PPT Digital Empowerment
35 pages
9B18A007 Pws
No ratings yet
9B18A007 Pws
12 pages
Unit - 5
No ratings yet
Unit - 5
91 pages
GUI - Event Handling (2) - Lecture
100% (1)
GUI - Event Handling (2) - Lecture
38 pages
Ysai Intern
No ratings yet
Ysai Intern
42 pages
Hospitality Vietnam Conference 2022 Broc...
No ratings yet
Hospitality Vietnam Conference 2022 Broc...
19 pages
Analysis of Cement Sector As An Investment Avenue For India Infoline Securities by Nutan Kirad
100% (1)
Analysis of Cement Sector As An Investment Avenue For India Infoline Securities by Nutan Kirad
85 pages
BlazeMeter Test Data Fundamentals Synthetic Examples
No ratings yet
BlazeMeter Test Data Fundamentals Synthetic Examples
8 pages
BTech Booklet
No ratings yet
BTech Booklet
251 pages
Editing Guidelines v.1.12: Transcribio
No ratings yet
Editing Guidelines v.1.12: Transcribio
16 pages
Warehouse Management - SCM - 28 - March - 2022
100% (1)
Warehouse Management - SCM - 28 - March - 2022
20 pages
QA Tool
No ratings yet
QA Tool
149 pages
Supportinfo d4684d269280
No ratings yet
Supportinfo d4684d269280
133 pages
Sister Doris, Cypress Case Study
No ratings yet
Sister Doris, Cypress Case Study
4 pages
Employee Resume: Name: Puneet Jain CRM Technical Consultant
No ratings yet
Employee Resume: Name: Puneet Jain CRM Technical Consultant
4 pages
Final Internship Report
No ratings yet
Final Internship Report
61 pages
Whitelane PA Nordic Survey 2019
100% (1)
Whitelane PA Nordic Survey 2019
39 pages
Google Compute Engine High Availability and Best Practices: Instance Groups
No ratings yet
Google Compute Engine High Availability and Best Practices: Instance Groups
3 pages
Project Proposal: Company-ITC Limited
No ratings yet
Project Proposal: Company-ITC Limited
7 pages
OCI Exam Preperation Handbook v2.0
No ratings yet
OCI Exam Preperation Handbook v2.0
14 pages
Data Cleaning
No ratings yet
Data Cleaning
42 pages
Lab 2c
No ratings yet
Lab 2c
9 pages
Merits of A Five Star Hotel in Co-Operative Sector: Wayanad, Kerala - A Case Study
No ratings yet
Merits of A Five Star Hotel in Co-Operative Sector: Wayanad, Kerala - A Case Study
86 pages
Database Software Market White Paper
No ratings yet
Database Software Market White Paper
72 pages
Devansh Verma
No ratings yet
Devansh Verma
5 pages
IIFM Placement Brochure 14-15
No ratings yet
IIFM Placement Brochure 14-15
52 pages
BonitaSoft BPM Applications Powered by Bonita PDF
No ratings yet
BonitaSoft BPM Applications Powered by Bonita PDF
2 pages
Aa Manushyan Nee Thanne
No ratings yet
Aa Manushyan Nee Thanne
10 pages
Team Lease PVT Ltd.
No ratings yet
Team Lease PVT Ltd.
3 pages
Deploy Sandboxes
No ratings yet
Deploy Sandboxes
70 pages
ClassX E 170301 LR
No ratings yet
ClassX E 170301 LR
2 pages
Software Engineering With Softjourn
No ratings yet
Software Engineering With Softjourn
39 pages
Week 5 CRM
No ratings yet
Week 5 CRM
119 pages
Awrad 2
No ratings yet
Awrad 2
7 pages
Genesys Pulse
No ratings yet
Genesys Pulse
76 pages
Chatbot KPIs & Benefits
No ratings yet
Chatbot KPIs & Benefits
4 pages
Special Vocabulary
No ratings yet
Special Vocabulary
12 pages
Log
No ratings yet
Log
5 pages
Mtech Cloud Computing Brochure
No ratings yet
Mtech Cloud Computing Brochure
15 pages
Garner Degausser Training: 20 February 2019 Hotel Hilton, Jaipur
No ratings yet
Garner Degausser Training: 20 February 2019 Hotel Hilton, Jaipur
20 pages
Bargi (Rani Avanti Bai Lodhi Sagar) Major Irrigation Project Brief
No ratings yet
Bargi (Rani Avanti Bai Lodhi Sagar) Major Irrigation Project Brief
6 pages
Mlfinlab PyPI
No ratings yet
Mlfinlab PyPI
7 pages
Yuanliang Lyu - Resume
No ratings yet
Yuanliang Lyu - Resume
1 page
Spoken Book PDF
No ratings yet
Spoken Book PDF
22 pages
Citrix XenApp Comparative Feature Matrix
No ratings yet
Citrix XenApp Comparative Feature Matrix
23 pages
Paints and Coatings Digitalization
No ratings yet
Paints and Coatings Digitalization
4 pages
Hubli Bill
No ratings yet
Hubli Bill
1 page
Capgemini Interview Questions
No ratings yet
Capgemini Interview Questions
2 pages
SC-108 - FTP Iconic Campus of The Zayed University PDF
No ratings yet
SC-108 - FTP Iconic Campus of The Zayed University PDF
9 pages
Combinepdf
No ratings yet
Combinepdf
84 pages
Internship Report Presentation
No ratings yet
Internship Report Presentation
51 pages
Keeping Pace With Technology and Big Data
No ratings yet
Keeping Pace With Technology and Big Data
34 pages
) Lnielxi HRDR E Gœr
No ratings yet
) Lnielxi HRDR E Gœr
20 pages
QRL CRP 001 Ver1.1
No ratings yet
QRL CRP 001 Ver1.1
2 pages
Big Data and Cloud
No ratings yet
Big Data and Cloud
7 pages
What Is Google Meet
No ratings yet
What Is Google Meet
18 pages
Miro English
No ratings yet
Miro English
8 pages
Microsoft Teams English
No ratings yet
Microsoft Teams English
14 pages
Digital Tool Padlet Wall-English
No ratings yet
Digital Tool Padlet Wall-English
9 pages
Etwinning Platform - English 1
No ratings yet
Etwinning Platform - English 1
13 pages
Cloud Computing
No ratings yet
Cloud Computing
2 pages
AnsibleNetworkAutomation PDF
100% (2)
AnsibleNetworkAutomation PDF
63 pages
Necessity of Mathematics in CSE
No ratings yet
Necessity of Mathematics in CSE
14 pages
SPR Install
No ratings yet
SPR Install
42 pages
804537709233305-service-definition-document-2024-05-01-1410
No ratings yet
804537709233305-service-definition-document-2024-05-01-1410
18 pages
Resume - Mahaputra Ilham Awal
No ratings yet
Resume - Mahaputra Ilham Awal
2 pages
3 Year Project
No ratings yet
3 Year Project
60 pages
CC Unit-2 (New)
No ratings yet
CC Unit-2 (New)
21 pages
Mock 18073 1643974737936
No ratings yet
Mock 18073 1643974737936
139 pages
What Is Virtualization
No ratings yet
What Is Virtualization
7 pages
Oracle Cloud Infrastructure (OCI) Foundations 2021 Associate (1Z0-1085-21) Sample Question & Answer
No ratings yet
Oracle Cloud Infrastructure (OCI) Foundations 2021 Associate (1Z0-1085-21) Sample Question & Answer
4 pages
Idc Futurescape: Latin America It Industry 2019 Predictions
No ratings yet
Idc Futurescape: Latin America It Industry 2019 Predictions
35 pages
Literature Review On Broadband Services
100% (2)
Literature Review On Broadband Services
8 pages
Huawei OTN Product Series Brochure
No ratings yet
Huawei OTN Product Series Brochure
8 pages
Proposal Ecommerce Allianzeinfosoft PDF
No ratings yet
Proposal Ecommerce Allianzeinfosoft PDF
13 pages
Java Advanced
No ratings yet
Java Advanced
9 pages
Your Business Guide: SAP Transformation Navigator
No ratings yet
Your Business Guide: SAP Transformation Navigator
84 pages
Cisco Contact Center 12 5 BDM
No ratings yet
Cisco Contact Center 12 5 BDM
39 pages
Assignment 2 - Solution
No ratings yet
Assignment 2 - Solution
3 pages
Customer Continuity and Growth Part 1
No ratings yet
Customer Continuity and Growth Part 1
5 pages
AZ 104T00A ENU PowerPoint - 07
No ratings yet
AZ 104T00A ENU PowerPoint - 07
56 pages
MTech Internet of Things 2017
No ratings yet
MTech Internet of Things 2017
66 pages
Network Monitoring Equipment Market Report 2020
No ratings yet
Network Monitoring Equipment Market Report 2020
23 pages
GCC Model Exam
No ratings yet
GCC Model Exam
2 pages
Oracle Global Human Resources Cloud 2017 Implementation Essentials v5.0 (1z0-965)
50% (2)
Oracle Global Human Resources Cloud 2017 Implementation Essentials v5.0 (1z0-965)
28 pages
Pros and Cons of GITHUB
No ratings yet
Pros and Cons of GITHUB
2 pages
(Ebook) High Performance Computing in Clouds: Moving HPC Applications to a Scalable and Cost-Effective Environment by Edson Borin, Lúcia Maria A. Drummond, Jean-Luc Gaudiot, Alba Melo, Maicon Melo Alves, Philippe Olivier Alexandre Navaux, (eds.) ISBN 9783031297694, 3031297695, B0BYN665C2 - Quickly download the ebook to start your content journey
100% (1)
(Ebook) High Performance Computing in Clouds: Moving HPC Applications to a Scalable and Cost-Effective Environment by Edson Borin, Lúcia Maria A. Drummond, Jean-Luc Gaudiot, Alba Melo, Maicon Melo Alves, Philippe Olivier Alexandre Navaux, (eds.) ISBN 9783031297694, 3031297695, B0BYN665C2 - Quickly download the ebook to start your content journey
81 pages
CompTIAAdvancedSecurityPractitioner (CASP) CAS 004CertGuide3e
No ratings yet
CompTIAAdvancedSecurityPractitioner (CASP) CAS 004CertGuide3e
878 pages
RISC V Introduction - Aug 2021
No ratings yet
RISC V Introduction - Aug 2021
50 pages

Introduction To Smart Systems

Uploaded by

Introduction To Smart Systems

Uploaded by

Introduction to smart

 What is big data in the cloud?

 Big data and cloud computing are two

 Big data refers to vast amounts of data

 Cloud computing provides

• Amazon Elastic MapReduce

• AWS Deep Learning AMIs

• Azure Analysis Services

• Google Cloud Dataproc

• Google Cloud AutoML

 Over the last couple of decades, the technological

 Automation: Automation is achieved through technology-based

You might also like