Subtitle

Uploaded by

clementanaab

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as TXT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

19 views2 pages

Subtitle

Uploaded by

clementanaab

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as TXT, PDF, TXT or read online on Scribd

You are on page 1/ 2

- [Raf] You may already

know that the cloud is more flexible, scalable, secure,

distributed, and resilient. But I want to give a more
data-related approach in terms of why cloud computing is
relevant for data analytics. In this section, I will explain why the
cloud is the best way to perform data analytics nowadays, and why it has been solid
for
operating big data workloads. So, let's get started. Before we start talking about
cloud, allow me to go back in time, maybe a decade, and
tell you a brief story. After going back in time, it will be natural for you to
understand why everybody loves doing
data analytics in the cloud. Ready for the journey? Get your beverage of
choice, and buckle up! (cup hitting the floor) (whirring sound) Years ago, the most
common
approach for companies to have compute infrastructure, big data included, was to
buy servers and install them into data centers. This is usually called
a collocation, or colo. The thing is, servers utilized for data operations are not
cheap, because they need lots of storage, consume lots of electricity, and require
careful maintenance
regarding data durability. Hence, entire dedicated
infrastructure teams. And trust me, I've been one of those infrastructure analysts
working with data centers. It is expensive and overwhelming. With that scenario,
only
big companies were able to work with big data. And consequently, data
analytics was not popular. It was very common for those servers to have a RAID
storage controller that replicates data across the disks, increasing the cost and
maintenance care even more. In the early 2000s, big data
operations were closely related to the underlying hardware, such as mainframes and
server clusters. Although this was extremely profitable for the ones selling
hardware, it was expensive and not
flexible for the consumers. Then, something fantastic
started to happen. And the name of this fantastic
thing is Apache Hadoop. Mostly, what Hadoop does is replacing all that fancy
hardware by software installed in operating systems. Yeah, that's right. With the
help of Hadoop
and computing frameworks, data could be distributed and replicated across multiple
servers by using distributed systems,
and eliminating the need of those expensive
data-replication hardware to start working with big data. All you needed was
efficient network equipment, and the data were
synchronized over the network to other servers. By embracing failures instead
of trying to avoid them, Hadoop helped reduce hardware complexity. And when you
reduce hardware
complexity, you reduce cost. And by reducing cost, you
start to democratize big data, because smaller companies could start leveraging it
as well. Welcome to the big data boom. I brought up Hadoop originally, because
Hadoop is the most popular open source, big data ecosystem. There are others. And
what I wanted to highlight
here, is the concept, and not specific frameworks or vendors. The thing is, by
baselining
hardware to a basic level and applying all big data concepts to software, such as
data replication, we can start thinking about
running big data operations on providers that are capable
to provide virtual machines with storage and a network card attached. We can start
thinking
about using the cloud to build entire data
lakes, data warehousing, and data analytics solutions. Since then, cloud computing
has emerged as an attractive alternative because it is exactly what it does. You
can get virtual machines,
install the software that will handle the data replication, distributed file
systems, and
entire big data ecosystems, and be happy without having to spend lots of money in
hardware. The advantage is that
cloud does not stop there. Many cloud providers, such
as Amazon Web Services, started to see that customers were spinning up virtual
machines to install big data tools and frameworks. And then based on that, Amazon
started to create offerings with everything already
installed, configured, and ready to use. That's why you have AWS
services, such as Amazon EMR, Amazon S3, Amazon RDS, Amazon
Athena, and many others. Those are what we call managed services. All those are AWS
services
that operate in the data scope. In a later lesson, I will
talk more about some services. We will need to build our
basic data analytics solution. Another big advantage of running
data analytics in the cloud is the ability to stop paying
for infrastructure resources when you don't need them anymore. This is very common
in the data analytics, because due to the nature
of big data operations, you may need to run
reports once in a while. And you can easily do that in the cloud by spinning up
server or services, using them, getting the
report you need, saving that, and turning off everything. In addition, you can
temporarily spin more servers to speed up your jobs, and
turn off when you're done. And since you mostly pay for
time and resources needed, 10 servers running for 1 hour
tends to have the same price of one server running for 10 hours. Basically, with
the cloud,
you're having access to hardware without having to concern with all the burden
involved on doing data center operations. It is like the best of both worlds. Stay
with me to learn
more about AWS services I will use towards my descriptive
data analysis solution using Amazon S3, CloudTrail,
Amazon Athena, and QuickSight.

Isbn 9789354738289
No ratings yet
Isbn 9789354738289
114 pages
Aws Data Engineer
No ratings yet
Aws Data Engineer
66 pages
Programmable Logic Controllers (PLC) : NIOEC-SP-70-21
No ratings yet
Programmable Logic Controllers (PLC) : NIOEC-SP-70-21
19 pages
Practical Labs CS505P
No ratings yet
Practical Labs CS505P
6 pages
2022-CAT-Grade 10-June Exam-Paper 2
67% (9)
2022-CAT-Grade 10-June Exam-Paper 2
11 pages
66fe5be07d788 PPT
No ratings yet
66fe5be07d788 PPT
207 pages
Internet Security Awareness of Filipinos: A Survey Paper
No ratings yet
Internet Security Awareness of Filipinos: A Survey Paper
13 pages
20k61a0497 PPT Data Analytics
No ratings yet
20k61a0497 PPT Data Analytics
16 pages
Mod 4
No ratings yet
Mod 4
76 pages
MLecture 1
No ratings yet
MLecture 1
41 pages
Unit 1 Big Data
No ratings yet
Unit 1 Big Data
124 pages
2020 Cloud DB Survey UW
No ratings yet
2020 Cloud DB Survey UW
75 pages
Lecture 5 Distributed Storage Systems
No ratings yet
Lecture 5 Distributed Storage Systems
26 pages
Centaur UserGuide 17935R13
No ratings yet
Centaur UserGuide 17935R13
229 pages
1) What Is Big Data? Explain Evolution of Big Data & Characteristics
No ratings yet
1) What Is Big Data? Explain Evolution of Big Data & Characteristics
52 pages
Bda U2
No ratings yet
Bda U2
32 pages
Project Report - E-Shopping For Clothes
100% (3)
Project Report - E-Shopping For Clothes
11 pages
Chapter 1 - Introduction
No ratings yet
Chapter 1 - Introduction
42 pages
Computer Chapter 6
No ratings yet
Computer Chapter 6
42 pages
Chapter 7
No ratings yet
Chapter 7
51 pages
Big Data Technology
No ratings yet
Big Data Technology
12 pages
21CS71 Solutions
No ratings yet
21CS71 Solutions
24 pages
Unit 2 - Intro To Hadoop
No ratings yet
Unit 2 - Intro To Hadoop
51 pages
Faadoo Engineers. Com
No ratings yet
Faadoo Engineers. Com
70 pages
Big Data Technology
No ratings yet
Big Data Technology
9 pages
IBM RedBook - BuldingBigData&AnalyticsSolut - Cloud PDF
No ratings yet
IBM RedBook - BuldingBigData&AnalyticsSolut - Cloud PDF
114 pages
Dhan Singh Big Data File - 4
No ratings yet
Dhan Singh Big Data File - 4
1 page
CCD Unit 3
No ratings yet
CCD Unit 3
8 pages
CH 4
No ratings yet
CH 4
4 pages
CBSE Class 12 Marking Scheme Computer Science 2016-2017
No ratings yet
CBSE Class 12 Marking Scheme Computer Science 2016-2017
38 pages
155928-Turn Big Data
No ratings yet
155928-Turn Big Data
8 pages
RRRR
No ratings yet
RRRR
33 pages
Placement Cheatsheet - Curious Freaks
100% (1)
Placement Cheatsheet - Curious Freaks
3 pages
ASR6502 Datasheet V0.4
No ratings yet
ASR6502 Datasheet V0.4
10 pages
Chap 5.2oo Analysis
No ratings yet
Chap 5.2oo Analysis
81 pages
BDA Module-2 Notes PDF
100% (1)
BDA Module-2 Notes PDF
14 pages
Com465ip Nae4122820
No ratings yet
Com465ip Nae4122820
9 pages
Ugc Care Paper 1
No ratings yet
Ugc Care Paper 1
8 pages
CCD Chapter 3 Notes
No ratings yet
CCD Chapter 3 Notes
11 pages
Masters Thesis Jitendra Kumar Jaiswal ME IT 2018
No ratings yet
Masters Thesis Jitendra Kumar Jaiswal ME IT 2018
53 pages
AWS Dev Part1
No ratings yet
AWS Dev Part1
178 pages
Lecture1 Big Data
No ratings yet
Lecture1 Big Data
47 pages
Application of Cloud Computing
No ratings yet
Application of Cloud Computing
17 pages
Introduction To Analytics and Big Data - Hadoop: Thomas Rivera Hitachi Data Systems
No ratings yet
Introduction To Analytics and Big Data - Hadoop: Thomas Rivera Hitachi Data Systems
45 pages
Unit3 - Cloud Data Storage
No ratings yet
Unit3 - Cloud Data Storage
7 pages
Big Data Processing in The Cloud - Challenges and Platforms
No ratings yet
Big Data Processing in The Cloud - Challenges and Platforms
8 pages
Cloud and Big Data
No ratings yet
Cloud and Big Data
2 pages
Cloud For Data Science
No ratings yet
Cloud For Data Science
1 page
Microprocessor & Microcontroller Vvi
No ratings yet
Microprocessor & Microcontroller Vvi
1 page
CV Tran Thanh Huy
No ratings yet
CV Tran Thanh Huy
5 pages
1.2.2 User Education
No ratings yet
1.2.2 User Education
2 pages
Unit 4
100% (1)
Unit 4
33 pages
Muhammad Sofiullah S-CV
No ratings yet
Muhammad Sofiullah S-CV
3 pages
Synology DS223 Data Sheet Enu
No ratings yet
Synology DS223 Data Sheet Enu
9 pages
The Architecture of Open Source Applications (Volume 1) LLVM2
No ratings yet
The Architecture of Open Source Applications (Volume 1) LLVM2
1 page
Pega Customer Service Pricing Matrix PDF
No ratings yet
Pega Customer Service Pricing Matrix PDF
3 pages
HWX Big Data Cloud Ebook
No ratings yet
HWX Big Data Cloud Ebook
9 pages
ETAP Workshop Notes Creating New Project File & One Line Diagram
50% (2)
ETAP Workshop Notes Creating New Project File & One Line Diagram
7 pages
Sound Card
No ratings yet
Sound Card
14 pages
Hadoop & BigData (UNIT - 2)
No ratings yet
Hadoop & BigData (UNIT - 2)
22 pages
SN-IND-1-040 Diagnostics With CAPL Since 9.0 SP3
No ratings yet
SN-IND-1-040 Diagnostics With CAPL Since 9.0 SP3
28 pages
Analytics and Processing: Yuanyuan Zhu Email: Yyzhu@whu - Edu.cn
No ratings yet
Analytics and Processing: Yuanyuan Zhu Email: Yyzhu@whu - Edu.cn
47 pages
Big Data and Cloud Computing
No ratings yet
Big Data and Cloud Computing
27 pages
Big Data Infrastructure
No ratings yet
Big Data Infrastructure
12 pages
2019 Dec. CS301-E - Ktu Qbank
No ratings yet
2019 Dec. CS301-E - Ktu Qbank
3 pages
C - B D A - A S C R F D: Loud Based IG ATA Nalytics Urvey of Urrent Esearch and Uture Irections
No ratings yet
C - B D A - A S C R F D: Loud Based IG ATA Nalytics Urvey of Urrent Esearch and Uture Irections
12 pages
Cloud Analytics Ability To Design, Build, Secure, and Maintain Analytics Solutions On The Cloud
No ratings yet
Cloud Analytics Ability To Design, Build, Secure, and Maintain Analytics Solutions On The Cloud
5 pages
What Is IoT
No ratings yet
What Is IoT
5 pages
What Is The Difference Between Defining and Declaring A Variable
No ratings yet
What Is The Difference Between Defining and Declaring A Variable
7 pages
FortiNAC Demo Walkthrough
No ratings yet
FortiNAC Demo Walkthrough
13 pages
IOT Design 1. IOT Topology
No ratings yet
IOT Design 1. IOT Topology
5 pages
Welcome To The New Era of Cloud Computing: The Web Is Replacing The Desktop
No ratings yet
Welcome To The New Era of Cloud Computing: The Web Is Replacing The Desktop
36 pages
A Cloud Based Approach For Big Data Analysis A Comprehensive Review
No ratings yet
A Cloud Based Approach For Big Data Analysis A Comprehensive Review
7 pages
NPTEL CC Assignment11
33% (3)
NPTEL CC Assignment11
4 pages
Data Engineering with AWS Cookbook: A recipe-based approach to help you tackle data engineering problems with AWS services
From Everand
Data Engineering with AWS Cookbook: A recipe-based approach to help you tackle data engineering problems with AWS services
Trâm Ngọc Phạm
No ratings yet
The Ultimate Aws Cloud Practitioner Mastery: Mastering AWS Essentials, A Comprehensive Guide for Cloud Practitioners
From Everand
The Ultimate Aws Cloud Practitioner Mastery: Mastering AWS Essentials, A Comprehensive Guide for Cloud Practitioners
Furuta Kimiko
No ratings yet
The Illustrated AWS Cloud: A Guide to Help You on Your Cloud Practitioner Journey
From Everand
The Illustrated AWS Cloud: A Guide to Help You on Your Cloud Practitioner Journey
Jen Looper
No ratings yet
AWS for Beginners: A Step-by-Step Guide to Cloud Computing
From Everand
AWS for Beginners: A Step-by-Step Guide to Cloud Computing
Sankar Srinivasan
No ratings yet
AWS for Beginners
From Everand
AWS for Beginners
Sankar Srinivasan
No ratings yet
Build Your First Home Server
From Everand
Build Your First Home Server
R.R. Arnob
No ratings yet
Learn Microsoft Azure: Step by Step in 7 day for .NET Developers
From Everand
Learn Microsoft Azure: Step by Step in 7 day for .NET Developers
Saillesh Pawar
No ratings yet
AWS: The Ultimate Guide From Beginners To Advanced For The Amazon Web Services (2020 Edition)
From Everand
AWS: The Ultimate Guide From Beginners To Advanced For The Amazon Web Services (2020 Edition)
Theo H. King
2/5 (1)
AWS Certified Solutions Architect #1 Audio Crash Course Guide To Master Exams, Practice Test Questions, Cloud Practitioner and Security
From Everand
AWS Certified Solutions Architect #1 Audio Crash Course Guide To Master Exams, Practice Test Questions, Cloud Practitioner and Security
Jamie Murphy
No ratings yet
Kafka Up and Running for Network DevOps: Set Your Network Data in Motion
From Everand
Kafka Up and Running for Network DevOps: Set Your Network Data in Motion
Eric Chou
No ratings yet
The Pandemic: Driven New Age of Cloud Computing
From Everand
The Pandemic: Driven New Age of Cloud Computing
VNS Surendra Chimakurthi
No ratings yet
Microsoft Azure Fundamentals Exam Cram: Second Edition
From Everand
Microsoft Azure Fundamentals Exam Cram: Second Edition
IP Specialist
5/5 (1)
DBA's Guide to NoSQL
From Everand
DBA's Guide to NoSQL
The Enlightened DBA
5/5 (1)
Edge Cloud Operations: A Systems Approach
From Everand
Edge Cloud Operations: A Systems Approach
Larry L Peterson
No ratings yet
Cloud Computing Made Simple: Navigating the Cloud: A Practical Guide to Cloud Computing
From Everand
Cloud Computing Made Simple: Navigating the Cloud: A Practical Guide to Cloud Computing
Poonam Devi
No ratings yet
The Quick Guide to Cloud Computing and Cyber Security
From Everand
The Quick Guide to Cloud Computing and Cyber Security
Marcia R.T. Pistorious
4/5 (11)

Subtitle

Uploaded by

Subtitle

Uploaded by

- [Raf] You may already

know that the cloud is more flexible, scalable, secure,

You might also like