Case Study

esign Principles for Web Connectivity (RGPV 2023 special) (1)

Uploaded by

321506402298

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

29 views16 pages

Case Study

esign Principles for Web Connectivity (RGPV 2023 special) (1)

Uploaded by

321506402298

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 16

CASE STUDY

- 321506402090
D.V SRIJA
4/6 CSE-2
CASE STUDY
UNIT 7-INSIDE CLOUD
 INTRODUCTION TO CLOUD COMPUTING
 INTRODUCTION TO MAP REDUCE
 BIG DATA AND ITS IMPACT ON CLOUD
COMPUTING
 HADOOP-OVERVIEW OF BIG DATA
 BUSINESS IMPACT OF CLOUD COMPUTING
1.Introdution to Cloud Computing:
What is Cloud Computing?
Cloud computing is the on-demand access of
computing resources—physical servers or virtual servers,
data storage, networking capabilities, application
development tools, software, AI-powered analytic tools and
more—over the internet with pay-per-use pricing.
The cloud computing model offers customers greater
flexibility and scalability compared to traditional on-premises
infrastructure.
Cloud computing plays a pivotal role in our everyday
lives, whether accessing a cloud application like Google
Gmail, streaming a movie on Netflix or playing a
cloud-hosted video game.
Types of Cloud Computing:
Public:
A public cloud is a type of cloud computing in which a
cloud service provider makes computing resources available
to users over the public internet. These include SaaS
applications, individual virtual machines (VMs).
Private:
A private cloud is a cloud environment where all cloud
infrastructure and computing resources are dedicated to one
customer only. Private cloud combines many benefits of
cloud computing—including elasticity, scalability and ease of
service delivery—with the access control, security and
resource customization of on-premises infrastructure.
Hybrid:
A hybrid cloud is just what it sounds like: a combination
of public cloud, private cloud and on-premises environments.
Specifically (and ideally), a hybrid cloud connects a
combination of these three environments into a single,
flexible infrastructure for running the organization’s
applications and workloads.

Cloud computing services:

IaaS (Infrastructure-as-a-Service):
IaaS (Infrastructure-as-a-Service) provides on-demand
access to fundamental computing resources—physical and
virtual servers, networking and storage—over the internet on
a pay-as-you-go basis.
PaaS (Platform-as-a-Service):
PaaS provides software developers with an on-demand
platform—hardware, complete software stack, infrastructure
and development tools—for running, developing and
managing applications without the cost, complexity and
inflexibility of maintaining that platform on-premises. These
include servers, networks, storage, operating system
software, middleware and databases.
SaaS (Software-as-a-Service):
SaaS (Software-as-a-Service), also known as cloud-based
software or cloud applications, is application software hosted
in the cloud.
SaaS is the primary delivery model for most commercial
software today. for example, (Salesforce) to robust enterprise
database and artificial intelligence (AI) software.
2.MAP REDUCE OVERVIEW:
 WHAT IS MAPREDUCING?
A MapReduce is a data processing tool which is used to
process the data parallelly in a distributed form. It was
developed in 2004, on the basis of paper titled as
"MapReduce: Simplified Data Processing on Large Clusters,"
published by Google.

for example, (Salesforce) to robust enterprise database

and artificial intelligence (AI) software.
Steps in Map Reduce:
The map takes data in the form of pairs and returns a list
of <key, value> pairs. The keys will not be unique in this case.
Using the output of Map, sort and shuffle are applied by
the Hadoop architecture. This sort and shuffle acts on these
list of <key, value> pairs and sends out unique keys and a list
of values associated with this unique key <key, list(values)>.
An output of sort and shuffle sent to the reducer phase.
The reducer performs a defined function on a list of values
for unique keys, and Final output <key, value> will be
stored/displayed.
Let us take a real-world example to comprehend the
power of MapReduce. Twitter receives around 500 million
tweets per day, which is nearly 3000 tweets per second. The
following illustration shows how Tweeter manages its tweets
with the help of MapReduce.
Tokenize: Tokenizes the tweets into maps of tokens and
writes them as key-value pairs.
Filter: Filters unwanted words from the maps of
tokens and writes the filtered maps as key-value pairs.
Count: Generates a token counter per word.
Aggregate Counters: Prepares an aggregate of similar
counter values into small manageable units.

3.Big Data and Its Impact on Cloud Computing:

Big Data is a term that describes extremely large and
complex sets of structured and unstructured data that are
too cumbersome to process through traditional database
management tools. The true power of Big Data lies in the
opportunity for deep analysis it provides. Analyzing Big Data
can lead to uncovering patterns, correlations, and insights
that are invaluable in making data-driven decisions and
automating various aspects of a business.
One of the most popular frameworks for understanding
Big Data is the concept of the Three Vs: Volume, Velocity, and
Variety.
Volume refers to the sheer size of the data, often
ranging from terabytes to petabytes.
Velocity indicates the speed at which new data is
generated and processed. Businesses like social media
platforms may deal with real-time or near-real-time
information that requires rapid processing.
Variety stands for the different types of data; in addition
to traditional structured data, Big Data can include text,
images, sound, video, and more.

Challenges of implementing big data

The most commonly reported big data challenges
include:
1.Lack of data talent and skills:
Data scientists, data analysts, and data engineers are
in short supply—and are some of the most highly sought
after (and highly paid) professionals in the IT industry. Lack of
big data skills and experience with advanced data tools is one
of the primary barriers to realizing value from big data
environments.
2.Speed of data growth:
Big data, by nature, is always rapidly changing and
increasing. Without a solid infrastructure in place that can
handle your processing, storage, network, and security
needs, it can become extremely difficult to manage.
3.Problems with data quality:
Data quality directly impacts the quality of decision-
making, data analytics, and planning strategies. Raw data is
messy and can be difficult to curate. Having big data doesn’t
guarantee results unless the data is accurate, relevant, and
properly organized for analysis. This can slow down reporting,
but if not addressed, you can end up with misleading results
and worthless insights.
4.Security concerns:
Big data contains valuable business and customer
information, making big data stores high-value targets for
attackers. Since these datasets are varied and complex, it can
be harder to implement comprehensive strategies and
policies to protect them.
4.Hadoop:
Overview and Its Role in Cloud Computing:
Hadoop is an open source framework based on Java that
manages the storage and processing of large amounts of data
for applications. Hadoop uses distributed storage and parallel
processing to handle big data and analytics jobs, breaking
workloads down into smaller workloads that can be run at
the same time.

How does Hadoop work?

Hadoop allows for the distribution of datasets across a
cluster of commodity hardware. Processing is performed in
parallel on multiple servers simultaneously.
Software clients input data into Hadoop. HDFS handles
metadata and the distributed file system. MapReduce then
processes and converts the data. Finally, YARN divides the
jobs across the computing cluster.

Modules of Hadoop:
HDFS: Hadoop Distributed File System. Google published its
paper GFS and on the basis of that HDFS was developed. It
states that the files will be broken into blocks and stored in
nodes over the distributed architecture.
Yarn: Yet another Resource Negotiator is used for job
scheduling and manage the cluster.
Map Reduce: This is a framework which helps Java programs
to do the parallel computation on data using key value pair.
The Map task takes input data and converts it into a data set
which can be computed in Key value pair.
The output of Map task is consumed by reduce task and
then the out of reducer gives the desired result.
Hadoop Common: These Java libraries are used to start
Hadoop and are used by other Hadoop modules. Hadoop
Architecture
The Hadoop architecture is a package of the file system,
MapReduce engine and the HDFS (Hadoop Distributed File
System). The MapReduce engine can be MapReduce/MR1 or
YARN/MR2.
A Hadoop cluster consists of a single master and multiple
slave nodes. The master node includes Job Tracker, Task
Tracker, Name Node, and Data Node whereas the slave node
includes Data Node and Task Tracker.
Role of Hadoop in Cloud Computing:
Hadoop plays a significant role in cloud computing by
enhancing data storage, processing, and analysis capabilities.
Here are some key aspects:
Scalability: Cloud environments can quickly scale resources
up or down based on demand. Hadoop’s ability to add nodes
easily aligns well with cloud elasticity, allowing organizations
to handle large datasets efficiently.
Cost Efficiency: Using commodity hardware in cloud
environments reduces costs significantly. Organizations can
leverage Hadoop on cloud platforms without the need for
expensive infrastructure.
Data Storage and Management: Hadoop can store vast
amounts of structured and unstructured data in the cloud,
making it easier for organizations to manage and analyze
diverse data sources.
Integration with Other Cloud Services: Hadoop can
integrate with various cloud services, including data lakes,
analytics tools, and machine learning platforms, providing a
comprehensive ecosystem for big data solutions.
Flexibility and Accessibility: Cloud-based Hadoop
deployments allow users to access data and analytics tools
from anywhere, facilitating collaboration and real-time data
processing.
Disaster Recovery and Backup: Cloud providers often
offer robust backup and disaster recovery options, ensuring
that Hadoop data is secure and recoverable.

5.Business impact of cloud computing:

1. Cost Efficiency:
Reduced Capital Expenditure: Businesses can avoid the
high costs of purchasing and maintaining hardware and
software by utilizing cloud services, converting fixed costs
into variable costs.
Pay-as-You-Go Model: Organizations can pay only for the
resources they use, leading to better cost management and
budgeting.
2.Enhanced Data Security and Compliance:
Robust Security Measures: Many cloud providers offer
advanced security features and compliance certifications,
which can be more effective than traditional in-house
solutions.
Regular Updates: Cloud services often include automatic
updates and patches, ensuring systems are secure and up to
date.
3.Usiness Continuity and Disaster Recovery:
Data Backup Solutions: Cloud computing simplifies
data backup and recovery processes, reducing downtime and
ensuring business continuity in case of disasters.
Geographic Redundancy: Data can be replicated
across multiple locations, enhancing resilience and
availability.
4.Enhanced Customer Experience
Personalization: Businesses can analyze customer data
more effectively, enabling personalized services and better
customer engagement.
Faster Service Delivery: Cloud solutions can improve
response times and service delivery, enhancing overall
customer satisfaction.
 Conclusion:
Cloud computing will affect large part of computer
industry including Software companies, Internet service
providers. Cloud computing makes it very easy for companies
to provide their products to end-user without worrying about
hardware configurations and other requirements of servers.
The cloud computing and virtualization are distinguished by
the fact that all of the control plane activities that center
around creation, management, and maintenance of the
virtual environment, are outsourced to an automated layer
that is called as an API and other management servers for the
cloud management.
In simple words, the virtualization is a part of cloud
computing where manual management is done for
interacting with a hypervisor. On the other hand, in cloud
computing, the activities are self-managing where an API
(Application Program Interface) is used so that the users can
self-consume the cloud service.

Internship F
No ratings yet
Internship F
9 pages
The Power of Big Data: Transforming Industries and Shaping the Future
From Everand
The Power of Big Data: Transforming Industries and Shaping the Future
Tom Henricksen
No ratings yet
Aruba Clearpass Exchange
No ratings yet
Aruba Clearpass Exchange
6 pages
Practical Data Strategies and Recipes
From Everand
Practical Data Strategies and Recipes
Tom Henricksen
No ratings yet
Ethical Vs Black Hat Hacking
No ratings yet
Ethical Vs Black Hat Hacking
4 pages
What Are Schedules and Serializability
No ratings yet
What Are Schedules and Serializability
3 pages
Learn Hadoop in 24 Hours
From Everand
Learn Hadoop in 24 Hours
Alex Nordeen
No ratings yet
MDG200 Col20 SAP Master Data Governance: Configuration and Customizing
No ratings yet
MDG200 Col20 SAP Master Data Governance: Configuration and Customizing
20 pages
Zscaler Data Security Posture Management
No ratings yet
Zscaler Data Security Posture Management
6 pages
Cloud Computing: Harnessing the Power of the Digital Skies: The IT Collection
From Everand
Cloud Computing: Harnessing the Power of the Digital Skies: The IT Collection
Christopher Ford
No ratings yet
Generic Application Audit
No ratings yet
Generic Application Audit
7 pages
Google Cloud Platform for Data Engineering: From Beginner to Data Engineer using Google Cloud Platform
From Everand
Google Cloud Platform for Data Engineering: From Beginner to Data Engineer using Google Cloud Platform
alasdair gilchrist
5/5 (1)
Cloud Computing For Noobs
From Everand
Cloud Computing For Noobs
Silas Meadowlark
No ratings yet
I ST Internal-CE
No ratings yet
I ST Internal-CE
26 pages
Aws Innovate Aiml and Data Edition Agenda
No ratings yet
Aws Innovate Aiml and Data Edition Agenda
1 page
Rajalakshmi R B - DevOps Engineer
No ratings yet
Rajalakshmi R B - DevOps Engineer
5 pages
Big Data and Data Mining
No ratings yet
Big Data and Data Mining
3 pages
Compose Release Notes
No ratings yet
Compose Release Notes
55 pages
UG-Big Data Analytics Unit - 3 - Big Data Business Perspectives
No ratings yet
UG-Big Data Analytics Unit - 3 - Big Data Business Perspectives
23 pages
Chapter-8 1679587952156
No ratings yet
Chapter-8 1679587952156
60 pages
Edge Cloud Operations: A Systems Approach
From Everand
Edge Cloud Operations: A Systems Approach
Larry L Peterson
No ratings yet
The Ultimate Guide to Unlocking the Full Potential of Cloud Services: Tips, Recommendations, and Strategies for Success
From Everand
The Ultimate Guide to Unlocking the Full Potential of Cloud Services: Tips, Recommendations, and Strategies for Success
Rick Spair
No ratings yet
NCR Report
No ratings yet
NCR Report
3 pages
Sistem Informasi Pengelolaan Pusat Prestasi Mahasiswa Guna Mendukung Program Kreatifitas Mahasiswa
No ratings yet
Sistem Informasi Pengelolaan Pusat Prestasi Mahasiswa Guna Mendukung Program Kreatifitas Mahasiswa
14 pages
Åα¿½«ªÑ¡¿Ñ 4
No ratings yet
Åα¿½«ªÑ¡¿Ñ 4
130 pages
Expert Veri+ed, Online, Free.: Topic 1 - Question Set 1
100% (2)
Expert Veri+ed, Online, Free.: Topic 1 - Question Set 1
183 pages
Hadoop Ecosystem for Big Data
From Everand
Hadoop Ecosystem for Big Data
Dr. Zemelak Goraga
No ratings yet
Mod 4
No ratings yet
Mod 4
76 pages
Bigdata
No ratings yet
Bigdata
6 pages
Module 1 (III)
No ratings yet
Module 1 (III)
47 pages
What Are The Primary Components of An Accounting Information System (AIS) ?
No ratings yet
What Are The Primary Components of An Accounting Information System (AIS) ?
3 pages
BSC Fallback Copying Error
No ratings yet
BSC Fallback Copying Error
5 pages
Exa Mig BP Latest PDF
No ratings yet
Exa Mig BP Latest PDF
41 pages
Ugc Care Paper 1
No ratings yet
Ugc Care Paper 1
8 pages
Case Study CC
No ratings yet
Case Study CC
5 pages
Transact-SQL by TechNet Wiki Community
No ratings yet
Transact-SQL by TechNet Wiki Community
295 pages
Inside Cloud - Case Study
No ratings yet
Inside Cloud - Case Study
11 pages
UNIT 4 (Class Notes
No ratings yet
UNIT 4 (Class Notes
28 pages
Script Diagram
No ratings yet
Script Diagram
1 page
Service Design Itil v3
No ratings yet
Service Design Itil v3
1 page
21cs71 Model Set 1 Paper Solution
No ratings yet
21cs71 Model Set 1 Paper Solution
32 pages
How To Cancel/ Restart The Cost Manager:: More Create Blog Sign in
No ratings yet
How To Cancel/ Restart The Cost Manager:: More Create Blog Sign in
4 pages
CC Unit4
No ratings yet
CC Unit4
14 pages
21CS71 Solutions
No ratings yet
21CS71 Solutions
24 pages
4 A Review Paper On Big Data and Hadoop
No ratings yet
4 A Review Paper On Big Data and Hadoop
3 pages
Emerging Trends - Q&A
No ratings yet
Emerging Trends - Q&A
5 pages
Big Data Overview
No ratings yet
Big Data Overview
18 pages
IT430 FinalTerm 06
No ratings yet
IT430 FinalTerm 06
65 pages
Seminar - Intro: Distributed Systems - Student Version
No ratings yet
Seminar - Intro: Distributed Systems - Student Version
5 pages
IncidentRequest Closed Monthly Aug
No ratings yet
IncidentRequest Closed Monthly Aug
430 pages
Cloud Computing Unit 1
No ratings yet
Cloud Computing Unit 1
21 pages
Unit 5
No ratings yet
Unit 5
68 pages
132 P16cse5a-P16ite3a 2020052706582977
No ratings yet
132 P16cse5a-P16ite3a 2020052706582977
15 pages
Emerging IT Trends and Virtualization
No ratings yet
Emerging IT Trends and Virtualization
34 pages
V4i5 0447
No ratings yet
V4i5 0447
6 pages
IJRPR2483
No ratings yet
IJRPR2483
4 pages
2020 Big Data Question
No ratings yet
2020 Big Data Question
7 pages
How To Setup The Freertos Project in Visual Studio Express 2015
No ratings yet
How To Setup The Freertos Project in Visual Studio Express 2015
5 pages
C - 41 Cloud Computing in Big Data Features and Issues
No ratings yet
C - 41 Cloud Computing in Big Data Features and Issues
8 pages
Big Data
No ratings yet
Big Data
4 pages
Big Data Analytics in The Cloud For Business Intelligence
No ratings yet
Big Data Analytics in The Cloud For Business Intelligence
11 pages
JIT9010
No ratings yet
JIT9010
17 pages
How Can I Remove Win32 - Grenam.a Permanently - Win32 - Grenam
No ratings yet
How Can I Remove Win32 - Grenam.a Permanently - Win32 - Grenam
15 pages
Big Data Analytics in Cloud Computing
No ratings yet
Big Data Analytics in Cloud Computing
10 pages
Digital Technologies – an Overview of Concepts, Tools and Techniques Associated with it
From Everand
Digital Technologies – an Overview of Concepts, Tools and Techniques Associated with it
Editor IJSMI
No ratings yet
Ade 12 Unit 2
No ratings yet
Ade 12 Unit 2
20 pages
GRASP Principles
No ratings yet
GRASP Principles
56 pages
Hadoop - MapReduce
No ratings yet
Hadoop - MapReduce
51 pages
Installing, Configuring, and Developing With Xampp
No ratings yet
Installing, Configuring, and Developing With Xampp
10 pages
Big Data Dan Cloud Computing
No ratings yet
Big Data Dan Cloud Computing
19 pages
CC-Unit 3
No ratings yet
CC-Unit 3
22 pages
Hadoop
No ratings yet
Hadoop
562 pages
BDA Assignment L9
No ratings yet
BDA Assignment L9
7 pages
DNS Interview Questions and Answers
100% (3)
DNS Interview Questions and Answers
4 pages
A Cloud Based Approach For Big Data Analysis A Comprehensive Review
No ratings yet
A Cloud Based Approach For Big Data Analysis A Comprehensive Review
7 pages
Cloud & Big Data
No ratings yet
Cloud & Big Data
5 pages
The Pandemic: Driven New Age of Cloud Computing
From Everand
The Pandemic: Driven New Age of Cloud Computing
VNS Surendra Chimakurthi
No ratings yet
Itm Group of Institution: Lab File OF Cloud Computing (Cs-8002)
No ratings yet
Itm Group of Institution: Lab File OF Cloud Computing (Cs-8002)
35 pages
Hadoop & BigData (UNIT - 2)
No ratings yet
Hadoop & BigData (UNIT - 2)
22 pages
Hadoop - Quick Guide Hadoop - Big Data Overview
No ratings yet
Hadoop - Quick Guide Hadoop - Big Data Overview
32 pages
Overview of Security Issues
No ratings yet
Overview of Security Issues
5 pages
High Level View of Cloud Security
No ratings yet
High Level View of Cloud Security
11 pages
Hadoop Quick Guide
No ratings yet
Hadoop Quick Guide
32 pages
Hadoop Report
No ratings yet
Hadoop Report
110 pages
Big Data, Map Reduce & Hadoop: By: Surbhi Vyas (7) Varsha
No ratings yet
Big Data, Map Reduce & Hadoop: By: Surbhi Vyas (7) Varsha
40 pages
Cloud Computing
No ratings yet
Cloud Computing
6 pages
Haramaya University Cover11
No ratings yet
Haramaya University Cover11
7 pages
CS 05 PDF
No ratings yet
CS 05 PDF
9 pages
Hadoop Job Runner UI Tool
No ratings yet
Hadoop Job Runner UI Tool
10 pages
Hadoop 2
No ratings yet
Hadoop 2
27 pages

Case Study

Uploaded by

Case Study

Uploaded by

CASE STUDY

Cloud computing services:

for example, (Salesforce) to robust enterprise database

3.Big Data and Its Impact on Cloud Computing:

Challenges of implementing big data

How does Hadoop work?

5.Business impact of cloud computing:

You might also like