0% found this document useful (0 votes)
76 views52 pages

ISM 2024 - Session 5

Uploaded by

kashid.khan15
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
76 views52 pages

ISM 2024 - Session 5

Uploaded by

kashid.khan15
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 52

Information Systems for Managers (ISM)

Cloud and Big Data

Session 5
Cloud Computing

Image Source: D. Fletcher for cloudTweaks.com

2
https://www.youtube.com/watch?v=AnxrJiS5uKU 3
What is
Cloud
Computing?
4
Some Statistics

Cloud market projected to reach $864 bn. in 2025

5
Cloud Computing
• Cloud computing is a model for
enabling convenient, on-demand
network access to a shared pool of
configurable computing resources (e.g., CLOUD:
C – Common Infrastructure
networks, servers, storage, L – Location Independence
O – Online Accessibility
applications, and services) that can be U – Utility Pricing
D – on Demand resources
rapidly provisioned and released with
minimal management effort or service
provider interaction. [NIST]
• Cloud Computing enables computing
facilities and infrastructure to be made
available over internet. 6
But Why?

Capacity expansion
through IT asset
purchases results in large
capital investments

7
Key technologies behind cloud computing

Virtualization Multitenancy Web Services


• Virtualization is technology Web Service is a strategy to
• Virtualization + Resource-
that allows you to create sharing = Multitenancy make the services of one
multiple simulated application available to
environments or dedicated • Multitenancy allows other applications using
resources from a single, multiple users (tenants) to messaging protocols (read
physical hardware system. work in the same “no user interface).
• Software called a software environment at
hypervisor connects the same time at their Microservice is a completely
directly to that hardware own user interfaces autonomous application that
(skins). performs a specific service in
and allows you to split one
system into separate, • SaaS (Software-as-a- a larger application
distinct, and secure Service) uses a highly architecture.
environments known as multi-tenant architecture.
virtual machines (VMs). E.g. Google Apps
Capabilities
Scalability Elasticity Measured Use Usage-Based Pricing
• Ability address needs of • Ability to expand and • Includes metrics and all • Can be based on usage and
increase demand contract depending on the services are measured needs of client
demand throughout

Managed Services Service Levels Ubiquitous Access Heterogeneity


• Building, maintain and • Duration of time an infra • Anytime-anywhere access • Can be managed easily
upgrading the system is done service to be available w/o through virtualization
by provider interruption or the speed or
response level of service

Reduced Setup Time Resource Pooling


• No need to purchase • Infra is created with hundred
hardware or do civil works to or thousands of CPUs and
setup memory device and only a
slice of it is allotted to client

9
Application Infrastructure

10
Service Models

Operating Environment with


Applications
End Users
e.g., CRM, ERP
SaaS

Development platforms
e.g., Google App Engine Application Developers
PaaS

Storage and Compute


e.g., Server, Database, Network Infra. Admins
IaaS

11
Cloud Computing Service Models
Infrastructure Platform Software
(as a Service) (as a Service) (as a Service)

You Manage
You Manage

Applications Applications Applications


Security and Security and Security and
Integration Integration Integration

Managed by vendor
Databases Databases Databases

Managed by vendor
Vertualization Vertualization Vertualization
Managed by vendor

Server OS Server OS Server OS

Storage Storage Storage

Networking Networking Networking

Server HW Server HW Server HW


SaaS v/s PaaS v/s IaaS
SaaS PaaS IaaS
• On demand service • This service is made up of a OS, a • Offers computing architecture
• Platform independent web server, and a database and infrastructure in virtual
• No need to install the s/w • Users build, compile, and run environment
on your PC programs without worrying • Multiple users can access four
• Used by multiple users about underlying infrastructure resources, managed by vendors:
• Managed by vendor • You manage data and application Data Storage, virtualization,
resources Servers and Networking
User: End users Users: Developers Users: System Administrators
Examples: Gmail, MS office Examples: Google App Engine, Examples: Amazon S3 storage
365, Google Drive, CRM, Microsoft’s Azure Services Platform, service, Rackspace cloud servers; EC2
ZOHO, Force.com LAMP platform
Adv: Adv: Adv:
• Universally accessible • Cost effective rapid development • Enhanced Scalability
from any platform • Easy development of web app • IaaS is flexible
• Excellent for collaboration
Disadv: Disadv: Disadvantages
• Performance based on • Migration issues: the risk of • Security issues
internet speed vendor lock-in • Network and service delays
• Browser issues
Cloud Computing Deployment Models

Private Cloud Public Cloud Hybrid Cloud Community Cloud


• A truly shared • Private Cloud with • Shared Computing
• Single Client
facility public resources infra
• Leverages Existing
• Shifts CapEx to • Address problem • Maybe setup by a
CapEx
OpEx of Cloud ‘bursts’ group of companies
• Can Reduce OpEx
• Offers a Pay as you • Infra is provided in sharing similar
• Though security is real time from
go model specifications
key feature lacks public cloud
core advantages of • Supports Multiple • E.g.: Bank, Hospitals
CC Tenants etc
Public vs. Private Cloud Deployment

Public Private

Economies of Scale Control

Cost Security

Flexibility Regulation

Op-ex Cap-ex
Players in the Cloud Industry

Provider Enablers Consumers Regulators

• Own and • Companies • Corporate or • Government or


manage their who facilitate individuals international
own systems – the cloud bodies
deliver ‘cloud’ client-provider
needs through partnership
cloud services through
migration,
monitoring,
maintenance,
etc

16
Cloud Providers

Gartner Magic Quadrant CSP 2009 Gartner Magic Quadrant CSP 202217
Amazon Web Services (AWS)
Market leader in cloud services (IaaS,
PaaS)
Offers more than 250 cloud-based services
Provide cloud solutions to millions of users

30 regions
96 availability zones
245 countries and territories served

Computing
EC2 – Micro, small, medium, large
On-demand ($0.05/hour), reserved (72%
discount), spot (90% discount)
18
Salesforce (SaaS)

It provides CRM software and applications focused on sales, customer


service, marketing automation, analytics, and application development.

#1 CRM: More than 1,50,000 clients use Salesforce’s CRM application

Sales Cloud, Service Cloud, Platform, Marketing Cloud, Commerce


Cloud, Community Cloud

Unique aspects of Salesforce.com services include ease of use,


customizability, continuous improvement & innovation

19
CRM

20
BUSINESS VALUE OF CLOUD
COMPUTING
21
Business value of cloud computing

Financial Drivers Operational Driver Strategic Drivers

• Reduced total • Agility • Develop


cost of • Scalability capabilities
ownership • Centralized • Rapid product
• Opportunity cost management development
• Productivity • Increased access • Customer
intimacy

22
When not to move to cloud?
• Tight coupling between hardware and
software
• Applications that do not scale
• The cost not justified
• IT staff may not have the relevant skills
• Security is a key concern

23
Applications that have volatile demand

Low business risk and commodity


application

When to
move to Applications that require collaboration

cloud?
Big data storage and analysis

Capital available for infrastructure is low


19-01-2024 ITS S12 | Vimal Kumar M | IIM Calcutta 24
Challenges
Privileged User Access

Regulatory Compliance

Data Location

Recovery

Investigative Support

Long-term Viability
Benefits
Significantly lower the cost of entry for newer firms that lack resources

Immediate access to hardware resources without capital investment leads to faster


time to market

Lowers IT barriers to innovation, helps organizations in digital transformation

Capabilities of cloud enables scale their business capabilities based on demand

Facilitates the development of new class of applications – mobile interactive


applications and batch processing

26
BIG DATA AND ANALYTICS

27
Big Data Definition
• No single standard definition…

“Big Data” is data whose scale, diversity, and


complexity require new technologies,
algorithms, and analytics to manage it and
extract value and hidden knowledge from it…

28
Characteristics of Big Data:
1-Volume
• Data Volume
– 44x increase from 2009 2020
– From 0.8 zettabytes to 35zb
• Data volume is increasing
exponentially

500 million tweets per day


350,000 tweets per minute
6,000 tweets per second

Exponential increase in
collected/generated data 29
30
Characteristics of Big Data:
2-Variety
• Various formats, types, and
structures
• Text, numerical, images,
audio, video, sequences,
time series, social media
data, multi-dim arrays, etc…
• Static data vs. streaming data
• A single application can be
generating/collecting many
types of data

To extract knowledge all these types of data need to linked together


31
Characteristics of Big Data:
3-Velocity
• Data is generated fast and need to be
processed fast
• Online Data Analytics
• Late decisions  missing opportunities
• Examples
– E-Promotions: Based on your current location, your purchase history,
what you like  send promotions right now for store next to you
– Healthcare monitoring: sensors monitoring your activities and body 
any abnormal measurements require immediate reaction
– LinkedIn: People You May Know, Groups You May Like, Jobs You May Be
Interested In

32
Big Data: 3V’s

33
Data Vs Big Data

Structure
d
Some Make it 4V’s

35
Or (5V)
Customer BD: 1 TB = 220 DVDs
sentiments are 16m FB photos
not verifiable

Text, photo, audio,


video, clickstream,
sensor data

LinkedIn, Amadeus (travel), GE


gas turbine
Batch  Real-time Revenue , Operational costs 
20b IoT devices by 2020 Serve customers better
Additional Dimensions…
• Variability: Variation in the data flow rate
• Complexity: Number of data sources
• Big data are collected from numerous data sources
• Complexity makes it difficult to collect, clean, store, and
process heterogeneous data
• Decay: Declining value of data over time
• In a time of high velocity, the timely processing and action is
the key
• Example: Data generated by wearable medical devices
Sources of Big Data

38
Numerous possibilities

Style of data Source Industry affected Function affected

Large volume Online Financial services Marketing

Unstructured Video Health care Supply chain

Continuous flow Sensor Manufacturing Human resources

Multiple formats Genomic Travel/transport Finance

Activity: Identify one application of big data by combining the dimensions


39
Applications Starbucks introducing new Drilling for Oil Chevron
coffee products

Monitoring trucks at US Target targeting customers


express
https://www.youtube.com/watch?v=XH1wQEgROg4 40
The Model Has Changed…
• The Model of Generating/Consuming Data has Changed

Old Model: Few companies are generating data, all others are consuming data

New Model: all of us are generating data, and all of us are consuming data

41
Difference between traditional data
and big data?
Continuous approach to sampling, analyzing
and action taking is necessary

Data is constant state of change; discovery and


agility is critical, not stability

42
Difference between traditional data
and big data?
Big data Traditional data
Type of data Unstructured format Formatted in rows
and columns
Volume of data 100 TB to PB Tens of TB or less
Flow of data Continuous flow of Static flow of data
data
Analysis methods Machine learning Hypothesis-based
Primary purpose Data-based products Internal decision
support and analysis
43
Analyzing data
Term Time frame Meaning

DSS 1970-1985 Use of data analysis to support decision making

EIS 1980-1990 Focus on data analysis for decisions by senior executives

OLAP 1990-2000 Software for analyzing multi-dimensional data tables

BI 1989-2005 Tools to support data driven decisions with emphasis on


reporting
Analytics 2005-2010 Focus on statistical and mathematical analysis for
decisions
Big data 2010- Focus on very large unstructured, fast-moving data

44
An Integrated View of Big Data
Processes for extracting insights from
Big Data

Stage 1: Acquisition Extraction, Integration,


Data and Cleaning and Aggregation and
Management Recording Annotation Representation

Stage 2: Modeling
Interpretation
Analytics and Analysis
Big Data Technology

47
Map Reduce Data Flow

48
Big Data Technology
Technology Definition
Hadoop Open-source software utilities for
processing big data across multiple
servers
MapReduce The architectural framework on which
Hadoop is based
Scripting languages Programming languages that work well
with big data (e.g., Python, Pig, Hive)
Machine learning Algorithm for rapidly finding the best
model fit of data set
Visual analytics Data Visualization
Natural language processing (NLP) Software for analyzing the text-
frequencies, meanings, etc.

49
Stage 2: Analytics

1
• Text Analytics

2
• Audio Analytics

3
• Video Analytics

4
• Social Media Analytics

5
• Predictive Analytics
Big Data Analytics Cycle

Data

Action Insights

Decision
Thank You

52

You might also like