VC Unit IV
VC Unit IV
Computing
Unit IV
Cloud Service Providers: Introduction - EMC -
Google - Sales force - Amazon Web Services: S3 -
EBS - EC2 - Dynamo DB - Microsoft - IBM
Cloud Service Providers
• Cloud computing is one of the most popular
buzzwords used these days.
• It is the upcoming technology provisioning
resources to the consumers in the form of
different services like software, infrastructure,
platform and security.
EMC
• EMC is one of the leading global enterprises
that require dynamic scalability and
infrastructure agility to meet changing
applications as well as business needs.
• EMC chose cloud computing as the ideal
solution to reduce the complexity and
optimize the infrastructure.
EMC IT
• Virtualization is the main concept behind the
success of EMC IT.
• By virtualizing the infrastructure, allocation of
the resources on demand is possible.
• EMC IT provides its business process units
with IaaS, PaaS, and SaaS.
Services
• IaaS offers EMC business units the ability to provision
– infrastructure components such as
• network, storage, computing, and operating
systems individually or as integrated services .
• PaaS provides the secure application and
information frameworks on top of
– application server, web server, database, unstructured
content management and security components as a
service to business units from which to develop solutions.
Continued..
• SaaS provides applications and tools in a service model
for business enablement.
• EMC IT brought together several existing business
solutions under the unified architecture named as
Business Intelligence as a Service.
• It also offers Enterprise Resource Planning (ERP) and
Customer Relationship Management (CRM) as a Service.
• User Interface as a Service (UIaaS) provisions user and
interface experience, rather than provisioning the actual
device used.
Captiva Cloud Toolkit
• EMC offers a tool called Captiva Cloud Toolkit
to help in the development of softwares.
• EMC Captiva Cloud Toolkit is a Software
Development Kit (SDK) comprised of modules
that help web application developers to
quickly add scanning and imaging
functionality directly to their web-based
business applications.
Continued..
• Using Captiva Cloud Toolkit, developers can
quickly create a working scan-enabled web-
based business application in as early as 1
week.
• As a result, time to market is shortened and
development, testing and support costs are
greatly reduced.
A few of these modules are as follows
• Scan:
– Scanning is importing activity of documents into Captiva from a scanner.
– Basically, scanning happens at page level to bring images page by page
into Captiva.
– Scanning is the entry point to Captiva where one can import any kind of
document like pdf, tiff, and jpg.
• MDW:
– Multi Directory Watch is another entry point to Captiva.
– MDW can be pointed to any folder/repository from where Captiva
could import documents directly.
– MDW is very useful if business is getting documents in the form of a soft
copy, for example, as an attached file in an e-mail.
– MDW also acts as a scan module except it does not interlock with the
scanner.
A few of these modules are as follows
• IE:
– Image enhancement is a kind of filter or repairing tool for
images that are not clear.
– It enhances the image quality so it could be processed easily
through Captiva.
– The functionalities of IE are deskew, noise removal, etc.
• Index:
– Indexing is a data capturing activity in Captiva through which
one can capture key data from various fields.
– For example, if bank form is being processed, the A/C no. and
sort code could be the indexing field.
– Indexing could be added as per requirement of business.
A few of these modules are as follows
• Export:
– Export is the exit point of Captiva where images/data are sent to various
repositories like file, net, document, or data.
– The exported data are used for business requirements of various
business divisions.
– For example, if we are capturing the A/C no. and sort code for a bank
application, this could be mapped to any department where it is
needed.
• Multi:
– Multi is the last process in Captiva to delete batches that have gone
through all modules and exported value successfully.
– In the case when it is required to take a backup of batches, this module
could be avoided.
Google
• Google is one among the leading cloud providers that offer
secure storage of user’s data.
• It provides
– cloud platform,
– app engine,
– cloud print,
– cloud connect and many more features that are scalable, reliable,
as well as secure.
• Cloud Platform
– Google Cloud Platform enables developers to build, test, and
deploy applications on Google’s highly scalable and reliable
infrastructure.
– Google has one of the largest and most advanced networks across
the globe.
Cloud Platform
• Google Cloud Platform includes virtual
machines, block storage, NoSQL datastore and
big data analytics.
• It provides a range of storage services that
allow easy maintenance and quick access of
user’s data.
• Applications hosted on the cloud platform can
automatically scale up to handle the most
demanding workloads and scale down when
traffic subsides.
Cloud Storage
• Google Cloud Storage is a RESTful online file
storage web service for storing and accessing
one’s data on Google’s infrastructure.
• Representational state transfer (REST) is an
architectural style consisting of a coordinated
set of architectural constraints applied to
components, connectors, and data elements
within a distributed system.
Continued..
• The following are the few tools for Google
Cloud Storage:
• Google Developers Console
– web application where one can perform simple
storage management tasks on the Google Cloud
Storage system.
• gsutil
– is a Python application that lets the user access
Google Cloud Storage from the command line.
Google Cloud Print
• Google Cloud Print
– service that extends the printer’s function to any
device that can connect to the Internet.
– To use Google Cloud Print, the user needs to have
a free Google profile, an app, a program, or a
website that incorporates the Google Cloud Print
feature, a cloud-ready printer or printer connected
to a computer logged on to the Internet.
Google Cloud Connect
• Google Cloud Connect is a feature provided by Google Cloud by
integrating cloud and the application programming interface
(API) for Microsoft Office.
• After installing a plug-in for the Microsoft Office suite of
programs, one can save files to the cloud.
• Google Cloud Connect assigns each file a unique URL that can be
shared to let others view the document.
• The back end is similar to the Google File System and relies on
the Google Docs infrastructure.
• As the documents sync to the master file, Google Cloud Connect
sends the updated data out to all downloaded copies of the
document using the metadata to guide updates to the right files.
• When the user uploads a document to Google Cloud Connect,
the service inserts some metadata into the file.
Google Cloud Print
• To use Google Cloud Print, the user needs to have a
free Google profile, an app, a program, or a website
that incorporates the Google Cloud Print feature, a
cloud-ready printer or printer connected to a computer
logged on to the Internet
• When Google Cloud Print is used through an app or
website, the print request goes through the Google
servers.
• Google Cloud Print is an extension built into the Google
Chrome Browser, but it should be enabled explicitly.
Google App Engine
• Google App Engine lets the user run web
applications on Google’s infrastructure.
• App Engine applications are easy to build, easy
to maintain, and easy to scale as traffic and
data storage needs grow.
• Google App Engine supports apps written in
several programming languages.
Google App Engine
• User has to only pay for what he or she uses.
• There are no setup costs and no recurring fees.
• The resources used by the application such as storage and
bandwidth are measured in gigabyte and billed at competitive rates.
• One has to control the maximum amount of resources one’s app can
consume, so it always stays within one’s budget.
• App Engine costs nothing to get started.
• All applications can use up to 1 GB of storage and enough CPU and
bandwidth to support an efficient app serving around five million
page views a month, absolutely free.
• When billing is enabled for the application, free limits are raised,
and one has to only pay for resources one uses above the free
levels.
Amazon Web Services
• Amazon Web Services (AWS) is a collection of
remote computing services that together make up a
cloud computing platform, offered over the Internet
by Amazon.com.
• The most central and well known of these services
are Amazon Elastic Compute Cloud (Amazon EC2),
Amazon Simple Queue Service (Amazon SQS), and
Amazon S3.
• Amazon EC2 is a computing service, whereas
Amazon SQS and Amazon S3 are support services.
Amazon Elastic Compute Cloud
• Amazon EC2 is an IaaS offered by AWS and is the
leading provider of IaaS in the current market.
• Powered by a huge infrastructure that the
company has built to run its retail business,
Amazon EC2 provides a true virtual computing
environment.
• The user can change the capacity and
characteristics of the virtual machine by using
the web service interfaces, hence named elastic.
Amazon Elastic Compute Cloud
• Computing capacity is provided in the form of virtual
machines or server instances by booting Amazon Machine
Images (AMI), which can be instantiated by the user.
• An AMI contains all the necessary information needed to
create an instance.
• The primary Graphical User Interface (GUI) interface is the
AWS Management Console (point and click) and a web
service API that supports both Simple Object Access
Protocol and Query Requests.
• The API provides programming libraries and resources for
Java, PHP, Python, Ruby, Windows, and .Net.
Amazon Elastic Compute Cloud
The infrastructure is virtualized by using Xen hypervisor, and
different instance types are provided as follows:
• Standard instances—suitable for most applications
• Micro instances—suitable for low-throughput applications
• High-memory instances—suitable for high-throughput
applications
• High-CPU instances—suitable for compute-intensive
applications
• Cluster compute instances—suitable for high-
performance computing (HPC) applications
Amazon Simple Storage Service
• Amazon Simple Storage Service known as Amazon S3, is
the storage for the Internet.
• It is designed to make web-scale computing easier for
developers.
• Amazon S3 provides a simple web service interface that
can be used to store and retrieve any amount of data, at
any time, from anywhere on the web.
• It gives any developer access to the same highly scalable,
reliable, secure, fast, inexpensive infrastructure that
Amazon uses to run its own global network of websites.
Microsoft
• Cloud computing provides a new way of looking
at IT at Microsoft called Microsoft IT (MSIT).
• Cloud computing is now the preferred and
default environment for new and migrated
applications at Microsoft.
• MSIT has captured these best practices and
documented them for other Microsoft
customers who wish to migrate their
organizations to cloud computing.
Windows Azure
• Windows Azure Cloud Services (web and worker
roles/PaaS) allow developers to easily deploy and manage
application services.
• The Migration Assessment Tool (MAT) for Windows Azure
encapsulates all the information to be aware of before
attempting the application migration to Windows Azure.
• Based on the response to a series of simple binary
questions, the tool generates a report that outlines the
amount of development effort involved to migrate the
application, or the architecture considerations for a new
application.
Windows Azure
• The Windows Azure Pricing Calculator
analyzes an application’s potential public
cloud requirements against the cost of the
application’s existing infrastructure.
• This tool can help to compare current
operational costs for an application, against
what the operating costs would be on
Windows Azure and SQL Azure.
Continued..
• Windows Azure Pack for Windows Server is a
collection of Windows Azure technologies
available to Microsoft customers at no
additional cost for installation into their data
center.
Microsoft Assessment and Planning Toolkit