0% found this document useful (0 votes)
3 views

Database in AWS

The document provides an overview of various database services offered by AWS, including relational databases like Amazon RDS and Aurora, NoSQL databases such as DynamoDB and DocumentDB, and analytics services like Amazon Redshift and Athena. It highlights the benefits of these services, including scalability, performance, and ease of management. Additionally, it covers specialized services like Amazon ElastiCache, AWS Glue, and Amazon Managed Blockchain for specific use cases.

Uploaded by

tapas202420
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
3 views

Database in AWS

The document provides an overview of various database services offered by AWS, including relational databases like Amazon RDS and Aurora, NoSQL databases such as DynamoDB and DocumentDB, and analytics services like Amazon Redshift and Athena. It highlights the benefits of these services, including scalability, performance, and ease of management. Additionally, it covers specialized services like Amazon ElastiCache, AWS Glue, and Amazon Managed Blockchain for specific use cases.

Uploaded by

tapas202420
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 24

Databases in AWS

• Introduction to Database
• Amazon Relational Database Service
• Amazon Elasticache
Agenda • Amazon Aurora
• Amazon DynamoDB
• Amazon EMR
• Amazon Redshift
• Amazon Neptune
• Amazon DocumentDB
• Summary - Databases & Analytics
Copyright Dot Net Tricks Innovation Pvt. Ltd. | All rights
Reserved.
Introduction to Database
• Storing data on disk such as EFS, EBS, EC2 Instance Store, S3.
• Helps you to structure the data and store it into a database.
• Helps you to build indexes to efficiently query and search through the data
from database.

Copyright Dot Net Tricks Innovation Pvt. Ltd. | All rights


Reserved.
Relational Databases
• A table-based database which have links between them
• Use the SQL language to perform queries / lookups.
• Relational databases are designed for transactional and online transaction
processing (OLTP) applications.
AWS Services on Relational Databases:
• Amazon Relational Database Service (RDS)
• Amazon Aurora

Copyright Dot Net Tricks Innovation Pvt. Ltd. | All rights


Reserved.
Amazon Relational Database Service
• RDS stands for Relational Database Service
• A managed DB service and you can use SQL as a query language.
• Allows you to create databases in the cloud, managed by AWS:
• Postgres
• MySQL
• MariaDB
• Oracle
• Microsoft SQL Server
• Aurora (AWS Proprietary database)

Copyright Dot Net Tricks Innovation Pvt. Ltd. | All rights


Reserved.
Benefits of RDS over EC2
• An AWS managed service
• Handles automated provisioning and OS patching.
• Provides continuous backups and restore to specific timestamp (Point in
Time Restore)
• Monitoring dashboards
• Maintenance windows for upgrades
• Provides vertical and horizontal scalability.
• Provides read replicas for improved read performance
• Provides multi availability zone (AZ) setup for Disaster Recovery.

Copyright Dot Net Tricks Innovation Pvt. Ltd. | All rights


Reserved.
RDS Deployments : Read Replicas, Multi AZ
Read Replicas:
• Scale the read workload of your DB
• Can create up to 5 Read Replicas
• Data is only written to the main DB
Multi-AZ:
• Failover in case of AZ outage (high availability)
• Data is only read/written to the main database
• Can only have 1 other AZ as failover
RDS Deployments:Multi-Region (Read Replicas)
• Disaster recovery in case of region issue
• Local performance for global reads
Copyright Dot Net Tricks Innovation Pvt. Ltd. | All rights
Reserved.
Amazon ElastiCache
• A fully managed in-memory caching service.
• Used for caching, which accelerates application and database performance
or as a primary data store.
• Compatible with Redis and Memcached.
• Reduce load off databases for read intensive workloads.
• AWS takes care of OS maintenance/patching, optimizations, setup,
configuration, monitoring, failure recovery and backups.
Use Case:
• Session stores
• Gaming leaderboards
• Streaming and analytics.
Copyright Dot Net Tricks Innovation Pvt. Ltd. | All rights
Reserved.
Amazon Aurora
• A fully managed relational database engine from AWS.
• Compatible with MySQL and PostgreSQL.
• Aurora is AWS cloud optimized and provides 5x performance over MySQL
on RDS, 3x performance of Postgres on RDS.
• Aurora storage automatically grows in increments of 10GB and can go up
to 64 TB.
• Aurora costs more than RDS.
• Amazon CloudWatch metrics lets you monitor the health and
performance of your Aurora DB cluster.

Copyright Dot Net Tricks Innovation Pvt. Ltd. | All rights


Reserved.
NoSQL databases
• A Non-relational databases
• Supports document, key-value, graph, or wide-column stores.
• Built for specific data models and have flexible schemas for building
modern applications.
Benefits:
• Flexibility: Store unstructured, semi-structured, or structured data.
• Scalability: designed to scale-out by handling large volumes of data at high
speed with a scale-out architecture.
• High-performance: optimized for a specific data model

Copyright Dot Net Tricks Innovation Pvt. Ltd. | All rights


Reserved.
NoSQL Databases Offered by AWS
• JSON is a common form of data that fits into a NoSQL model
• Data can be nested and fields can be changed over time.
AWS NoSQL Databases:
• Amazon DynamoDB
• Amazon DocumentDB
• Amazon Keyspaces (for Apache Cassandra)
• Amazon Neptune
• Amazon Timestream
• Amazon Quantum Ledger Database (QLDB)

Copyright Dot Net Tricks Innovation Pvt. Ltd. | All rights


Reserved.
Amazon DynamoDB
• A fully managed serverless NoSQL database service that provides fast and
predictable performance with seamless scalability with single-digit
millisecond latency.
• A highly available with replication across 3 AZ
• Handle millions of requests per seconds, trillions of rows, 100s of TB of
storage
• Integrated with IAM for security, authorization and administration
• Allows you to delete expired items from tables automatically to help you
reduce storage usage and cost of storing data that is no longer needed by
using DynamoDB Time to Live(TTL).
• Low cost and auto scaling capabilities
Copyright Dot Net Tricks Innovation Pvt. Ltd. | All rights
Reserved.
DynamoDB Accelerator – DAX
• A fully managed in-memory cache for DynamoDB.
• Provides 10x performance improvement (i.e.)single-digit millisecond latency
to microseconds latency when accessing your DynamoDB tables.
• Secure, highly scalable & highly available
• DynamoDB Accelerator (DAX) delivers fast response times for accessing
eventually consistent data.
Use Case:
• Applications that require repeated reads against a large set of data.
• Real-time bidding, social gaming, and trading applications.
• DAX delivers fast, in-memory read performance.

Copyright Dot Net Tricks Innovation Pvt. Ltd. | All rights


Reserved.
Amazon EMR (Elastic MapReduce)
• Helps to create Hadoop clusters to analyze and process big data.
• EMR clusters can be made of hundreds of EC2 instances
• Supports Apache Spark, HBase, Presto, Flink
• Takes care of all the provisioning and configuration Auto-scaling and
integrated with Spot instances
• EMR integrates with Amazon CloudWatch to monitor cluster performance
and configure alarms
• EMR integrates with AWS Data Pipeline to schedule and start your clusters
Use cases: Log analysis, web indexing, data warehousing, machine learning,
financial analysis, scientific simulation.

Copyright Dot Net Tricks Innovation Pvt. Ltd. | All rights


Reserved.
Amazon Redshift
• Used for Analytics and Data Warehousing.
• Based on PostgreSQL, but it’s not used for OLTP
• Works well for OLAP – online analytical processing.
Amazon Redshift Features:
• Provides 10x better performance than other data warehouses, scale to PBs
of data.
• Load data once every hour, not every second
• Columnar storage of data
• Massively Parallel Query Execution (MPP), highly available
• Pay as you go based on the instances provisioned
• Supports a SQL interface for performing the queries.
Copyright Dot Net Tricks Innovation Pvt. Ltd. | All rights
Reserved.
Amazon Athena
• An interactive query service that makes it easy to analyze data directly in
Amazon S3 using standard SQL.
• Uses standard SQL language to query the files.
• Supports CSV, JSON, columnar data formats such as Apache Parquet and
Apache ORC.
Pricing:
• $5.00 per TB of data scanned
• Use compressed or columnar data for cost-savings (less scan)
• Use cases: Business intelligence, analytics, reporting, analyze & query VPC
Flow Logs, ELB Logs, CloudTrail trails.

Copyright Dot Net Tricks Innovation Pvt. Ltd. | All rights


Reserved.
AWS Glue
• A managed extract, transform, and load (ETL) service.
• Fully serverless service so there is no infrastructure to manage.
• Useful to prepare and transform data for analytics
• Designed to work with semi-structured data.
• Glue Data Catalog: A central repository of datasets.
• Used by Athena, Redshift, EMR.
Use Case:
• Used to organize, cleanse, validate, and format data for storage in a data
warehouse or data lake.
• To create event-driven ETL pipelines with AWS Glue.
• To understand your data assets.
Copyright Dot Net Tricks Innovation Pvt. Ltd. | All rights
Reserved.
Amazon QuickSight
• A serverless cloud-scale business intelligence service to create
interactive dashboards.
• A fast, automatically scalable with pay per-session pricing
• In a single data dashboard, QuickSight can include AWS data, third-
party data, big data, spreadsheet data, SaaS data, B2B data.
Use cases:
• Integrated with Amazon RDS, Amazon Aurora, Amazon Athena,
Amazon Redshift, Amazon S3.
• Business analytics.
• Building visualizations.
• Get business insights using data
Copyright Dot Net Tricks Innovation Pvt. Ltd. | All rights
Reserved.
Amazon Neptune
• A fully managed graph database.
• Highly available, with read replicas, point-in-time recovery,
continuous backup to Amazon S3.
• Provides data security features, with support for encryption at rest and in
transit.
• Highly available across 3 AZ, with up to 15 read replicas
• Build and run applications working with highly connected
• Optimized for these complex and hard queries
• Can store up to billions of relations and query the graph with
milliseconds latency

Copyright Dot Net Tricks Innovation Pvt. Ltd. | All rights


Reserved.
Amazon Managed Blockchain
• Blockchain makes it possible to build applications where multiple parties
can execute transactions without the need for a trusted, central authority.
• A fully managed service
• Makes it easy to create and manage scalable blockchain networks using the
Ethereum or Hyperledger Fabric open-source frameworks.
• Blockchain technology can be beneficial for several other use cases,
including supply chain management, asset transfer and tokenization,
customer loyalty programs, and digital identity.

Copyright Dot Net Tricks Innovation Pvt. Ltd. | All rights


Reserved.
Amazon QLDB (Quantum Ledger Database)
• A fully managed ledger database.
• Ledgers are typically used to record a history of economic and financial
activity in an organization.
• Fully Managed, Serverless, High available, Replication across 3 AZ.
• Used to review history of all the changes made to your application data
over time.
• Immutable system: Keeps a full record of all changes to your data that
cannot be deleted, modified, or overwritten.
• 2-3 timesbetter performance than common ledger blockchain frameworks,
manipulate data using SQL.
Copyright Dot Net Tricks Innovation Pvt. Ltd. | All rights
Reserved.
Amazon DocumentDB
• A reliable and fully managed database service and is compatible with
MongoDB
• Easy to set up, operate and scale MongoDB-compatible databases in the
cloud.
• Automatically grows the size of your storage volume as your database
storage needs grow.
• Amazon DocumentDB storage volume grows in increments of 10 GB, up to a
maximum of 64 TB.
• No need to provision any excess storage for your cluster to handle future
growth.
Copyright Dot Net Tricks Innovation Pvt. Ltd. | All rights
Reserved.
AWS Database Migration Service
• A service that makes it easy to migrate relational databases, data
warehouses, NoSQL databases, and other types of data stores.
• Used to migrate your data into the AWS Cloud or from on-premises.
Types of Migrations
• Homogeneous migrations
• Eg: MySQL to MySQL
• Heterogeneous migrations
• Eg: SQL Server to Aurora

Copyright Dot Net Tricks Innovation Pvt. Ltd. | All rights


Reserved.
Summary – Databases and Analytics
• RDS & Aurora (SQL): Relational Databases - OLTP
• Amazon ElastiCache: In-memory Database.
• Key/Value Database: DynamoDB & DAX (cache for DynamoDB).
• Amazon Redshift: Data Warehouse - OLAP (SQL).
• Amazon EMR: Hadoop Cluster.
• Amazon Athena: query data on Amazon S3 (serverless & SQL).
• Amazon QuickSight: dashboards on your data (serverless)

Copyright Dot Net Tricks Innovation Pvt. Ltd. | All rights


Reserved.
Summary – Databases and Analytics Contd…
• Amazon DocumentDB: Compatible with MongoDB(NoSQL database)
• Amazon QLDB: Financial Transactions Ledger
• Amazon managed Blockchain & managed Hyperledger Fabric &
Ethereum blockchains.
• AWS Glue: Managed ETL and Data Catalog service
• AWS DMS: Database Migration
• Neptune: Graph database

Copyright Dot Net Tricks Innovation Pvt. Ltd. | All rights


Reserved.

You might also like