Data Engineering 101 Redshift
Data Engineering 101 Redshift
Engineering 101
Amazon Redshift
Shwetank Singh
GritSetGrow - GSGLearn.com
Data Engineering 101: Redshift
Cluster
Creating a cluster:
aws redshift create-cluster --cluster-identifier my-
cluster --node-type dc2.large --master-username
admin --master-user-password Password123 --
number-of-nodes 2
Shwetank Singh
GritSetGrow - GSGLearn.com
Data Engineering 101: Redshift
Node Types
Shwetank Singh
GritSetGrow - GSGLearn.com
Data Engineering 101: Redshift
Leader Node
Shwetank Singh
GritSetGrow - GSGLearn.com
Data Engineering 101: Redshift
Compute Node
Shwetank Singh
GritSetGrow - GSGLearn.com
Data Engineering 101: Redshift
Columnar Storage
Shwetank Singh
GritSetGrow - GSGLearn.com
Data Engineering 101: Redshift
Sort Keys
Shwetank Singh
GritSetGrow - GSGLearn.com
Data Engineering 101: Redshift
Distribution Keys
Shwetank Singh
GritSetGrow - GSGLearn.com
Data Engineering 101: Redshift
Compression
Shwetank Singh
GritSetGrow - GSGLearn.com
Data Engineering 101: Redshift
Vacuum
Shwetank Singh
GritSetGrow - GSGLearn.com
Data Engineering 101: Redshift
Analyze
ANALYZE sales;
Shwetank Singh
GritSetGrow - GSGLearn.com
Data Engineering 101: Redshift
Materialized Views
Shwetank Singh
GritSetGrow - GSGLearn.com
Data Engineering 101: Redshift
Snapshots
Shwetank Singh
GritSetGrow - GSGLearn.com
Data Engineering 101: Redshift
Shwetank Singh
GritSetGrow - GSGLearn.com
Data Engineering 101: Redshift
Concurrency Scaling
Shwetank Singh
GritSetGrow - GSGLearn.com
Data Engineering 101: Redshift
Elastic Resize
Shwetank Singh
GritSetGrow - GSGLearn.com
Data Engineering 101: Redshift
Redshift Spectrum
Shwetank Singh
GritSetGrow - GSGLearn.com
Data Engineering 101: Redshift
External Tables
Shwetank Singh
GritSetGrow - GSGLearn.com
Data Engineering 101: Redshift
WLM (Workload
Management)
WLM allows you to define queues that allocate
resources based on query priority, enabling
better management of multiple workloads.
Shwetank Singh
GritSetGrow - GSGLearn.com
Data Engineering 101: Redshift
RA3 Instances
Shwetank Singh
GritSetGrow - GSGLearn.com
Data Engineering 101: Redshift
Shwetank Singh
GritSetGrow - GSGLearn.com
Data Engineering 101: Redshift
Automatic Table
Optimization
Redshift automatically chooses the best sort
and distribution keys for tables based on usage
patterns, optimizing query performance.
Shwetank Singh
GritSetGrow - GSGLearn.com
Data Engineering 101: Redshift
Stored Procedures
Shwetank Singh
GritSetGrow - GSGLearn.com
Data Engineering 101: Redshift
Shwetank Singh
GritSetGrow - GSGLearn.com
Data Engineering 101: Redshift
Data Sharing
Shwetank Singh
GritSetGrow - GSGLearn.com
Data Engineering 101: Redshift
Shwetank Singh
GritSetGrow - GSGLearn.com
Data Engineering 101: Redshift
Column-Level Encryption
Shwetank Singh
GritSetGrow - GSGLearn.com
Data Engineering 101: Redshift
Data API
Shwetank Singh
GritSetGrow - GSGLearn.com
Data Engineering 101: Redshift
UNLOAD Command
Shwetank Singh
GritSetGrow - GSGLearn.com
Data Engineering 101: Redshift
COPY Command
Shwetank Singh
GritSetGrow - GSGLearn.com
Data Engineering 101: Redshift
Concurrency Scaling
Shwetank Singh
GritSetGrow - GSGLearn.com
Data Engineering 101: Redshift
Redshift ML
Shwetank Singh
GritSetGrow - GSGLearn.com
Data Engineering 101: Redshift
Partitioning in Spectrum
Shwetank Singh
GritSetGrow - GSGLearn.com
Data Engineering 101: Redshift
Shwetank Singh
GritSetGrow - GSGLearn.com
Data Engineering 101: Redshift
Concurrency Limits
Shwetank Singh
GritSetGrow - GSGLearn.com
Data Engineering 101: Redshift
Amazon S3 Integration
Shwetank Singh
GritSetGrow - GSGLearn.com
Data Engineering 101: Redshift
Security Groups
Shwetank Singh
GritSetGrow - GSGLearn.com
Data Engineering 101: Redshift
Audit Logging
Shwetank Singh
GritSetGrow - GSGLearn.com
Data Engineering 101: Redshift
Automated Snapshots
Shwetank Singh
GritSetGrow - GSGLearn.com
Data Engineering 101: Redshift
Event Notifications
Shwetank Singh
GritSetGrow - GSGLearn.com
Data Engineering 101: Redshift
Shwetank Singh
GritSetGrow - GSGLearn.com
Data Engineering 101: Redshift
Reserved Instances
Shwetank Singh
GritSetGrow - GSGLearn.com
Data Engineering 101: Redshift
Elastic IP Address
Shwetank Singh
GritSetGrow - GSGLearn.com
Data Engineering 101: Redshift
Cluster Resizing
Shwetank Singh
GritSetGrow - GSGLearn.com
Data Engineering 101: Redshift
Shwetank Singh
GritSetGrow - GSGLearn.com
Data Engineering 101: Redshift
Enhanced Logging
Shwetank Singh
GritSetGrow - GSGLearn.com
Data Engineering 101: Redshift
Encryption at Rest
Shwetank Singh
GritSetGrow - GSGLearn.com
Data Engineering 101: Redshift
Query Caching
Shwetank Singh
GritSetGrow - GSGLearn.com
Data Engineering 101: Redshift
Manual Snapshots
Shwetank Singh
GritSetGrow - GSGLearn.com
Data Engineering 101: Redshift
Shwetank Singh
GritSetGrow - GSGLearn.com
Data Engineering 101: Redshift
Shwetank Singh
GritSetGrow - GSGLearn.com
Data Engineering 101: Redshift
Federated Authentication
Shwetank Singh
GritSetGrow - GSGLearn.com
Data Engineering 101: Redshift
Shwetank Singh
GritSetGrow - GSGLearn.com
Data Engineering 101: Redshift
Cluster Maintenance
Shwetank Singh
GritSetGrow - GSGLearn.com
Data Engineering 101: Redshift
Database Auditing
Shwetank Singh
GritSetGrow - GSGLearn.com
Data Engineering 101: Redshift
Cross-Region Snapshots
Shwetank Singh
GritSetGrow - GSGLearn.com
Data Engineering 101: Redshift
Performance Insights
Shwetank Singh
GritSetGrow - GSGLearn.com
Data Engineering 101: Redshift
Cluster Security
Configuration
Redshift clusters can be configured with
security features such as SSL encryption, VPC
security groups, and cluster parameter groups
to ensure secure access and operation.
Shwetank Singh
GritSetGrow - GSGLearn.com
Data Engineering 101: Redshift
Shwetank Singh
GritSetGrow - GSGLearn.com
Data Engineering 101: Redshift
Shwetank Singh
GritSetGrow - GSGLearn.com
Data Engineering 101: Redshift
Query Optimizer
Shwetank Singh
GritSetGrow - GSGLearn.com
Data Engineering 101: Redshift
Shwetank Singh
GritSetGrow - GSGLearn.com
Data Engineering 101: Redshift
Shwetank Singh
GritSetGrow - GSGLearn.com
Data Engineering 101: Redshift
Shwetank Singh
GritSetGrow - GSGLearn.com
Data Engineering 101: Redshift
Lambda Integration
Shwetank Singh
GritSetGrow - GSGLearn.com