Unite Real-Time and Batch Analytics With AWS Glue
Unite Real-Time and Batch Analytics With AWS Glue
Unite Real-Time and Batch Analytics With AWS Glue
Melody Yang
Data Specialist Solutions Architect, AWS
• Demo
IoT device
Web log
Extract Visualize/
Collect Store Transform Analyze
Load Report
Social
media
Transaction
ERP
Polling Application
Extract Visualize/
Generate Store Transform Analyze
Load Report
Amazon Kinesis
Amazon Managed
Streaming for Kafka
AWS DMS
Extract Visualize/
Generate Store Transform Analyze
Amazon S3 Load Report
AWS DataSync
AWS Snowball
Amazon S3
Extract Visualize/
Generate Collect Transform Analyze
Load Report
Amazon
RDS
Database
on EC2
Amazon Athena
Extract Visualize/
Generate Collect Store Transform
Load Amazon EMR Report
Amazon Kinesis
Analytics
Data
scientists
Data
analysts
Extract
Generate Collect Store Transform Analyze
Load Business
users
Engagement
platforms
Automation/
events
Amazon EMR
Visualize/
Generate Collect Store Analyze
Report
AWS
Lambda
Amazon Kinesis
Client Library (KCL)
( EMR ) Serving
Layer
Speed Layer
( Lambda &
Kinesis )
AWS Glue
Visualize/
Generate Collect Store Analyze
Report
You can create streaming extract, transform, and load (ETL) jobs
that run continuously, consume data from streaming sources like:
5
3 Create ETL AWS Glue
Amazon Kinesis
Streaming Job Output Amazon S3
Streaming
2
Amazon MSK Create catalog table AWS Glue
Data Catalog
JDBC
Create Apache Kafka Data store
1 connection
Apache Kafka
AWS Glue
Connection
• Speed of implementation
Amazon Kinesis
Data Stream AWS Glue
Stream
Streaming ETL
Job
IoT Log
Glue
ETL Job
Data Catalog
Amazon Amazon
Upload Athena QuickSight
https://aws.amazon.com/training/
[email protected]
twitter.com/AWSCloud
facebook.com/AmazonWebServices
youtube.com/user/AmazonWebServices
slideshare.net/AmazonWebServices
twitch.tv/aws