AWS Summary
AWS Summary
● Amazon Redshift: A fully managed data warehousing service for large-scale data
analytics. Optimized for OLAP workloads and complex queries on large datasets.
○ Uses MPP (Massively Parallel Processing) for fast query execution.
○ Integrates with S3, Kinesis, and AWS Glue for data pipelines.
● Amazon Athena: An interactive query service to analyze data in S3 using SQL.
Serverless and pay-per-query, making it cost-effective for ad-hoc analysis.
○ Supports various data formats like CSV, JSON, Parquet, and ORC.
○ No infrastructure management needed; scales automatically with demand.
● AWS Glue: A fully managed ETL service that automates data preparation for analytics.
Connects to diverse data sources and supports visual or code-based transformations.
○ Automatically discovers schemas and supports various data formats.
○ Serverless and integrates with other AWS services like S3 and Redshift.
● AWS Lake Formation: Simplifies building and managing data lakes on Amazon S3.
Provides centralized data cataloging, fine-grained access control, and security.
○ Supports structured, semi-structured, and unstructured data.
○ Integrates with analytics and ML services like Redshift, Athena, and SageMaker.
● Amazon Comprehend: An NLP service for finding insights and relationships in text.
Performs entity recognition, sentiment analysis, key phrase extraction, and topic
modeling.
○ Can be used to analyze customer feedback, social media posts, and documents.
○ Offers custom classification to categorize text according to specific needs.
● Amazon Textract: Extracts text, forms, and tables from scanned documents and
images. Goes beyond OCR to understand document structure and relationships.
○ Automates data entry from invoices, receipts, and forms.
○ Can be used for document archiving, search, and analysis.
● Amazon Translate: Provides high-quality language translation using neural machine
translation. Supports over 70 languages and allows for custom terminology.
○ Useful for translating websites, documents, and applications.
○ Can be integrated with other AWS services like Comprehend and Transcribe.
● Amazon Polly: Converts text into lifelike speech using deep learning. Offers a variety of
voices, languages, and supports SSML for fine-grained control.
○ Used in applications like chatbots, e-learning platforms, and accessibility tools.
○ Can create custom voice models for unique brand identities.
● Amazon Lex: Builds conversational interfaces (chatbots) using NLU and ASR. Supports
text and voice interactions, dialog management, and integration with Lambda.
○ Can be used to automate customer service, provide information, or collect data.
○ Offers pre-built bots and integrates with AWS Connect for contact center
solutions.
● Amazon Rekognition: Analyzes images and videos for object detection, facial
recognition, text extraction, and content moderation.
○ Can be used for security, image search, and media analysis.
○ Offers custom labels to train models for specific objects or scenes.
● AWS DeepLens: A deep learning-enabled video camera for edge computing. Runs ML
models locally for real-time image and video analysis.
○ Ideal for applications like surveillance, retail analytics, and industrial automation.
○ Integrates with AWS services like Lambda and IoT for broader edge computing
solutions.
● AWS Panorama: An ML appliance for running computer vision models at the edge.
Enables real-time analysis on-premises for applications like retail and manufacturing.
○ Reduces latency and allows for processing in environments with limited
connectivity.
○ Supports pre-built and custom models, and integrates with AWS IoT and
Lambda.
● AWS Bedrock: A service that provides access to foundation models (FMs) from
Amazon and leading AI startups.
○ Allows users to build and scale generative AI applications.
○ Provides FMs for text and images, with different strengths and capabilities.
● AWS CodeWhisperer: An AI coding companion that generates code suggestions in
real-time. Helps developers increase productivity and focus on higher-level tasks.
○ Trained on billions of lines of code and supports multiple programming
languages.
○ Can be used within popular IDEs like VS Code, IntelliJ IDEA, and AWS Cloud9.
● Amazon SageMaker: A fully managed service for building, training, and deploying
machine learning models. Provides tools for the entire ML workflow.
○ Offers a variety of built-in algorithms and supports popular frameworks.
○ Enables model deployment to various platforms and provides monitoring
capabilities.
● AWS Deep Learning AMIs: Pre-configured Amazon Machine Images (AMIs) optimized
for deep learning. Provides a quick and easy way to get started with deep learning in the
cloud.
○ Includes popular deep learning frameworks and tools pre-installed.
○ Available for various instance types and operating systems.
● AWS Deep Learning Containers: Docker containers pre-installed with deep learning
frameworks and libraries. Offers flexibility and portability for deep learning applications.
○ Can be used on-premises, in the cloud, or at the edge.
○ Simplifies deployment and dependency management for deep learning projects.
● Amazon Forecast: A fully managed service for time series forecasting. Uses machine
learning to generate accurate forecasts for various business metrics.
○ Can be used to forecast demand, sales, web traffic, and other time-dependent
data.
○ Requires no ML expertise and automates model selection and training.
● AWS Inferentia: A custom-designed machine learning inference chip. Provides high
performance and cost-effective inference for deep learning models.
○ Optimized for running large-scale deep learning models in production.
○ Available in EC2 Inf1 instances and supports popular frameworks.
Analytics Services
● S3 (Simple Storage Service): Object storage for data of all types. Highly scalable,
durable, and cost-effective.
○ Used for data lakes, backups, archives, and content distribution.
○ Integrates with other AWS services for analytics, machine learning, and more.
● EBS (Elastic Block Store): Block storage for EC2 instances. Provides persistent
storage for applications and databases.
○ Offers different volume types for various performance needs.
○ Can be used as boot volumes or for additional storage.
● EFS (Elastic File System): A fully managed, scalable file system for EC2 instances.
Provides shared file storage for applications.
○ Supports NFS protocol and can be accessed by multiple instances concurrently.
○ Suitable for content management, web serving, and media workflows.
● DynamoDB: A NoSQL database service for high-performance applications. Offers low
latency, scalability, and flexibility.
○ Handles key-value and document data models.
○ Used for mobile gaming, ad tech, and IoT applications.
● DocumentDB: A scalable, highly available document database service. Compatible with
MongoDB and offers global replication.
○ Suitable for content management, catalogs, and user profiles.
○ Provides ACID transactions and supports MongoDB drivers and tools.
● MemoryDB: A durable, in-memory database service. Provides ultra-fast performance for
real-time applications.
○ Compatible with Redis and offers microsecond latency.
○ Used for caching, session management, and real-time analytics.