Final Internship Report Content
Final Internship Report Content
in
Submitted by
Professor
April-2024
Table of Contents
CHAPTER 1 AWS CLOUD FOUNDATIONS 1
1.1 INTRODUCTION TO THE COURSE 1
1.2 TOPICS COVERED IN THE COURSE 1
1.3 CASE STUDIES FROM THIS COURSE 3
1.3.1 CASE STUDY A: EXPLORING AMAZON EC2 WITH HANDS-ON LABS 3
1.3.2 CASE STUDY B: LEVERAGING AMAZON RDS FOR RELATIONAL DATABASE
MANAGEMENT 3
CHAPTER 2 AWS MACHINE LEARNING FOUNDATIONS 7
2.1 INTRODUCTION TO THE COURSE 7
2.2 TOPICS COVERED IN THE COURSE 7
2.3 CASE STUDIES FROM THIS COURSE 8
2.3.1 CASE STUDY C: FACIAL RECOGNITION WITH AMAZON REKOGNITION 8
2.3.2 CASE STUDY D: NATURAL LANGUAGE PROCESSING WITH AMAZON LEX 9
CHAPTER 3 AWS DATA ENGINEERING 12
3.1 INTRODUCTION TO THE COURSE 12
3.2 TOPICS COVERED IN THE COURSE 12
3.3 CASE STUDIES FROM THIS COURSE 13
3.3.1 CASE STUDY E: OPTIMIZING DATA ANALYTICS WORKFLOW WITH AMAZON S3 13
3.3.2 CASE STUDY F: STREAMLINING E-COMMERCE DATA ANALYTICS WITH AWS GLUE
14
CERTIFICATES 16
CHAPTER 1
AWS Academy Cloud Foundations is intended for students who seek an overall
understanding of cloud computing concepts, independent of specific technical roles. It
provides a detailed overview of cloud concepts, AWS core services, security, architecture,
pricing, and support.
The course covers a wide range of topics, including cloud computing concepts, AWS core
services, security, architecture, pricing, and billing. Students will learn about essential AWS
services such as Amazon EC2, Amazon S3, Amazon RDS, and Amazon VPC, among others.
Hands-on labs and exercises are often included to give students practical experience working
with AWS services and tools.
One of the key benefits of the AWS Academy Cloud Foundations course is that it equips
students with the knowledge and skills needed to pursue careers in cloud computing and
AWS. AWS certifications are highly valued in the industry, and completing this course can
help students prepare for certification exams such as the AWS Certified Cloud Practitioner.
Furthermore, the course provides a solid foundation for students who wish to pursue more
advanced AWS training and certifications in areas such as cloud architecture, DevOps,
security, and machine learning.Overall, the AWS Academy Cloud Foundations course offers a
comprehensive introduction to cloud computing and AWS services, preparing students for
careers in the rapidly growing field of cloud technology. By gaining proficiency in AWS,
students can enhance their job prospects and contribute to the success of organizations
leveraging cloud computing for innovation and growth.
1
1. Introduction: An introductory overview of the course objectives, structure, and the
importance of cloud computing and AWS services in today's technology landscape.
2. Cloud Concepts Overview: Covers fundamental cloud computing concepts such as
elasticity, scalability, on-demand resource provisioning, and shared responsibility
model.
3. Cloud Economics and Billing: Explores the economic aspects of cloud computing,
including pay-as-you-go pricing models, cost optimization strategies, and
understanding AWS billing.
4. AWS Global Infrastructure Overview: Provides an overview of AWS's global
infrastructure, including regions, availability zones, edge locations, and the benefits of
geographic redundancy.
5. AWS Cloud Security: Focuses on security best practices in AWS, covering topics such
as identity and access management , encryption, compliance, and implementing
security controls.
6. Networking and Content Delivery: Covers networking fundamentals in AWS,
including virtual private cloud (VPC) setup, subnets, route tables, and content
delivery using services like Amazon CloudFront.
7. Compute: Introduces AWS compute services such as Amazon EC2 for virtual servers,
Amazon ECS for container management, and AWS Lambda for serverless computing.
8. Storage: Discusses various storage options available in AWS, including Amazon S3
for object storage, Amazon EBS for block storage, and Amazon Glacier for archival
storage.
9. Databases: Explores AWS database services such as Amazon RDS for relational
databases, Amazon DynamoDB for NoSQL databases, and Amazon Redshift for data
warehousing.
10. Cloud Architecture: Covers architectural principles for designing scalable, highly
available, and fault-tolerant applications on AWS, including best practices and design
patterns.
11. Auto Scaling and Monitoring: Introduces auto-scaling concepts for dynamically
adjusting resources based on demand, and monitoring tools like Amazon CloudWatch
for tracking performance and health metrics of AWS resources.
2
These modules collectively provide a comprehensive understanding of cloud computing
fundamentals and AWS services in preparing for further exploration or certification in cloud
technologies.
Introduction:
In today's rapidly evolving digital landscape, cloud computing has emerged as a critical
enabler for businesses seeking scalability, flexibility, and cost-efficiency in their IT
infrastructure. Among the leading cloud service providers, Amazon Web Services (AWS)
stands out, offering a comprehensive suite of cloud services tailored to diverse organizational
needs. As businesses increasingly adopt AWS solutions, understanding key services like
Amazon Elastic Compute Cloud (Amazon EC2) becomes imperative.
Objective:
This case study aims to provide a practical exploration of Amazon EC2 through hands-on
labs, equipping participants with essential skills in launching, managing, and monitoring EC2
instances within the AWS cloud environment.
Lab Overview:
The lab encompasses a series of tasks designed to guide participants through various aspects
of EC2 instance management. Participants learn to launch an EC2 instance with termination
and stop protection, monitor instance performance using built-in tools, modify security group
settings to allow HTTP access, resize instance type and associated storage volumes, explore
EC2 service limits, and test stop protection functionality to prevent accidental instance
termination.
3
through the AWS Management Console, ensuring hands-on interaction with EC2 services in a
controlled setting.
Key Learnings:
1. Launching EC2 Instances: Participants gain proficiency in launching EC2 instances with
essential configurations, including termination and stop protection settings.
4. Instance Resizing: Participants explore the flexibility of EC2 instance types and storage
volumes, optimizing resource allocation to meet workload demands.
5. Exploring Service Limits: By examining EC2 service limits and quotas, participants
develop awareness of resource constraints and scalability considerations.
Conclusion:
The hands-on labs offer a practical and immersive learning experience, empowering
participants with essential skills in Amazon EC2 utilization. By mastering EC2 instance
management and monitoring techniques, participants are better equipped to leverage AWS
cloud services effectively, driving innovation and scalability within their organizations.
Future Directions:
Continued exploration of AWS services, including advanced EC2 features and integration
with complementary services like Amazon RDS and AWS Lambda, can further enhance
participants' cloud computing expertise. Additionally, pursuing AWS certification pathways
provides formal recognition of skills and expertise, opening doors to career advancement
opportunities in the cloud computing domain.
4
1.3.2 CASE STUDY B: LEVERAGING AMAZON RDS FOR
RELATIONAL DATABASE MANAGEMENT
Introduction:
Objective:
The primary objective of this case study is to provide participants with practical experience in
deploying, configuring, and interacting with an Amazon RDS DB instance. By the end of the
lab, participants will have acquired essential skills in launching a Multi-AZ RDS deployment,
configuring security groups and DB subnet groups, and integrating the database with web
applications.
Lab Overview:
The lab is structured into four tasks, each focusing on a distinct aspect of Amazon RDS
deployment and utilization:
1. Creating a Security Group: Participants create a security group to permit access from the
web server to the RDS DB instance, ensuring secure communication between components.
5
3. Creating an Amazon RDS DB Instance: Participants configure and launch a Multi-AZ
MySQL database instance, leveraging Amazon RDS's managed services for enhanced
availability and durability.
4. Interacting with the Database: Participants interact with a web application connected to the
RDS database, performing CRUD operations to test data persistence and replication across
Availability Zones.
Key Learnings:
4. Testing Data Persistence and Replication: Through CRUD operations on the web
application, participants validate the functionality of data persistence and replication across
multiple Availability Zones, ensuring data integrity and availability.
Conclusion:
The hands-on labs provide participants with practical experience in leveraging Amazon RDS
for relational database management, empowering them with essential skills for deploying and
managing database instances in the cloud. By mastering Amazon RDS concepts and best
practices, participants are better equipped to architect scalable and reliable data solutions,
driving innovation and efficiency within their organizations.
6
Future Directions:
Continued exploration of Amazon RDS features and integration with other AWS services,
such as Amazon Aurora and AWS Lambda, can further enhance participants' database
management expertise. Additionally, pursuing AWS certification pathways in database
specialization offers formal recognition of skills and knowledge, opening avenues for career
advancement in cloud computing and database administration.
CHAPTER 2
7
6. Introducing Natural Language Processing (NLP): Covers fundamentals of natural
language processing, such as sentiment analysis and language modeling.
7. Introducing Generative AI: Introduces generative AI concepts and applications,
including content generation.
8. Course Wrap-Up: Recaps key learnings, discusses practical applications, and offers
guidance on further learning.
9.
Background:
A leading security firm wanted to enhance its surveillance system with facial recognition
capabilities to improve security measures. They sought to integrate Amazon Rekognition, a
powerful image analysis service, into their existing infrastructure to detect known faces in
real-time.
Objective:
The objective was to implement facial recognition using Amazon Rekognition to identify
known faces captured by surveillance cameras.
Implementation:
- The security firm created a custom collection in Amazon Rekognition to store images of
known individuals.
- This involved setting up the collection using the AWS Management Console and
configuring the necessary permissions.
- Each image was carefully labeled with metadata to facilitate accurate identification.
8
- Using a Jupyter notebook instance in Amazon SageMaker, the team accessed the facial
detection notebook provided by Amazon.
- They followed the instructions in the notebook to execute facial detection on images
captured by surveillance cameras.
- The system leveraged Amazon Rekognition to detect known faces within the surveillance
footage.
- Detected faces were cross-referenced with the images stored in the custom collection to
identify known individuals.
Results:
- The facial recognition capabilities enhanced the overall effectiveness and efficiency of the
surveillance operations.
Conclusion:
By leveraging Amazon Rekognition for facial recognition, the security firm successfully
implemented a robust surveillance solution that provided advanced threat detection
capabilities. The seamless integration of Amazon Rekognition with their existing
infrastructure enabled them to strengthen security measures and mitigate potential risks
effectively.
Background:
9
A dental clinic aimed to streamline their appointment booking process by implementing a
chatbot solution. They sought to leverage Natural Language Processing (NLP) capabilities to
allow patients to schedule appointments conveniently.
Objective:
The objective was to create a chatbot using Amazon Lex that would enable patients to
interact naturally to schedule dental appointments.
Implementation:
- The dental clinic utilized Amazon Lex to create a chatbot using the ScheduleAppointment
blueprint.
- They configured the bot to understand natural language input related to scheduling
appointments.
- Testing was conducted within the Amazon Lex console to ensure the bot's accuracy and
responsiveness.
- An AWS Lambda function was created to handle initiation, validation, and fulfillment tasks
related to appointment scheduling.
- The Lambda function was integrated with Amazon Lex to perform backend processing of
user requests.
- The MakeAppointment intent of the bot was updated to use the AWS Lambda function as a
code hook.
- After configuring the intent, the bot was rebuilt to incorporate the changes made.
- Amazon Cognito was utilized to add security to the webpage, ensuring secure access to the
bot.
10
- IAM roles were configured to grant necessary permissions for the webpage to interact with
Amazon Lex.
Results:
- Patients could now schedule appointments seamlessly through natural language interactions
with the chatbot.
- Hosting the bot on a webpage provided accessibility to patients, allowing them to interact
with the bot conveniently from any device with internet access.
Conclusion:
By leveraging Amazon Lex for Natural Language Processing, the dental clinic successfully
deployed a chatbot solution that revolutionized their appointment booking process. The
integration of AWS Lambda and Amazon S3 provided a robust backend infrastructure for
efficient bot operation. The project showcased the transformative potential of NLP
technology in enhancing customer service and operational efficiency in the healthcare
industry.
11
CHAPTER 3
The AWS Academy Data Engineering course equips learners with the essential skills and
knowledge required to design, build, and maintain scalable data processing solutions on the
AWS Cloud platform. This comprehensive course covers a range of topics essential for
aspiring data engineers, including data modeling, data warehousing, ETL (Extract,
Transform, Load) processes, and big data analytics.
12
7. Ingesting by Batch or by Stream: Comparing batch and real-time data processing
methods and their use cases.
8. Storing and Organizing Data: Exploring AWS storage options and best practices for
data organization.
9. Processing Big Data: Delving into distributed computing frameworks like Apache
Hadoop and Spark for large-scale data processing.
10. Processing Data for ML: Preparing data for machine learning applications, including
feature engineering and preprocessing.
11. Analyzing and Visualizing Data: Techniques for analyzing data and creating
impactful visualizations to communicate insights.
12. Automating the Pipeline: Automation strategies using workflow orchestration tools to
improve efficiency.
13. Bridging to Certification: Preparing for AWS certification in data engineering,
including exam preparation and practice tests.
Background:
Sofia, Paulo, and Mary need a faster, more secure way to analyze large .csv files. They want
to use Amazon S3's features like S3 Select, encryption, and storage class modification to
streamline their workflow.
Objective:
13
Implementation:
Result:
Conclusion:
Background:
ElectroMart, an e-commerce company, faces challenges managing and analyzing diverse data
sources. With data stored in disparate formats and locations, the need for a scalable ETL
solution is evident to unlock insights crucial for business growth.
Objectives:
14
ElectroMart aims to streamline data integration and analysis by leveraging AWS Glue. The
objectives include automating data discovery, transforming raw data into a usable format, and
loading it into a centralized data warehouse for analytics.
Implementation:
1. Data Discovery: AWS Glue's crawler scans and catalogs metadata from various data
sources like Amazon S3, RDS, and Redshift.
2. ETL Jobs: ElectroMart designs ETL jobs using AWS Glue, employing PySpark scripts to
cleanse, transform, and enrich data.
3. Scheduled Workflows: ElectroMart schedules ETL jobs for periodic execution, ensuring
data freshness and accuracy.
4. Data Loading: Transformed data is loaded into the data warehouse, utilizing AWS storage
options like Amazon Redshift or S3.
Result:
1. Scalability: AWS Glue's automatic resource scaling enables efficient processing of large
datasets. Cost-effectiveness: Pay-as-you-go pricing eliminates upfront infrastructure costs.
2. Time-saving: Automated data discovery and ETL reduce manual effort, speeding up
insights delivery.
3. Data Quality: Standardized and cleansed data improves the accuracy of analytics
outcomes.
Conclusion:
By implementing AWS Glue for ETL, ElectroMart achieves a streamlined data pipeline,
empowering informed decision-making and driving business growth. The company now
harnesses advanced analytics capabilities to stay competitive in the dynamic e-commerce
landscape.
15
CERTIFICATES
COHORT-7
16
AWS Course Certificates
17
18