Tcobza
Tcobza
[email protected]
Linkedin
Github
Cell: +923212377314
OBJECTIVE:
To obtain a challenging role that leverages my programming skills in Python and my passion for
Computer Science, while providing opportunities to learn and grow in the field of Data Science and
Engineering.
QUALIFICATION:
Intermediate From Dhacss Degree College in Computer Science (2024)
Matriculation From Chiniot Islamia Public School (2022)
Python Programming Language
Data Engineering Course
SKILLS:
Python
DataBase: SQL Server, Snowflake, Rds cluster
Data Analysis: Pandas, NumPy, Matplotlib, Power Bi
Tools: Snowflake, GitHub, AWS (Amazon Web Services)
MS Word, MS Excel, Powerpoint
ETL Processes
PROJECTS Github:
ETL Data Pipeline for Global Bank: Developed an ETL pipeline to streamline data
processing and enhance data accessibility LINK
End-to-End Data Analysis and Visualization using Kaggle API, Pandas, and SQL:
The project involved data cleaning, transformation, and visualization to uncover
valuable insights and answer complex business questions. LINK
Real-time Weather Data ETL Pipeline with Apache Airflow and AWS: This project
demonstrates the implementation of an automated ETL (Extract, Transform,
Load) pipeline for real-time weather data using Apache Airflow and AWS. LINK
Real-time Parallel Processing Using Amazon, Aws: Developed a pipeline that
fetches weather data for Houston from the OpenWeatherMap API, transforms it,
and loads it into an AWS RDS PostgreSQL database. The pipeline is orchestrated
using Apache Airflow, which allows for the scheduling and monitoring of data
workflows. The project also includes steps to manage city data from an S3 bucket,
enabling parallel processing of tasks. LINK
Slowly Changing Dimensions Real-time Data Streaming: A real-time data pipeline
for continuous data ingestion and transformation into a Snowflake data
warehouse. It showcases the implementation of Change Data Capture (CDC) and
Slowly Changing Dimensions (SCD) for historical data management using various
cloud technologies. LINK
Redfin Data Analysis ETL: This project implements a robust data analysis pipeline
for real estate data sourced from Redfin. Utilizing a combination of Python,
Apache Airflow, AWS S3, Snowpipe, and Power BI, the pipeline automates the
extraction, transformation, and loading (ETL) of data, enabling insightful
visualizations and analytics. LINK
Real-time Stocks Market Streaming Using Kafka: An End-To-End Data Engineering
pipeline for Real-Time Stock Market Data using Apache Kafka and various AWS
services. The goal is to build a robust data pipeline that ingests, processes, and
analyzes stock market data in real-time. LINK
Spotify Data ETL Pipeline using AWS: This project implements an ETL pipeline for
Spotify data using AWS services. LINK
Superstore data Analysis Pipeline(Serverless): This project showcases a data
analysis pipeline designed to analyze data from a fictional super store. The
pipeline utilizes various AWS services to streamline data processing and analysis.
LINK