Prashant Kumar CV PDF
Prashant Kumar CV PDF
ETL/ELT Developer
Mob: +91-8076656908/9891822705
Email: [email protected]
Data Modeling Data profiling, Understanding the data, creating data model (LDM & PDM), Deriving
insights from the data for Business/client.
Project Profiles
Client Name Deliverable:
America’s largest media and ▪ To create a warehouse through which we can track all the cloud cost at the most granular
Entertainment level possible for the media house.
conglomerate.
End result:
▪ The solution we created is not only loved by the VPs and CXOs of the client but also was
awarded with the Data Breakthrough's "Cross infrastructure Analytics Solution of the
Year"!
Client Name Deliverable:
America’s largest media and ▪ To create migrate the complete Talend ETL using AWS services on ETL framework and
Entertainment side by side migrating database from snowflake hosted on Azure to Snowflake hosted
conglomerate. in AWS in a SOW of 60 days.
Project Name
Talend to AWS Migration Approach:
▪ We decided to migrate the ETL and database simultaneously and with one ETL resource
Role and with one offshore architect.
ETL Pipeline Developer
Complexity:
Technology Stack ▪ The key challenges were that we have to again come up with an approach to dynamically
Talend, AWS, Snowflake, create a file, but now this time with the python and also pull the snowflake data for the
Python 6 different instances directly from the metadata and do ETL on top of that.
▪ We were also told that, during this migration project, we have to move away from the
existing snowflake instance on Azure, and migrate the complete Database for all the
three env to a different Snowflake instance hosted on AWS with all the historical data.
▪ During this time the team were hit with COVID and the complete offshore were shut
down for a week.
End result:
▪ We managed to deliver the project not on time but 5 days before our committed date and
a transition so smooth that the users didn't even get to know that something changed in the
backend.
End Result:
▪ So, in a nutshell, we provided the analysis each and every employee; When an employee
is scheduled to come in office in next wave, the executives and CXO's level people have
access to his Covid tests, his health, once he comes to office, where did he swiped, what
was his health status at the time he entered the premises, is he following the protocols
or not and how and where is the space utilized.
▪ This solution is only basis by which the client is not kicking off their return to office for
their 727k employees across 36 different countries.
Approach:
▪ We created a data warehouse for the Operational Metrics review which contains the
data at lowest grain possible for the review for upper management.
End Result:
▪ As of now, this report is used by every CIOs of each BU for the media house to analyze
their legacy system usage and operational metrics.
▪ This report is also used by the VP and CEO to decide on the decommissioning of the
legacy systems.
Client Name Deliverable
America’s largest media and ▪ To provide the insights in the YouTube data for their two YouTube channels for the client
Entertainment ranging from a single metrics that tells how their YouTube is performing in terms of
conglomerate. revenue to details as granular as per video analysis.
Approach:
▪ We used a python script to download all the API files which is schedules every day on
Fargate cluster task which is pointing to a docker image as a supporting image and push
the files retrieved from API to our S3 buckets.
End Result:
▪ We created the complete ETL pipeline and now the reports based on this warehouse is
used by the Director of Social Engagements for the client to take the necessary action
and analyze the YouTube performance for their two channels.