0% found this document useful (0 votes)

768 views3 pages

Databricks Developer Resume

This is my data bricks developer resume

Uploaded by

ashutoshsingh0166

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

768 views3 pages

Databricks Developer Resume

This is my data bricks developer resume

Uploaded by

ashutoshsingh0166

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 3

My passion is working with data.

By such, I mean ETL processes, data science, big data

processing, and cloud computations. I’m a fan of football, used-cars market, and Polish cuisine.
My big hobby is my dog and continuous growth of Big Data related knowledge.

Skills
• Python (environment management, testing, project • Microsoft Azure
flow, dev) ~ 6 years, • AWS
• Apache Spark (deployment, resources • Docker, git, bash, Jira, Linux,
optimization, config optimization, applications • MS Office,
efficiency monitoring, dev) ~ 4 years,
• Power BI,
• Hadoop ecosystem (Hadoop on-premises and • English – C1, German – A2,
EMR, HDFS management, yarn, Sqoop, Hive, Impala)
~ 2 years, • Scala and Java for Data Engineering
• Databricks (dev, admin, integration, migration) ~
4 years
• SQL (SQL Server, PostgreSQL) ~ 6 years,

Experience
APRIL 2021 – present

Contractor
Advised on Data Lake maintenance and expansion (banking sector, EU, as Lead
Data Engineer):
• Apache Spark / Airflow / AWS development (process / code / architecture) for Data Lake.
• Built analytical platform upon Databricks on AWS (resolving matters like scalability/data security in the
cloud/ IaC automatization)
• Reduced prod AWS EMR processing costs by 25% and decreased downtime by 37%.

Environment: AWS (S3, IAM, Lambda, EC2, RDS, DynamoDB, Kinesis, Glue, EMR), Databricks
Tools: Airflow, Terraform, Python, Scala, bash, git, docker, GitHub, GitHub Actions, Apache Spark
Other: scrum methodology

Built a data processing framework for FHIR format compliant data (medical sector,
US).
• Developed FHIR format – Azure – Databricks integration framework (also automated cucumber / pytest-bdd
test framework)
• Troubleshot Delta Live Tables jobs.

Environment: Azure (ADLS, EventHubs, ACR), Databricks

Tools: Airflow, Python, git, docker, bitbucket, Jenkins, Apache Spark
Other: scrum methodology

Implemented a PoC for Azure Databricks-based Data Lake (e-commerce, PL).

• Designed ELT processes (pyspark, Databricks Workflows).
• Created CICD processes for schema migrations, workflows, cluster pools, etc.

Designed Apache Airflow architecture for an MFT business case (energy sector, PL).

JULY 2020 – MARCH 2021

Big Data Developer – Lingaro

Developed custom Apache Spark listeners (FMCG)
• Led project.
• Gathered logs produced by Spark jobs on Databricks.
• Visualized and pointed out weak spots, cost generators, and suboptimal queries.

Environment: Azure (ADLS, EventHubs), Databricks

Tools: Python, Java, git, docker, bitbucket, Jenkins, Apache Spark, ELK, PowerBI, SQL Server
Other: scrum methodology

Master Data Engineering (FMCG)

• Migrated SAP-based ETL to Microsoft Azure.
• Built from scratch data processing engine (Databricks + Airflow + ADLS + Docker).
• Built REST APIs connecting the engine’s components.

Environment: Azure (ADLS, Azure Functions), Databricks

Tools: Python, git, docker, Azure Repos, Azure Pipelines, Apache Spark
Other: scrum methodology

OCTOBER 2019 – JUNE 2020

Data Engineer (Senior Associate) – PwC Advisory (Data

Analytics)
Big Data Engineering (Financial Services)
• Developed a solution responsible for orchestrating workflows from data vendors (public and private sources,
both structured and unstructured) to a machine learning engine.
• Reviewed pull requests, distributed tasks to subordinates, and supervised them.
• Planned and executed data migration from HDFS to Azure Blob Storage.
• Optimized Apache Spark jobs and HDFS storage.

Environment: Azure (Azure Storage), on-premises

Tools: Python, Apache Spark, Scala, Hadoop, Hive, Kafka, Airflow
Other: scrum methodology

APRIL 2018 – SEPTEMBER 2019

Data Engineer (Associate) – PwC Advisory (Data

Analytics)
Created store chain expansion model (Retail):
Designed and implemented a machine learning workflow responsible for the prediction of store income based on
geographical and internal data.

I hereby give consent for my personal data included in my application to be processed for the purposes of the recruitment process under the Regulation (EU) 2016/679
of the European Parliament and of the Council of 27 April 2016 on the protection of natural persons with regard to the processing of personal data and on the free
movement of such data, and repealing Directive 95/46/EC (General Data Protection Regulation).
Cost to Serve and SCM network optimization (Retail)
Cloudera Hadoop cluster administration:
• Configured nodes / roles, installed / updated software.
• Performance monitoring, and troubleshooting.
• Prepared and maintained a working environment for Data Scientists (JupyterHub, Cloudera Data Science
Workbench, mlflow, RStudio Server, etc.)
• Completed Cloudera Administrator Training for Apache Hadoop

Environment: on-premises
Tools: Linux, Ansible, Hadoop, Apache Spark, Hive, Impala, Kafka, Nifi, Flume

MAY 2016 – MARCH 2018

Business Analyst – Creamfinance Poland

Developed KPIs tracking Shiny application
Developed a process responsible for handling loans assignment to external debt
collectors.
Refactored an LGD calculation model from Excel based to a standalone Shiny
dashboard.

JULY 2015 – SEPTEMBER 2015

Intern – Citi Service Center

Education
OCTOBER 2018 –

Big Data - MSc / Warsaw School of Economics

OCTOBER 2014 – JULY 2017

Econometrics - BSc / University of Warsaw

BSc thesis: Analysis of dependencies between S&P 500, DAX and WIG20 changes.

OCTOBER 2013 – SEPTEMBER 2016

Mathematics – BSc / University of Warsaw

Chandana - Azure Data Engineer
0% (1)
Chandana - Azure Data Engineer
7 pages
PHP Runner Manual 9.6
75% (4)
PHP Runner Manual 9.6
795 pages
Saraswati K DA
No ratings yet
Saraswati K DA
6 pages
Ashish Resume Final
No ratings yet
Ashish Resume Final
2 pages
Prashant Bodhale GCP Data Engineer (3.5 Exp)
No ratings yet
Prashant Bodhale GCP Data Engineer (3.5 Exp)
2 pages
Dice Resume CV PAVAN SRI HARSHA LAGHUVARAPU
No ratings yet
Dice Resume CV PAVAN SRI HARSHA LAGHUVARAPU
4 pages
Data Engineer Interview Questions
No ratings yet
Data Engineer Interview Questions
16 pages
Oracle Multitenant 19c - All About Pluggable D
0% (1)
Oracle Multitenant 19c - All About Pluggable D
67 pages
El Masri CH 1 PPT
No ratings yet
El Masri CH 1 PPT
38 pages
Relational Algebra Exercises
No ratings yet
Relational Algebra Exercises
5 pages
Getting Started With Amazon Redshift
No ratings yet
Getting Started With Amazon Redshift
51 pages
Lesson Note On Data Processing SS2 First Term
No ratings yet
Lesson Note On Data Processing SS2 First Term
40 pages
Yasaswi-Sr Data Engineer-Resume
100% (1)
Yasaswi-Sr Data Engineer-Resume
4 pages
N Jaya Mani - Data Engineer
No ratings yet
N Jaya Mani - Data Engineer
8 pages
Mourya K Data Engineer
No ratings yet
Mourya K Data Engineer
7 pages
Santhoshi Musku
No ratings yet
Santhoshi Musku
8 pages
Azure Data Engineer Resume
No ratings yet
Azure Data Engineer Resume
2 pages
Thimmarayudu. Gangavaram 8007779596: Loading Unloading
No ratings yet
Thimmarayudu. Gangavaram 8007779596: Loading Unloading
4 pages
Deepak (Sr. Data Engineer)
No ratings yet
Deepak (Sr. Data Engineer)
10 pages
Data Mining MCQ
78% (147)
Data Mining MCQ
34 pages
Suganya Rangasamy Resume Talend-AWS-Informatica
No ratings yet
Suganya Rangasamy Resume Talend-AWS-Informatica
2 pages
Pca15E04 Database Administration Unit-2
100% (1)
Pca15E04 Database Administration Unit-2
11 pages
Dhanush Bigdata Resume Updated
No ratings yet
Dhanush Bigdata Resume Updated
9 pages
Anbarasu R +91-9944841393: Curriculum Vitae
No ratings yet
Anbarasu R +91-9944841393: Curriculum Vitae
5 pages
Pranjal Soni: Professional Summary
No ratings yet
Pranjal Soni: Professional Summary
4 pages
Albrin DataAnalyst Resume
No ratings yet
Albrin DataAnalyst Resume
2 pages
Madhusudhan Senior Data Engineer
No ratings yet
Madhusudhan Senior Data Engineer
4 pages
Shelly Bansal - SR Data Engineer
No ratings yet
Shelly Bansal - SR Data Engineer
6 pages
AyushiPatra Resume
No ratings yet
AyushiPatra Resume
1 page
Etl Resume
No ratings yet
Etl Resume
2 pages
Mohit BigData 5yr
100% (1)
Mohit BigData 5yr
3 pages
Shubham Chhimpa Resume April2023 Without Number
100% (1)
Shubham Chhimpa Resume April2023 Without Number
1 page
Siva
No ratings yet
Siva
4 pages
Adithya Jatangi: Professional Summary
No ratings yet
Adithya Jatangi: Professional Summary
7 pages
Hinal - Data Engineer - Resume
No ratings yet
Hinal - Data Engineer - Resume
1 page
Open SQL
No ratings yet
Open SQL
6 pages
Dice Resume CV Yamini Vakula
No ratings yet
Dice Resume CV Yamini Vakula
5 pages
Chandralekha Rao Yachamaneni
No ratings yet
Chandralekha Rao Yachamaneni
7 pages
Cloudera Developer Training For Apache Spark
No ratings yet
Cloudera Developer Training For Apache Spark
3 pages
Resume 3
No ratings yet
Resume 3
3 pages
Akash Resume
No ratings yet
Akash Resume
7 pages
Thomas Zebar Resume T 5 2
No ratings yet
Thomas Zebar Resume T 5 2
5 pages
Sanjana Data Engineer
No ratings yet
Sanjana Data Engineer
4 pages
Barman-3 9 0-Manual
No ratings yet
Barman-3 9 0-Manual
92 pages
Lab Lab
No ratings yet
Lab Lab
87 pages
Databricks
No ratings yet
Databricks
11 pages
Sivakrishna Nandimandalam Resume
No ratings yet
Sivakrishna Nandimandalam Resume
3 pages
Summary of Experience
No ratings yet
Summary of Experience
8 pages
Installing Grid Infrastructure For A Standalone Server
No ratings yet
Installing Grid Infrastructure For A Standalone Server
56 pages
DBMS Notes VTu
No ratings yet
DBMS Notes VTu
26 pages
Iswarya - SR - Bigdata Hadoop Developer
No ratings yet
Iswarya - SR - Bigdata Hadoop Developer
8 pages
Jarupula Praveen
No ratings yet
Jarupula Praveen
7 pages
Vijay Kanth - Azure Data Engineer
No ratings yet
Vijay Kanth - Azure Data Engineer
2 pages
D5606AQGSQmhSvp0QDAAhmed.C Resume
No ratings yet
D5606AQGSQmhSvp0QDAAhmed.C Resume
17 pages
Git Commands Cheat Sheet by PhoenixNAP
No ratings yet
Git Commands Cheat Sheet by PhoenixNAP
1 page
Security Architecture Guide
No ratings yet
Security Architecture Guide
8 pages
Nagarjuna Hadoop Resume
No ratings yet
Nagarjuna Hadoop Resume
7 pages
Thara
No ratings yet
Thara
4 pages
Advanced Database System Chapter 5
No ratings yet
Advanced Database System Chapter 5
22 pages
Ajay Resume VLaF
No ratings yet
Ajay Resume VLaF
2 pages
Maneesh Azure
No ratings yet
Maneesh Azure
6 pages
Hanumantha Rao Resume-1 (4391)
No ratings yet
Hanumantha Rao Resume-1 (4391)
4 pages
TCM Reflash 3 0 Installation Guide (24 Jan 2013)
No ratings yet
TCM Reflash 3 0 Installation Guide (24 Jan 2013)
19 pages
IBM Storage Modeller (StorM)
No ratings yet
IBM Storage Modeller (StorM)
13 pages
Objective
No ratings yet
Objective
3 pages
Carlo Mazzaferro: Machine Learning Engineer
No ratings yet
Carlo Mazzaferro: Machine Learning Engineer
2 pages
Receiving IDOCs From SAP by Using BizTalk Server
No ratings yet
Receiving IDOCs From SAP by Using BizTalk Server
13 pages
Data Resume Snowflake
No ratings yet
Data Resume Snowflake
7 pages
Data Analyst Sneha Chogale 5YrsExp
No ratings yet
Data Analyst Sneha Chogale 5YrsExp
2 pages
Disk Management - Walkthrough
No ratings yet
Disk Management - Walkthrough
7 pages
Big Data Engineer Resume Template Download 20201120
No ratings yet
Big Data Engineer Resume Template Download 20201120
2 pages
Function Point Analysis: A Simple Five Step Counting Process
No ratings yet
Function Point Analysis: A Simple Five Step Counting Process
13 pages
Sr. ETL Kafka Developer 5 Years, Arumugam P - AXO2370 - AugmntX
No ratings yet
Sr. ETL Kafka Developer 5 Years, Arumugam P - AXO2370 - AugmntX
2 pages
Tapasvi - Lead GCP Cloud Data Engineer
No ratings yet
Tapasvi - Lead GCP Cloud Data Engineer
5 pages
Database Testing
No ratings yet
Database Testing
3 pages
Lab 6: Retrieving Data From Multiple Tables
No ratings yet
Lab 6: Retrieving Data From Multiple Tables
7 pages
Aditya Jha Senior Data Engineer Resume
No ratings yet
Aditya Jha Senior Data Engineer Resume
1 page
Tableau Command-Line Parameters
No ratings yet
Tableau Command-Line Parameters
3 pages
Dice Resume CV Vijay Krishna
No ratings yet
Dice Resume CV Vijay Krishna
4 pages
Fullstack Test
No ratings yet
Fullstack Test
2 pages
HackCloud - OCI Explorer - Exame
No ratings yet
HackCloud - OCI Explorer - Exame
7 pages
Harshit Resume Big Data
No ratings yet
Harshit Resume Big Data
1 page
Piyush Bodhani Resume 01
No ratings yet
Piyush Bodhani Resume 01
1 page
Database Management Systems
No ratings yet
Database Management Systems
3 pages
Python ML Resume Template Coding Mafia
No ratings yet
Python ML Resume Template Coding Mafia
1 page
SOLARIS Commands
No ratings yet
SOLARIS Commands
34 pages
Sai Harish Addanki - Lead Data EngineerAWS Certified Professional
No ratings yet
Sai Harish Addanki - Lead Data EngineerAWS Certified Professional
2 pages
LG He It Ss Gh22np21
No ratings yet
LG He It Ss Gh22np21
2 pages
Data Engineering with Scala and Spark: Build streaming and batch pipelines that process massive amounts of data using Scala
From Everand
Data Engineering with Scala and Spark: Build streaming and batch pipelines that process massive amounts of data using Scala
Eric Tome
No ratings yet
HDInsight Essentials - Second Edition
From Everand
HDInsight Essentials - Second Edition
Rajesh Nadipalli
No ratings yet