Sherbin W - Resume
Sherbin W - Resume
Sherbin W - Resume
SUMMARY
Data Engineer with around 5 years of experience in emerging technologies like Databricks, AWS-Serverless Computing, Snowflake,
Airflow, DBT and with hands on expertise in design and development of data modeling, ETL Processes, Data Integration, and Data
Warehouse. Adopt in working quickly and efficiently in close collaboration with analytic, engineering and other stakeholders.
EXPERIENCE
Senior Data Engineer, LatentView Analytics, Chennai, India. May 2022 – Present
Client: Procore Technologies, Carpinteria, CA, USA. Feb 2023 - Present
➢ Led bidirectional migration projects between Databricks and Snowflake, optimizing for efficiency and performance.
➢ Implemented strategies for capturing delta changes, reducing computational costs, and streamlining processes to enhance
data processing efficiency.
➢ Designed and developed APIs for Tableau data extraction, facilitating seamless integration with visualization tools.
➢ Engineered APIs for Alation data extraction to establish and track lineage between dashboards and data sources.
➢ Spearheaded the development of an automation script for updating dbt files, including the identification and removal of
redundant columns to enhance file accuracy and efficiency.
Client: Autodesk, San Francisco, CA, USA. May 2022 – Jan 2023
➢ Proficient in job scheduling and adept at building pipelines within Databricks for efficient data processing.
➢ Led the migration of the entire data stack from Star schema to Snowflake schema, resulting in significant
improvements in business report loading.
➢ Developed external tables as Delta tables in Snowflake using Databricks and implemented scheduling through MWAA
➢ Automated quality assurance scripts using Pyspark and MWAA, ensuring data reliability and integrity in processing.
Data Engineer, Mindsbeam Technologies Private Limited, Chennai, India. Sep 2020 – April 2022
➢ Developed data generation and reports using Python pandas, deployed on a serverless Lambda framework.
➢ Automated and scheduled the entire process by creating a data pipeline with Apache Airflow.
➢ Transformed an existing data generation script from Python pandas to Pyspark, leveraging enhanced capabilities.
➢ Migrated a serverless framework from AWS data services to Databricks for improved scalability and efficiency.
➢ Implemented Slowly Changing Dimension Type 2 methodology in Delta tables using Pyspark, ensuring effective
historical data tracking and management.
Integration Engineer, ABE Semiconductor Designs, Chennai, India May 2019 – Aug 2020
➢ Implemented various machine learning algorithms in Python for R & D.
➢ Developed dashboard using python matplotlib for the data from IoT devices.
➢ Developed the Data Mart for IoT projects
EDUCATION
Bachelors in Electronics and Communication Engineering from Agni College of Technology, Chennai, India, Aug 2015 – April 2019
SKILLSET
CERTIFICATIONS