Career Objective Experienced in data engineering, I am actively seeking a challenging role that allows me to leverage my expertise in crafting efficient data pipelines, orchestrating ETL processes, and harnessing big data technologies. My goal is to provide innovative solutions that propel business growth and success. PROFESSIONAL EXPERIENCE Adfolks – LLC (A Zaintech Company) Remote, India Data Engineer January 2024 – Present Engineered scalable data pipelines and optimized ETL processes, create Data Warehouses, Data Mart, enhancing data quality and processing efficiency in Telecommunications, finance, sales, Oil & Gas, Real State and human resource data domains. Collaborated with data scientists to create tailored data structures for machine learning and analytics. Implemented strategies for optimized query performance, emphasizing schema design, data partitioning, and performance tuning. Developed and executed ETL jobs to seamlessly migrate and integrate data from various sources into ADLS and Delta Lake within healthcare, finance, sales, and human resource sectors. Provided mentorship to junior data engineers, devops and data scientists, conducting training on best practices and emerging technologies across diverse business domains. Celebal Technologies Jaipur, Rajasthan, India Data Engineer September 2021 – December 2023 Engineered scalable data pipelines and optimized ETL processes, enhancing data quality and processing efficiency in healthcare, finance, sales, Airlines, Aviation, Real State and human resource data domains. Collaborated with data scientists to create tailored data structures for machine learning and analytics across diverse industries. Implemented strategies for optimized query performance, emphasizing schema design, data partitioning, and performance tuning. Developed and executed ETL jobs to seamlessly migrate and integrate data from various sources into ADLS and Delta Lake within healthcare, finance, sales, and human resource sectors. Leveraged Azure services (Azure Data Factory, ADLS, Databricks, SQL) to architect scalable and cost-effective data storage and analysis solutions for multiple industries. Ensured data integrity through meticulous quality assessments, implementing robust cleansing and validation techniques tailored to the unique requirements of healthcare, finance, sales, and human resource datasets. Provided mentorship to junior data engineers and data scientists, conducting training on best practices and emerging technologies across diverse business domains. Strivemindz Jaipur, Rajasthan, India Data Scientist March 2021 – August 2021 Highly skilled and dedicated data scientist with expertise in implementing diverse algorithms and approaches in data science with effective communication skills, enabling effective collaboration with cross-functional teams and delivering successful outcomes. Proficient in utilizing a wide range of libraries and tools, including Numpy, Pandas, Matplotlib, Seaborn, Scikit-learn, NLTK, NLP, Spark, Python, Statistics and Probability. Experienced in working on various data science projects, employing these libraries to extract insights, analyze data, and develop predictive models. SKILLS Programming Skills: Python Databases: SQL, NoSQL Data Skills: Databricks, Data Pipeline Development, Azure Data Factory, ETL Development, Data Integration, Data Lake, Spark, Pyspark, Exploratory data analysis(EDA), Spark SQL, ADLS gen2, JIRA, Delta Lake (Delta Tables), real-time and batch data processing. EDUCATION Jaipur National University, School of Engineering and Technology Jaipur, Rajasthan, India B.Tech in Computer Science June 2021 Cumulative Percentage : 70.25 % Relevant Coursework: Data Engineering, Machine Learning, Data Science, Statistics, Big Data and Software Development CERTIFICATIONS Databricks Certified Data Engineer Associate Microsoft Certified Data Engineer Associate (DP-203) Microsoft Azure Data Fundamental (DP-900) Microsoft Azure Fundamental (AZ-900) Databricks Solution Architect
Mastering Data Engineering and Analytics with Databricks: A Hands-on Guide to Build Scalable Pipelines Using Databricks, Delta Lake, and MLflow (English Edition)