Az Data Eng20

Download as pdf or txt
Download as pdf or txt
You are on page 1of 2

Himanshu Khanduri

ƒ +91 6395497880 # [email protected] ï linkedin.com/in/himanshukhanduri20


§ github.com/himanshukhanduri20
Software Engineer
Software Engineer with 2+ year of professional experience in Azure cloud Data engineering services, proficient in
distributed data transformation using pyspark, program optimization and Data Modeling.

Education
University of Petroleum and Energy Studies April. 2018 – May 2022
B.Tech in Computer Science CGPA-8.5 Dehradun, Uttrakhand

Technical Skills
Languages: Python, C++, C,Pyspark
Cloud Services: Azure Storage Service, ADLS, ADF, Azure Databricks, Synapse analytics (Data warehouse)
Big Data: Hadoop, Spark, Hive, Nifi, kafka, Data Warehousing modeling, RDD, DataFrame
DevOps Tools: Linux, GitHub, Docker .
Data Base: SQL, NoSQL.

Experience
Numantra Technology Nov 2023 – Present
Azure Data Engineer Mumbai, India
• Proficient in processing data utilizing PySpark, adhering to the Medallion architecture framework, and proficient in

managing Delta tables for efficient data management and analysis.


• Skilled in handling both structured and unstructured data processing, proficiently applying appropriate schema for

accurate data ingestion and analysis.


• Experienced in optimizing PySpark programs for enhanced performance and efficiency, adept at implementing robust

error handling mechanisms to ensure smooth execution.


• Collaborated closely with the finance team to handle data pipelines, ensuring smooth data flow and accurate financial

analysis.
• Collaborated with differnet teams to gather requirements, assess existing workspaces and worked on consolidation

project, streamlining and centralizing financial operations.


• Worked on creating comprehensive plan, CI CD flow and ARM template to merge disparate finance workspaces into a

unified system, resulting in improved efficiency and data integrity.


Reliance Jio Jan 2022 – Nov 2023
Big Data Engineer Mumbai, India
• Collaborated with the security team to process unstructured log data from security devices using PySpark and performed

data ingestion to the warehouse.


• Extensive hands-on experience with Azure Cloud Services, including Azure Data Factory (ADF), Azure Databricks,

Azure Storage,Azure Data Lake Storage (ADLS) and Synapse Analytics


• Extensive experience in query optimization and fine-tuning, utilizing advanced techniques to enhance database

performance, streamline data retrieval, and optimize query execution plans for efficient and rapid data processing.
• Hands-on experience in working with distributed systems, demonstrating proficiency in designing, implementing, and

optimizing distributed architectures for efficient data processing and scalable applications.
• Designed and developed a Spark application to efficiently process unstructured data from a Kafka stream, transform it

into a structured format, and ingest the data into a Hive table for further analysis, enabling real-time data processing
and insights.
• Conduct data analysis by querying a MySQL database with SQL and HIVE Warehouse with HQL .Wrote SQL queries to

extract and transform data from MySQL to support business intelligence and decision-making processes.Utilized HQL to
gather data from HIVE warehouse for further analysis and reporting.
• Proficient in writing Python and shell scripts on Linux for efficient task automation, streamlining processes and

improving productivity.
• Developed WebSocket functionality from scratch in Python to expose limited data.

• With the help of Python and its framework Developed a WAF Compliacne Portal for getting detailed overview of WAF

rules enabled on devices and also developed Firewall + WAF log export portal for getting particular period log statistics
and to apply search operation on logs
• Adept at handling ambiguous or undefined problems as well as ability to think abstractly
Certificates
• Azure Associate Data Engineering certificate
• IBM Data Science Professional Certificate.
• Tools for Data Science.
• Python for Data Science, AI Development.
• Data Analysis with Python.

Languages
• English
• Hindi

You might also like