Az Data Eng20
Az Data Eng20
Az Data Eng20
Education
University of Petroleum and Energy Studies April. 2018 – May 2022
B.Tech in Computer Science CGPA-8.5 Dehradun, Uttrakhand
Technical Skills
Languages: Python, C++, C,Pyspark
Cloud Services: Azure Storage Service, ADLS, ADF, Azure Databricks, Synapse analytics (Data warehouse)
Big Data: Hadoop, Spark, Hive, Nifi, kafka, Data Warehousing modeling, RDD, DataFrame
DevOps Tools: Linux, GitHub, Docker .
Data Base: SQL, NoSQL.
Experience
Numantra Technology Nov 2023 – Present
Azure Data Engineer Mumbai, India
• Proficient in processing data utilizing PySpark, adhering to the Medallion architecture framework, and proficient in
analysis.
• Collaborated with differnet teams to gather requirements, assess existing workspaces and worked on consolidation
performance, streamline data retrieval, and optimize query execution plans for efficient and rapid data processing.
• Hands-on experience in working with distributed systems, demonstrating proficiency in designing, implementing, and
optimizing distributed architectures for efficient data processing and scalable applications.
• Designed and developed a Spark application to efficiently process unstructured data from a Kafka stream, transform it
into a structured format, and ingest the data into a Hive table for further analysis, enabling real-time data processing
and insights.
• Conduct data analysis by querying a MySQL database with SQL and HIVE Warehouse with HQL .Wrote SQL queries to
extract and transform data from MySQL to support business intelligence and decision-making processes.Utilized HQL to
gather data from HIVE warehouse for further analysis and reporting.
• Proficient in writing Python and shell scripts on Linux for efficient task automation, streamlining processes and
improving productivity.
• Developed WebSocket functionality from scratch in Python to expose limited data.
• With the help of Python and its framework Developed a WAF Compliacne Portal for getting detailed overview of WAF
rules enabled on devices and also developed Firewall + WAF log export portal for getting particular period log statistics
and to apply search operation on logs
• Adept at handling ambiguous or undefined problems as well as ability to think abstractly
Certificates
• Azure Associate Data Engineering certificate
• IBM Data Science Professional Certificate.
• Tools for Data Science.
• Python for Data Science, AI Development.
• Data Analysis with Python.
Languages
• English
• Hindi