Rajveer Data Science Resume
Rajveer Data Science Resume
EDUCATION
• The National Institute of Engineering, Mysore Aug 2021- June 2025
B.E. in Electrical & Electronics Engineering: 7.8/10
SKILLS
• DS/ Programming & Querying: Python, C++, SQL
• Cloud Platforms: Azure: Azure Data Factory, Azure Databricks, Azure Synapse Analytics, Azure Data Lake, AWS (EC2, S3, Lambda, Glue)
• Data Engineering: ETL & ELT Pipelines, Data Ingestion, Data Warehousing, Data Transformation, Data Quality, Workflow Automation, Real-
time Data Processing
• Frameworks & Libraries: Django, Apache Spark (PySpark), RESTful APIs, Pandas
• Tools & DevOps: Git, GitHub, Azure DevOps, VS Code, Docker (Basic)
• Databases: PostgreSQL, Azure SQL Database
EXPERIENCE
Python Developer -Vedav Tech Private limited Feb 2025 - May 2025
• Developed a production-grade AI chatbot for a Dubai-based client using Python, Django, and Large Language Models (LLMs), efficiently
managing 10,000+ daily user interactions.
• Built and deployed RESTful APIs to integrate the chatbot with a PostgreSQL database, enhancing data retrieval speed by 20%
• Deployed the chatbot on AWS using Docker, achieving 99.9% uptime and ensuring high availability and fault tolerance.
• Optimized critical SQL queries, reducing data retrieval time by 15% and improving backend performance for analytics.
PROJECTS
End-to-End Airlines Data Ingestion Pipeline | AWS, Redshift, PySpark, Python April 2025
• Built scalable ETL pipelines using AWS Glue and PySpark to process 100GB+ airline data into Amazon S3.
• Extracted data from APIs and CSVs, transformed with Python, improving processing efficiency by 20%
• Queried and optimized reports in Amazon Redshift, reducing execution time by 15%.
• Implemented data validation with AWS Glue Data Quality and CloudWatch, achieving 99% accuracy.
• Automated workflows using Step Functions and EventBridge, cutting manual effort by 30%.
Insurance Management System – Django Web App -Group Project Dec 2024
• End-to-End Policy Management: Designed and deployed a fully functional insurance management platform using Django, ensuring seamless
policy handling.
• Role-Based Multi-User Access: Implemented secure authentication for admins and customers, enabling efficient policy tracking and
management.
• Dynamic CRUD Operations: Developed an intuitive dashboard for policy creation, updates, deletion, and user account management.
• Real-Time Customer Support: Integrated automated query handling for admin-customer interactions, enhancing response time
and service efficiency.
• Cloud-Optimized Deployment: Deployed on Heroku with scalable architecture, ensuring high availability and real-time database
synchronization.
Data Engineering: YouTube Data Analytics | Python, AWS, Data Ingestion, ETL May 2025
• Developed and maintained ETL processes using SQL and Python to load data into the data warehouse
• Created data lakes and data warehouses using AWS technologies such as S3 and Redshift
• Partitioned different tables and written ETL jobs using AWS spark, used AWS glue for cataloging
• Created dashboards and reports operating with BI tools such as AWS Quicksight to provide data-driven insights
• Worked with programming languages used for data manipulation, including: PySpark, Spark SQL
Certifications / Achievements
• Python Functions for Data Science – LinkedIn Learning
• Introduction to Microsoft Azure Cloud Services – Coursera
• Introduction to Big Data with Spark and Hadoop – IBM
• Solved 200+ coding problems across platforms like LeetCode, HackerRank, and GeeksForGeeks
• Appointed as Class Committee Head for 3 consecutive years, demonstrating leadership and coordination skills.