Gaurav Tailor is currently pursuing an MS in Data Science at the Indian Institute of Information Technology with a background in Mathematics. He has experience in data science and machine learning, having developed AI solutions, implemented a RAG system, and conducted data analysis during internships. His technical skills include proficiency in Python, machine learning frameworks, and ETL pipeline development, complemented by achievements in competitive exams and community service.
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0 ratings0% found this document useful (0 votes)
28 views1 page
MSD 23010
Gaurav Tailor is currently pursuing an MS in Data Science at the Indian Institute of Information Technology with a background in Mathematics. He has experience in data science and machine learning, having developed AI solutions, implemented a RAG system, and conducted data analysis during internships. His technical skills include proficiency in Python, machine learning frameworks, and ETL pipeline development, complemented by achievements in competitive exams and community service.
EDUCATION Indian Institute of Information Technology-CGPA:8.2 Lucknow, Utter Pradesh MS - Data Science Aug 2023 - June 2025 University of Kota -CGPA:7.0 Kota, Rajasthan Bachlor of Science - Mathematics July 2018 - June 2021 EXPERIENCE AND KEY COURSES Innometic Research Lab Data Science and Gen AI LOR • Applied Data Science Techniques and Developed Generative AI Solutions : Analyzed 3+ complex datasets, applying advanced preprocessing and visualization techniques and Developed and optimized 2+ AI models • Implemented RAG System: Developed a FastAPI and Docker-based Retrieval-Augmented Generation (RAG) system using Neo4j for graph management. Implemented advanced semantic search with Gemini and LangChain for optimal profile retrieval based on job descriptions. YBI Foundation Data Scientist and Machine Learning (Intern) • Conducted Data Analysis: Performed in-depth analysis on key product KPIs using SQL, leveraging data mining and statistical techniques to inform product optimization. • Machine Learning Implementation: Developed and applied supervised and unsupervised machine learning models using Python (NumPy, Scikit-learn) to enhance algorithmic efficiency and relevance DSMP 1.0 Machine Learning-CampusX Certificate • Skills & Tools:Proficient in Python, Plotly, Flask, Tableau; algorithms including Linear Regression,
Logistic Regression, KNN, Decision Trees, Gradient Boosting, PCA, SVM, and Clustering. • Learning Outcomes: Mastered data analysis (100K+ records), visualization (50+ plots), model building
(95%+ accuracy), feature engineering and deploying dashboards (10+ dashboards).
PROJECT Predictive Maintenance ML Model Web Application - View Repository Oct 2024 ∗ Description: Designed and deployed binary and multiclass predictive maintenance classification models to predict equipment failures and failure types using the AI4I 2020 dataset, achieving high precision, recall, F1 score, and AUC-ROC. ∗ Tech Stack: Utilized Python, Machine Learning, Docker, Huggingface Spaces, and implemented an end-to-end MLOps pipeline with automated CI/CD for seamless deployment. Fine Tune, Classification using BERT - View Repository Nov 2024 ∗ Classified 82,500 emails as spam or legitimate using BERT models (DistilBERT, TinyBERT), leveraging tokenization, text cleaning, and preprocessing techniques. ∗ Fine-tuned models and evaluated performance and achieved fewer false negatives with TinyBERT, while pruned DistilBERT delivered the best overall results. Enhanced model performance through pruning, quantization, and knowledge distillation, optimizing for speed and accuracy. Developed Databricks, PySpark ETL Data Pipeline - View Repository Aug 2024 ∗ Description: Designed and implemented scalable ETL pipelines using PySpark and Databricks, ingesting data from various sources, transforming it with Spark SQL and the DataFrame API, and storing it in optimized formats like Parquet and Delta Tables. ∗ Techniques: Integrated DataLake solutions for efficient storage and retrieval while ensuring high performance and scalability for large datasets. TECHNICAL SKILLS Languages & Tools: Python, PyTorch, Scikit-Learn, TensorFlow, Keras, SQL, LaTeX, Power BI, Docker, AWS Libraries & Frameworks: Streamlit, LangChain, Ollama, Qdrant, Groq API, Hugging Face Transformers, FastAPI Coursework: Machine Learning, Deep Learning, Artificial Intelligence, Big Data, Data Science, Statistical Analysis Soft Skills: Decision Making, Time Management, Team Collaboration, Technical Communication, Problem-Solving ACHIEVEMENTS & VOLUNTEERING IIT JAM 2023: Achieved All India Rank (AIR) 876 in Mathematics - 2023. NSS Volunteer and Leader: Organized community service projects, led teams, and facilitated awareness campaigns. Teacher Assistant: Assisted with teaching Probability and Statistics to undergraduates, including support in lectures and grading assignments.