Parag Resume
Parag Resume
[email protected]
+91 9821278883
Bengaluru, India 560035
Education
06/2015
Summary
B.E. IN ELECTRICAL ENGG: Guided multiple organizations through the journey of setting up their analytics and
PEC UNIVERSITY OF data science. Currently, focused extensively on the Gen AI space, fine-tuning LLMs
TECHNOLOGY for domain-specific tasks, building RAG systems for enhanced contextual responses
Chandigarh and leveraging advanced prompt engineering techniques to ensure structured
structured and actionable outputs. Having been part of several 0 to 1 initiatives, I
am proficient in creating robust frameworks tailored to each organization's specific
Skills needs.
• Hugging Face Transformers
• Finetuning LLMs
• OpenAI
Experience
• Perplexity Bridgetown Research - Senior Scientist (Consultant)
• Exa Search Bengaluru, India
• SQL 09/2024 - Current
• Python
• Model Finetuning: Used tagged data and synthetic generation to fine-tune a
• Beam
checkpoint of BART-large (pretrained on the MNLI dataset) in SageMaker to
• AWS Sagemaker
predict user concerns and question types, with recall of approximately 98%.
• Amazon Timestream
Deployed it as an endpoint on Beam.cloud's serverless GPU instance, achieving
• Deep Learning
inference latency below 300ms
• NLP
• Metrics For Interview: Created a script powered by GPT-4 to analyze interview
• AWS Redshift
data by tagging question type, evaluating answers quality and identifying
• PG Admin
concerns raised by respondents.
• Redash
• Quick Search: Developed a RAG system that would quickly search interview
• Metabase
transcripts to answer user queries. Embeddings of interview Q&A pairs were
• Dune Analytics
created using OpenAI's text-embedding-3-small. These were stored in Pinecone,
• Firebase
and then compared with the user query embedding to fetch the top 30 similar
• Docker
results, and then passed to GPT to provide a relevant summarized answer.
• Redis
• Automated Question Generation: Designed and implemented a three-step
• Streamlit
pipeline that takes research questions as input, identifies relevant domains, and
generates tailored interviewee personas and main interview questions by
Domains leveraging GPT's structured output feature. Used Firebase to store and sync data
in real time.
• Decision research • Question Bank Optimization: Developed a question adaptation system that
• Opinion Trading personalizes a fixed question bank based on research objectives and user
• Online Gaming personas to generate highly relevant interview questions.
• EdTech
• US healthcare
TRADEX - DATA SCIENCE LEAD
08/2022 - Current
Linkedin Profile • Worked closely with founders and aligned data strategies with core business
• https://www.linkedin.com objectives. Empowered them to leverage data as a strategic asset right from
/in/parag-jain-07583a167/ beginning.
• Lead growth experiments by clustering users through machine learning based on
their trading pattern, pocket size and in app behavior to improve retention by
25%.
• Sole creator and owner of complete data science layer of the organization.
Automated monitoring of key KPIs on daily, weekly and monthly level.
• Spearheaded automation across various trading markets. Identified bottlenecks
and implemented end-to-end logic which automatically created and settled
markets across finance, sports and media by pulling data through various apis.
• Improved the health of entire system by reconciling all trading and payment
activities and flagging mismatches. This alone helped in improving user
experience and brought withdrawal time from 1-2 days to couple of hours.
• End to end creation of different trading strategies to provide the market with
liquidity and depth while profiting from the difference in the bid-ask spread.
Deployed arbitrage strategy to take advantage of various bookmakers and
exchanges having different odds.