Rakesh Kumar - Data Scientist

Rakesh Kumar has extensive experience as a lead data scientist with expertise in areas such as NLP, computer vision, and recommendation systems. He holds a Master's degree in Data Science and a Bachelor's degree in Information Technology. He has worked on projects involving chatbots, OCR, object detection, and time series forecasting. His professional experience includes developing machine learning models for various companies and leading teams as a data scientist.

Uploaded by

aditya25jain2003

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

237 views3 pages

Rakesh Kumar - Data Scientist

Uploaded by

aditya25jain2003

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 3

Rakesh Kumar

Professional Summary:
 As a Lead Data Scientist, he has extensive experience working with large-scale enterprises across various domains.
Expertise in driving and developing AI-related innovations and ideas, from conception to full-scale implementation,
in areas such as NLP, time series analysis, computer vision, and AI marketing and recommendation systems. Broad
understanding of software engineering projects related to micro-services and data science.

Education
 Master of Data Science: The University of British Columbia, Vancouver
 Bachelor Of Technology: Information Technology, Maharshi Dayanand University - Rohtak

Certifications
 Specialized Models: Time Series and Survival Analysis (Coursera) DeepLearning.AI TensorFlow Developer (Coursera)
 Deep Learning Specialization (Coursera Andrew NG)

Technical Skills:
 Deep Learning: Transformer
• LLM(BERT, BART, GPT-2, GPT-3, T5, XML ROBERTA, Longformer, and Electra)
• Prompt engineering(GPT-3.5 DaVinci & Turbo, Zero & Few shot learning + COT + Experts, LangChain)
• Chatbot
• Siamese Networks
• Dialog Management
• Conversational AI (RASA, Dialogflow and Custom Chatbots)
• CNN-based models (MobileNet, ResNet)
• OCR (Tesseract, EasyOCR, and PaddleOCR)
• Object Detection (R-CNN, Fast R-CNN, Faster R-CNN, RetinaNet, YOLO, and SSD)
• Image Segmentation
• Image classification
• Object localization & detection
Languages/Framework: Python, Java, C, C++, R, Javascript
• TensorFlow, Pytorch, SQL, Flask
• Spring Boot, Microservices
Time Series/AI-Marketing: ARIMA, SARIMA, FBProphet, Deep Learning Forecasting
Big Data Technologies: PIG, Hive, PySpark, Hadoop
• MongoDB, BigQuery, Snowflake, Apache Airflow
Additional: Personalized & Non-Personalized Recommendations Systems
• SQL, Vector DB
• Assessment, Reporting, Presentation
• Databricks, Snowflake
• Docker, Jenkins, Kubernetes, Ansible, Chef, Terraform
• Virtualization, Azure ML, AWS, GCP
• ETL, Tableau, Kafka, Redis
• Elasticsearch
• DeepSpeed, LORA, Distillation, Quantization, ONNX, TensorRT, XLA Compiler

Professional Experience:
Ai Intelli, Canada 2021-08 - Present
Lead Data Scientist
Responsibilities
 Leading the development of an intelligent document processing tool using Python and libraries like Pandas, NumPy,
spaCy, NLTK, and others to extract information from image, PDF, and HTML files.
 Actively contributing to the implementation of YOLO for object detection, with a focus on optimizing its performance
for low latency using ONNX-TensorRT
 Leading the design and development of a Named Entity Recognition (NER) system using the Large Language Model
GPT- NER, BERT, and RoBERTa, a character-based spell checker utilizing a seq-2-seq model using transformer
architecture, and fine-tuned BERT, RoBERTa, word embedding techniques (Word2Vec, GloVe, FastText) for semantic
similarity.
 Collaborating with DevOps to establish a Continuous Integration and Continuous Deployment (CICD) pipeline for
machine learning model training and deployment.
 Utilizing various AWS cloud services, including SageMaker, EC2 instances, and S3 bucket, for model tuning, training,
and storage.
 Conducting data visualization and analysis using Tableau and integrating it with GCP's Big Query for enhanced data
processing.
 Led the development and optimization of complex deep learning models using Keras to achieve state-of-the-art
performance on relevant business problems.
 Working the improvement of Paddle OCR and Tesseract 4 OCR accuracy by creating a training dataset and fine-tuning
the model.
 Collaborated with solution architects and software developers to integrate microservices into existing systems,
leading the team in extensive testing and debugging.

Samsung SDS, Organization : Altran Technologies, Gurgaon, IN 2019-05 – 2021-08

Data Scientist(ML)
Responsibilities:
 Leading a team of data engineers and scientists responsible for designing and developing advanced chatbot and NLP
models using Python and its libraries, establishing ETL pipelines, and implementing data warehousing solutions.
 Achieved a significant accomplishment by delivering two advanced NLP chatbot projects within a short time frame,
including one for internal use and one for customer-facing purposes.
 Designed a large language processing framework utilizing BERT, RoBERTa, XML-RoBERTa, and Electra models.
Developed an advanced Sentiment Analyzer for chatbots using RoBERTa.
 Utilized the RoBERTa, BERT, BART, ELECTRA model to its full potential by fine-tuning it for extractive summarization
and using it to convert large text responses from APIs into concise summaries.
 Developed intents and entities classifier using ELECTRA architecture and completed POCs on GPT-2 and BERT for the
same task.
 Applied innovative solutions and problem-solving skills to address challenges in data preprocessing, feature
engineering, and model tuning within the Keras framework.
 Demonstrated expertise in natural language processing by designing a highly effective spell checker for English using
a cutting-edge sequence-to-sequence long short-term memory model.
 Developed a state-of-the-art model for identifying similar questions using cutting-edge technologies: BERT, RoBERTa,
and the Siamese Manhattan LSTM model.

Organization : Tavant technologies, Noida, IN 2016-03 - 2019-05

Project : MLBAM(Beat the Streak),
Data Scientist(ML)
Responsibilities:
 Managed and led a team of 10 developers and engineers for a mission-critical MLP project, responsible for delivery
and development tasks, while utilizing Python, Java, Oracle database to develop high-performance and scalable
backend services, providing reliable and efficient communication between the front-end and back-end systems
 Developed advanced player selection algorithms using predictive modeling and machine learning in Python to
enhance participant performance and increase streak lengths in the "Beat the Streak" contest for Major League
Baseball (MLB).
 Developed and maintained comprehensive documentation for Keras-based models, ensuring that best practices,
standards, and insights are shared with the broader data science team.
 Employed data analytics and statistical analysis on MLB player performance data to design a personalized
recommendation system for the "Beat the Streak" contest, leveraging Python, pandas, and scikit-learn to deliver
tailored player suggestions for an optimized gaming experience.
 Utilized Java Spring Boot JPA to develop high-performance and scalable backend services, providing reliable and
efficient communication between the front-end and back-end systems.
 Employed Spring Boot JPA's transaction management capabilities to ensure data consistency and reliability, reducing
the risk of errors and data corruption.
 Worked with Spring Boot JPA's caching mechanisms to optimize database access and reduce response times,
improving overall system performance and user experience.
 Worked on various module involving Redis and SQL Server, delivering high-quality solutions that met the client's
needs and expectations.

HCL technologies, Noida, IN 2015-02 - 2016-03

Project : Western Union, Organization
Lead Engineer
Responsibilities:
 Managed project and personal schedule by consistently meeting deadlines.
 Implemented and adopted cloud technologies and best practices for automation, configuration, monitoring and
platform scalability.
 Improved DevOps across deployment and testing strategies and application monitoring.
 Served as subject matter expert on Backend and Frontend technologies for both clients and internal team members.
 Provided technical support for Oracle database-related issues and resolved them in a timely manner
 Worked on improving database scalability and availability by implementing load balancing and redundancy solutions.

Kazzam Software Solution, Noida, IN 2012-07 - 2015-02

Software Developer
Responsibilities:
 Designed and implemented Code in Spring MVC using coding best practices. Developed and maintained REST APIs
for integrating applications with the database and other services.
 Participated in Agile software development processes, including Scrum and Kanban methodologies.
 Collaborated with cross-functional teams to ensure quality delivery of software releases.

NCA-GENL Nvidia Generative Ai Llms Exam Dumps
No ratings yet
NCA-GENL Nvidia Generative Ai Llms Exam Dumps
5 pages
Playwright Cheat Sheet 1742579786
No ratings yet
Playwright Cheat Sheet 1742579786
1 page
Generative AI A Transformative Force in Business Intelligence
No ratings yet
Generative AI A Transformative Force in Business Intelligence
7 pages
INT426 Gen AI
No ratings yet
INT426 Gen AI
4 pages
Distributed Database Concepts
No ratings yet
Distributed Database Concepts
35 pages
Yugandar - Generative AI Architect
No ratings yet
Yugandar - Generative AI Architect
8 pages
Deeskhith Resume AI ML GenAI
No ratings yet
Deeskhith Resume AI ML GenAI
2 pages
Palash Mondal (Data Scientist) Resume 5+ Exp
No ratings yet
Palash Mondal (Data Scientist) Resume 5+ Exp
3 pages
Lab7 LLM Chains
No ratings yet
Lab7 LLM Chains
7 pages
Bedrock Doc 1
No ratings yet
Bedrock Doc 1
4 pages
GenAI Interview Questions-Draft
No ratings yet
GenAI Interview Questions-Draft
27 pages
Transformers
No ratings yet
Transformers
21 pages
Top 50 GenAI Interview Questions
No ratings yet
Top 50 GenAI Interview Questions
3 pages
Generative AI Interview Questions and Answers
No ratings yet
Generative AI Interview Questions and Answers
7 pages
Gen AI Roadmap 2025
No ratings yet
Gen AI Roadmap 2025
19 pages
Shreyash's Resume
No ratings yet
Shreyash's Resume
1 page
Resume Template in Docx Format
No ratings yet
Resume Template in Docx Format
1 page
Top 10 Machine Learning Algo PDF
No ratings yet
Top 10 Machine Learning Algo PDF
15 pages
Machine Learning For Tabular Data XGBoost, Deep Learning, and AI (Mark Ryan, Luca Massaron) (Z-Library)
100% (1)
Machine Learning For Tabular Data XGBoost, Deep Learning, and AI (Mark Ryan, Luca Massaron) (Z-Library)
504 pages
Fine Tuning Techniques For Large Language Models LLMs
No ratings yet
Fine Tuning Techniques For Large Language Models LLMs
15 pages
5 Pretraining On Unlabeled Data - Build A Large Language Model (From Scratch)
No ratings yet
5 Pretraining On Unlabeled Data - Build A Large Language Model (From Scratch)
61 pages
10 Evani Generative AI Champion
No ratings yet
10 Evani Generative AI Champion
39 pages
Kaggle: Your Machine Learning and Data Science Community
No ratings yet
Kaggle: Your Machine Learning and Data Science Community
7 pages
Introduction To Generative AI LLM
100% (1)
Introduction To Generative AI LLM
9 pages
Machine Learning.
No ratings yet
Machine Learning.
50 pages
Pawan Resume May 2023
No ratings yet
Pawan Resume May 2023
2 pages
Projects GenAI Pinnacle Program
No ratings yet
Projects GenAI Pinnacle Program
14 pages
Pulkit Agarwal
No ratings yet
Pulkit Agarwal
1 page
Reading:: Sources
No ratings yet
Reading:: Sources
15 pages
Generative AI With Large Language Models AWS & DeepLearning
No ratings yet
Generative AI With Large Language Models AWS & DeepLearning
96 pages
170 Machine Learning Interview Questions and Answer For 2021
100% (1)
170 Machine Learning Interview Questions and Answer For 2021
65 pages
AI Institutes
No ratings yet
AI Institutes
98 pages
Context-Aware Summarization For PDF Documents Using Large Language Models
No ratings yet
Context-Aware Summarization For PDF Documents Using Large Language Models
6 pages
Srikanth - Bellary Architect Resume
No ratings yet
Srikanth - Bellary Architect Resume
6 pages
2023-24 AhaGuru Brochure
No ratings yet
2023-24 AhaGuru Brochure
8 pages
Introduction To Natural Language Processing
No ratings yet
Introduction To Natural Language Processing
45 pages
RAG Notes
No ratings yet
RAG Notes
4 pages
TensorFlow Cheatsheet Zero To Mastery V1.01
No ratings yet
TensorFlow Cheatsheet Zero To Mastery V1.01
26 pages
GenAI Interview Questions-1
No ratings yet
GenAI Interview Questions-1
9 pages
Ibm Ix Dach Genai Pov en
No ratings yet
Ibm Ix Dach Genai Pov en
30 pages
Large-Language-Model-Based-Artificial-Intelligence-In-The-Language-Classroom-Practical-Ideas-For-Teaching - Content File PDF
No ratings yet
Large-Language-Model-Based-Artificial-Intelligence-In-The-Language-Classroom-Practical-Ideas-For-Teaching - Content File PDF
20 pages
Knowledge Graph Construction Using Large Language Models
No ratings yet
Knowledge Graph Construction Using Large Language Models
17 pages
Brittany King Data Scientist Resume
No ratings yet
Brittany King Data Scientist Resume
1 page
Heart Disease Prediction-02-1
No ratings yet
Heart Disease Prediction-02-1
27 pages
Manideep Data Scientist
No ratings yet
Manideep Data Scientist
3 pages
GenAI Unit1 3
No ratings yet
GenAI Unit1 3
31 pages
SNA Labs - Master AI Agents (No Code)
No ratings yet
SNA Labs - Master AI Agents (No Code)
6 pages
Projects Gen AI Pinnacle
100% (1)
Projects Gen AI Pinnacle
12 pages
Stylish Modern Resume Template
No ratings yet
Stylish Modern Resume Template
3 pages
Day 2 Module 2 - Understanding LLMs
No ratings yet
Day 2 Module 2 - Understanding LLMs
14 pages
LCM LoRA Technical Report
No ratings yet
LCM LoRA Technical Report
7 pages
K-Means Clustering Using Python
No ratings yet
K-Means Clustering Using Python
30 pages
Machine Learning GenAI Roadma
No ratings yet
Machine Learning GenAI Roadma
36 pages
Hugging Face
100% (1)
Hugging Face
11 pages
Build An MLOps Project in 6 Steps
No ratings yet
Build An MLOps Project in 6 Steps
8 pages
Large Language Model (LLM) 1
100% (1)
Large Language Model (LLM) 1
17 pages
Nandan Resume
No ratings yet
Nandan Resume
1 page
Brief Introduction To GenAI
No ratings yet
Brief Introduction To GenAI
1 page
IDE204 - TimeGPT Generative AI For Time Series
100% (1)
IDE204 - TimeGPT Generative AI For Time Series
36 pages
The Datadog Handbook: A Guide to Monitoring, Metrics, and Tracing
From Everand
The Datadog Handbook: A Guide to Monitoring, Metrics, and Tracing
Robert Johnson
No ratings yet
Hybrid Neural Networks: Fundamentals and Applications for Interacting Biological Neural Networks with Artificial Neuronal Models
From Everand
Hybrid Neural Networks: Fundamentals and Applications for Interacting Biological Neural Networks with Artificial Neuronal Models
Fouad Sabry
No ratings yet
A Systematic Mapping Study of Process Mining: Enterprise Information Systems
No ratings yet
A Systematic Mapping Study of Process Mining: Enterprise Information Systems
46 pages
Clearance Pnco (Processor) : User'S Manual
No ratings yet
Clearance Pnco (Processor) : User'S Manual
24 pages
Unit - 5 Notes
No ratings yet
Unit - 5 Notes
9 pages
Spam Email Classification
No ratings yet
Spam Email Classification
10 pages
Data Preprocessing
No ratings yet
Data Preprocessing
57 pages
Data Cleansing
No ratings yet
Data Cleansing
6 pages
UNIT-1 1) KDD: KDD (Knowledge Discovery in Database)
No ratings yet
UNIT-1 1) KDD: KDD (Knowledge Discovery in Database)
17 pages
Telephone Data, Call Tower: IBM I2 Analyst'S Notebook
100% (1)
Telephone Data, Call Tower: IBM I2 Analyst'S Notebook
4 pages
Will Cuppy - Maroon Tales
No ratings yet
Will Cuppy - Maroon Tales
349 pages
21bce0968 VL2023240100969 Ast01
No ratings yet
21bce0968 VL2023240100969 Ast01
22 pages
Obj WS MID 2
No ratings yet
Obj WS MID 2
3 pages
Sample Paper 7
No ratings yet
Sample Paper 7
7 pages
Database Design Using The E-R Model: Practice Exercises
No ratings yet
Database Design Using The E-R Model: Practice Exercises
4 pages
SDSD
No ratings yet
SDSD
6 pages
Shreyansh - SPORTS CLUB MANAGEMENT SYSTEM
No ratings yet
Shreyansh - SPORTS CLUB MANAGEMENT SYSTEM
39 pages
Zero-Downtime Encryption and Key Rotation: Ciphertrust Live Data Transformation
No ratings yet
Zero-Downtime Encryption and Key Rotation: Ciphertrust Live Data Transformation
2 pages
Innovation Proposal SRMS
No ratings yet
Innovation Proposal SRMS
3 pages
17.1 Instagram Analytics PDF
No ratings yet
17.1 Instagram Analytics PDF
9 pages
The World Wide Web - A Very Short Personal History
No ratings yet
The World Wide Web - A Very Short Personal History
2 pages
A14-DBMS - (CSE, IT & ECM) - Reg
No ratings yet
A14-DBMS - (CSE, IT & ECM) - Reg
2 pages
Oracle Database Questions & Answers
0% (1)
Oracle Database Questions & Answers
4 pages
Doesn't Meet Our Quality Requirements
No ratings yet
Doesn't Meet Our Quality Requirements
1 page
Sap - C - Hanaimp - 1 SAP Certified Application Associate - SAP HANA 1.0 (C - HANAIMP - 1)
No ratings yet
Sap - C - Hanaimp - 1 SAP Certified Application Associate - SAP HANA 1.0 (C - HANAIMP - 1)
5 pages
SAS Lesson 1
No ratings yet
SAS Lesson 1
62 pages
REST API-Software Architecture - Wikipedia
No ratings yet
REST API-Software Architecture - Wikipedia
16 pages
1 - Hci-Intro
No ratings yet
1 - Hci-Intro
20 pages
Hawig Scores
No ratings yet
Hawig Scores
10 pages
Lesson 4 For Tuesday, January 05, 2010 Creating Relationships
No ratings yet
Lesson 4 For Tuesday, January 05, 2010 Creating Relationships
5 pages
B-7039-Article Text-21030-1-2-20220317-1
No ratings yet
B-7039-Article Text-21030-1-2-20220317-1
10 pages

Rakesh Kumar - Data Scientist

Uploaded by

Rakesh Kumar - Data Scientist

Uploaded by

Rakesh Kumar

Samsung SDS, Organization : Altran Technologies, Gurgaon, IN 2019-05 – 2021-08

Organization : Tavant technologies, Noida, IN 2016-03 - 2019-05

HCL technologies, Noida, IN 2015-02 - 2016-03

Kazzam Software Solution, Noida, IN 2012-07 - 2015-02

You might also like