0% found this document useful (0 votes)
237 views3 pages

Rakesh Kumar - Data Scientist

Rakesh Kumar has extensive experience as a lead data scientist with expertise in areas such as NLP, computer vision, and recommendation systems. He holds a Master's degree in Data Science and a Bachelor's degree in Information Technology. He has worked on projects involving chatbots, OCR, object detection, and time series forecasting. His professional experience includes developing machine learning models for various companies and leading teams as a data scientist.

Uploaded by

aditya25jain2003
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
237 views3 pages

Rakesh Kumar - Data Scientist

Rakesh Kumar has extensive experience as a lead data scientist with expertise in areas such as NLP, computer vision, and recommendation systems. He holds a Master's degree in Data Science and a Bachelor's degree in Information Technology. He has worked on projects involving chatbots, OCR, object detection, and time series forecasting. His professional experience includes developing machine learning models for various companies and leading teams as a data scientist.

Uploaded by

aditya25jain2003
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 3

Rakesh Kumar

Professional Summary:
 As a Lead Data Scientist, he has extensive experience working with large-scale enterprises across various domains.
Expertise in driving and developing AI-related innovations and ideas, from conception to full-scale implementation,
in areas such as NLP, time series analysis, computer vision, and AI marketing and recommendation systems. Broad
understanding of software engineering projects related to micro-services and data science.

Education
 Master of Data Science: The University of British Columbia, Vancouver
 Bachelor Of Technology: Information Technology, Maharshi Dayanand University - Rohtak

Certifications
 Specialized Models: Time Series and Survival Analysis (Coursera) DeepLearning.AI TensorFlow Developer (Coursera)
 Deep Learning Specialization (Coursera Andrew NG)

Technical Skills:
 Deep Learning: Transformer
• LLM(BERT, BART, GPT-2, GPT-3, T5, XML ROBERTA, Longformer, and Electra)
• Prompt engineering(GPT-3.5 DaVinci & Turbo, Zero & Few shot learning + COT + Experts, LangChain)
• Chatbot
• Siamese Networks
• Dialog Management
• Conversational AI (RASA, Dialogflow and Custom Chatbots)
• CNN-based models (MobileNet, ResNet)
• OCR (Tesseract, EasyOCR, and PaddleOCR)
• Object Detection (R-CNN, Fast R-CNN, Faster R-CNN, RetinaNet, YOLO, and SSD)
• Image Segmentation
• Image classification
• Object localization & detection
Languages/Framework: Python, Java, C, C++, R, Javascript
• TensorFlow, Pytorch, SQL, Flask
• Spring Boot, Microservices
Time Series/AI-Marketing: ARIMA, SARIMA, FBProphet, Deep Learning Forecasting
Big Data Technologies: PIG, Hive, PySpark, Hadoop
• MongoDB, BigQuery, Snowflake, Apache Airflow
Additional: Personalized & Non-Personalized Recommendations Systems
• SQL, Vector DB
• Assessment, Reporting, Presentation
• Databricks, Snowflake
• Docker, Jenkins, Kubernetes, Ansible, Chef, Terraform
• Virtualization, Azure ML, AWS, GCP
• ETL, Tableau, Kafka, Redis
• Elasticsearch
• DeepSpeed, LORA, Distillation, Quantization, ONNX, TensorRT, XLA Compiler

Professional Experience:
Ai Intelli, Canada 2021-08 - Present
Lead Data Scientist
Responsibilities
 Leading the development of an intelligent document processing tool using Python and libraries like Pandas, NumPy,
spaCy, NLTK, and others to extract information from image, PDF, and HTML files.
 Actively contributing to the implementation of YOLO for object detection, with a focus on optimizing its performance
for low latency using ONNX-TensorRT
 Leading the design and development of a Named Entity Recognition (NER) system using the Large Language Model
GPT- NER, BERT, and RoBERTa, a character-based spell checker utilizing a seq-2-seq model using transformer
architecture, and fine-tuned BERT, RoBERTa, word embedding techniques (Word2Vec, GloVe, FastText) for semantic
similarity.
 Collaborating with DevOps to establish a Continuous Integration and Continuous Deployment (CICD) pipeline for
machine learning model training and deployment.
 Utilizing various AWS cloud services, including SageMaker, EC2 instances, and S3 bucket, for model tuning, training,
and storage.
 Conducting data visualization and analysis using Tableau and integrating it with GCP's Big Query for enhanced data
processing.
 Led the development and optimization of complex deep learning models using Keras to achieve state-of-the-art
performance on relevant business problems.
 Working the improvement of Paddle OCR and Tesseract 4 OCR accuracy by creating a training dataset and fine-tuning
the model.
 Collaborated with solution architects and software developers to integrate microservices into existing systems,
leading the team in extensive testing and debugging.

Samsung SDS, Organization : Altran Technologies, Gurgaon, IN 2019-05 – 2021-08


Data Scientist(ML)
Responsibilities:
 Leading a team of data engineers and scientists responsible for designing and developing advanced chatbot and NLP
models using Python and its libraries, establishing ETL pipelines, and implementing data warehousing solutions.
 Achieved a significant accomplishment by delivering two advanced NLP chatbot projects within a short time frame,
including one for internal use and one for customer-facing purposes.
 Designed a large language processing framework utilizing BERT, RoBERTa, XML-RoBERTa, and Electra models.
Developed an advanced Sentiment Analyzer for chatbots using RoBERTa.
 Utilized the RoBERTa, BERT, BART, ELECTRA model to its full potential by fine-tuning it for extractive summarization
and using it to convert large text responses from APIs into concise summaries.
 Developed intents and entities classifier using ELECTRA architecture and completed POCs on GPT-2 and BERT for the
same task.
 Applied innovative solutions and problem-solving skills to address challenges in data preprocessing, feature
engineering, and model tuning within the Keras framework.
 Demonstrated expertise in natural language processing by designing a highly effective spell checker for English using
a cutting-edge sequence-to-sequence long short-term memory model.
 Developed a state-of-the-art model for identifying similar questions using cutting-edge technologies: BERT, RoBERTa,
and the Siamese Manhattan LSTM model.

Organization : Tavant technologies, Noida, IN 2016-03 - 2019-05


Project : MLBAM(Beat the Streak),
Data Scientist(ML)
Responsibilities:
 Managed and led a team of 10 developers and engineers for a mission-critical MLP project, responsible for delivery
and development tasks, while utilizing Python, Java, Oracle database to develop high-performance and scalable
backend services, providing reliable and efficient communication between the front-end and back-end systems
 Developed advanced player selection algorithms using predictive modeling and machine learning in Python to
enhance participant performance and increase streak lengths in the "Beat the Streak" contest for Major League
Baseball (MLB).
 Developed and maintained comprehensive documentation for Keras-based models, ensuring that best practices,
standards, and insights are shared with the broader data science team.
 Employed data analytics and statistical analysis on MLB player performance data to design a personalized
recommendation system for the "Beat the Streak" contest, leveraging Python, pandas, and scikit-learn to deliver
tailored player suggestions for an optimized gaming experience.
 Utilized Java Spring Boot JPA to develop high-performance and scalable backend services, providing reliable and
efficient communication between the front-end and back-end systems.
 Employed Spring Boot JPA's transaction management capabilities to ensure data consistency and reliability, reducing
the risk of errors and data corruption.
 Worked with Spring Boot JPA's caching mechanisms to optimize database access and reduce response times,
improving overall system performance and user experience.
 Worked on various module involving Redis and SQL Server, delivering high-quality solutions that met the client's
needs and expectations.

HCL technologies, Noida, IN 2015-02 - 2016-03


Project : Western Union, Organization
Lead Engineer
Responsibilities:
 Managed project and personal schedule by consistently meeting deadlines.
 Implemented and adopted cloud technologies and best practices for automation, configuration, monitoring and
platform scalability.
 Improved DevOps across deployment and testing strategies and application monitoring.
 Served as subject matter expert on Backend and Frontend technologies for both clients and internal team members.
 Provided technical support for Oracle database-related issues and resolved them in a timely manner
 Worked on improving database scalability and availability by implementing load balancing and redundancy solutions.

Kazzam Software Solution, Noida, IN 2012-07 - 2015-02


Software Developer
Responsibilities:
 Designed and implemented Code in Spring MVC using coding best practices. Developed and maintained REST APIs
for integrating applications with the database and other services.
 Participated in Agile software development processes, including Scrum and Kanban methodologies.
 Collaborated with cross-functional teams to ensure quality delivery of software releases.

You might also like